What Is Gaussian Distribution?

The Gaussian distribution is commonly used to understand probabilities.

••• Comstock Images/Comstock/Getty Images

How to Calculate the Coefficient of Variation

Updated April 24, 2017

By Paul Dohrman

In statistics, the Gaussian, or normal, distribution is used to characterize complex systems with many factors. As described in Stephen Stigler’s The History of Statistics, Abraham De Moivre invented the distribution that bears Karl Fredrick Gauss’s name. Gauss’s contribution lay in his application of the distribution to the least squares approach to minimizing error in fitting data with a line of best fit. He thus made it the most important error distribution in statistics.

Motivation

What is the distribution of a sample of data? What if you don’t know the data’s underlying distribution? Is there any way to test hypotheses about the data without knowing the underlying distribution? Thanks to the Central Limit Theorem, the answer is yes.

Statement of the Theorem

It states that a sample mean from an infinite population is approximately normal, or Gaussian, with mean the same as the underlying population, and variance equal to the population variance divided by the sample size. The approximation improves as the sample size gets large.

The approximation statement is sometimes misstated as a conclusion about convergence to a normal distribution. Since the approximating normal distribution changes as the sample size increases, such a statement is misleading.

The theorem was developed by Pierre Simon Laplace.

Why It's Everywhere

Normal distributions are omnipresent. The reason comes from the Central Limit Theorem. Oftentimes, when a value is measured, it is the sum effect of many independent variables. Therefore, the value being measured itself has a sample-mean quality to it. For example, a distribution of athlete’s performances may have a bell-shape, as a result of differences in diet, training, genetics, coaching and psychology. Even men's heights has a normal distribution, being a function of many biological factors.

Gaussian Copulas

What is called a “copula function” with a Gaussian distribution was in the news in 2009 because of its use in assessing the risk of investing in collateralized bonds. The misuse of the function was instrumental in the financial crisis of 2008-2009. Although there were many causes of the crisis, in hindsight Gaussian distributions likely should not have been used. A function with a thicker tail would have assigned greater probability to adverse events.

Derivation

The Central Limit Theorem can be proven in many lines by analyzing the moment generating function (mgf) of (sample mean - population mean)/?(population variance / sample size) as a function of the mgf of the underlying population. The approximation part of the theorem is introduced by expanding the underlying population’s mgf as a power series, then showing most terms are insignificant as the sample size gets large.

It can be proven in far fewer lines by using a Taylor expansion on the characteristic equation of the same function and making the sample size large.

Computational Convenience

Some statistical models presume the errors to be Gaussian. This enables distributions of functions of normal variables, like the chi-square- and F-distribution, to be used in hypothesis testing. Specifically, in the F-test, the F statistic is composed of a ratio of chi-square distributions, which themselves are functions of a normal variance parameter. The ratio of the two causes the variance to cancel out, enabling hypothesis testing without knowledge of the variances aside from their normality and constancy.

References

About the Author

Photo Credits

Sciencing_Icons_Cells Cells

Sciencing_Icons_Molecular Molecular

Sciencing_Icons_Microorganisms Microorganisms

Sciencing_Icons_Genetics Genetics

Sciencing_Icons_Human Body Human Body

Sciencing_Icons_Ecology Ecology

Sciencing_Icons_Atomic &amp; Molecular Structure Atomic & Molecular Structure

Sciencing_Icons_Bonds Bonds

Sciencing_Icons_Reactions Reactions

Sciencing_Icons_Stoichiometry Stoichiometry

Sciencing_Icons_Solutions Solutions

Sciencing_Icons_Acids &amp; Bases Acids & Bases

Sciencing_Icons_Thermodynamics Thermodynamics

Sciencing_Icons_Organic Chemistry Organic Chemistry

Sciencing_Icons_Fundamentals-Physics Fundamentals

Mechanics

Sciencing_Icons_Electronics Electronics

Sciencing_Icons_Waves Waves

Sciencing_Icons_Energy Energy

Sciencing_Icons_Fluid Fluid

Sciencing_Icons_Astronomy Astronomy

Sciencing_Icons_Fundamentals-Geology Fundamentals

Sciencing_Icons_Minerals &amp; Rocks Minerals & Rocks

Sciencing_Icons_Earth Scructure Earth Structure

Sciencing_Icons_Fossils Fossils

Sciencing_Icons_Natural Disasters Natural Disasters

Sciencing_Icons_Ecosystems Ecosystems

Sciencing_Icons_Environment Environment

Sciencing_Icons_Insects Insects

Sciencing_Icons_Plants &amp; Mushrooms Plants & Mushrooms

Sciencing_Icons_Animals Animals

Sciencing_Icons_Addition &amp; Subtraction Addition & Subtraction

Sciencing_Icons_Multiplication &amp; Division Multiplication & Division

Sciencing_Icons_Decimals Decimals

Sciencing_Icons_Fractions Fractions

Sciencing_Icons_Conversions Conversions

Sciencing_Icons_Working with Units Working With Units

Sciencing_Icons_Equations &amp; Expressions Equations & Expressions

Sciencing_Icons_Ratios &amp; Proportions Ratios & Proportions

Sciencing_Icons_Inequalities Inequalities

Sciencing_Icons_Exponents &amp; Logarithms Exponents & Logarithms

Sciencing_Icons_Factorization Factorization

Sciencing_Icons_Functions Functions

Sciencing_Icons_Linear Equations Linear Equations

Sciencing_Icons_Graphs Graphs

Sciencing_Icons_Quadratics Quadratics

Sciencing_Icons_Polynomials Polynomials

Sciencing_Icons_Fundamentals-Geometry Fundamentals

Sciencing_Icons_Cartesian Cartesian

Sciencing_Icons_Circles Circles

Sciencing_Icons_Solids Solids

Sciencing_Icons_Trigonometry Trigonometry

Sciencing_Icons_Mean-Median-Mode Mean/Median/Mode

Sciencing_Icons_Independent-Dependent Variables Independent/Dependent Variables

Sciencing_Icons_Deviation Deviation

Sciencing_Icons_Correlation Correlation

Sciencing_Icons_Sampling Sampling

Sciencing_Icons_Distributions Distributions

Sciencing_Icons_Probability Probability

Sciencing_Icons_Differentiation-Integration Differentiation/Integration

Sciencing_Icons_Application Application

What Is Gaussian Distribution?

How to Calculate the Coefficient of Variation

Motivation

Statement of the Theorem

Why It's Everywhere

Gaussian Copulas

Derivation

Computational Convenience

Related Articles

How to Calculate the Coefficient of Variation

How to Calculate the Cumulative Probabilities in SPSS

The Pros & Cons of Queueing Theory

Advantages & Disadvantages of Finding Variance

How to Calculate Binomial Probability

How to Use the Pearson Correlation Coefficient

The Relationship Between Standard Deviations & Percentiles

The Effects of a Small Sample Size Limitation

How to Interpret a Beta Coefficient

What Is PPS Sampling?

Cells

Molecular

Microorganisms

Genetics

Human Body

Ecology

Atomic & Molecular Structure

Bonds

Reactions

Stoichiometry

Solutions

Acids & Bases

Thermodynamics

Organic Chemistry

Fundamentals

Electronics

Waves

Energy

Fluid

Astronomy

Fundamentals

Minerals & Rocks

Earth Structure

Fossils

Natural Disasters

Ecosystems

Environment

Insects

Plants & Mushrooms

Animals

Addition & Subtraction

Multiplication & Division

Decimals

Fractions

Conversions

Working With Units

Equations & Expressions

Ratios & Proportions

Inequalities

Exponents & Logarithms

Factorization

Functions

Linear Equations

Graphs

Quadratics

Polynomials

Fundamentals

Cartesian

Circles

Solids

Trigonometry

Mean/Median/Mode

Independent/Dependent Variables

Deviation

Correlation

Sampling

Distributions

Probability

Differentiation/Integration

Application