How to Calculate Sample Proportion?

Baseball batting averages provide a good example of how to calculate a sample proportion.
••• bmcent1/iStock/GettyImages

Calculating a sample proportion in probability statistics is straightforward. Not only is such a calculation a handy tool in its own right, but it is also a useful way to illustrate how sample sizes in normal distributions affect the standard deviations of those samples.

Say that a baseball player is batting .300 over a career that includes many thousands of plate appearances, meaning that the probability he will get a base hit any time he faces a pitcher is 0.3. From this, it is possible to determine how close to .300 he will hit in a smaller number of plate appearances.

Definitions and Parameters

For these problems, it is important that the sample sizes be sufficiently large to produce meaningful results. The product of the sample size n and the probability p of the event in question occurring must be greater than or equal to 10, and similarly, the product of the sample size and one minus the probability of the event in occurring must also greater than or equal to 10. In mathematical language, this means that

np ≥ 10

and

n(1 - p) ≥ 10

The sample proportion is simply the number of observed events x divided by the sample size n, or

p̂ = \frac{x}{n}

Mean and Standard Deviation of the Variable

The mean of x is simply np, the number of elements in the sample multiplied by the probability of the event occurring. The standard deviation of x is:

\sqrt{np(1 - p)}

Returning to the example of the baseball player, assume he has 100 plate appearances in his first 25 games. What are the mean and standard deviation of the number of hits he is expected to get?

np = 100 × 0.3 = 30

and

\begin{aligned} \sqrt{np(1 - p)} &= \sqrt{100×0.3×0.7} \\ &= 10 \sqrt{0.21} \\ &= 4.58 \end{aligned}

This means that the player getting as few as 25 hits in his 100 plate appearances or as many as 35 would not be considered statistically anomalous.

Mean and Standard Deviation of the Sample Proportion

The mean of any sample proportion is just p. The standard deviation of is:

\frac{\sqrt{p(1 - p)}}{\sqrt{n}}

For the baseball player, with 100 tries at the plate, the mean is simply 0.3 and the standard deviation is:

\begin{aligned} \frac{\sqrt{0.3 × 0.7}}{\sqrt{100}} &= \frac{\sqrt{0.21}}{10} \\ &= 0.0458 \end{aligned}

Note that the standard deviation of is far smaller than the standard deviation of x.

Related Articles

How to Calculate CV Values
How to Calculate Variance From Standard Error
How to Calculate Standard Errors
Different Kinds of Probability
How to Calculate a P-Value
How to Calculate Cumulative Relative Frequency
How to Use a Binomial Table
How to Compute a Population Mean
How to Make a Relative Frequency Table
How to Calculate Incidence & Prevalence
How to Calculate P-hat
How to Estimate the True Proportion
How to Calculate Unexplained Variance
The Physics of a Playground Slide
How to Calculate Statistical Difference
How to Find Standard Deviation on a TI 84 Plus
How to Calculate Correlation
Science Project Ideas & the Scientific Method