Simple Random Sample Probability Calculator

Estimate the probability of drawing a specific number of successes in a simple random sample without replacement. This calculator uses the hypergeometric model, the standard probability framework for simple random sampling from a finite population.

Calculator Inputs

Population size (N)

Total number of items or people in the population.

Number of successes in population (K)

How many population members have the characteristic of interest.

Sample size (n)

Number selected in the simple random sample.

Target successes in sample (k)

The count of successes you want to evaluate.

Probability type

Choose whether you need an exact probability or a cumulative probability.

Probability Distribution Chart

The chart displays the full probability distribution for the number of successes in your sample. The highlighted bar corresponds to your selected k value.

Results Summary

Expected successes –

Standard deviation –

Sample fraction –

Population success rate –

Enter your values and click Calculate Probability to see the exact result, cumulative probability, and interpretation.

Expert Guide to Simple Random Sample Probability Calculation

Simple random sampling is one of the most important ideas in statistics. It gives every possible sample of a fixed size the same chance of being selected from a finite population. Because of that property, simple random sampling is the foundation for many survey designs, audit procedures, clinical selection methods, educational experiments, and quality control workflows. When analysts want to know the probability of drawing a certain number of observations with a given trait, they are almost always working with the probability model behind sampling without replacement: the hypergeometric distribution.

This calculator is designed to help you compute probabilities for that exact situation. If a population contains a known number of successes and failures, and you select a sample without replacement, the number of successes observed in the sample does not follow a binomial distribution. Instead, it follows the hypergeometric distribution because each draw changes the composition of the remaining population. That dependency matters. It is the reason simple random sample probability calculations are slightly more involved than independent-trial calculations.

Core idea: If your population size is N, the number of successes in the population is K, your sample size is n, and the observed number of successes in the sample is k, then the probability of exactly k successes is based on how many favorable samples exist divided by how many total samples are possible.

When this calculator should be used

You should use a simple random sample probability calculation when all of the following are true:

You are sampling from a finite, known population.
The sample is drawn without replacement.
You know or can estimate how many members of the population are classified as successes.
You want the probability of seeing exactly, at least, or at most a certain number of successes in the sample.

Typical use cases include selecting defective parts from a production lot, drawing households from a small municipality, choosing files for an audit, testing a subset of medications from a batch, or sampling student records from a school district where a known count shares a particular characteristic.

The formula behind the calculator

The exact probability of observing k successes in a simple random sample is:

P(X = k) = [C(K, k) × C(N – K, n – k)] / C(N, n)

Here, C(a, b) means the number of combinations of a items taken b at a time. The numerator counts all samples with exactly k successes and n – k failures. The denominator counts all samples of size n from the population of size N.

This formula is elegant because it mirrors the logic of simple random sampling. If every sample of size n is equally likely, then probability becomes a counting problem. How many equally likely samples satisfy your condition? That count, divided by the total number of possible samples, gives the answer.

Understanding each input

Population size (N): The total number of units in the population. In a quality-control problem, this may be the total number of products in a batch.
Number of successes (K): The number of population members having the trait of interest. For example, how many products in the lot are defective.
Sample size (n): The number selected from the population. This is the actual size of the simple random sample.
Target successes (k): The number of observed successes in the sample that you want to evaluate.
Probability type: Whether you want exactly k, at least k, or at most k successes.

Why without replacement matters

If you were sampling with replacement, each draw would be independent and the binomial model would be appropriate. But simple random sampling from a finite population is almost always without replacement. Once an item is selected, it cannot be selected again, and the pool changes. That means probabilities shift on each draw. For example, if a batch contains 60 defective units among 500 total units, drawing one defective item slightly lowers the defective rate in the remaining pool. Drawing a nondefective unit slightly increases it. The hypergeometric model accounts for that changing composition exactly.

Worked example: quality control lot sampling

Suppose a manufacturer has a lot of 500 components. Inspection records show 60 are defective. If an auditor selects a simple random sample of 40 components, what is the probability of finding exactly 5 defective items?

Set the values as follows:

N = 500
K = 60
n = 40
k = 5

The expected number of defectives in the sample is n × (K/N), which equals 40 × (60/500) = 4.8. So seeing 5 defectives is very close to the center of the distribution. As a result, the exact probability tends to be relatively high compared with more extreme values like 0, 10, or 12 defectives.

Scenario	Population Size (N)	Successes in Population (K)	Sample Size (n)	Target (k)	Probability Focus
Manufacturing audit	500 parts	60 defectives	40 sampled	5 defectives	Exactly 5 defectives in sample
School records review	1,200 students	180 with program participation	75 selected	12 participants	At least 12 program participants
Inventory check	220 units	18 damaged	25 inspected	2 damaged	At most 2 damaged units

Expected value and standard deviation

A strong probability analysis does not stop at one probability. You should also understand the center and spread of the distribution. For a hypergeometric random variable:

Expected value: E(X) = n(K/N)
Variance: Var(X) = n(K/N)(1 – K/N)[(N – n)/(N – 1)]
Standard deviation: the square root of the variance

The finite population correction factor, (N – n)/(N – 1), is what distinguishes this spread from the spread of a binomial model. As the sample becomes a larger fraction of the population, the correction becomes more important and the sampling variability decreases. In plain language, when you sample a large share of a population without replacement, there is less uncertainty left in what remains.

How to interpret exact, at least, and at most probabilities

These three output types serve different decision goals:

Exactly k: Useful for textbook problems and precise event modeling.
At least k: Common in threshold decisions, such as the chance of detecting at least a minimum number of defects.
At most k: Useful for acceptance sampling and compliance checks, where too many observed failures would trigger concern.

For example, an auditor may ask, “What is the probability that our random review finds at least 12 noncompliant files?” That is a cumulative probability. A quality manager may ask, “What is the probability that we find at most 2 damaged units?” That is also cumulative, but on the lower tail of the distribution.

Realistic comparison table

The table below shows how changing the sample fraction can alter interpretation, even when the population success rate is similar. These are realistic finite-population examples frequently seen in operations, audit, and institutional research.

Case	Population Rate	Sample Fraction	Expected Successes	Why It Matters
Batch inspection: 60 defectives in 500, sample 40	12.0%	8.0%	4.8	Moderate sample fraction, finite population correction is noticeable but not dominant.
Program evaluation: 180 participants in 1,200, sample 75	15.0%	6.25%	11.25	Sample is small relative to population, so hypergeometric and binomial results may be close.
Inventory damage review: 18 damaged in 220, sample 25	8.18%	11.36%	2.05	Larger sample fraction makes the without-replacement model more important.

Common mistakes to avoid

Using the binomial distribution when sampling without replacement: This is the most common error. If the sample fraction is not negligible, the binomial approximation can be meaningfully off.
Entering a target value outside the feasible range: The number of successes in the sample cannot exceed the sample size or the number of successes in the population.
Ignoring finite population structure: In small or medium-sized populations, exact finite population methods are often preferable to large-sample approximations.
Confusing percentages and counts: The calculator needs actual counts for N, K, n, and k.
Forgetting that “at least” and “at most” are cumulative: These probabilities sum across multiple values, not just one value.

How the chart helps decision-making

The distribution chart provides more than visual appeal. It shows the full range of plausible success counts in the sample. If your selected k falls near the peak, the event is relatively common. If it falls in the far tail, the event is rare. This visual context is especially useful for executives, auditors, researchers, and students who need to explain why an outcome is ordinary or unusual under the assumptions of simple random sampling.

When to use approximations

In some settings, analysts approximate the hypergeometric distribution with a binomial or normal distribution. That can be reasonable when the population is very large relative to the sample and when the event rate is not too extreme. However, if your data come from a finite list, roster, lot, or batch, exact hypergeometric probabilities are usually better. Since modern browsers can compute them instantly, there is little reason to rely on rough approximations unless you are working analytically by hand.

Practical applications across industries

Public administration: Randomly sampling claims, permits, or files from a finite administrative inventory.
Healthcare operations: Reviewing charts from a known panel of patients to estimate the likelihood of finding cases with a specific condition.
Manufacturing: Acceptance sampling from a finite production lot where the defect count is known or modeled.
Education: Selecting student records from a finite school population to assess program participation or test accommodations.
Research design: Teaching finite-population sampling theory using exact probabilities rather than asymptotic approximations.

Authoritative references for further study

If you want a deeper statistical treatment of simple random sampling and finite-population inference, review these authoritative sources:

Final takeaway

Simple random sample probability calculation is fundamentally about counting how many valid samples satisfy your condition out of all equally likely samples from a finite population. The correct model is hypergeometric because the sample is drawn without replacement. By entering the population size, the number of successes in the population, the sample size, and the target number of successes, you can compute exact or cumulative probabilities, compare the result with the expected value, and visualize the full distribution. This makes the calculator useful not just for classroom exercises, but for real-world decision support in auditing, quality control, surveys, inventory review, and institutional analysis.