Skip to content

Multivariate Hypergeometric Distribution

  • Computes the probability of obtaining specified counts for multiple categories when sampling without replacement from a finite population.
  • Use binomial coefficients for each category, multiply them, and divide by the binomial coefficient for the total sample.
  • Generalizes the two-group hypergeometric case to any number of distinct groups.

The multivariate hypergeometric distribution is a probability distribution that describes the possible outcomes of drawing samples from a finite population without replacement. It generalizes the standard hypergeometric distribution (which covers two groups) to cases with more than two distinct groups.

Given a finite population and sampling without replacement, the probability of a particular combination of counts across categories is obtained by counting the ways to choose the required number from each category and dividing by the number of ways to choose the total sample. For the two-group case with population size N, K items of type A, N-K items of type B, and a draw of n items yielding X of type A and Y of type B, the probability is computed by multiplying the binomial coefficients for each group and dividing by the binomial coefficient for the total draw.

Suppose a population of 10 items contains 5 red and 5 blue items. Drawing 3 items without replacement, the probability of obtaining 2 red and 1 blue is:

Probability=(52)(51)(103)\text{Probability} = \frac{\binom{5}{2}\binom{5}{1}}{\binom{10}{3}}

As presented:

(52)(51)(103)=(5 choose 2)(5 choose 1)/(10 choose 3)\frac{\binom{5}{2}\binom{5}{1}}{\binom{10}{3}} = ( 5\ \text{choose}\ 2 ) * ( 5\ \text{choose}\ 1 ) / ( 10\ \text{choose}\ 3 ) =(10)(5)/(120)= ( 10 ) * ( 5 ) / ( 120 ) =1/12= 1/12

Suppose a population of 20 items contains 8 red, 6 blue, and 6 green items. Drawing 6 items without replacement, the probability of obtaining 2 red, 2 blue, and 2 green is:

Probability=(82)(62)(62)(206)\text{Probability} = \frac{\binom{8}{2}\binom{6}{2}\binom{6}{2}}{\binom{20}{6}}

As presented:

(82)(62)(62)(206)=(8 choose 2)(6 choose 2)(6 choose 2)/(20 choose 6)\frac{\binom{8}{2}\binom{6}{2}\binom{6}{2}}{\binom{20}{6}} = ( 8\ \text{choose}\ 2 ) * ( 6\ \text{choose}\ 2 ) * ( 6\ \text{choose}\ 2 ) / ( 20\ \text{choose}\ 6 ) =(28)(15)(15)/(38760)= ( 28 ) * ( 15 ) * ( 15 ) / ( 38760 ) =1/3240= 1/3240
  • Calculating the probability of drawing a specific combination of items from a finite population when sampling without replacement.
  • Hypergeometric distribution