Population Genetics. Matthew B. Hamilton. Читать онлайн. Newlib. NEWLIB.NET

Автор: Matthew B. Hamilton
Издательство: John Wiley & Sons Limited
Серия:
Жанр произведения: Биология
Год издания: 0
isbn: 9781118436899
Скачать книгу
of the green labeled amelogenin locus). A full color version of this figure is available on the textbook website.

GCCCCATAGGTTTTGAACTCACAGATTAAACTGTAACCAAAATAAAATTAGGCATTATTTACAAGCTAGTTT CTTT CTTT CTTT TTTCT CTTT CTTT CTTT CTTT CTTT CTTT CTTT CTTT CTTT CTTT CTTT CTTT CTTT CTTT CTTT CTTT CTC CTTC CTTC CTTT CTTC CTTT CTTT TTTGCTGGCA ATTACAGACAAATCAA

Frequency of M = images = 0.4184 Frequency of N = images = 0.5816
Genotype Observed Expected number of genotypes Observed – Expected
MM 165 images = 1066 × (0.4184)2 = 186.61 −21.6
MN 562 images = 1066 × 2(0.4184)(0.5816) = 518.80 43.2
NN 339 images = 1066 × (0. 5816)2 = 360.58 −21.6

      In more general terms, the expected frequency of an event, p, times the number of trials or samples, n, gives the expected number of events or np. To test the hypothesis that p is the frequency of an event in an actual population, we compare np with images. Close agreement suggests that the parameter and the estimate are the same quantity. But a large disagreement instead suggests that p and images are likely to be different probabilities. The chi‐squared (χ2) distribution is a statistical test commonly used to compare np and images. The χ2 test provides the probability of obtaining the difference (or more) between the observed images and expected (np) number of outcomes by chance alone if the null hypothesis is true. As the difference between the observed and expected grows larger, it becomes less probable that the parameter and the parameter estimate are actually the same but differ in a given sample due to chance. The χ2 statistic is:

      (2.7)equation

      where ∑ (pronounced “sigma”) indicates taking the sum of multiple terms.

      (2.8)equation

      We need to compare our statistic to values from the χ2 distribution. But, first, we need to know how much information, or the degrees of freedom (commonly abbreviated as df), was used to estimate the χ2 statistic. In general, degrees of freedom are based on the number of categories of data: df = no. of classes compared − no. of parameters estimated −1 for the χ2 test itself. In this case, df = 3–1 − 1 = 1 for three genotypes and one estimated allele frequency (with two alleles: the other allele frequency is fixed once the first has been estimated).

Graph depicts x power 2 distribution with one degree of freedom. The x power 2 value for the Hardy–Weinberg test with MN blood group genotypes as well as the critical value to reject the null hypothesis are shown. The area under the curve to the right of the arrow indicates the probability of observing that much or more difference between the observed and expected outcomes.