Difference between revisions of "AP Statistics Curriculum 2007 Contingency Fit"

From SOCR
Jump to: navigation, search
( General Advance-Placement (AP) Statistics Curriculum - Multinomial Experiments: Chi-Square Goodness-of-Fit)
m (Text replacement - "{{translate|pageName=http://wiki.stat.ucla.edu/socr/" to ""{{translate|pageName=http://wiki.socr.umich.edu/")
 
(25 intermediate revisions by 4 users not shown)
Line 1: Line 1:
 
==[[AP_Statistics_Curriculum_2007 | General Advance-Placement (AP) Statistics Curriculum]] - Multinomial Experiments: Chi-Square Goodness-of-Fit ==
 
==[[AP_Statistics_Curriculum_2007 | General Advance-Placement (AP) Statistics Curriculum]] - Multinomial Experiments: Chi-Square Goodness-of-Fit ==
  
The Chi-Square Test is used to test if a data sample comes from a population with specific characteristics. The Chi-Square Goodness-of-Fit Test is applied to binned data (data put into classes or categories). In most situations, the data histogram or frequency histogram may be obtained and the Chi-Square Test may be applied to these (frequency) values. This test requires a sufficient sample size in order for the Chi-Square approximation to be valid.
+
Chi-Square Test is used to test if a data sample comes from a population with specific characteristics. The Chi-Square Goodness-of-Fit Test is applied to binned data (data put into classes or categories). In most situations, the data histogram or frequency histogram may be obtained and the Chi-Square Test may be applied to these (frequency) values. This test requires a sufficient sample size in order for the Chi-Square approximation to be valid.
  
 
The [http://en.wikipedia.org/wiki/Kolmogorov-Smirnov_test Kolmogorov-Smirnov] is an alternative to the Chi-Square Goodness-of-Fit Test. The Chi-Square Goodness-of-Fit Test may also be applied to discrete distributions such as the Binomial and the Poisson. The Kolmogorov-Smirnov Test is restricted to continuous distributions.
 
The [http://en.wikipedia.org/wiki/Kolmogorov-Smirnov_test Kolmogorov-Smirnov] is an alternative to the Chi-Square Goodness-of-Fit Test. The Chi-Square Goodness-of-Fit Test may also be applied to discrete distributions such as the Binomial and the Poisson. The Kolmogorov-Smirnov Test is restricted to continuous distributions.
  
==Motivational example==
+
==Motivational Example==
[http://en.wikipedia.org/wiki/Mendelian_inheritance Mendel's pea experiment] relates to the transmission of hereditary characteristics from parent organisms to their offspring; it underlies much of genetics. Suppose a ''tall offspring'' is the event of interest and that the true proportion of tall peas (based on a 3:1 phenotypic ratio) is 3/4 or ''p = 0.75''.  He would like to show that Mendel's data follow this 3:1 phenotypic ratio.  
+
[http://en.wikipedia.org/wiki/Mendelian_inheritance Mendel's Pea Experiment] relates to the transmission of hereditary characteristics from parent organisms to their offspring; it underlies much of genetics. Suppose a ''tall offspring'' is the event of interest and that the true proportion of tall peas (based on a 3:1 phenotypic ratio) is 3/4 or ''p = 0.75''.  He would like to show that Mendel's data follow this 3:1 phenotypic ratio.  
  
 
<center>
 
<center>
Line 21: Line 21:
 
==Calculations==
 
==Calculations==
  
Suppose there were ''N = 1064'' data measurements with ''Observed(Tall) = 787'' and ''Observed(Dwarf) = 277''. These are the O’s (observed values). To calculate the E’s (expected values), we will take the hypothesized proportions under <math>H_o</math> and multiply them by the total sample size ''N''. Expected(Tall) = (0.75)(1064) = 798 and Expected(Dwarf) = (0.25)(1064) = 266. Quickly check to see if the expected total = N = 1064.
+
Suppose there were ''N = 1064'' data measurements with ''Observed(Tall) = 787'' and ''Observed(Dwarf) = 277''. These are the O’s (observed values). To calculate the E’s (expected values), we will take the hypothesized proportions under $H_o$ and multiply them by the total sample size ''N''. Expected(Tall) = (0.75)(1064) = 798 and Expected(Dwarf) = (0.25)(1064) = 266. Quickly check to see if the expected total = N = 1064.
  
* The hypotheses:
+
* The Hypotheses:
: <math>H_o</math>:P(tall) = 0.75 (No effect, follows a 3:1phenotypic ratio)
+
: $H_o$:P(tall) = 0.75 (No effect, follows a 3:1phenotypic ratio)
 
:: P(dwarf) = 0.25   
 
:: P(dwarf) = 0.25   
: <math>H_a</math>: P(tall)  ≠  0.75
+
: $H_a$: P(tall)  ≠  0.75
 
::P(dwarf) ≠ 0.25
 
::P(dwarf) ≠ 0.25
  
* Test statistics:
+
* Test Statistics:
:<math>\chi_o^2 = \sum_{all-categories}{(O-E)^2 \over E} \sim \chi_{(df=number\_of\_categories - 1)}^2</math>
+
:$\chi_o^2 = \sum_{all-categories}{(O-E)^2 \over E} \sim \chi_{(df=number\_of\_categories - 1)}^2$
  
* P-values and critical values for the [http://socr.stat.ucla.edu/htmls/SOCR_Distributions.html Chi-Square distribution may be easily computed using SOCR Distributions].
+
* P-values and Critical values for the [http://socr.stat.ucla.edu/htmls/SOCR_Distributions.html Chi-Square Distribution may be easily computed using SOCR Distributions].
  
 
* Results:
 
* Results:
For the Mendel's pea experiment, we can compute the Chi-square test statistics to be:
+
For the Mendel's Pea Experiment, we can compute the Chi-Square Test Statistics to be:
: <math>\chi_o^2 = {(787-798)^2 \over 798}  + {(277-266)^2 \over 266} = 0.152+0.455=0.607</math>.
+
: $\chi_o^2 = {(787-798)^2 \over 798}  + {(277-266)^2 \over 266} = 0.152+0.455=0.607$.
: p-value=<math>P(\chi_{(df=1)}^2 > \chi_o^2)=0.436</math>
+
: p-value=$P(\chi_{(df=1)}^2 > \chi_o^2)=0.436$
  
* [[SOCR_EduMaterials_AnalysisActivities_Chi_Goodness |SOCR Chi-square Calculations]]:
+
* [[SOCR_EduMaterials_AnalysisActivities_Chi_Goodness |SOCR Chi-Square Calculations]]:
  
 
<center>[[Image:SOCR_EBook_Dinov_ChiSquare_030108_Fig1.jpg|500px]]</center>
 
<center>[[Image:SOCR_EBook_Dinov_ChiSquare_030108_Fig1.jpg|500px]]</center>
 +
 +
==Assumptions==
 +
The chi-square goodness-of-fit test requires that the data is divided into ''k'' bins and the test statistic is defined as
 +
 +
:$\chi_o^2 = \sum_{i=1}^k{(O_i-E_i)^2 \over E_i} \sim \chi_{(df=k - 1)}^2$,
 +
where $O_i$ is the observed frequency and $E_i$ is the expected frequency for bin $1\leq i\leq k$. The expected counts may often be calculated by
 +
 +
: $E_i = k\times(F(U_i) - F(L_i))$,
 +
where ''k'' is the total sample size, ''F'' is the cumulative distribution function (CDF) for the distribution being tested, $U_i$ is the upper limit and $L_i$ is the lower limit for class ''i''.
 +
 +
The chi-square test is sensitive to the choice of bins and the optimal choice for the bin-width may depend on the choice of the distribution. The chi-square test is valid if the data represent a random sample of at least 20 observations and the expected frequencies at each bin are at least 5. Otherwise, the distribution of the $\chi_o^2$ statistics is not guaranteed to be $\chi_{k - 1}^2$, in general. In particular, the test may not be valid for small samples. If the expected counts are less than five for some bins, you may need to combine bins together to increase these counts.
 +
 +
* [http://socr.ucla.edu/Applets.dir/SOCRCurveFitter.html Try the SOCR Polynomial curve modeling applet] to see how the chi-square test can be used to assess model quality.
  
 
==Examples==
 
==Examples==
 +
 +
===ApoE and Alzheimer's disease (AD)===
 +
 +
ApoE (Apolipoprotein E) is a strong genetic risk factor for AD. About 40-65% of AD patients have at least one copy of the 4 allele, ApoE4, yet, at least 30% of patients with AD are ApoE4 negative and some ApoE4 homozygotes never develop the disease. People with two e4 alleles have up to 20 times the risk of developing AD. There is also evidence that the ApoE2 allele may serve a protective role in AD. Thus, the genotype most at risk for Alzheimer's disease and at an earlier age is the homozygous ApoE 4,4. The ApoE 3,4 genotype is at increased risk, genotype ApoE 3,3 is considered at normal risk for Alzheimer disease, and genotype ApoE 2,3 lowers the risk for Alzheimer disease.
 +
 +
Suppose we have a random sample of 100 AD patients and 100 asymptomatic age-matched controls. The table below illustrates the [http://en.wikipedia.org/wiki/Apolipoprotein_E#Alzheimer_disease expected distribution of the ApoE traits].
 +
 +
<center>
 +
{| class="wikitable"
 +
|-
 +
| colspan="6"  align="center" | '''Estimated worldwide human allele frequencies of ApoE '''
 +
|-
 +
| Allele || ε2 ||ε3 ||ε4||Total
 +
|-
 +
| General Frequency||8||78||14||100
 +
|-
 +
| AD Frequency||4||59||37||100
 +
|-
 +
| Total||12||137||51||200
 +
|}
 +
</center>
 +
 +
* Is there evidence of an association between genotype (alleles) and phenotype (disease)? You can use the [http://socr.umich.edu/html/ana/SOCR_Analyses.html SOCR Chi-Square Test for Association/Contingency Applet].
 +
* What is the probability $P(AD|ε2)$?
 +
* [[SMHS_NonParamInference|See also SMHS EBook section on non-parametric tests]].
  
 
===Butterfly Hotspots===
 
===Butterfly Hotspots===
A hotspot is defined as a <math>10 km^2</math> area that is species rich (heavily populated by the species of interest).  Suppose in a study of butterfly hotspots in a particular region, the number of butterfly hotspots in a sample of 2,588, <math>10 km^2</math> areas is 165.  In theory, 5% of the areas should be butterfly hotspots.  Do the data provide evidence to suggest that the number of butterfly hotspots is increasing from the theoretical standards?  Test using <math>\alpha= 0.01</math>.
+
A hotspot is defined as a $10 km^2$ area that is species rich (heavily populated by the species of interest).  Suppose in a study of butterfly hotspots in a particular area of $10 km^2$, the number of butterfly hotspots in a sample of 2,588 is 165.  In theory, 5% of the areas should be butterfly hotspots.  Does the data provide evidence to suggest that the number of butterfly hotspots is increasing from the theoretical standards?  Test using $\alpha= 0.01$.
  
 
===Cell-Phone Usage===
 
===Cell-Phone Usage===
Of 250 randomly selected cell phone users, is there evidence to show that there is a difference in area of home residence, defined as: Northern California (North); Southern California (South); or Out of State (Out)? Without further information suppose we have P(North) = 0.24, P(South) = 0.45, and P(Out) = 0.31. Is there any evidence suggesting different use of cell phones in these three groups of users?
+
Of 250 randomly selected cell phone users, is there any evidence to show that there is a difference in area of home residence, defined as: Northern California (North); Southern California (South); or Out of State (Out)? Without further information suppose we have P(North) = 0.24, P(South) = 0.45, and P(Out) = 0.31. Is there any evidence suggesting different use of cell phones in these three groups of users?
  
 
===Brain Cancer===
 
===Brain Cancer===
Suppose 200 randomly selected cancer patients were asked if their primary diagnosis was Brain cancer and if they owned a cell phone before their diagnosis.  The results are presented in the table below:
+
Suppose 200 randomly selected cancer patients were asked if their primary diagnosis was brain cancer and if they owned a cell phone before their diagnosis.  The results are presented in the table below:
  
 
<center>
 
<center>
Line 70: Line 108:
  
 
Does it seem like there is an association between brain cancer and cell phone use?   
 
Does it seem like there is an association between brain cancer and cell phone use?   
Of the brain cancer patients 18/25 = 0.72, owned a cell phone before their diagnosis.     
+
Of the brain cancer patients, 18 out of 25 (about 0.72) owned a cell phone before their diagnosis.     
''P(CP|BC) = 0.72''estimated probability of owning a cell phone given that the patient has brain cancer.
+
''P(CP|BC) = 0.72'' is the estimated probability of patients owning a cell phone given that he or she has brain cancer.
  
Of the other cancer patients, 80/175 = 0.46, owned a cell phone before their diagnosis.   
+
Of the other cancer patients, 80 out of 175 (about 0.46) owned a cell phone before their diagnosis.   
''P(CP|NBC) = 0.46'', estimated probability of owning a cell phone given that the patient has another cancer.
+
''P(CP|NBC) = 0.46'' is the estimated probability of patients owning a cell phone given that he or she has a different type of cancer.
 +
 
 +
===Chi-Square Die Experiment===
 +
The [[SOCR_EduMaterials_Activities_ChiSquareDiceExperiment | SOCR Chi-square die experiment]] illustrates the Chi-square goodness-of-fit test using two-dice. Suppose we are trying to prove that one of two dice is loaded. Let's call the first die the ''sampling-die'' and the second one the ''testing-die''. Using the [http://socr.ucla.edu/htmls/SOCR_Experiments.html Chi-Square dice applet] you can manually select the loading of the two dice (these can be the same or different loadings). The figure below illustrates this situation. Note that in this experiment, we've intentionally loaded the two dice in the opposite ways (look at the probability distributions in the testing and sampling tables in the image). Thus, we expect that the ''sampling-die'' will be a poor model for the (oppositely loaded) ''testing-die''. This is reflected in the results of the 10 experiments. All of them indicate statistically significant differences between the model (sampling-die) and the data (testing-die), which we can expect in this case. On the other hand, if we make the probability distributions of the two dice the same, a rejection of the null-hypothesis will only appear at the rate of the preset for &alpha; (0.02).
 +
 
 +
<center>[[Image:SOCR_EBook_Dinov_ChiSquare_042908_Fig2.jpg|500px]]</center>
 +
 
 +
===Iris sepal and petal length===
 +
The [[SOCR_Data_052511_IrisSepalPetalClasses |Fisher's multivariate dataset on iris sepal and petal length]] provides another interesting example where we can look for how close are the petal or the sepal lengths of iris plant of different types.
  
 
==Applications==
 
==Applications==
Line 82: Line 128:
  
 
<hr>
 
<hr>
==References==
+
==[[EBook_Problems_Contingency_Fit|Problems]]==
* TBD
+
 
 +
==See also==
 +
* [[SOCR_EduMaterials_Activities_BMI_Modeling_Activity|SOCR BMI Modeling Activity]]
 +
* [[SOCR_EduMaterials_AnalysisActivities_Chi_Goodness| SOCR Chi-Square Goodness-of-Fit Test]]
  
 
<hr>
 
<hr>
 
* SOCR Home page: http://www.socr.ucla.edu
 
* SOCR Home page: http://www.socr.ucla.edu
  
{{translate|pageName=http://wiki.stat.ucla.edu/socr/index.php?title=AP_Statistics_Curriculum_2007_Contingency_Fit}}
+
"{{translate|pageName=http://wiki.socr.umich.edu/index.php?title=AP_Statistics_Curriculum_2007_Contingency_Fit}}

Latest revision as of 15:24, 3 March 2020

General Advance-Placement (AP) Statistics Curriculum - Multinomial Experiments: Chi-Square Goodness-of-Fit

Chi-Square Test is used to test if a data sample comes from a population with specific characteristics. The Chi-Square Goodness-of-Fit Test is applied to binned data (data put into classes or categories). In most situations, the data histogram or frequency histogram may be obtained and the Chi-Square Test may be applied to these (frequency) values. This test requires a sufficient sample size in order for the Chi-Square approximation to be valid.

The Kolmogorov-Smirnov is an alternative to the Chi-Square Goodness-of-Fit Test. The Chi-Square Goodness-of-Fit Test may also be applied to discrete distributions such as the Binomial and the Poisson. The Kolmogorov-Smirnov Test is restricted to continuous distributions.

Motivational Example

Mendel's Pea Experiment relates to the transmission of hereditary characteristics from parent organisms to their offspring; it underlies much of genetics. Suppose a tall offspring is the event of interest and that the true proportion of tall peas (based on a 3:1 phenotypic ratio) is 3/4 or p = 0.75. He would like to show that Mendel's data follow this 3:1 phenotypic ratio.

Observed (O) Expected (E)
Tall 787 798
Dwarf 277 266

Calculations

Suppose there were N = 1064 data measurements with Observed(Tall) = 787 and Observed(Dwarf) = 277. These are the O’s (observed values). To calculate the E’s (expected values), we will take the hypothesized proportions under $H_o$ and multiply them by the total sample size N. Expected(Tall) = (0.75)(1064) = 798 and Expected(Dwarf) = (0.25)(1064) = 266. Quickly check to see if the expected total = N = 1064.

  • The Hypotheses:
$H_o$:P(tall) = 0.75 (No effect, follows a 3:1phenotypic ratio)
P(dwarf) = 0.25
$H_a$: P(tall) ≠ 0.75
P(dwarf) ≠ 0.25
  • Test Statistics:
$\chi_o^2 = \sum_{all-categories}{(O-E)^2 \over E} \sim \chi_{(df=number\_of\_categories - 1)}^2$
  • Results:

For the Mendel's Pea Experiment, we can compute the Chi-Square Test Statistics to be:

$\chi_o^2 = {(787-798)^2 \over 798} + {(277-266)^2 \over 266} = 0.152+0.455=0.607$.
p-value=$P(\chi_{(df=1)}^2 > \chi_o^2)=0.436$
SOCR EBook Dinov ChiSquare 030108 Fig1.jpg

Assumptions

The chi-square goodness-of-fit test requires that the data is divided into k bins and the test statistic is defined as

$\chi_o^2 = \sum_{i=1}^k{(O_i-E_i)^2 \over E_i} \sim \chi_{(df=k - 1)}^2$,

where $O_i$ is the observed frequency and $E_i$ is the expected frequency for bin $1\leq i\leq k$. The expected counts may often be calculated by

$E_i = k\times(F(U_i) - F(L_i))$,

where k is the total sample size, F is the cumulative distribution function (CDF) for the distribution being tested, $U_i$ is the upper limit and $L_i$ is the lower limit for class i.

The chi-square test is sensitive to the choice of bins and the optimal choice for the bin-width may depend on the choice of the distribution. The chi-square test is valid if the data represent a random sample of at least 20 observations and the expected frequencies at each bin are at least 5. Otherwise, the distribution of the $\chi_o^2$ statistics is not guaranteed to be $\chi_{k - 1}^2$, in general. In particular, the test may not be valid for small samples. If the expected counts are less than five for some bins, you may need to combine bins together to increase these counts.

Examples

ApoE and Alzheimer's disease (AD)

ApoE (Apolipoprotein E) is a strong genetic risk factor for AD. About 40-65% of AD patients have at least one copy of the 4 allele, ApoE4, yet, at least 30% of patients with AD are ApoE4 negative and some ApoE4 homozygotes never develop the disease. People with two e4 alleles have up to 20 times the risk of developing AD. There is also evidence that the ApoE2 allele may serve a protective role in AD. Thus, the genotype most at risk for Alzheimer's disease and at an earlier age is the homozygous ApoE 4,4. The ApoE 3,4 genotype is at increased risk, genotype ApoE 3,3 is considered at normal risk for Alzheimer disease, and genotype ApoE 2,3 lowers the risk for Alzheimer disease.

Suppose we have a random sample of 100 AD patients and 100 asymptomatic age-matched controls. The table below illustrates the expected distribution of the ApoE traits.

Estimated worldwide human allele frequencies of ApoE
Allele ε2 ε3 ε4 Total
General Frequency 8 78 14 100
AD Frequency 4 59 37 100
Total 12 137 51 200

Butterfly Hotspots

A hotspot is defined as a $10 km^2$ area that is species rich (heavily populated by the species of interest). Suppose in a study of butterfly hotspots in a particular area of $10 km^2$, the number of butterfly hotspots in a sample of 2,588 is 165. In theory, 5% of the areas should be butterfly hotspots. Does the data provide evidence to suggest that the number of butterfly hotspots is increasing from the theoretical standards? Test using $\alpha= 0.01$.

Cell-Phone Usage

Of 250 randomly selected cell phone users, is there any evidence to show that there is a difference in area of home residence, defined as: Northern California (North); Southern California (South); or Out of State (Out)? Without further information suppose we have P(North) = 0.24, P(South) = 0.45, and P(Out) = 0.31. Is there any evidence suggesting different use of cell phones in these three groups of users?

Brain Cancer

Suppose 200 randomly selected cancer patients were asked if their primary diagnosis was brain cancer and if they owned a cell phone before their diagnosis. The results are presented in the table below:

Brain cancer
Yes No Total
Cell Phone Use Yes 18 80 98
No 7 95 102
Total 25 175 200

Does it seem like there is an association between brain cancer and cell phone use? Of the brain cancer patients, 18 out of 25 (about 0.72) owned a cell phone before their diagnosis. P(CP|BC) = 0.72 is the estimated probability of patients owning a cell phone given that he or she has brain cancer.

Of the other cancer patients, 80 out of 175 (about 0.46) owned a cell phone before their diagnosis. P(CP|NBC) = 0.46 is the estimated probability of patients owning a cell phone given that he or she has a different type of cancer.

Chi-Square Die Experiment

The SOCR Chi-square die experiment illustrates the Chi-square goodness-of-fit test using two-dice. Suppose we are trying to prove that one of two dice is loaded. Let's call the first die the sampling-die and the second one the testing-die. Using the Chi-Square dice applet you can manually select the loading of the two dice (these can be the same or different loadings). The figure below illustrates this situation. Note that in this experiment, we've intentionally loaded the two dice in the opposite ways (look at the probability distributions in the testing and sampling tables in the image). Thus, we expect that the sampling-die will be a poor model for the (oppositely loaded) testing-die. This is reflected in the results of the 10 experiments. All of them indicate statistically significant differences between the model (sampling-die) and the data (testing-die), which we can expect in this case. On the other hand, if we make the probability distributions of the two dice the same, a rejection of the null-hypothesis will only appear at the rate of the preset for α (0.02).

SOCR EBook Dinov ChiSquare 042908 Fig2.jpg

Iris sepal and petal length

The Fisher's multivariate dataset on iris sepal and petal length provides another interesting example where we can look for how close are the petal or the sepal lengths of iris plant of different types.

Applications

Polynomial Model Fitting

This applet demonstrated the use of the Chi-Square test to assess quality of fitting a polynomial model (of any degree) to manually drawn curves.


Problems

See also


"-----


Translate this page:

(default)
Uk flag.gif

Deutsch
De flag.gif

Español
Es flag.gif

Français
Fr flag.gif

Italiano
It flag.gif

Português
Pt flag.gif

日本語
Jp flag.gif

България
Bg flag.gif

الامارات العربية المتحدة
Ae flag.gif

Suomi
Fi flag.gif

इस भाषा में
In flag.gif

Norge
No flag.png

한국어
Kr flag.gif

中文
Cn flag.gif

繁体中文
Cn flag.gif

Русский
Ru flag.gif

Nederlands
Nl flag.gif

Ελληνικά
Gr flag.gif

Hrvatska
Hr flag.gif

Česká republika
Cz flag.gif

Danmark
Dk flag.gif

Polska
Pl flag.png

România
Ro flag.png

Sverige
Se flag.gif