Difference between revisions of "AP Statistics Curriculum 2007 NonParam VarIndep"

From SOCR
Jump to: navigation, search
m (Approach)
m (Text replacement - "{{translate|pageName=http://wiki.stat.ucla.edu/socr/" to ""{{translate|pageName=http://wiki.socr.umich.edu/")
 
(11 intermediate revisions by 2 users not shown)
Line 2: Line 2:
  
 
==Differences of Variances of Independent Samples==
 
==Differences of Variances of Independent Samples==
It is frequently necessary to test if ''k'' samples have equal variances. Equal variances across samples is called ''homogeneity of variances''. Some statistical tests, for example the [[AP_Statistics_Curriculum_2007_ANOVA_1Way | analysis of variance]], assume that variances are equal across groups or samples.  
+
It is frequently necessary to test if ''k'' samples have equal variances. ''Homogeneity of variances'' is often a reference to equal variances across samples. Some statistical tests, for example the [[AP_Statistics_Curriculum_2007_ANOVA_1Way | analysis of variance]], assume that variances are equal across groups or samples.  
  
 
==Approach==
 
==Approach==
The (modified) Fligner-Killeen test for homogeneity of variances of ''k'' populations jointly ranks the absolute values <math>|X_{i,j}-\tilde{X_j}|</math> and assigns increasing '''scores''' <math>a_{N,i}=\Phi^{-1}({1 + {i\over N+1} \over 2})</math>, based on the ranks of all observations, see the Conover, Johnson, and Johnson (1981) reference below.
+
The (modified) Fligner-Killeen test provides the means for studying the homogeneity of variances of ''k'' populations { <math>X_{i,j}</math>, for <math>1\leq i \leq n_j</math> and <math>1\leq j \leq k</math>}. The test jointly ranks the absolute values of <math>|X_{i,j}-\tilde{X_j}|</math> and assigns increasing '''scores''' <math>a_{N,i}=\Phi^{-1}\bigg ({1 + {i\over N+1} \over 2} \bigg )</math>, based on the ranks of all observations, see the [[AP_Statistics_Curriculum_2007_NonParam_VarIndep#References | Conover, Johnson, and Johnson (1981) reference below]].
  
In this test, <math>\tilde{X_j}</math> is the sample median of the ''j<sup>th</sup>'' population, and <math>\Phi(.)</math> is the [[AP_Statistics_Curriculum_2007_Normal_Std | cummulative distribution function for Normal distirbution]]. The Fligner-Killeen test is sometimes also called the ''median-centering Fligner-Killeen test''.
+
In this test, <math>\tilde{X_j}</math> is the sample median of the ''j<sup>th</sup>'' population, and <math>\Phi(.)</math> is the [[AP_Statistics_Curriculum_2007_Normal_Std | cumulative distribution function for Normal distribution]]. The Fligner-Killeen test is sometimes also called the ''median-centering Fligner-Killeen test''.
  
 
* '''Fligner-Killeen test statistics''':
 
* '''Fligner-Killeen test statistics''':
: <math>x_o^2 = {\sum_{j=1}^k {n_j(\bar{A_j} -\bar{a})^2} \over V^2}</math>,
+
: <math>x_o^2 = {\sum_{j=1}^k {n_j(\bar{A_j} -\bar{a})^2} \over V^2},</math>
: where <math>\bar{A_j}</math> is the mean score for the ''j<sup>th</sup>'' sample, ''a'' is the overall mean score of all <math>a_{N,i}</math>, and <math>V^2</math> is the sample variance of all scores.
+
: where <math>\bar{A_j}</math> is the mean score for the ''j<sup>th</sup>'' sample, <math>\bar{a}</math> is the overall mean score of all <math>a_{N,i}</math>, and <math>V^2</math> is the sample variance of all scores.
  
 
That is:
 
That is:
: <math>N=\sum_{j=1}^k{n_j}</math>,
+
: <math>N=\sum_{j=1}^k{n_j},</math>
: <math>\bar{A_j} = {1\over n_j}\sum_{i=1}^{n_j}{a_{N,m_i}}</math>,
+
: <math>\bar{A_j} = {1\over n_j}\sum_{i=1}^{n_j}{a_{N,m_i}},</math> where <math>a_{N,m_i}</math> is the increasing rank score for the ''i<sup>th</sup>''-observation in the ''j<sup>th</sup>''-sample,
: <math>\bar{a} = {1\over N}\sum_{i=1}^{N}{a_{N,i}}</math>,
+
: <math>\bar{a} = {1\over N}\sum_{i=1}^{N}{a_{N,i}},</math>
: <math>V^2 = {1\over N-1}\sum_{i=1}^{N}{(a_{N,i}-\bar{a})^2}</math>.
+
: <math>V^2 = {1\over N-1}\sum_{i=1}^{N}{(a_{N,i}-\bar{a})^2}.</math>
  
 
* '''Fligner-Killeen probabilities''':
 
* '''Fligner-Killeen probabilities''':
 
For large sample sizes, the modified Fligner-Killeen test statistic has an asymptotic chi-square distribution with ''(k-1)'' degrees of freedom
 
For large sample sizes, the modified Fligner-Killeen test statistic has an asymptotic chi-square distribution with ''(k-1)'' degrees of freedom
: <math>x_o^2 \sim \chi_{(k-1)}^2</math>.
+
: <math>x_o^2 \sim \chi_{(k-1)}^2.</math>
  
 
* '''Note''':
 
* '''Note''':
 
: Conover, Johnson, and Johnson (1981) carried a simulation comparing different ''variance homogeneity tests'' and reported that the ''modified Fligner-Killeen test'' is most robust against departures from normality.
 
: Conover, Johnson, and Johnson (1981) carried a simulation comparing different ''variance homogeneity tests'' and reported that the ''modified Fligner-Killeen test'' is most robust against departures from normality.
 
==Model Validation==
 
TBD
 
  
 
==Computational Resources: Internet-based SOCR Tools==
 
==Computational Resources: Internet-based SOCR Tools==
TBD
+
* See the [http://www.socr.ucla.edu/htmls/ana/FlignerKilleen_Analysis.html SOCR Fligner-Killeen Analysis applet].
 +
* See the [[SOCR_EduMaterials_AnalysisActivities_FlignerKilleen | SOCR Fligner-Killeen Activity]].
  
 
==Examples==
 
==Examples==
TBD
+
Suppose we wanted to study whether the variances in certain time period (e.g., 1981 to 2006) of the consumer-price-indices (CPI) of several items were significantly different. We can use the [[SOCR_Data_Dinov_021808_ConsumerPriceIndex | SOCR CPI Dataset]] to answer this question for the '''Fuel''', '''Oil''', '''Bananas''', '''Tomatoes''', '''Orange Juice''', '''Beef''' and '''Gasoline''' items.
  
==Hands-on Activities==
+
==See also==
TBD
+
* [[SOCR_EduMaterials_AnalysisActivities_FlignerKilleen | SOCR Fligner-Killeen Activity]] provides more hands-on examples.
 +
* [[AP_Statistics_Curriculum_2007_Infer_BiVar | Parametric Variance Homogeneity test]].
 +
* [http://www.socr.ucla.edu/htmls/ana/FlignerKilleen_Analysis.html SOCR Fligner-Killeen Applet].
  
 
<hr>
 
<hr>
  
==Alternative tests of Variance Homegeneity==
+
==Alternative tests of Variance Homogeneity==
 
* [http://en.wikipedia.org/wiki/Levene_test Levene Test]
 
* [http://en.wikipedia.org/wiki/Levene_test Levene Test]
 
* [http://en.wikipedia.org/wiki/Bartlett's_test Bartlett Test]
 
* [http://en.wikipedia.org/wiki/Bartlett's_test Bartlett Test]
* [http://en.wikipedia.org/wiki/Brown-Forsythe_test Brown-Forsythe_test]
+
* [http://en.wikipedia.org/wiki/Brown-Forsythe_test Brown-Forsythe test]
  
 
==References==
 
==References==
Line 51: Line 51:
 
* SOCR Home page: http://www.socr.ucla.edu
 
* SOCR Home page: http://www.socr.ucla.edu
  
{{translate|pageName=http://wiki.stat.ucla.edu/socr/index.php?title=AP_Statistics_Curriculum_2007_NonParam_VarIndep}}
+
"{{translate|pageName=http://wiki.socr.umich.edu/index.php?title=AP_Statistics_Curriculum_2007_NonParam_VarIndep}}

Latest revision as of 12:52, 3 March 2020

General Advance-Placement (AP) Statistics Curriculum - Variances of Two Independent Samples

Differences of Variances of Independent Samples

It is frequently necessary to test if k samples have equal variances. Homogeneity of variances is often a reference to equal variances across samples. Some statistical tests, for example the analysis of variance, assume that variances are equal across groups or samples.

Approach

The (modified) Fligner-Killeen test provides the means for studying the homogeneity of variances of k populations { \(X_{i,j}\), for \(1\leq i \leq n_j\) and \(1\leq j \leq k\)}. The test jointly ranks the absolute values of \(|X_{i,j}-\tilde{X_j}|\) and assigns increasing scores \(a_{N,i}=\Phi^{-1}\bigg ({1 + {i\over N+1} \over 2} \bigg )\), based on the ranks of all observations, see the Conover, Johnson, and Johnson (1981) reference below.

In this test, \(\tilde{X_j}\) is the sample median of the jth population, and \(\Phi(.)\) is the cumulative distribution function for Normal distribution. The Fligner-Killeen test is sometimes also called the median-centering Fligner-Killeen test.

  • Fligner-Killeen test statistics:

\[x_o^2 = {\sum_{j=1}^k {n_j(\bar{A_j} -\bar{a})^2} \over V^2},\]

where \(\bar{A_j}\) is the mean score for the jth sample, \(\bar{a}\) is the overall mean score of all \(a_{N,i}\), and \(V^2\) is the sample variance of all scores.

That is: \[N=\sum_{j=1}^k{n_j},\] \[\bar{A_j} = {1\over n_j}\sum_{i=1}^{n_j}{a_{N,m_i}},\] where \(a_{N,m_i}\) is the increasing rank score for the ith-observation in the jth-sample, \[\bar{a} = {1\over N}\sum_{i=1}^{N}{a_{N,i}},\] \[V^2 = {1\over N-1}\sum_{i=1}^{N}{(a_{N,i}-\bar{a})^2}.\]

  • Fligner-Killeen probabilities:

For large sample sizes, the modified Fligner-Killeen test statistic has an asymptotic chi-square distribution with (k-1) degrees of freedom \[x_o^2 \sim \chi_{(k-1)}^2.\]

  • Note:
Conover, Johnson, and Johnson (1981) carried a simulation comparing different variance homogeneity tests and reported that the modified Fligner-Killeen test is most robust against departures from normality.

Computational Resources: Internet-based SOCR Tools

Examples

Suppose we wanted to study whether the variances in certain time period (e.g., 1981 to 2006) of the consumer-price-indices (CPI) of several items were significantly different. We can use the SOCR CPI Dataset to answer this question for the Fuel, Oil, Bananas, Tomatoes, Orange Juice, Beef and Gasoline items.

See also


Alternative tests of Variance Homogeneity

References

  • Conover, W. J., Johnson, M.E., and Johnson M. M. (1981), A comparative study of tests for homogeneity of variances, with applications to the outer continental shelf bidding data. Technometrics 23, 351-361.

"-----


Translate this page:

(default)
Uk flag.gif

Deutsch
De flag.gif

Español
Es flag.gif

Français
Fr flag.gif

Italiano
It flag.gif

Português
Pt flag.gif

日本語
Jp flag.gif

България
Bg flag.gif

الامارات العربية المتحدة
Ae flag.gif

Suomi
Fi flag.gif

इस भाषा में
In flag.gif

Norge
No flag.png

한국어
Kr flag.gif

中文
Cn flag.gif

繁体中文
Cn flag.gif

Русский
Ru flag.gif

Nederlands
Nl flag.gif

Ελληνικά
Gr flag.gif

Hrvatska
Hr flag.gif

Česká republika
Cz flag.gif

Danmark
Dk flag.gif

Polska
Pl flag.png

România
Ro flag.png

Sverige
Se flag.gif