Difference between revisions of "AP Statistics Curriculum 2007 IntroDesign"

From SOCR
Jump to: navigation, search
Line 7: Line 7:
 
The following are the most common components used in Experimental Design.
 
The following are the most common components used in Experimental Design.
  
* Comparison: To make inference about effects, associations or predictions, one typically has to compare different groups subjected to distinct conditions. This allows contrasting observed responses and underlying group differences which ultimately may lead to inference on relations and influence between controlled and observed variables.
+
* '''Comparison''': To make inference about effects, associations or predictions, one typically has to compare different groups subjected to distinct conditions. This allows contrasting observed responses and underlying group differences which ultimately may lead to inference on relations and influence between controlled and observed variables.
  
* Randomization: The second fundamental design principle is randomization. It requires that we make allocation of (controlled variables) treatments to units using some [[SOCR_EduMaterials_Activities_RNG | random mechanism]]. This will simply guarantees that effects that may be present is the units, but not incorporated in the model, are equidistributed amongst all groups and are therefore unlikely to significantly effect our group comparisons at the end of the statistical inference or analysis (as these effects, if present, will be similar within each group).
+
* '''Randomization''': The second fundamental design principle is randomization. It requires that we make allocation of (controlled variables) treatments to units using some [[SOCR_EduMaterials_Activities_RNG | random mechanism]]. This will simply guarantees that effects that may be present is the units, but not incorporated in the model, are equidistributed amongst all groups and are therefore unlikely to significantly effect our group comparisons at the end of the statistical inference or analysis (as these effects, if present, will be similar within each group).
  
3. Replication: All measurements we make, observations we acquire or data we collect is subject to variation, as [[SOCR_EduMaterials_Activities_RNG | there are no completely deterministic processes]]. As we try to make inference about the process that generated the observed data (not the sample data itself, even though our statistical analysis are data-driven and therefore based on the observed measurements), the more data we collect (unbiasly) the stronger our inference is likely to be. Therefore, repeated measurements intuitively would allow is to tame the variability associated with the phenomenon we study.
+
* '''Replication''': All measurements we make, observations we acquire or data we collect is subject to variation, as [[SOCR_EduMaterials_Activities_RNG | there are no completely deterministic processes]]. As we try to make inference about the process that generated the observed data (not the sample data itself, even though our statistical analysis are data-driven and therefore based on the observed measurements), the more data we collect (unbiasly) the stronger our inference is likely to be. Therefore, repeated measurements intuitively would allow is to tame the variability associated with the phenomenon we study.
  
4. Blocking: Blocking is the arrangement of experimental units into groups (blocks) that are similar to one another. Blocking reduces known but irrelevant sources of variation between units and thus allows greater precision in the estimation of the source of variation under study.
+
* '''Blocking''': Blocking is related to randomization. The difference is that we use blocking when we know ''a priori'' of certain effects of the observational units on the response measurements (e.g., when studying the effects of hormonal treatments on humans, gender plays a significant role). We arrange units into groups (blocks) that are similar to one another when we design an experiment in which certain unit characteristics are known to affect the response measurements. Blocking reduces known and irrelevant sources of variation between units and allows greater precision in the estimation of the source of variation in the study.
  
5. Orthogonality
+
* '''Orthogonality''': Orthogonality allows division of complex relations, variation into separate (independent/orthogonal) contrasts, or factors, that can be studies efficiently and autonomously. Often, these contrasts may be represented by vectors where sets of orthogonal contrasts are uncorrelated and may be independently distributed. Independence implies that each orthogonal contrast provides complementary information to other contrasts (i.e., other treatments). The goal is to completely decompose the variance or the relations of the observed measurements into independent components (e.g., like [http://en.wikipedia.org/wiki/Taylor_expansion Taylor expension] allows polynomial desomposition of smooth functions, where the polynomial base functions are easy to differentiate, integrate, etc.) This will, of course, allow easier interpretation of the statistical analysis and the findings of the study.
 
 
Orthogonality concerns the forms of comparison (contrasts) that can be legitimately and efficiently carried out. Contrasts can be represented by vectors and sets of orthogonal contrasts are uncorrelated and independently distributed if the data are normal. Because of this independence, each orthogonal treatment provides different information to the others. If there are T treatments and T - 1 orthogonal contrasts, all the information that can be captured from the experiment is obtainable from the set of contrasts.
 
 
 
6. Use of factorial experiments instead of the one-factor-at-a-time method. These are efficient at evaluating the effects and possible interactions of several factors (independent variables).
 
 
 
Analysis of the design of experiments was built on the foundation of the analysis of variance, a collection of models in which the observed variance is partitioned into components due to different factors which are estimated and/or tested.
 
* TBD
 
  
 
===Model Validation===
 
===Model Validation===
Line 30: Line 23:
  
 
===Computational Resources: Internet-based SOCR Tools===
 
===Computational Resources: Internet-based SOCR Tools===
* TBD
+
* [http://www.socr.ucla.edu/htmls/SOCR_ChoiceOfStatisticalTest.html How to choose a statistical test based on a study design?]
  
===Examples===
+
===Examples & Hands-on activities===
 
Computer simulations and real observed data.  
 
Computer simulations and real observed data.  
  
* TBD
 
 
===Hands-on activities===
 
Step-by-step practice problems.
 
 
* TBD
 
  
 
<hr>
 
<hr>

Revision as of 20:36, 16 June 2007

General Advance-Placement (AP) Statistics Curriculum - Design and Experiments

Design and Experiments

Design of experiments refers of the blueprint for planning a study or experiment, performing the data collection protocol and controlling the study parameters for accuracy and consistency. Design of experiments only makes sense in studies where variation, chance and uncertainly are present and unavoidable. Data, or information, is typically collected in regard to a specific process or phenomenon being studied to investigate the effects of some controlled variables (independent variables or predictors) on other observed measurements (responses or dependent variables). Both types of variables are associated with specific observational units (living beings, components, objects, materials, etc.)

Approach

The following are the most common components used in Experimental Design.

  • Comparison: To make inference about effects, associations or predictions, one typically has to compare different groups subjected to distinct conditions. This allows contrasting observed responses and underlying group differences which ultimately may lead to inference on relations and influence between controlled and observed variables.
  • Randomization: The second fundamental design principle is randomization. It requires that we make allocation of (controlled variables) treatments to units using some random mechanism. This will simply guarantees that effects that may be present is the units, but not incorporated in the model, are equidistributed amongst all groups and are therefore unlikely to significantly effect our group comparisons at the end of the statistical inference or analysis (as these effects, if present, will be similar within each group).
  • Replication: All measurements we make, observations we acquire or data we collect is subject to variation, as there are no completely deterministic processes. As we try to make inference about the process that generated the observed data (not the sample data itself, even though our statistical analysis are data-driven and therefore based on the observed measurements), the more data we collect (unbiasly) the stronger our inference is likely to be. Therefore, repeated measurements intuitively would allow is to tame the variability associated with the phenomenon we study.
  • Blocking: Blocking is related to randomization. The difference is that we use blocking when we know a priori of certain effects of the observational units on the response measurements (e.g., when studying the effects of hormonal treatments on humans, gender plays a significant role). We arrange units into groups (blocks) that are similar to one another when we design an experiment in which certain unit characteristics are known to affect the response measurements. Blocking reduces known and irrelevant sources of variation between units and allows greater precision in the estimation of the source of variation in the study.
  • Orthogonality: Orthogonality allows division of complex relations, variation into separate (independent/orthogonal) contrasts, or factors, that can be studies efficiently and autonomously. Often, these contrasts may be represented by vectors where sets of orthogonal contrasts are uncorrelated and may be independently distributed. Independence implies that each orthogonal contrast provides complementary information to other contrasts (i.e., other treatments). The goal is to completely decompose the variance or the relations of the observed measurements into independent components (e.g., like Taylor expension allows polynomial desomposition of smooth functions, where the polynomial base functions are easy to differentiate, integrate, etc.) This will, of course, allow easier interpretation of the statistical analysis and the findings of the study.

Model Validation

Checking/affirming underlying assumptions.

  • TBD

Computational Resources: Internet-based SOCR Tools

Examples & Hands-on activities

Computer simulations and real observed data.



References

  • TBD



Translate this page:

(default)
Uk flag.gif

Deutsch
De flag.gif

Español
Es flag.gif

Français
Fr flag.gif

Italiano
It flag.gif

Português
Pt flag.gif

日本語
Jp flag.gif

България
Bg flag.gif

الامارات العربية المتحدة
Ae flag.gif

Suomi
Fi flag.gif

इस भाषा में
In flag.gif

Norge
No flag.png

한국어
Kr flag.gif

中文
Cn flag.gif

繁体中文
Cn flag.gif

Русский
Ru flag.gif

Nederlands
Nl flag.gif

Ελληνικά
Gr flag.gif

Hrvatska
Hr flag.gif

Česká republika
Cz flag.gif

Danmark
Dk flag.gif

Polska
Pl flag.png

România
Ro flag.png

Sverige
Se flag.gif