Difference between revisions of "SOCR Courses 2012 2013 Stat13 1 Lab1"

From SOCR
Jump to: navigation, search
(Created page with '== Stats 13.1 - Laboratory Activity 1== ===Histogram Activity=== This is an exploratory data analysis SOCR activity that illustrates the g…')
 
(3. Histogram from Simulated Data)
 
(One intermediate revision by the same user not shown)
Line 1: Line 1:
== [[SOCR_Courses_2012_2013_Stat13_1 | Stats 13.1]] - Laboratory Activity 1==
+
 
 +
== [[SOCR_Courses_2011_2012_Stat13_1 | Stats 13.1]] - Laboratory Activity 1==
  
 
===Histogram Activity===
 
===Histogram Activity===
Line 9: Line 10:
 
====1. Histogram from Categories and Frequencies====
 
====1. Histogram from Categories and Frequencies====
  
* In the area on the left, click the arrow next to Bar Charts then XYPlots. Then click on '''HistogramChartDemo3'''
+
* In the area on the left, click the arrow next to [http://socr.ucla.edu/htmls/SOCR_Charts.html Bar Charts then XYPlots]. Then click on [http://socr.ucla.edu/htmls/chart/HistogramDemoChart3_Chart.html HistogramChartDemo3].
 +
* Click on the SNAPSHOT button and save a copy of this original histogram.
 
* Click on the DATA tab to view the default data. Notice that the chart requires the user to enter the counts/frequencies of observations within each of the range categories (in this default data case, year).
 
* Click on the DATA tab to view the default data. Notice that the chart requires the user to enter the counts/frequencies of observations within each of the range categories (in this default data case, year).
* Using the SHOW ALL tab you can see all three (graph, data and mapping) in the same view.
+
* Using the SHOW ALL tab you can see all three (graph, data and mapping) in the same view.  
* Try revising some of the numbers in the second (frequency) column and click UPDATE button to see the effect of these changes on the histogram. Do this several times. Take a SNAPSHOT and print off one new histogram.
+
* Try revising some of the numbers in the second (frequency) column and click the UPDATE CHART button to see the effect of these changes on the histogram. Change the frequency for at least 6 years.
 +
* Click on the GRAPH button to return to view just the histogram. Take a SNAPSHOT and
 +
print off your new histogram.
  
 
====2. Simple Histogram from Raw Data====
 
====2. Simple Histogram from Raw Data====
  
* Click on HistogramChartDemo1
+
* Click on [http://socr.ucla.edu/htmls/chart/HistogramDemoChart3_Chart.html HistogramChartDemo1] and take a SNAPSHOT of the original histogram.
 
* Scroll down to find the Bin Size adjustment bar
 
* Scroll down to find the Bin Size adjustment bar
* Change the bin size
+
* Make the size of the bins both smaller and larger. Take a SNAPSHOT of each.
  
 
====3. Histogram from Simulated Data====
 
====3. Histogram from Simulated Data====
  
* Lets first get some data: Go to the Modeler tab at the top of the page. It is best if you open a new page for this.
+
* Let's first get some data: Go to the [http://socr.ucla.edu/htmls/SOCR_Modeler.html Modeler tab] at the top of the page (It's best if you open a new page for this).
* Click on the Data Generation button in the center of the screen. From the drop down bar choose the Normal Distribution and change the number of samples to 20 and the standard deviation to 100. Make sure the Raw Data box on the left is checked.
+
* Click on the [[SOCR_EduMaterials_Activities_RNG|Data Generation button]] in the center of the screen. From the drop down bar choose the Normal Distribution and change the number of samples to 20 and the standard deviation to 100. Make sure the Raw Data box on the left is checked.
 
* Hit Sample then click on the Data tab to see the data you generated.
 
* Hit Sample then click on the Data tab to see the data you generated.
* Copy these data using the Copy button at the top. (select all using Apple+a)
+
*Copy these data by highlighting them with the mouse and then using the Copy button at the top.
* Go back to the data section of the histogram chart and replace the current data with the data you just copied. Make sure to use the PASTE button on the left.
+
* Go back to the your histogram plots window and click on the DATA section. Wipe out the old data by pressing on the CLEAR button and then click on the first cell under the column C1. Click on the PASTE button on the left to paste your new data into the window. Double click on 'C1' and relabel your column of data as 'Data' then hit the RETURN key on your keyboard.
* Click UPDATE CHART
+
* Click UPDATE CHART. Go back to your histogram and take SNAPSHOT of your new histogram.
* Change the bin size smaller and larger and observe how the graph changes
+
* Change the bin size of your histogram and take SNAPSHOTs of it with both a smaller and larger bin size.
* Repeat but change the number of samples to 100
+
* Redo the previous steps again but this time generate 150 samples.
* Take a SNAPSHOT and print off one new histogram
+
* Remember to take a SNAPSHOT of your new histogram with the original bin size, with a smaller bin size and with a larger bin size.
 
 
====4. Questions====
 
  
Answer each question fully using appropriate terminology and references to snapshots if
+
====4. Exercise Questions====
appropriate.
 
  
1) What is the effect of the width/size of the histogram bin on the shape of the resulting histogram? Would the shape of the histogram change significantly if we alter the bin-size?
+
'''Directions''': Answer each question by using complete sentences and appropriate terminology. Remember to word your solutions in such a way so that the reader will understand the question you're being asked to solve. You should include appropriately sized snapshots to help explain your answers.
  
2) How do the sample size and bin size interact?
+
* From the Histogram from Categories and Frequencies section:
 +
** How many data points are included in the original histogram (before you made your changes).
 +
** Describe the distribution (Where is it centered? How is it shaped? How much spread is there?). Would using the mean or median be more appropriate for describing the center of the distribution? Why?
 +
** How does the snapshot of the original distribution compare to the snapshot of the one you changed (in terms of the center, shape and spread of the distributions)?
 +
*From the Simple Histogram from Raw Data section:
 +
** Describe the original distribution (with bin size = 0.1).
 +
** Compare the original distribution with the other two snapshots with smaller and larger bin sizes. How does the Y-axis change when bins are smaller or larger? Why does this occur?
 +
** As the bin size gets smaller or larger, how is the shape of the distribution afected?
 +
* From the Histogram from Simulated Data section:
 +
** Describe the distributions of the histograms using both 20 and 150 samples (and the original bin size). How do they compare?
 +
** What happens to the shape of the histogram as we take larger samples?
 +
** Look at your snapshots of the histograms with 150 samples. What happens as the size of the bins get smaller and larger? Of the three snapshots you took, which bin size do you think best describes the data?
 +
** Under what circumstances would you use smaller or larger bin sizes?
  
3) Would you expect the shape of the sample histogram to look like the shape of the population distribution the data sample came from?
 
  
 
<hr>
 
<hr>

Latest revision as of 18:16, 11 April 2013

Stats 13.1 - Laboratory Activity 1

Histogram Activity

This is an exploratory data analysis SOCR activity that illustrates the generation and interpretation of the histogram of quantitative data. In a nutshell, a histogram of a dataset is a graphical visualization of tabulated frequencies or counts of data within equispaced partition of the range of the data. A histogram shows what proportion of measurements fall into each of the categories defined by the partition of the data range space.

Go to the SOCR Charts (use the Charts tab at the top of the page). Once the page comes up, go to the left side of the page and drag the gray bar to the right.

1. Histogram from Categories and Frequencies

  • In the area on the left, click the arrow next to Bar Charts then XYPlots. Then click on HistogramChartDemo3.
  • Click on the SNAPSHOT button and save a copy of this original histogram.
  • Click on the DATA tab to view the default data. Notice that the chart requires the user to enter the counts/frequencies of observations within each of the range categories (in this default data case, year).
  • Using the SHOW ALL tab you can see all three (graph, data and mapping) in the same view.
  • Try revising some of the numbers in the second (frequency) column and click the UPDATE CHART button to see the effect of these changes on the histogram. Change the frequency for at least 6 years.
  • Click on the GRAPH button to return to view just the histogram. Take a SNAPSHOT and

print off your new histogram.

2. Simple Histogram from Raw Data

  • Click on HistogramChartDemo1 and take a SNAPSHOT of the original histogram.
  • Scroll down to find the Bin Size adjustment bar
  • Make the size of the bins both smaller and larger. Take a SNAPSHOT of each.

3. Histogram from Simulated Data

  • Let's first get some data: Go to the Modeler tab at the top of the page (It's best if you open a new page for this).
  • Click on the Data Generation button in the center of the screen. From the drop down bar choose the Normal Distribution and change the number of samples to 20 and the standard deviation to 100. Make sure the Raw Data box on the left is checked.
  • Hit Sample then click on the Data tab to see the data you generated.
  • Copy these data by highlighting them with the mouse and then using the Copy button at the top.
  • Go back to the your histogram plots window and click on the DATA section. Wipe out the old data by pressing on the CLEAR button and then click on the first cell under the column C1. Click on the PASTE button on the left to paste your new data into the window. Double click on 'C1' and relabel your column of data as 'Data' then hit the RETURN key on your keyboard.
  • Click UPDATE CHART. Go back to your histogram and take SNAPSHOT of your new histogram.
  • Change the bin size of your histogram and take SNAPSHOTs of it with both a smaller and larger bin size.
  • Redo the previous steps again but this time generate 150 samples.
  • Remember to take a SNAPSHOT of your new histogram with the original bin size, with a smaller bin size and with a larger bin size.

4. Exercise Questions

Directions: Answer each question by using complete sentences and appropriate terminology. Remember to word your solutions in such a way so that the reader will understand the question you're being asked to solve. You should include appropriately sized snapshots to help explain your answers.

  • From the Histogram from Categories and Frequencies section:
    • How many data points are included in the original histogram (before you made your changes).
    • Describe the distribution (Where is it centered? How is it shaped? How much spread is there?). Would using the mean or median be more appropriate for describing the center of the distribution? Why?
    • How does the snapshot of the original distribution compare to the snapshot of the one you changed (in terms of the center, shape and spread of the distributions)?
  • From the Simple Histogram from Raw Data section:
    • Describe the original distribution (with bin size = 0.1).
    • Compare the original distribution with the other two snapshots with smaller and larger bin sizes. How does the Y-axis change when bins are smaller or larger? Why does this occur?
    • As the bin size gets smaller or larger, how is the shape of the distribution afected?
  • From the Histogram from Simulated Data section:
    • Describe the distributions of the histograms using both 20 and 150 samples (and the original bin size). How do they compare?
    • What happens to the shape of the histogram as we take larger samples?
    • Look at your snapshots of the histograms with 150 samples. What happens as the size of the bins get smaller and larger? Of the three snapshots you took, which bin size do you think best describes the data?
    • Under what circumstances would you use smaller or larger bin sizes?





Translate this page:

(default)
Uk flag.gif

Deutsch
De flag.gif

Español
Es flag.gif

Français
Fr flag.gif

Italiano
It flag.gif

Português
Pt flag.gif

日本語
Jp flag.gif

България
Bg flag.gif

الامارات العربية المتحدة
Ae flag.gif

Suomi
Fi flag.gif

इस भाषा में
In flag.gif

Norge
No flag.png

한국어
Kr flag.gif

中文
Cn flag.gif

繁体中文
Cn flag.gif

Русский
Ru flag.gif

Nederlands
Nl flag.gif

Ελληνικά
Gr flag.gif

Hrvatska
Hr flag.gif

Česká republika
Cz flag.gif

Danmark
Dk flag.gif

Polska
Pl flag.png

România
Ro flag.png

Sverige
Se flag.gif