Difference between revisions of "SOCR EduMaterials Activities BoxPlot"

From SOCR
Jump to: navigation, search
m
 
(5 intermediate revisions by 3 users not shown)
Line 1: Line 1:
== [[SOCR_EduMaterials_Activities | SOCR Educational Materials - Activities]] - SOCR Box-and-Whisker Plot Activity ==
+
== [[SOCR_EduMaterials_ChartsActivities | SOCR Charts Activities]] - SOCR Box-and-Whisker Plot Activity ==
  
=== Summary===
+
== Summary==
 
This activity describes the construction of the box-and-whisker plot (or simply box plot) in SOCR.  The applets can be accessed at [http://www.socr.ucla.edu/htmls/SOCR_Charts.html SOCR Charts] under the Miscellaneous folder.
 
This activity describes the construction of the box-and-whisker plot (or simply box plot) in SOCR.  The applets can be accessed at [http://www.socr.ucla.edu/htmls/SOCR_Charts.html SOCR Charts] under the Miscellaneous folder.
  
===Goals===
+
==Goals==
 
The aims of this activity are to:
 
The aims of this activity are to:
* show the importance of the box plot in explonatory data analysis (EDA)
+
* show the importance of the box plot in exploratory data analysis (EDA)
 
* illustrate how to use SOCR to construct a box plot
 
* illustrate how to use SOCR to construct a box plot
 
* present some unusual pathologies of a box plot
 
* present some unusual pathologies of a box plot
  
===Background & Motivation===
+
==Background & Motivation==
'''The boxplot (or box-and-whisker-plot), invented by John Tukey in 1977, is an efficient way for presenting data, especially for comparing multiple groups of data.  In the box plot we can mark-off the five-number summay of a data set (minimum, 25th percentile, median, 75th percentile, maximum).  The box contains the <math> 50 % </math> of the data.  The upper edge of the box represents the 75th percentile, while the lower edge the 35th percentile.  The median is represented by a line drawn in the middle of the box.  If the median is not in the middle of the box then the data are skewed.  The ends of the lines (called whiskers) represent the minimum and maximum values of the data set, unless there are outliers.  Outliers are observations below <math> Q_1 -1.5 (IQR) </math> or above <math> Q_3 + 1.5(IQR) </math>, where <math> Q_1</math> is the 25th percentile, <math> Q_3</math> is the 75th percentile, and <math> IQR=Q_3-Q_1 </math> (called the interquartile range).  The advantage of a box plot is that it provides grahically the location and the spread of the data set, it provides an idea about the skewness of the data set, and can provide a comparison between variables by constructing a side-by-side box plots.
+
The box plot (or box-and-whisker-plot), invented by John Tukey in 1977, is an efficient way for presenting data, especially for comparing multiple groups of data.  In the box plot, we can mark-off the five-number summary of a data set (minimum, 25th percentile, median, 75th percentile, maximum).  The box contains the <math> 50 % </math> of the data.  The upper edge of the box represents the 75th percentile, while the lower edge is the 25th percentile.  The median is represented by a line drawn in the middle of the box.  If the median is not in the middle of the box then the data are skewed.  The ends of the lines (called whiskers) represent the minimum and maximum values of the data set, unless there are outliers.  Outliers are observations below <math> Q_1 -1.5 (IQR) </math> or above <math> Q_3 + 1.5(IQR) </math>, where <math> Q_1</math> is the 25th percentile, <math> Q_3</math> is the 75th percentile, and <math> IQR=Q_3-Q_1 </math> (called the interquartile range).  The advantage of a box plot is that it provides graphically the location and the spread of the data set, it provides an idea about the skewness of the data set, and can provide a comparison between variables by constructing a side-by-side box plots.
  
===Examples & Exercises===
+
==Examples & Exercises==
* '''Example 1''': Go to the [http://www.socr.ucla.edu/htmls/SOCR_Charts.html SOCR Charts] and first, click on the '''Miscellaneous''' folder and then on '''BoxAndWhiskerChartDemo1'''.  In the Demo1 boxplot we can see side-by-side box plots of two categories for each of three series.  These demonstration data can be viewed by clicking on DATA.  Clicking on MAPPING you can choose the variables.  Clicking on SHOW ALL the applet will present the graph, the data, and the mapping environment.  Let’s clear this data set (click on CLEAR) so that we can enter our own data.  After you click on CLEAR click on DATA to enter into the spreadsheet.  The following data will be entered (don’t forget to separate the data by commas!):
+
* '''Example 1''': Go to the [http://www.socr.ucla.edu/htmls/SOCR_Charts.html SOCR Charts] and first, click on the '''Miscellaneous''' folder and then on '''BoxAndWhiskerChartDemo1'''.  In the Demo1 box plot, we can see side-by-side box plots of two categories for each of three series.  These demonstration data can be viewed by clicking on DATA.  Clicking on MAPPING you can choose the variables.  Clicking on SHOW ALL the applet will present the graph, the data, and the mapping environment.  Let’s clear this data set (click on CLEAR) so that we can enter our own data.  After you click on CLEAR button, click on the DATA tab to enter data into the spreadsheet.  The following data will be entered (don’t forget to separate the data by commas!):
  
<TABLE BORDER=1>
+
<center><TABLE BORDER=1>
 
<TR> <TD>C1</TD>        <TD>  C2</TD>            <TD>  C3 </TD>    </TR>             
 
<TR> <TD>C1</TD>        <TD>  C2</TD>            <TD>  C3 </TD>    </TR>             
 
<TR><TD>Series 1</TD>  <TD>1,2,3,4,5,6</TD>      <TD>2,4,6,8,10,12</TD>    </TR>  
 
<TR><TD>Series 1</TD>  <TD>1,2,3,4,5,6</TD>      <TD>2,4,6,8,10,12</TD>    </TR>  
 
<TR><TD>Series 2</TD>    <TD>3,4,5,6,7,8</TD>      <TD>6,8,10,12,14,16,18</TD>  </TR>
 
<TR><TD>Series 2</TD>    <TD>3,4,5,6,7,8</TD>      <TD>6,8,10,12,14,16,18</TD>  </TR>
 
<TR><TD>Series 3</TD>    <TD>5,6,7,8,9</TD>        <TD>10,16,18,20,22</TD>  </TR>
 
<TR><TD>Series 3</TD>    <TD>5,6,7,8,9</TD>        <TD>10,16,18,20,22</TD>  </TR>
</TABLE>
+
</TABLE></center>
  
 
When you finish entering your data, click on MAPPING to select the series and categories, and finally click on UPDATE_CHART to view the box plots.
 
When you finish entering your data, click on MAPPING to select the series and categories, and finally click on UPDATE_CHART to view the box plots.
Line 40: Line 40:
  
 
* '''Example 2''':  
 
* '''Example 2''':  
If we are working with a single variable we can use the '''BoxAndWhiskerChartDemo2'''.  Double click this link to see the demonstration of the contruction of the box plot with one variable.  As we did in example 1, we will enter our own data.  Click on CLEAR to enter your data in the spreadsheet.  The data we want to enter are the following: 60, 95, 72, 87, 88, 75, 76, 91, 100, 58, 78, 81, 73, 94, 65.   
+
If we are working with a single variable, we can use the '''BoxAndWhiskerChartDemo2'''.  Double click this link to see the demonstration of the construction of the box plot with one variable.  As we did in example 1, we will enter our own data.  Click on CLEAR to enter your data in the spreadsheet.  The data we want to enter are the following: 60, 95, 72, 87, 88, 75, 76, 91, 100, 58, 78, 81, 73, 94, 65.   
  
 
When you finish entering your data, click on MAPPING to select the category (here only C1), and finally click on UPDATE_CHART to view the box plot.
 
When you finish entering your data, click on MAPPING to select the category (here only C1), and finally click on UPDATE_CHART to view the box plot.
Line 49: Line 49:
 
The following snapshot shows the mapping of the data:
 
The following snapshot shows the mapping of the data:
 
<center>[[Image: SOCR_Activities_More_Examples_Christou_box2_map.jpg|600px]]</center>
 
<center>[[Image: SOCR_Activities_More_Examples_Christou_box2_map.jpg|600px]]</center>
 
  
 
The following snapshot shows the box plot:
 
The following snapshot shows the box plot:
Line 58: Line 57:
  
 
===Box Plot Pathologies===
 
===Box Plot Pathologies===
Box plots can show unusual pathologies.  For the following box plots enter the data in the SOCR Charts spreadsheet that created them.
+
Box plots can show unusual pathologies.  For the following box plots, enter the data in the SOCR Charts spreadsheet that created them.
  
 
* '''Example 1''':  
 
* '''Example 1''':  
Line 80: Line 79:
  
  
===Other Forms of Data===
+
==Other Forms of Data==
Alternatively, the user can access data by clicking on FILE OPEN.  The data must be saved previously as a comma delimited (CSV) file before accessed in SOCR.
+
Alternatively, the user can import data by clicking on FILE OPEN.  Note here that the data must be saved previously as a comma delimited (CSV) in order to be accessed in SOCR.
  
  
Line 87: Line 86:
 
* SOCR Home page: http://www.socr.ucla.edu
 
* SOCR Home page: http://www.socr.ucla.edu
  
{{translate|pageName=http://wiki.stat.ucla.edu/socr/index.php?title=SOCR_EduMaterials_Activities_BoxPlot}}</math>
+
{{translate|pageName=http://wiki.stat.ucla.edu/socr/index.php?title=SOCR_EduMaterials_Activities_BoxPlot}}

Latest revision as of 11:19, 2 July 2009

SOCR Charts Activities - SOCR Box-and-Whisker Plot Activity

Summary

This activity describes the construction of the box-and-whisker plot (or simply box plot) in SOCR. The applets can be accessed at SOCR Charts under the Miscellaneous folder.

Goals

The aims of this activity are to:

  • show the importance of the box plot in exploratory data analysis (EDA)
  • illustrate how to use SOCR to construct a box plot
  • present some unusual pathologies of a box plot

Background & Motivation

The box plot (or box-and-whisker-plot), invented by John Tukey in 1977, is an efficient way for presenting data, especially for comparing multiple groups of data. In the box plot, we can mark-off the five-number summary of a data set (minimum, 25th percentile, median, 75th percentile, maximum). The box contains the \( 50 % \) of the data. The upper edge of the box represents the 75th percentile, while the lower edge is the 25th percentile. The median is represented by a line drawn in the middle of the box. If the median is not in the middle of the box then the data are skewed. The ends of the lines (called whiskers) represent the minimum and maximum values of the data set, unless there are outliers. Outliers are observations below \( Q_1 -1.5 (IQR) \) or above \( Q_3 + 1.5(IQR) \), where \( Q_1\) is the 25th percentile, \( Q_3\) is the 75th percentile, and \( IQR=Q_3-Q_1 \) (called the interquartile range). The advantage of a box plot is that it provides graphically the location and the spread of the data set, it provides an idea about the skewness of the data set, and can provide a comparison between variables by constructing a side-by-side box plots.

Examples & Exercises

  • Example 1: Go to the SOCR Charts and first, click on the Miscellaneous folder and then on BoxAndWhiskerChartDemo1. In the Demo1 box plot, we can see side-by-side box plots of two categories for each of three series. These demonstration data can be viewed by clicking on DATA. Clicking on MAPPING you can choose the variables. Clicking on SHOW ALL the applet will present the graph, the data, and the mapping environment. Let’s clear this data set (click on CLEAR) so that we can enter our own data. After you click on CLEAR button, click on the DATA tab to enter data into the spreadsheet. The following data will be entered (don’t forget to separate the data by commas!):
C1 C2 C3
Series 1 1,2,3,4,5,6 2,4,6,8,10,12
Series 2 3,4,5,6,7,8 6,8,10,12,14,16,18
Series 3 5,6,7,8,9 10,16,18,20,22

When you finish entering your data, click on MAPPING to select the series and categories, and finally click on UPDATE_CHART to view the box plots. The following snapshot shows how the above data entered into SOCR:

SOCR Activities More Examples Christou box1 data.jpg

The following snapshot shows the mapping of the data:

SOCR Activities More Examples Christou box1 map.jpg


The following snapshot shows the side-by-side box plots:

SOCR Activities More Examples Christou box1 graph.jpg

The following snapshot shows the data, the mapping, and the box plots in one screen:

SOCR Activities More Examples Christou box1 show all.jpg


  • Example 2:

If we are working with a single variable, we can use the BoxAndWhiskerChartDemo2. Double click this link to see the demonstration of the construction of the box plot with one variable. As we did in example 1, we will enter our own data. Click on CLEAR to enter your data in the spreadsheet. The data we want to enter are the following: 60, 95, 72, 87, 88, 75, 76, 91, 100, 58, 78, 81, 73, 94, 65.

When you finish entering your data, click on MAPPING to select the category (here only C1), and finally click on UPDATE_CHART to view the box plot.

The following snapshot shows how the above data entered into SOCR:

SOCR Activities More Examples Christou box2 data.jpg

The following snapshot shows the mapping of the data:

SOCR Activities More Examples Christou box2 map.jpg

The following snapshot shows the box plot:

SOCR Activities More Examples Christou box2 graph.jpg

The following snapshot shows the data, the mapping, and the box plots in one screen:

SOCR Activities More Examples Christou box2 show all.jpg

Box Plot Pathologies

Box plots can show unusual pathologies. For the following box plots, enter the data in the SOCR Charts spreadsheet that created them.

  • Example 1:
SOCR Activities More Examples Christou box2 pathology1.jpg


  • Example 2:
SOCR Activities More Examples Christou box2 pathology2.jpg


  • Example 3:
SOCR Activities More Examples Christou box2 pathology3.jpg


  • Example 4:
SOCR Activities More Examples Christou box2 pathology4.jpg


  • Example 5:
SOCR Activities More Examples Christou box2 pathology5.jpg


Other Forms of Data

Alternatively, the user can import data by clicking on FILE OPEN. Note here that the data must be saved previously as a comma delimited (CSV) in order to be accessed in SOCR.





Translate this page:

(default)
Uk flag.gif

Deutsch
De flag.gif

Español
Es flag.gif

Français
Fr flag.gif

Italiano
It flag.gif

Português
Pt flag.gif

日本語
Jp flag.gif

България
Bg flag.gif

الامارات العربية المتحدة
Ae flag.gif

Suomi
Fi flag.gif

इस भाषा में
In flag.gif

Norge
No flag.png

한국어
Kr flag.gif

中文
Cn flag.gif

繁体中文
Cn flag.gif

Русский
Ru flag.gif

Nederlands
Nl flag.gif

Ελληνικά
Gr flag.gif

Hrvatska
Hr flag.gif

Česká republika
Cz flag.gif

Danmark
Dk flag.gif

Polska
Pl flag.png

România
Ro flag.png

Sverige
Se flag.gif