Difference between revisions of "SOCR MotionCharts CAOzoneData"
(→References) |
|||
(40 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
− | == [[SOCR_MotionCharts| SOCR MotionCharts Activities]] - California Ozone | + | == [[SOCR_MotionCharts| SOCR MotionCharts Activities]] - California Ozone Pollution: Geographic and Time Effects on Health Activity == |
== Summary== | == Summary== | ||
− | This activity demonstrates the usage and functionality of [http://socr.ucla.edu/SOCR_MotionCharts/ SOCR MotionCharts] using the [[ | + | This activity demonstrates a complete, interactive, self-contained and technology-enhanced pedagogical study of geographic and time effects of Ozone pollution on human health. In addition, this activity illustrates the usage and functionality of [http://socr.ucla.edu/SOCR_MotionCharts/ SOCR MotionCharts] for exploratory multivariate data analysis using the [[SOCR_Data_121608_OzoneData | SOCR California Ozone dataset]]. |
==Goals== | ==Goals== | ||
− | The aims of this activity | + | The aims of this activity are to: |
− | * | + | * Use MotionChart to address 2 specific health-related case-studies |
− | * | + | * Demonstrate data import, MotionChart data manipulations and graphical data interpretation |
− | * | + | * Explore the interactive graphical visualization of real-life multidimensional datasets |
− | * | + | * Data navigation from different directions (using data mappings). |
+ | |||
+ | ==Activity videos== | ||
+ | There are several videos that demonstrate the mechanics of using the SOCR Motion Charts to analyze longitudinal multivariate data: | ||
+ | * A 6-minute video of this California Ozone Pollution Activity: Geographic and Time Effects on Health, which is available [http://www.youtube.com/watch?v=4rPLixYLwgw YouTube Stream (low resolution)], and a [http://www.socr.ucla.edu/docs/videos/SOCR_CA_OzoneActivity_July2010.mp4 high resolution analogue (80MB)] | ||
+ | * [http://www.socr.ucla.edu/docs/videos/SOCR_OzoneVideoActivity_2010_old.mp4 A video describing the health and environmental implications of this data and activity]. | ||
==Background == | ==Background == | ||
[[Image:SOCR_OzoneData_GeoMap_Dinov_121608_Fig1.png|200px|thumbnail|right| [http://socr.ucla.edu/docs/resources/SOCR_Data/SOCR_OzoneData_GoogleMap.html Ozone Geo-Map] ]] | [[Image:SOCR_OzoneData_GeoMap_Dinov_121608_Fig1.png|200px|thumbnail|right| [http://socr.ucla.edu/docs/resources/SOCR_Data/SOCR_OzoneData_GoogleMap.html Ozone Geo-Map] ]] | ||
− | Suppose we are asked to analyze a complex dataset that included observational multivariate ozone depletion data. The data included [[ | + | Suppose we are asked to analyze a complex dataset that included observational multivariate ozone depletion data. The data included [[SOCR_Data_121608_OzoneData |California Ozone measurements from 20 locations between 1980 and 2006]]. The figure on the right illustrates a dynamic interactive map of the geographic locations of the data measurements. This dataset consists of 540 rows and 22 variables. The goals of the study were to identify relationships and associations between the variables and map geographically the significant ozone layer effects. Any such quantitative study requires a preliminary exploratory data analysis. The complexity of the dataset and the intrinsic measurement characteristics of the ozone data demands a new approach to visualization and exploration of these heterogeneous measurements. |
==Case Study== | ==Case Study== | ||
Line 20: | Line 25: | ||
This Ozone pollution case study addresses the following specific driving environmental challenges: | This Ozone pollution case study addresses the following specific driving environmental challenges: | ||
* ''Are there temporal changes in California Ozone?'' | * ''Are there temporal changes in California Ozone?'' | ||
− | * ''What is the geographic distribution of the California Ozone pollution and | + | * ''What is the geographic distribution of the California Ozone pollution and is it changing with time?'' |
− | The following chart illustrates the health-related interpretation of the Ozone data in terms of the particulate (particles per million, '''ppm''') recordings, according to the National Oceanic and Atmospheric Administration's (NOAA) Air Quality Index (AQI). | + | The following chart illustrates the health-related interpretation of the Ozone data in terms of the particulate (particles per million, '''ppm''') recordings, according to the National Oceanic and Atmospheric Administration's (NOAA) Air Quality Index (AQI). [http://www.epa.gov EPA] guidelines on air quality are as follows: |
+ | * (2010) The Obama administration’s proposal sets a primary standard for ground-level ozone of no more than 0.060 to 0.070 ppm, to be phased in over two decades. | ||
+ | * (2005) The Bush administration imposed a limit of 0.075 ppm. | ||
+ | * (1997) The Clinton administration introduced a standard of 0.084 ppm. | ||
===Temporal changes in California Ozone=== | ===Temporal changes in California Ozone=== | ||
* Observation: Notice the annualized increase of the ozone pollution with time (increase of the proportion of ''hot-colored bubbles'' with time). | * Observation: Notice the annualized increase of the ozone pollution with time (increase of the proportion of ''hot-colored bubbles'' with time). | ||
− | * Motion Chart: Use the following variable-mapping to demonstrate the significant time effect on the increase of the ozone pollution as measured by ppm recordings: | + | * Motion Chart: Use the following ''variable-mapping'' to demonstrate the significant time effect on the increase of the ozone pollution as measured by ppm recordings: |
<center> | <center> | ||
{| class="wikitable" | {| class="wikitable" | ||
Line 34: | Line 42: | ||
! [http://socr.ucla.edu/SOCR_MotionCharts/ SOCR MotionChart Property] || Key || X-Axis || Y-Axis || Size || Color || Category | ! [http://socr.ucla.edu/SOCR_MotionCharts/ SOCR MotionChart Property] || Key || X-Axis || Y-Axis || Size || Color || Category | ||
|- | |- | ||
− | ! [[ | + | ! [[SOCR_Data_121608_OzoneData | Data Column Name]] || Year || MTH_1|| MTH_8 || HI_COVER || ANNUAL || Location |
|} | |} | ||
</center> | </center> | ||
+ | * Note that the variable mapping we used above allows us to track seasonal changes of ozone pollution (Winter, month1, on the X-axis; and Summer, month8, on the Y-axis). If we play this motion chart, the following interpretation of the motion chart is appropriate: For a given bubble (representing one measurement location): | ||
+ | ** ''up-down movement'' indicate annual Winter (January) increase or decrease of pollution levels, respectively; | ||
+ | ** ''left-right movement'' indicate annual Summer (August) increase or decrease of pollution levels, respectively; | ||
+ | ** ''bubble-size increase'' or ''decrease'' indicate corresponding changes in the annual percent coverage during typical periods of high concentration; | ||
+ | ** ''color change'' from ''cool-to-hot'' indicates an ''annual increase'' of the ozone measurement for that specific location from one year to the next. | ||
+ | |||
: You should see an image like this one shown below. Play this motion charts by clicking the '''Play''' button and observe the increase of hot-colored bubbles in the chart as time goes from 1980 to 2006. | : You should see an image like this one shown below. Play this motion charts by clicking the '''Play''' button and observe the increase of hot-colored bubbles in the chart as time goes from 1980 to 2006. | ||
<center>[[Image:SOCR_OzoneData_AQI_Ozone_Chart2.png|600px]]</center> | <center>[[Image:SOCR_OzoneData_AQI_Ozone_Chart2.png|600px]]</center> | ||
− | ===Geographic distribution | + | ===Geographic distribution of California Ozone pollution=== |
* Observation: The ozone pollution appears to be a more geographically spread out phenomenon in the 2000's, compared to the 1980's -- most of the bubbles cluster together in later years, whereas there were wider geographic-driven fluctuations in the ozone particles in the earlier years. The size of the bubbles reflects the maximum annual pollution and the bubble color indicates the average annual ozone pollution -- ''hot-colors'' represent high and ''cool-colors'' represent low ozone pollution levels, respectively. | * Observation: The ozone pollution appears to be a more geographically spread out phenomenon in the 2000's, compared to the 1980's -- most of the bubbles cluster together in later years, whereas there were wider geographic-driven fluctuations in the ozone particles in the earlier years. The size of the bubbles reflects the maximum annual pollution and the bubble color indicates the average annual ozone pollution -- ''hot-colors'' represent high and ''cool-colors'' represent low ozone pollution levels, respectively. | ||
* Motion Chart: Use the following variable-mapping to demonstrate the significant geographic temporal re-distribution of the ozone pollution as measured by ppm recordings: | * Motion Chart: Use the following variable-mapping to demonstrate the significant geographic temporal re-distribution of the ozone pollution as measured by ppm recordings: | ||
Line 50: | Line 64: | ||
! [http://socr.ucla.edu/SOCR_MotionCharts/ SOCR MotionChart Property] || Key || X-Axis || Y-Axis || Size || Color || Category | ! [http://socr.ucla.edu/SOCR_MotionCharts/ SOCR MotionChart Property] || Key || X-Axis || Y-Axis || Size || Color || Category | ||
|- | |- | ||
− | ! [[ | + | ! [[SOCR_Data_121608_OzoneData | Data Column Name]] || Year || LONGITUDE || LATITUDE || HI_COVER || ANNUAL || Location |
|} | |} | ||
</center> | </center> | ||
− | : You should see an image like this one shown below. In this mapping, each bubble corresponds spatially to a geographic location, just like in the [http://socr.ucla.edu/docs/resources/SOCR_Data/SOCR_OzoneData_GoogleMap.html geographic-map above]. Play this motion charts by clicking the '''Play''' button and observe the increase of hot-colored bubbles in later years at geographic locations which did not show unhealthy ozone pollution levels | + | * This second variable mapping allows us to track seasonal changes of ozone pollution for each geographic location. Here the X and Y locations of bubbles are locked at the [http://en.wikipedia.org/wiki/Geographic_coordinate_system GIS longitude and latitude coordinates]. By playing this motion chart, we have the following interpretation: For a given bubble (representing one GIS location): |
+ | ** ''bubble-size increase'' or ''decrease'' indicate corresponding changes in the annual percent coverage during typical periods of high concentration; | ||
+ | ** ''color change'' from ''cool-to-hot'' or ''hot-to-cool'' indicate ''annual increase'' or ''annual decrease'', respectively, of the ozone measurement for that specific location from one year to the next. | ||
+ | |||
+ | : You should see an image like this one shown below. In this mapping, each bubble corresponds spatially to a geographic location, just like in the [http://socr.ucla.edu/docs/resources/SOCR_Data/SOCR_OzoneData_GoogleMap.html geographic-map above]. Play this motion charts by clicking the '''Play''' button and observe the increase of hot-colored bubbles in later years at geographic locations which did not show unhealthy ozone pollution levels in the early years. | ||
<center>[[Image:SOCR_OzoneData_AQI_Ozone_Chart3.png|600px]]</center> | <center>[[Image:SOCR_OzoneData_AQI_Ozone_Chart3.png|600px]]</center> | ||
==Initial setting== | ==Initial setting== | ||
− | In addition to this activity, open 2 more browser tabs - one pointing to the [http://socr.ucla.edu/SOCR_MotionCharts/ SOCR MotionCharts applet] and the other displaying the [[ | + | In addition to this activity, open 2 more browser tabs - one pointing to the [http://socr.ucla.edu/SOCR_MotionCharts/ SOCR MotionCharts applet] and the other displaying the [[SOCR_Data_121608_OzoneData | SOCR California Ozone dataset]]. The image below shows this setting. |
<center>[[Image:SOCR_Activities_MotionCharts_Ozone_070109_Fig1.png|250px]] [[Image:SOCR_Activities_MotionCharts_Ozone_070109_Fig2.png|250px]] [[Image:SOCR_Activities_MotionCharts_Ozone_070109_Fig3.png|250px]] | <center>[[Image:SOCR_Activities_MotionCharts_Ozone_070109_Fig1.png|250px]] [[Image:SOCR_Activities_MotionCharts_Ozone_070109_Fig2.png|250px]] [[Image:SOCR_Activities_MotionCharts_Ozone_070109_Fig3.png|250px]] | ||
</center> | </center> | ||
==Hands-on activity== | ==Hands-on activity== | ||
− | * Using the mouse, copy the [[ | + | * Using the mouse, copy the [[SOCR_Data_121608_OzoneData | SOCR California Ozone dataset]], click on the first cell (top-left) in the DATA tab of the [http://socr.ucla.edu/SOCR_MotionCharts/ SOCR Motion Charts applet], and paste the data in the spreadsheet. |
<center>[[Image:SOCR_Activities_MotionCharts_Ozone_070109_Fig2.png|400px]]</center> | <center>[[Image:SOCR_Activities_MotionCharts_Ozone_070109_Fig2.png|400px]]</center> | ||
Line 74: | Line 92: | ||
! [http://socr.ucla.edu/SOCR_MotionCharts/ SOCR MotionChart Property] || Key || X-Axis || Y-Axis || Size || Color || Category | ! [http://socr.ucla.edu/SOCR_MotionCharts/ SOCR MotionChart Property] || Key || X-Axis || Y-Axis || Size || Color || Category | ||
|- | |- | ||
− | ! [[ | + | ! [[SOCR_Data_121608_OzoneData | Data Column Name]] || Year || Longitude || Latitude || MTH_1|| MTH_7 || Location |
|} | |} | ||
</center> | </center> | ||
Line 91: | Line 109: | ||
[[Image:SOCR_Activities_MotionCharts_Ozone_070109_Fig10.png|150px]] | [[Image:SOCR_Activities_MotionCharts_Ozone_070109_Fig10.png|150px]] | ||
</center> | </center> | ||
+ | |||
+ | We can also overlay the Motion-chart above on the [http://socr.ucla.edu/docs/resources/SOCR_Data/SOCR_OzoneData_GoogleMap.html Geo-map of the 20 locations] of the ozone recording stations. This of course, stretches vertically the motion chart, as longitude and latitude coordinates are not isotropic. | ||
+ | <center>[[Image:SOCR_Activities_MotionCharts_Ozone_032510_Fig11.png|400px]]</center> | ||
+ | |||
+ | == Other types of exploratory and statistical analyses of the Ozone data== | ||
+ | Various [http://socr.ucla.edu/ SOCR online tools] can also be used to analyze (visually or quantitatively) the Ozone pollution data. The table below contains a summary of the annual pollution rates (ppm) for each of the 20 locations across the span of 27 years: | ||
+ | <center> | ||
+ | {| class="wikitable" | ||
+ | |- | ||
+ | ! LOCATION || 1980 || 1981 || 1982 || 1983 || 1984 || 1985 || 1986 || 1987 || 1988 || 1989 || 1990 || 1991 || 1992 || 1993 || 1994 || 1995 || 1996 || 1997 || 1998 || 1999 || 2000 || 2001 || 2002 || 2003 || 2004 || 2005 || 2006 | ||
+ | |- | ||
+ | | 2008 || 0.12 || 0.11 || 0.15 || 0.14 || 0.14 || 0.13 || 0.11 || 0.17 || 0.11 || 0.19 || 0.11 || 0.11 || 0.13 || 0.11 || 0.106 || 0.135 || 0.12 || 8.7 || 9.8 || 0.088 || 9.3 || 9.2 || 7.4 || 9.7 || 0.109 || 8.2 || 8.2 | ||
+ | |- | ||
+ | | 2040 || 0.18 || 0.2 || 0.23 || 0.28 || 0.28 || 0.22 || 0.15 || 0.17 || 0.22 || 0.23 || 0.2 || 0.18 || 0.16 || 0.146 || 0.102 || 0.12 || 0.12 || 0.115 || 0.125 || 0.104 || 0.118 || 0.135 || 0.112 || 0.107 || 0.105 || 8.3 || 0.108 | ||
+ | |- | ||
+ | | 2102 || 0.13 || 0.11 || 0.1 || 0.14 || 0.16 || 0.14 || 0.1 || 0.15 || 0.12 || 0.11 || 0.11 || 7.9 || 0.11 || 0.13 || 0.111 || 0.124 || 0.121 || 8.7 || 0.097 || 9.7 || 0.107 || 0.118 || 0.111 || 9.3 || 8.9 || 9.3 || 0.105 | ||
+ | |- | ||
+ | | 2125 || 0.15 || 0.13 || 0.1 || 0.17 || 0.11 || 0.13 || 0.1 || 0.12 || 0.1 || 0.1 || 7.9 || 7.9 || 8.9 || 0.1 || 8.3 || 0.14 || 0.097 || 8.9 || 6.5 || 8.2 || 0.083 || 0.105 || 8.9 || 0.113 || 0.097 || 8.3 || 8.4 | ||
+ | |- | ||
+ | | 2199 || 0.21 || 0.19 || 0.19 || 0.19 || 0.2 || 0.24 || 0.18 || 0.17 || 0.2 || 0.19 || 0.17 || 0.18 || 0.15 || 0.17 || 0.165 || 0.16 || 0.16 || 0.155 || 0.173 || 0.126 || 0.124 || 0.137 || 0.136 || 0.141 || 0.125 || 0.139 || 0.126 | ||
+ | |- | ||
+ | | 2249 || 0.31 || 0.27 || 0.32 || 0.27 || 0.32 || 0.34 || 0.25 || 0.24 || 0.29 || 0.26 || 0.21 || 0.21 || 0.21 || 0.19 || 0.252 || 0.16 || 0.15 || 0.134 || 0.182 || 0.116 || 0.137 || 0.114 || 0.121 || 0.165 || 9.8 || 9.3 || 0.146 | ||
+ | |- | ||
+ | | 2293 || 0.19 || 0.16 || 0.14 || 0.16 || 0.15 || 0.15 || 0.14 || 0.16 || 0.13 || 0.12 || 0.13 || 0.12 || 0.12 || 0.13 || 0.12 || 0.153 || 0.1 || 0.109 || 0.115 || 0.133 || 0.102 || 0.109 || 0.11 || 0.123 || 8.9 || 0.105 || 0.102 | ||
+ | |- | ||
+ | | 2410 || 0.14 || 0.12 || 0.1 || 0.13 || 0.14 || 0.12 || 8.9 || 0.11 || 0.12 || 0.12 || 0.11 || 0.11 || 0.1 || 0.11 || 0.1 || 0.133 || 0.112 || 0.103 || 0.119 || 0.113 || 0.079 || 9.1 || 0.109 || 0.101 || 0.104 || 8.7 || 7.9 | ||
+ | |- | ||
+ | | 2420 || 0.38 || 0.25 || 0.22 || 0.26 || 0.26 || 0.25 || 0.22 || 0.22 || 0.25 || 0.23 || 0.19 || 0.22 || 0.17 || 0.19 || 0.14 || 0.145 || 0.205 || 0.121 || 0.161 || 0.1 || 0.109 || 0.14 || 0.152 || 0.179 || 0.131 || 0.138 || 0.158 | ||
+ | |- | ||
+ | | 2460 || 0.23 || 0.19 || 0.18 || 0.18 || 0.16 || 0.22 || 0.16 || 0.17 || 0.19 || 0.2 || 0.17 || 0.15 || 0.17 || 0.187 || 0.147 || 0.146 || 0.138 || 0.136 || 0.164 || 0.124 || 0.121 || 0.135 || 0.121 || 0.125 || 0.106 || 0.113 || 0.121 | ||
+ | |- | ||
+ | | 2484 || 0.41 || 0.35 || 0.36 || 0.39 || 0.31 || 0.36 || 0.31 || 0.3 || 0.3 || 0.33 || 0.23 || 0.28 || 0.27 || 0.24 || 0.251 || 0.212 || 0.196 || 0.162 || 0.195 || 0.137 || 0.174 || 0.189 || 0.136 || 0.15 || 0.134 || 0.145 || 0.165 | ||
+ | |- | ||
+ | | 2492 || 0.35 || 0.27 || 0.25 || 0.31 || 0.26 || 0.3 || 0.28 || 0.23 || 0.24 || 0.2 || 0.2 || 0.22 || 0.22 || 0.18 || 0.167 || 0.165 || 0.142 || 0.134 || 0.177 || 0.12 || 0.152 || 0.129 || 0.128 || 0.134 || 0.137 || 0.142 || 0.166 | ||
+ | |- | ||
+ | | 2499 || 0.32 || 0.35 || 0.32 || 0.28 || 0.34 || 0.3 || 0.26 || 0.29 || 0.29 || 0.27 || 0.33 || 0.27 || 0.28 || 0.24 || 0.265 || 0.256 || 0.234 || 0.205 || 0.244 || 0.174 || 0.176 || 0.17 || 0.161 || 0.163 || 0.163 || 0.182 || 0.164 | ||
+ | |- | ||
+ | | 2525 || 0.29 || 0.24 || 0.28 || 0.26 || 0.22 || 0.29 || 0.22 || 0.2 || 0.23 || 0.21 || 0.19 || 0.2 || 0.21 || 0.2 || 0.184 || 0.202 || 0.18 || 0.136 || 0.149 || 0.112 || 0.164 || 0.152 || 0.147 || 0.155 || 0.128 || 0.126 || 0.169 | ||
+ | |- | ||
+ | | 2589 || 0.16 || 0.17 || 0.2 || 0.21 || 0.15 || 0.2 || 0.14 || 0.16 || 0.22 || 0.16 || 0.15 || 0.15 || 0.15 || 0.133 || 9.8 || 0.14 || 9.7 || 0.117 || 9.8 || 0.105 || 9.1 || 0.102 || 0.115 || 7.4 || 0.097 || 9.2 || 8.3 | ||
+ | |- | ||
+ | | 2596 || 0.37 || 0.3 || 0.31 || 0.36 || 0.32 || 0.35 || 0.25 || 0.29 || 0.28 || 0.27 || 0.29 || 0.24 || 0.26 || 0.26 || 0.253 || 0.213 || 0.203 || 0.187 || 0.195 || 0.142 || 0.14 || 0.143 || 0.155 || 0.169 || 0.141 || 0.144 || 0.151 | ||
+ | |- | ||
+ | | 2655 || 0.12 || 0.11 || 8.9 || 0.11 || 0.11 || 0.11 || 8.9 || 0.11 || 0.1 || 0.1 || 8.9 || 0.11 || 8.9 || 0.12 || 9.2 || 0.13 || 8.9 || 8.3 || 0.125 || 0.115 || 7.7 || 9.8 || 0.116 || 0.105 || 9.2 || 9.1 || 9.6 | ||
+ | |- | ||
+ | | 2898 || 0.37 || 0.33 || 0.31 || 0.34 || 0.31 || 0.33 || 0.27 || 0.29 || 0.29 || 0.25 || 0.24 || 0.24 || 0.26 || 0.21 || 0.241 || 0.215 || 0.187 || 0.156 || 0.184 || 0.141 || 0.152 || 0.144 || 0.15 || 0.161 || 0.131 || 0.14 || 0.151 | ||
+ | |- | ||
+ | | 2899 || 0.29 || 0.32 || 0.4 || 0.26 || 0.29 || 0.3 || 0.22 || 0.22 || 0.21 || 0.25 || 0.2 || 0.19 || 0.2 || 0.16 || 0.193 || 0.167 || 0.144 || 0.12 || 0.148 || 0.128 || 0.136 || 0.116 || 0.122 || 0.152 || 0.11 || 0.121 || 0.108 | ||
+ | |- | ||
+ | | 2991 || 0.13 || 0.16 || 0.15 || 0.15 || 0.14 || 0.15 || 0.18 || 0.18 || 0.16 || 0.19 || 0.12 || 0.12 || 0.141 || 0.138 || 0.115 || 0.124 || 0.121 || 0.102 || 0.106 || 0.103 || 8.3 || 9.3 || 8.6 || 8.0 || 8.3 || 7.5 || 8.8 | ||
+ | |} | ||
+ | </center> | ||
+ | |||
+ | === Spider Chart=== | ||
+ | This [[SOCR_EduMaterials_Activities_SpiderWebChart | spider chart]] visually demonstrates the rapid increase of the pollution levels across the years (radial spokes). For each site the 27 annual measurements are connected with lines of the same color as shown on the 20-locations color-mapping below. | ||
+ | <center>[[Image:SOCR_Activities_MotionCharts_Ozone_032510_Fig12.jpg|400px]] | ||
+ | [[Image:SOCR_Activities_MotionCharts_Ozone_032510_Fig13.png|400px]]</center> | ||
+ | |||
+ | === Box-and-whisker plot === | ||
+ | This [[ SOCR_EduMaterials_Activities_BoxPlot | Box and Whiskers Plot]] illustrates the same annual (across locations) averages of the ozone pollution for the 27 years on record. Notice the increase in (high-level) outliers, denoted by colored triangles on the top, and the atypically average high pollution levels in the last few years. | ||
+ | |||
+ | <center>[[Image:SOCR_Activities_MotionCharts_Ozone_032510_Fig14.png|400px]]</center> | ||
+ | |||
+ | === Simple Linear Regression analysis=== | ||
+ | We can employ the [[SOCR_EduMaterials_AnalysisActivities_SLR | SOCR simple linear regression analysis]] to determine the relation between the (average) '''Annual''' Ozone pollution (response variable) relative to '''Year''' (time predictor variable). The quantitative results and graphs of this analysis, using only the [[SOCR_Data_121608_OzoneData | Year and Annual variables in the Ozone dataset]], are included below: | ||
+ | *Quantitative analysis: | ||
+ | ** Regression Line: <math>OzonePollution = -191.6095127656 + 0.09667\times YEAR</math> | ||
+ | *** Intercept: | ||
+ | ****Parameter Estimate: -191.6095127656 | ||
+ | ****Standard Error: 28.4291851368 | ||
+ | ****T-Statistics: -6.7398876135 | ||
+ | ****P-Value: 0.0000000000 | ||
+ | *** Slope: | ||
+ | ****Parameter Estimate: 0.0966959040 | ||
+ | ****Standard Error: 0.0142644095 | ||
+ | ****T-Statistics: 6.7788228014 | ||
+ | ****P-Value: 0.0000000000 | ||
+ | ** Correlation(YEAR, OzonePollution) = 0.2805211066 | ||
+ | * The prediction interval and the regression line have a slight (but statistically significant) upward trend, which is mostly due to the sporadic extremely high pollution levels in the last few years. | ||
+ | <center>[[Image:SOCR_Activities_MotionCharts_Ozone_032510_Fig15.png|400px]]</center> | ||
== Data type and format == | == Data type and format == | ||
Line 98: | Line 196: | ||
The SOCR MotionCharts can be used in a variety of applications to visualize dynamic relationships in multidimensional data in up to four dimensions and a fifth temporal component. Its design and implementation allow for extensions allowing and supporting higher dimensions plug-ins. The overall purpose of SOCR MotionCharts is to provide users with a way to visualize the relationships between multiple variables over a period of time in a simple, intuitive and animated fashion. | The SOCR MotionCharts can be used in a variety of applications to visualize dynamic relationships in multidimensional data in up to four dimensions and a fifth temporal component. Its design and implementation allow for extensions allowing and supporting higher dimensions plug-ins. The overall purpose of SOCR MotionCharts is to provide users with a way to visualize the relationships between multiple variables over a period of time in a simple, intuitive and animated fashion. | ||
+ | ==See also== | ||
+ | * [http://www.nytimes.com/2010/01/08/science/earth/08smog.html E.P.A. Seeks Stricter Rules to Curb Smog (New York Times article)]. | ||
+ | * [http://www.ioe.ucla.edu/reportcard/article.asp?parentid=1700 Health effects of ambient and traffic related air pollution on pregnant women, their infants, and young children: Southern California Environmental Report Card (Fall 2008)]. | ||
+ | * [http://wiki.stat.ucla.edu/socr/uploads/7/7e/SOCR_MotionCharts_CAOzone_Activity_2010.pdf 2MB PDF file containing this activity]. | ||
+ | * [[062510 | Middle school SOCR Motion Charts Activity using the CA Ozone Data]] | ||
+ | |||
+ | ==References== | ||
+ | * Al-Aziz, J, Christou, N, Dinov, ID. (2010). [http://www.amstat.org/publications/jse/v18n3/dinov.pdf SOCR Motion Charts: An Efficient, Open-Source, Interactive and Dynamic Applet for Visualizing Longitudinal Multivariate Data], [http://www.amstat.org/publications/jse/contents_2010.htm JSE, 18(3)], 1-29. | ||
+ | * Dinov, ID, Christou, N. (2011) [http://www.informaworld.com/smpp/content%7Edb=all?content=10.1080/0020739X.2011.562315 Web-based tools for modelling and analysis of multivariate data: California ozone pollution activity], [http://www.informaworld.com/smpp/title%7Edb=all%7Econtent=t713736815 International Journal of Mathematical Education in Science and Technology (JMEST)], iFIRST:1-17, [http://www.informaworld.com/smpp/content%7Edb=all?content=10.1080/0020739X.2011.562315 DOI: 10.1080/0020739X.2011.562315]. | ||
+ | |||
+ | |||
+ | <hr> | ||
{{translate|pageName=http://wiki.stat.ucla.edu/socr/index.php?title=SOCR_MotionCharts_CAOzoneData}} | {{translate|pageName=http://wiki.stat.ucla.edu/socr/index.php?title=SOCR_MotionCharts_CAOzoneData}} |
Latest revision as of 12:27, 25 May 2011
Contents
- 1 SOCR MotionCharts Activities - California Ozone Pollution: Geographic and Time Effects on Health Activity
- 2 Summary
- 3 Goals
- 4 Activity videos
- 5 Background
- 6 Case Study
- 7 Initial setting
- 8 Hands-on activity
- 9 Other types of exploratory and statistical analyses of the Ozone data
- 10 Data type and format
- 11 Applications
- 12 See also
- 13 References
SOCR MotionCharts Activities - California Ozone Pollution: Geographic and Time Effects on Health Activity
Summary
This activity demonstrates a complete, interactive, self-contained and technology-enhanced pedagogical study of geographic and time effects of Ozone pollution on human health. In addition, this activity illustrates the usage and functionality of SOCR MotionCharts for exploratory multivariate data analysis using the SOCR California Ozone dataset.
Goals
The aims of this activity are to:
- Use MotionChart to address 2 specific health-related case-studies
- Demonstrate data import, MotionChart data manipulations and graphical data interpretation
- Explore the interactive graphical visualization of real-life multidimensional datasets
- Data navigation from different directions (using data mappings).
Activity videos
There are several videos that demonstrate the mechanics of using the SOCR Motion Charts to analyze longitudinal multivariate data:
- A 6-minute video of this California Ozone Pollution Activity: Geographic and Time Effects on Health, which is available YouTube Stream (low resolution), and a high resolution analogue (80MB)
- A video describing the health and environmental implications of this data and activity.
Background
Suppose we are asked to analyze a complex dataset that included observational multivariate ozone depletion data. The data included California Ozone measurements from 20 locations between 1980 and 2006. The figure on the right illustrates a dynamic interactive map of the geographic locations of the data measurements. This dataset consists of 540 rows and 22 variables. The goals of the study were to identify relationships and associations between the variables and map geographically the significant ozone layer effects. Any such quantitative study requires a preliminary exploratory data analysis. The complexity of the dataset and the intrinsic measurement characteristics of the ozone data demands a new approach to visualization and exploration of these heterogeneous measurements.
Case Study
This Ozone pollution case study addresses the following specific driving environmental challenges:
- Are there temporal changes in California Ozone?
- What is the geographic distribution of the California Ozone pollution and is it changing with time?
The following chart illustrates the health-related interpretation of the Ozone data in terms of the particulate (particles per million, ppm) recordings, according to the National Oceanic and Atmospheric Administration's (NOAA) Air Quality Index (AQI). EPA guidelines on air quality are as follows:
- (2010) The Obama administration’s proposal sets a primary standard for ground-level ozone of no more than 0.060 to 0.070 ppm, to be phased in over two decades.
- (2005) The Bush administration imposed a limit of 0.075 ppm.
- (1997) The Clinton administration introduced a standard of 0.084 ppm.
Temporal changes in California Ozone
- Observation: Notice the annualized increase of the ozone pollution with time (increase of the proportion of hot-colored bubbles with time).
- Motion Chart: Use the following variable-mapping to demonstrate the significant time effect on the increase of the ozone pollution as measured by ppm recordings:
Variables | ||||||
---|---|---|---|---|---|---|
SOCR MotionChart Property | Key | X-Axis | Y-Axis | Size | Color | Category |
Data Column Name | Year | MTH_1 | MTH_8 | HI_COVER | ANNUAL | Location |
- Note that the variable mapping we used above allows us to track seasonal changes of ozone pollution (Winter, month1, on the X-axis; and Summer, month8, on the Y-axis). If we play this motion chart, the following interpretation of the motion chart is appropriate: For a given bubble (representing one measurement location):
- up-down movement indicate annual Winter (January) increase or decrease of pollution levels, respectively;
- left-right movement indicate annual Summer (August) increase or decrease of pollution levels, respectively;
- bubble-size increase or decrease indicate corresponding changes in the annual percent coverage during typical periods of high concentration;
- color change from cool-to-hot indicates an annual increase of the ozone measurement for that specific location from one year to the next.
- You should see an image like this one shown below. Play this motion charts by clicking the Play button and observe the increase of hot-colored bubbles in the chart as time goes from 1980 to 2006.
Geographic distribution of California Ozone pollution
- Observation: The ozone pollution appears to be a more geographically spread out phenomenon in the 2000's, compared to the 1980's -- most of the bubbles cluster together in later years, whereas there were wider geographic-driven fluctuations in the ozone particles in the earlier years. The size of the bubbles reflects the maximum annual pollution and the bubble color indicates the average annual ozone pollution -- hot-colors represent high and cool-colors represent low ozone pollution levels, respectively.
- Motion Chart: Use the following variable-mapping to demonstrate the significant geographic temporal re-distribution of the ozone pollution as measured by ppm recordings:
Variables | ||||||
---|---|---|---|---|---|---|
SOCR MotionChart Property | Key | X-Axis | Y-Axis | Size | Color | Category |
Data Column Name | Year | LONGITUDE | LATITUDE | HI_COVER | ANNUAL | Location |
- This second variable mapping allows us to track seasonal changes of ozone pollution for each geographic location. Here the X and Y locations of bubbles are locked at the GIS longitude and latitude coordinates. By playing this motion chart, we have the following interpretation: For a given bubble (representing one GIS location):
- bubble-size increase or decrease indicate corresponding changes in the annual percent coverage during typical periods of high concentration;
- color change from cool-to-hot or hot-to-cool indicate annual increase or annual decrease, respectively, of the ozone measurement for that specific location from one year to the next.
- You should see an image like this one shown below. In this mapping, each bubble corresponds spatially to a geographic location, just like in the geographic-map above. Play this motion charts by clicking the Play button and observe the increase of hot-colored bubbles in later years at geographic locations which did not show unhealthy ozone pollution levels in the early years.
Initial setting
In addition to this activity, open 2 more browser tabs - one pointing to the SOCR MotionCharts applet and the other displaying the SOCR California Ozone dataset. The image below shows this setting.
Hands-on activity
- Using the mouse, copy the SOCR California Ozone dataset, click on the first cell (top-left) in the DATA tab of the SOCR Motion Charts applet, and paste the data in the spreadsheet.
- Next, you need to map the column-variables to different properties it the SOCR MotionChart. For example, you can us the following mapping:
Variables | ||||||
---|---|---|---|---|---|---|
SOCR MotionChart Property | Key | X-Axis | Y-Axis | Size | Color | Category |
Data Column Name | Year | Longitude | Latitude | MTH_1 | MTH_7 | Location |
The figures below represent snapshots of the generated dynamic SOCR motion chart. In the real applet, you can play (animate) or scroll (1-year steps) through the years (1980, ..., 2006). Notice the position change between different snapshots of the time slider on the bottom of these figures. Also, mouse-over a blob triggers a dynamic graphical pop-up providing additional information about the data for the specified blob in the chart.
You can also change what variables are mapped to the following SOCR MotionCharts properties:
- Key, X-Axis, Y-Axis, Size, Color and Category.
We can also overlay the Motion-chart above on the Geo-map of the 20 locations of the ozone recording stations. This of course, stretches vertically the motion chart, as longitude and latitude coordinates are not isotropic.
Other types of exploratory and statistical analyses of the Ozone data
Various SOCR online tools can also be used to analyze (visually or quantitatively) the Ozone pollution data. The table below contains a summary of the annual pollution rates (ppm) for each of the 20 locations across the span of 27 years:
LOCATION | 1980 | 1981 | 1982 | 1983 | 1984 | 1985 | 1986 | 1987 | 1988 | 1989 | 1990 | 1991 | 1992 | 1993 | 1994 | 1995 | 1996 | 1997 | 1998 | 1999 | 2000 | 2001 | 2002 | 2003 | 2004 | 2005 | 2006 |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2008 | 0.12 | 0.11 | 0.15 | 0.14 | 0.14 | 0.13 | 0.11 | 0.17 | 0.11 | 0.19 | 0.11 | 0.11 | 0.13 | 0.11 | 0.106 | 0.135 | 0.12 | 8.7 | 9.8 | 0.088 | 9.3 | 9.2 | 7.4 | 9.7 | 0.109 | 8.2 | 8.2 |
2040 | 0.18 | 0.2 | 0.23 | 0.28 | 0.28 | 0.22 | 0.15 | 0.17 | 0.22 | 0.23 | 0.2 | 0.18 | 0.16 | 0.146 | 0.102 | 0.12 | 0.12 | 0.115 | 0.125 | 0.104 | 0.118 | 0.135 | 0.112 | 0.107 | 0.105 | 8.3 | 0.108 |
2102 | 0.13 | 0.11 | 0.1 | 0.14 | 0.16 | 0.14 | 0.1 | 0.15 | 0.12 | 0.11 | 0.11 | 7.9 | 0.11 | 0.13 | 0.111 | 0.124 | 0.121 | 8.7 | 0.097 | 9.7 | 0.107 | 0.118 | 0.111 | 9.3 | 8.9 | 9.3 | 0.105 |
2125 | 0.15 | 0.13 | 0.1 | 0.17 | 0.11 | 0.13 | 0.1 | 0.12 | 0.1 | 0.1 | 7.9 | 7.9 | 8.9 | 0.1 | 8.3 | 0.14 | 0.097 | 8.9 | 6.5 | 8.2 | 0.083 | 0.105 | 8.9 | 0.113 | 0.097 | 8.3 | 8.4 |
2199 | 0.21 | 0.19 | 0.19 | 0.19 | 0.2 | 0.24 | 0.18 | 0.17 | 0.2 | 0.19 | 0.17 | 0.18 | 0.15 | 0.17 | 0.165 | 0.16 | 0.16 | 0.155 | 0.173 | 0.126 | 0.124 | 0.137 | 0.136 | 0.141 | 0.125 | 0.139 | 0.126 |
2249 | 0.31 | 0.27 | 0.32 | 0.27 | 0.32 | 0.34 | 0.25 | 0.24 | 0.29 | 0.26 | 0.21 | 0.21 | 0.21 | 0.19 | 0.252 | 0.16 | 0.15 | 0.134 | 0.182 | 0.116 | 0.137 | 0.114 | 0.121 | 0.165 | 9.8 | 9.3 | 0.146 |
2293 | 0.19 | 0.16 | 0.14 | 0.16 | 0.15 | 0.15 | 0.14 | 0.16 | 0.13 | 0.12 | 0.13 | 0.12 | 0.12 | 0.13 | 0.12 | 0.153 | 0.1 | 0.109 | 0.115 | 0.133 | 0.102 | 0.109 | 0.11 | 0.123 | 8.9 | 0.105 | 0.102 |
2410 | 0.14 | 0.12 | 0.1 | 0.13 | 0.14 | 0.12 | 8.9 | 0.11 | 0.12 | 0.12 | 0.11 | 0.11 | 0.1 | 0.11 | 0.1 | 0.133 | 0.112 | 0.103 | 0.119 | 0.113 | 0.079 | 9.1 | 0.109 | 0.101 | 0.104 | 8.7 | 7.9 |
2420 | 0.38 | 0.25 | 0.22 | 0.26 | 0.26 | 0.25 | 0.22 | 0.22 | 0.25 | 0.23 | 0.19 | 0.22 | 0.17 | 0.19 | 0.14 | 0.145 | 0.205 | 0.121 | 0.161 | 0.1 | 0.109 | 0.14 | 0.152 | 0.179 | 0.131 | 0.138 | 0.158 |
2460 | 0.23 | 0.19 | 0.18 | 0.18 | 0.16 | 0.22 | 0.16 | 0.17 | 0.19 | 0.2 | 0.17 | 0.15 | 0.17 | 0.187 | 0.147 | 0.146 | 0.138 | 0.136 | 0.164 | 0.124 | 0.121 | 0.135 | 0.121 | 0.125 | 0.106 | 0.113 | 0.121 |
2484 | 0.41 | 0.35 | 0.36 | 0.39 | 0.31 | 0.36 | 0.31 | 0.3 | 0.3 | 0.33 | 0.23 | 0.28 | 0.27 | 0.24 | 0.251 | 0.212 | 0.196 | 0.162 | 0.195 | 0.137 | 0.174 | 0.189 | 0.136 | 0.15 | 0.134 | 0.145 | 0.165 |
2492 | 0.35 | 0.27 | 0.25 | 0.31 | 0.26 | 0.3 | 0.28 | 0.23 | 0.24 | 0.2 | 0.2 | 0.22 | 0.22 | 0.18 | 0.167 | 0.165 | 0.142 | 0.134 | 0.177 | 0.12 | 0.152 | 0.129 | 0.128 | 0.134 | 0.137 | 0.142 | 0.166 |
2499 | 0.32 | 0.35 | 0.32 | 0.28 | 0.34 | 0.3 | 0.26 | 0.29 | 0.29 | 0.27 | 0.33 | 0.27 | 0.28 | 0.24 | 0.265 | 0.256 | 0.234 | 0.205 | 0.244 | 0.174 | 0.176 | 0.17 | 0.161 | 0.163 | 0.163 | 0.182 | 0.164 |
2525 | 0.29 | 0.24 | 0.28 | 0.26 | 0.22 | 0.29 | 0.22 | 0.2 | 0.23 | 0.21 | 0.19 | 0.2 | 0.21 | 0.2 | 0.184 | 0.202 | 0.18 | 0.136 | 0.149 | 0.112 | 0.164 | 0.152 | 0.147 | 0.155 | 0.128 | 0.126 | 0.169 |
2589 | 0.16 | 0.17 | 0.2 | 0.21 | 0.15 | 0.2 | 0.14 | 0.16 | 0.22 | 0.16 | 0.15 | 0.15 | 0.15 | 0.133 | 9.8 | 0.14 | 9.7 | 0.117 | 9.8 | 0.105 | 9.1 | 0.102 | 0.115 | 7.4 | 0.097 | 9.2 | 8.3 |
2596 | 0.37 | 0.3 | 0.31 | 0.36 | 0.32 | 0.35 | 0.25 | 0.29 | 0.28 | 0.27 | 0.29 | 0.24 | 0.26 | 0.26 | 0.253 | 0.213 | 0.203 | 0.187 | 0.195 | 0.142 | 0.14 | 0.143 | 0.155 | 0.169 | 0.141 | 0.144 | 0.151 |
2655 | 0.12 | 0.11 | 8.9 | 0.11 | 0.11 | 0.11 | 8.9 | 0.11 | 0.1 | 0.1 | 8.9 | 0.11 | 8.9 | 0.12 | 9.2 | 0.13 | 8.9 | 8.3 | 0.125 | 0.115 | 7.7 | 9.8 | 0.116 | 0.105 | 9.2 | 9.1 | 9.6 |
2898 | 0.37 | 0.33 | 0.31 | 0.34 | 0.31 | 0.33 | 0.27 | 0.29 | 0.29 | 0.25 | 0.24 | 0.24 | 0.26 | 0.21 | 0.241 | 0.215 | 0.187 | 0.156 | 0.184 | 0.141 | 0.152 | 0.144 | 0.15 | 0.161 | 0.131 | 0.14 | 0.151 |
2899 | 0.29 | 0.32 | 0.4 | 0.26 | 0.29 | 0.3 | 0.22 | 0.22 | 0.21 | 0.25 | 0.2 | 0.19 | 0.2 | 0.16 | 0.193 | 0.167 | 0.144 | 0.12 | 0.148 | 0.128 | 0.136 | 0.116 | 0.122 | 0.152 | 0.11 | 0.121 | 0.108 |
2991 | 0.13 | 0.16 | 0.15 | 0.15 | 0.14 | 0.15 | 0.18 | 0.18 | 0.16 | 0.19 | 0.12 | 0.12 | 0.141 | 0.138 | 0.115 | 0.124 | 0.121 | 0.102 | 0.106 | 0.103 | 8.3 | 9.3 | 8.6 | 8.0 | 8.3 | 7.5 | 8.8 |
Spider Chart
This spider chart visually demonstrates the rapid increase of the pollution levels across the years (radial spokes). For each site the 27 annual measurements are connected with lines of the same color as shown on the 20-locations color-mapping below.
Box-and-whisker plot
This Box and Whiskers Plot illustrates the same annual (across locations) averages of the ozone pollution for the 27 years on record. Notice the increase in (high-level) outliers, denoted by colored triangles on the top, and the atypically average high pollution levels in the last few years.
Simple Linear Regression analysis
We can employ the SOCR simple linear regression analysis to determine the relation between the (average) Annual Ozone pollution (response variable) relative to Year (time predictor variable). The quantitative results and graphs of this analysis, using only the Year and Annual variables in the Ozone dataset, are included below:
- Quantitative analysis:
- Regression Line\[OzonePollution = -191.6095127656 + 0.09667\times YEAR\]
- Intercept:
- Parameter Estimate: -191.6095127656
- Standard Error: 28.4291851368
- T-Statistics: -6.7398876135
- P-Value: 0.0000000000
- Slope:
- Parameter Estimate: 0.0966959040
- Standard Error: 0.0142644095
- T-Statistics: 6.7788228014
- P-Value: 0.0000000000
- Intercept:
- Correlation(YEAR, OzonePollution) = 0.2805211066
- Regression Line\[OzonePollution = -191.6095127656 + 0.09667\times YEAR\]
- The prediction interval and the regression line have a slight (but statistically significant) upward trend, which is mostly due to the sporadic extremely high pollution levels in the last few years.
Data type and format
SOCR Motion Charts currently accepts three types of data: numbers, dates/time, and strings. With these data types, we feel that the application is able to handle the majority of data out here. We use the natural ordering of these types as defined by Java however. While many types of data can be interpreted as a string, it may not make sense to use lexicological ordering on all the different types. When designing SOCR Motion Charts, we took this into consideration and designed the application so that it can easily be extended to provide a greater variety of interpreted types. Thus, a developer should be able to easily provide better type interpretation for particular types of data.
Applications
The SOCR MotionCharts can be used in a variety of applications to visualize dynamic relationships in multidimensional data in up to four dimensions and a fifth temporal component. Its design and implementation allow for extensions allowing and supporting higher dimensions plug-ins. The overall purpose of SOCR MotionCharts is to provide users with a way to visualize the relationships between multiple variables over a period of time in a simple, intuitive and animated fashion.
See also
- E.P.A. Seeks Stricter Rules to Curb Smog (New York Times article).
- Health effects of ambient and traffic related air pollution on pregnant women, their infants, and young children: Southern California Environmental Report Card (Fall 2008).
- 2MB PDF file containing this activity.
- Middle school SOCR Motion Charts Activity using the CA Ozone Data
References
- Al-Aziz, J, Christou, N, Dinov, ID. (2010). SOCR Motion Charts: An Efficient, Open-Source, Interactive and Dynamic Applet for Visualizing Longitudinal Multivariate Data, JSE, 18(3), 1-29.
- Dinov, ID, Christou, N. (2011) Web-based tools for modelling and analysis of multivariate data: California ozone pollution activity, International Journal of Mathematical Education in Science and Technology (JMEST), iFIRST:1-17, DOI: 10.1080/0020739X.2011.562315.
Translate this page: