# SOCR EduMaterials Activities Histogram Graphs

## Contents

## SOCR Educational Materials - SOCR Histogram Generation Graphing Activity

## Summary

This is an exploratory data analysis SOCR activity that illustrates the generation and interpretation of the histogram of quantitative data. The complete details about histograms can be found here. In a nutshell, a histogram of a dataset is a graphical visualization of tabulated frequencies or counts of data within equal spaced partition of the range of the data. A histogram shows what proportion of measurements that fall into each of the categories defined by the partition of the data range space.

## Exercises

**Exercise 1**: Simple Histogram from Raw Data

- This exercise demonstrates the construction of a histogram plot from raw quantitative data.
- First, point your browser to SOCR Charts and select the
**HistogramChartDemo**(under BarCharts --> XYChart). There are three different ways to select data for this histogram chart:- Use the default data provided for this chart (
**DEMO**button); - Enter your own data. This can be done by copying to the mouse buffer data from external spreadsheet/table, clicking on the top-left cell in the SOCR Histogram Data table, and pasting (
**Paste**button) the data into the histogram data table. Remember to**MAP**the data - this indicates what columns rows, parts of the data need to be used in the histogram calculations. Then you click**UPDATE**chart to have the new graph drawn in the**Graph**tab-pane; - Obtain SOCR simulated data from the
**Data-Generation**tab of the SOCR Modeler (an example is shown in exercise 3, as well as in the SOCR Power Transform Activity).

- Use the default data provided for this chart (

**Exercise 2**: Histogram from Categories and Frequencies

- Again, point your browser to SOCR Charts. This time select the
**HistogramChartDemo3**chart (under BarCharts --> XYChart). Use the default data provided for this chart (**DEMO**button). - Notice that this time, the chart requires the user to enter the counts/frequencies of observations within each of the range categories (in this default data case,
*year*). - Try revising some of the numbers in the second (frequency) column and click
**UPDATE**button to see the effect of these changes on the histogram. - Remember that if you enter your own data you need to go to the
**MAP**tab-pane and select the columns that contain your histogram bin and frequency columns. - Using the
**SHOW_ALL**tab-pane you can see all three (graph, data and mapping) in the same view.

**Exercise 3**: Histogram from Simulated Data

- Let’s first get some data: Go to SOCR Modeler and generate 100 Cauchy Distributed variables. Copy these data in your mouse buffer (CNT-C). Of course, you may use your own data throughout this exercise.
- Next, paste (CNT-V) these 100 observations in SOCR Charts
**HistogramChartDemo**(BarCharts -> XYChart). Go to the**MAP**tab-pane and select the first column (where you pasted your data) in the**XValue**bin. Click**Update Chart**to see the histogram plot of these 100 Cauchy observations in RED! - Note that the shape of this data histogram resembles the shape of the Cauchy distribution that we sampled this data from.

Error creating thumbnail: File missing

**Questions**

- What is the effect of the width/size of the histogram bin on the shape of the resulting histogram? If we alter the bin-size, would the shape of the histogram change significantly? Does the sample-size play role in this?
- Would you expect the shape of the sample histogram to
*look like*the shape of the population distribution the data sample came from?

- SOCR Home page: http://www.socr.ucla.edu

Translate this page: