SOCR EduMaterials ModelerActivities MixtureModel 1
SOCR Modeler Activities - SOCR Mixture Model Fitting Activity
This is a SOCR Activity that demonstrates random sampling and fitting of mixture models to data
- Data Generation: You typically have investigator-acquired data that you need to fit a model to. In this case we will generate the data by randomly sampling using the SOCR resource. Go to the SOCR Modeler and select the Data Generation tab from the right panel.
- Now, click the Raw Data check-box in the left panel, select Laplace Distribution (or any other distribution you want to sample data from), choose the sample-size to be 100 (keep the center, mu, at zero) and click Sample. Then go to the Data tab, in the right panel. There you should see the 100 random Laplace observations stored as a column vector.
- Next, go back to the Data Generation tab from the right panel and change the center of the Laplace distribution (set Mu=20, say). Click Sample again and you will see the list of randomly generated data in the Data tab expand to 200 (as you sampled another set of 100 random Laplace observations).
- Exploratory Data Analysis (EDA): Go to the Data tab and select all observations in the data column (use CTR-A, or mouse-copy). Then open another web browser and go to SOCR Charts. Choose HistogramChartDemo2, say, clear the default data (Data tab) and paste in (CTR-V or mouse paste-in) the first column the 200 observations that you sampled in the SOCR Modeler Data Generator (above). Then you need to map the values - go to the Mapping tab, select the first column, where you pasted the data (C1), and click XValue. This will move the C1 column label from the right bin to the bottom-right bin. Finally, click Update Chart and go to the Graph tab to see your histogram of the 200 (bimodal) Laplace observations. Notice, that you can change the width of the histogram bin to clearly see the bi-modality of the distribution of these 200 measurements. Of course, this is due to the fact that we sampled from two distinct Laplace distributions, one with mean of zero and the second with mean of 20.0.
- Model Fitting: Now go back to the SOCR Modeler browser (where you did the data sampling). Choose Mixed-Model-Fit from the drop-down list in the left panel.
We will now try to fit a 2-component mixture of Gaussian (Normal) distributions to this Bimodal Laplace distribution (of the generated sample).
- SOCR Home page: http://www.socr.ucla.edu
Translate this page: