Difference between revisions of "SOCR Data"

From SOCR
Jump to: navigation, search
(reorganized the classification of the data by type)
(US Census Data)
 
(38 intermediate revisions by 2 users not shown)
Line 17: Line 17:
 
* [[SOCR_Data_Dinov_071108_OilGasData | Energy Resources, Production and Consumption Dataset]]
 
* [[SOCR_Data_Dinov_071108_OilGasData | Energy Resources, Production and Consumption Dataset]]
 
* [[SOCR_Data_121608_OzoneData | California Ozone Data (1980-2006)]]
 
* [[SOCR_Data_121608_OzoneData | California Ozone Data (1980-2006)]]
* [http://www.stat.ucla.edu/~nchristo/statistics_c173_c273/ca_ozone.txt CA Ozone] and [http://www.stat.ucla.edu/~nchristo/statistics_c173_c273/ozone.txt US Ozone]
+
* [[SOCR_Data_121608_CA_US_OzoneData | California and US Ozone Data Snapshot]]
  
 
=== Population Data===
 
=== Population Data===
Line 28: Line 28:
  
 
=== Economic, Business and Stock Market Data===
 
=== Economic, Business and Stock Market Data===
* Consumer Price Index (CPI)
+
==== Consumer Price Index (CPI)====
** [[SOCR_Data_Dinov_021808_ConsumerPriceIndex | Consumer Price Index (1981-2006) - Fuel and Food Data]]
+
* [[SOCR_Data_Dinov_021808_ConsumerPriceIndex | Consumer Price Index (1981-2006) - Fuel and Food Data]]
** [[SOCR_Data_Dinov_021808_ConsumerPriceIndex3Way | Consumer Price Index (1981-2007) - One-, Two- or Three-Way ANOVA Data by items, months and years]]
+
* [[SOCR_Data_Dinov_021808_ConsumerPriceIndex3Way | Consumer Price Index (1981-2007) - One-, Two- or Three-Way ANOVA Data by items, months and years]]
** [[SOCR_Data_Dinov_010309_HousingPriceIndex | Housing Price Index (2000-2006) (motion charts)]]
+
* [[SOCR_Data_Dinov_010309_HousingPriceIndex | Housing Price Index (2000-2006) (motion charts)]]
** [[SOCR_Data_Dinov_091609_SnP_HomePriceIndex | S&P Home Price Index (1991-2009) (motion charts)]]
+
* [[SOCR_Data_Dinov_091609_SnP_HomePriceIndex | S&P Home Price Index (1991-2009) (motion charts)]]
* [http://www.eoddata.com Stock Market Data]
+
==== [http://www.eoddata.com Stock Market Data]====
** [[SOCR_Data_Dinov_070108_JAVA | Sun Microsystems (Java) Stock price (2007-2008)]]
+
* [[SOCR_Data_Dinov_070108_JAVA | Sun Microsystems (Java) Stock price (2007-2008)]]
** [[SOCR_Data_Dinov_070108_SP500_0608 | S&P 500 (2007-2008)]]
+
* [[SOCR_Data_Dinov_070108_SP500_0608 | S&P 500 (2007-2008)]]
** [[SOCR_Data_Dinov_101709_USEconomy | US Economy by Sectors (1997-2007) and 2007-2009 Recession Data]]
+
* [[SOCR_Data_Dinov_101709_USEconomy | US Economy by Sectors (1997-2007) and 2007-2009 Recession Data]]
* Monetary-Base Data
+
* [[SOCR_Data_Fortune500_1955_2008 | Ranking, Profits and Income of Fortune500 Companies (1955-2008) Dataset]]
** [[SOCR_Data_MonetaryBase1959_2009 | US Federal Reserve monetary-base data (1959-2009)]]
+
==== Monetary-Base Data====
** [[SOCR_Data_MonetaryBaseStocksInterest1959_2009 | Monthly US Economics data including monetary-base data, interest, CPI, S&P, Unemployment, Inflation, etc. (1959-2009)]]
+
* [[SOCR_Data_MonetaryBase1959_2009 | US Federal Reserve monetary-base data (1959-2009)]]
* Budgets and Deficits Data
+
* [[SOCR_Data_MonetaryBaseStocksInterest1959_2009 | Monthly US Economics data including monetary-base data, interest, CPI, HPI, S&P, Unemployment, Inflation, etc. (1959-2009)]]
** [[SOCR_Data_US_BudgetsDeficits_1849_2016 | US Federal Budget and Deficit data (1849-2016)]]
+
* [[SOCR_Data_WorldInflation2002_2012 | Monthly Monetary Inflation for Several Countries (2002-2012)]]
 +
 
 +
==== Budgets and Deficits Data====
 +
* [[SOCR_Data_US_BudgetsDeficits_1849_2016 | US Federal Budget and Deficit data (1849-2016)]]
 +
====Sector Data, Population Perception Trends data====
 +
* [[SOCR_Data_GoogleTrends_2005_2011|Google Web-Search Trends and Stock Market Data (2005-2011)]]
 +
====World Peace====
 +
* [[SOCR_Data_GlobalPeaceIndex_2001_2011|Global Peace Index Data (2001-2011)]]
 +
* [[SOCR_Data_WealthOfNations_1800_2009|Wealth of Nations Data (1800-2009)]]
  
 
=== Neuroimaging Data===
 
=== Neuroimaging Data===
Line 49: Line 57:
 
* [[SOCR_Data_April2009_ID_NI | Neuroimaging study of Prefrontal Cortex Volume across Species and Tissue Types]]
 
* [[SOCR_Data_April2009_ID_NI | Neuroimaging study of Prefrontal Cortex Volume across Species and Tissue Types]]
 
* [[SOCR_Data_Oct2009_ID_NI | Normal and Schizophrenia Children Neuroimaging study]]
 
* [[SOCR_Data_Oct2009_ID_NI | Normal and Schizophrenia Children Neuroimaging study]]
 +
* [[SOCR_Data_April2011_NI_IBS_Pain | A large Neuroimaging study of pain including visceral pain, irritable bowel syndrome, ulcerative colitis, and Crohn's disease]]
 +
* [[SOCR_Data_N46_TBI_ROI_Volumes | A Neuroimaging study of Traumatic Brain Injury (TBI) including global and local volumetric measures of brain integrity at acute and chronic states]]
  
 
=== Biomedical Data===
 
=== Biomedical Data===
 +
* [https://www.healthdatagym.org/datasets Health Data Gymnasium]
 +
* [[SOCR_Data_PD_BiomedBigMetadata|Human Health: Predictive Big Data Analytics, Modeling and Visualization of Clinical, Genetic and Imaging Data for Parkinson’s Disease]]
 +
* [[SOCR_Data_AMI_NY_1993_HeartAttacks| 1993 New York State Heart Attack Patients: Acute Myocardial Infarction (AMI), N=12,844]]
 +
* [[SOCR_Data_AD_BiomedBigMetadata|Human Health: Modeling and Analysis of Clinical, Genetic and Imaging Data of Alzheimer’s Disease]]
 
* [[SOCR_Data_Dinov_032708_AllometricPlanRels | Allometric  relationship between population density, body mass and metabolic activity in Plants]]
 
* [[SOCR_Data_Dinov_032708_AllometricPlanRels | Allometric  relationship between population density, body mass and metabolic activity in Plants]]
 +
* [[SOCR_Data_052511_IrisSepalPetalClasses | Fisher's multivariate dataset on iris sepal and petal length]]
 
* [[SOCR_Data_BMI_Regression | Body Density & Body Mass Index (BMI) Data]]
 
* [[SOCR_Data_BMI_Regression | Body Density & Body Mass Index (BMI) Data]]
 
* [[SOCR_Data_KneePainData_041409 | Knee Pain Centroid Locations Data]]
 
* [[SOCR_Data_KneePainData_041409 | Knee Pain Centroid Locations Data]]
 +
* [[SOCR_Data_NIPS_InfantVitK_ShotData | Neonate Infant Pain Score (NIPS) Data (Vitamin K shots)]]
 +
* [[SOCR_Simulated_HELP_Data | Simulated Health Evaluation and Linkage to Primary (HELP) Care Dataset]]
 +
* [[SMHS_MissingData#Example| Demographic, clinical and cognitive variables in a cohort of traumatic brain injury (TBI) patients]]
 +
 +
===Healthcare and Health Science Data===
 +
* [https://umich.instructure.com/courses/38100/files/ A number of case-studies including Big and Heterogeneous clinical, nursing, and healthcare datasets].
 +
* [https://dataverse.harvard.edu/dataverse/harvard Harvard Dataverse (1,000's of case-studies)].
  
 
=== [[SOCR_US_CensusData | US Census Data]]===
 
=== [[SOCR_US_CensusData | US Census Data]]===
 
* [[SOCR_Data_LA_Neighborhoods_Data | Los Angeles County Neighborhoods Data (from US Census)]]
 
* [[SOCR_Data_LA_Neighborhoods_Data | Los Angeles County Neighborhoods Data (from US Census)]]
 
* [[SOCR_Data_2011_US_JobsRanking | 2011 US Jobs Ranking (200 Best to Worst Jobs in the USA for 2011)]]
 
* [[SOCR_Data_2011_US_JobsRanking | 2011 US Jobs Ranking (200 Best to Worst Jobs in the USA for 2011)]]
 +
* [[SOCR_Data_2019_US_JobsRanking | 2019 US Jobs Ranking (200 Best to Worst Jobs in the USA for 2019)]]
  
 
=== [http://www.presidency.ucsb.edu/ US Elections Data]===
 
=== [http://www.presidency.ucsb.edu/ US Elections Data]===
Line 64: Line 87:
  
 
===Other Data===
 
===Other Data===
 +
* [[SOCR_TurkiyeStudentEvalData  | Turkiye Student Evaluation Data Set]]
 +
* [[SOCR_Data_Brain2BodyWeight | Brain to Body Weight Dataset]]
 
* [[SOCR_Data_Dinov_021708_Earthquakes | California Earthquakes Data]] (1969-2007)
 
* [[SOCR_Data_Dinov_021708_Earthquakes | California Earthquakes Data]] (1969-2007)
 +
* [[SOCR_Data_CaliforniaLottery2011 | California Lottery]] (1992-2011)
 
* [[SOCR_Data_Dinov_072108_H_Index_Pubs | Faculty Publications]]
 
* [[SOCR_Data_Dinov_072108_H_Index_Pubs | Faculty Publications]]
 
* [[SOCR_Data_Dinov_030708_APExamScores | 2007 Advanced Placement (AP) Exam Scores by Discipline]]
 
* [[SOCR_Data_Dinov_030708_APExamScores | 2007 Advanced Placement (AP) Exam Scores by Discipline]]
Line 73: Line 99:
 
* [[SOCR_061708_NC_Data_Aquifer | Texas Wolfcamp aquifer data]]
 
* [[SOCR_061708_NC_Data_Aquifer | Texas Wolfcamp aquifer data]]
 
* [[SOCR_012708_ID_Data_HotDogs | Hot Dog Calorie and Sodium Dataset]]
 
* [[SOCR_012708_ID_Data_HotDogs | Hot Dog Calorie and Sodium Dataset]]
 +
 +
===SOCR Course Data and Case-Studies===
 +
* [https://umich.instructure.com/courses/38100 UMich HS 853, Fall 2015]
 +
** [https://umich.instructure.com/courses/38100/files General Resources]
 +
** [https://umich.instructure.com/courses/38100/files/folder/data Small Datasets]
 +
** [https://umich.instructure.com/courses/38100/files/folder/Case_Studies Biomedical and Health Science Case-Studies]
 +
* [https://umich.instructure.com/courses/90136 UMich HS 853, Fall 2016]
 +
** [https://umich.instructure.com/courses/90136/files General Resources including data, case-studies, lecture notes and code]
 +
* [https://umich.instructure.com/courses/143011/ UMich Data Science and Predictive Analytics (HS 650)]
 +
** [https://umich.instructure.com/courses/143011/files General Resources including data, case-studies, lecture notes and code]
 +
 +
===External Data Archives===
 +
* [https://ihpi.umich.edu/member-resources/data-and-methods/available-datasets University of Michigan Institute for Healthcare Policy and Innovation (IHPI)]
 +
* [https://www.ai.gov/ai-researchers-portal/data-resources/ AI.gov Data Resources]
 +
* [http://www.nature.com/sdata/archive Nature Scientific Data]
 +
* [https://toolbox.google.com/datasetsearch Google Data Search]
 +
* [https://www.cms.gov/Research-Statistics-Data-and-Systems/Statistics-Trends-and-Reports/Medicare-Provider-Charge-Data/Physician-and-Other-Supplier.html Centers for Medicare & Medicaid Services (CMS)]
 +
* [http://www.census.gov/developers/ US Census] and [https://usa.ipums.org/usa/ US Census Graphical API Interface]
 +
* [http://www.ncbi.nlm.nih.gov/gap database of Genotypes and Phenotypes (dbGaP)]
 +
 +
==Machine Interfaces to Downloading [[SOCR Data]]==
 +
In addition to human interactions with the [[SOCR Data]], we provide several machine interfaces to consume and process these data.
 +
 +
* [[SOCR Data]] can be copy pasted directly from the Wiki HTML pages into any of the [http://socr.umich.edu/html/ana/ SOCR Java applets].
 +
* [http://socr.umich.edu/HTML5/SOCRAT/ SOCR Analytical Toolbox (SOCRAT)] provides a scalable web platform for in-browser applications for interactive data analysis and visualization.
 +
* SOCR Data can also be loaded into an R computational environment automatically using the protocol below illustrated with the case of a [[SOCR_Data_PD_BiomedBigMetadata|Parkinson's Disease dataset]]:
 +
 +
library(rvest)
 +
# Loading required package: xml2
 +
 +
wiki_url <- read_html("http://wiki.socr.umich.edu/index.php/SOCR_Data_PD_BiomedBigMetadata")
 +
html_nodes(wiki_url, "#content")
 +
 +
pd_data <- html_table(html_nodes(wiki_url,"table")\([[1]]\))
 +
head(pd_data); summary(pd_data)
  
 
<hr>
 
<hr>
* SOCR Home page: http://www.socr.ucla.edu
+
* SOCR Home page: https://www.socr.umich.edu
  
{{translate|pageName=http://wiki.stat.ucla.edu/socr/index.php?title=SOCR_Data}}
+
{{translate|pageName=https://wiki.socr.umich.edu/index.php?title=SOCR_Data}}

Latest revision as of 13:43, 21 October 2023

SOCR Educational Materials - SOCR Data

The links below contain a number of datasets that may be used for demonstration purposes in probability and statistics education. There are two types of data - simulated (computer-generated using random sampling) and observed (research, observationally or experimentally acquired).

SOCR Data

Simulated data

The SOCR resources provide a number of mechanisms to simulate data using computer random-number generators. Here are some of the most commonly used SOCR generators of simulated data:

Observed data

The following collections include a number of real observed datasets from different disciplines, acquired using different techniques and applicable in different situations.

Climate Change Data

Population Data

Economic, Business and Stock Market Data

Consumer Price Index (CPI)

Stock Market Data

Monetary-Base Data

Budgets and Deficits Data

Sector Data, Population Perception Trends data

World Peace

Neuroimaging Data

Biomedical Data

Healthcare and Health Science Data

US Census Data

US Elections Data

Other Data

SOCR Course Data and Case-Studies

External Data Archives

Machine Interfaces to Downloading SOCR Data

In addition to human interactions with the SOCR Data, we provide several machine interfaces to consume and process these data.

library(rvest)
# Loading required package: xml2

wiki_url <- read_html("http://wiki.socr.umich.edu/index.php/SOCR_Data_PD_BiomedBigMetadata")
html_nodes(wiki_url, "#content")
pd_data <- html_table(html_nodes(wiki_url,"table")\([[1]]\))
head(pd_data); summary(pd_data)



Translate this page:

(default)
Uk flag.gif

Deutsch
De flag.gif

Español
Es flag.gif

Français
Fr flag.gif

Italiano
It flag.gif

Português
Pt flag.gif

日本語
Jp flag.gif

България
Bg flag.gif

الامارات العربية المتحدة
Ae flag.gif

Suomi
Fi flag.gif

इस भाषा में
In flag.gif

Norge
No flag.png

한국어
Kr flag.gif

中文
Cn flag.gif

繁体中文
Cn flag.gif

Русский
Ru flag.gif

Nederlands
Nl flag.gif

Ελληνικά
Gr flag.gif

Hrvatska
Hr flag.gif

Česká republika
Cz flag.gif

Danmark
Dk flag.gif

Polska
Pl flag.png

România
Ro flag.png

Sverige
Se flag.gif