Difference between revisions of "SOCR News UMich SPH MLEED 2019"

From SOCR
Jump to: navigation, search
(References)
(Presentation)
 
(2 intermediate revisions by the same user not shown)
Line 3: Line 3:
  
 
==Logistics==
 
==Logistics==
[[Image:SOCR_Hat_2019.png|150px|thumbnail|right| [http://socr.umich.edu SOCR Resource] ]]
+
[[Image:SOCR_BeanieHat_2020.png|150px|thumbnail|right| [http://socr.umich.edu SOCR Resource] ]]
* '''Seminar Series''': [http://mleead.umich.edu/Events.php University of Michigan SPH Environmental Epidemiology Seminar Series]
+
* '''Seminar Series''': [https://sph.umich.edu/events/event.php?ID=8233 University of Michigan SPH Environmental Epidemiology Seminar Series]
 
* '''Date/Times''': Tuesday, Dec. 3, 2019, 12 PM (Noon).
 
* '''Date/Times''': Tuesday, Dec. 3, 2019, 12 PM (Noon).
 
* '''Place/Time''': [https://maps.studentlife.umich.edu/building/henry-frieze-vaughan-public-health-building 3755 SPH I] (School of Public Health, 1415 Washington Heights)
 
* '''Place/Time''': [https://maps.studentlife.umich.edu/building/henry-frieze-vaughan-public-health-building 3755 SPH I] (School of Public Health, 1415 Washington Heights)
Line 15: Line 15:
 
:  
 
:  
  
: To address this problem, the SOCR lab developed [http://DataSifter.org a novel statistical method (DataSifter)] that provides on-the-fly obfuscation of high-dimensional structured and unstructured sensitive data, e.g., clinical data from electronic health records (EHR). This technique provides complete administrative control over the balance between risk of data re-identification and preservation of the data information. Under careful set up of user-defined privacy levels, our simulation experiments suggest that the DataSifter protects privacy while maintaining data utility for different types of outcomes of interest. The application of DataSifter on ABIDE data provides a realistic demonstration of how to employ the proposed algorithm on EHR with more than 500 features. We are extending the DataSifter to desensitize longitudinal data and free-text. Time-permitting, some additional SOCR tools and resources may be demonstrated (http://www.socr.umich.edu).  
+
: To address this problem, the SOCR lab developed [http://DataSifter.org a novel statistical method (DataSifter)] that provides on-the-fly obfuscation of high-dimensional structured and unstructured sensitive data, e.g., clinical data from electronic health records (EHR). This technique provides complete administrative control over the balance between risk of data re-identification and preservation of the data information. Under careful set up of user-defined privacy levels, our simulation experiments suggest that the DataSifter protects privacy while maintaining data utility for different types of outcomes of interest. The application of DataSifter on ABIDE data provides a realistic demonstration of how to employ the proposed algorithm on EHR with more than 500 features. We are extending the DataSifter to desensitize longitudinal data and free-text. Time-permitting, some additional SOCR tools and resources may be demonstrated (http://www.socr.umich.edu).
 
 
  
 +
* [http://socr.umich.edu/docs/uploads/2019/Dinov_DataSifter_SPH_MLEED_2019.pdf Presentation Slides (PDF)]
  
 
==Demos==
 
==Demos==

Latest revision as of 16:53, 27 November 2019

SOCR News & Events: SOCR DataSifter: A Statistical Obfuscation Technique enabling Effective Data Sharing

Logistics

Presentation

  • Presenter: Ivo D. Dinov, joint work with Nina Zhou, Simeone Marino, Yi Zhao, Lu Wei, Lu Wang.
  • Title: SOCR DataSifter: A Statistical Obfuscation Technique enabling Effective Data Sharing
  • Abstract: Effective and pragmatic sharing of data that includes sensitive information is difficult. The validation and reproducibility of findings in many health, financial, intelligence, socioeconomic, and other high-dimensional case-studies is inhibited when the data can’t be shared and the results independently confirmed. Either the utility of the data may be compromised by significant masking of the data or alternatively there may be a high risk of exposing private personal or secure organizational information. Excessive scrambling or encoding of the information makes the information less useful for modeling, or analytical processing. Insufficient preprocessing may compromise sensitive information and introduce a substantial risk for re-identification of individuals by various stratification techniques.
To address this problem, the SOCR lab developed a novel statistical method (DataSifter) that provides on-the-fly obfuscation of high-dimensional structured and unstructured sensitive data, e.g., clinical data from electronic health records (EHR). This technique provides complete administrative control over the balance between risk of data re-identification and preservation of the data information. Under careful set up of user-defined privacy levels, our simulation experiments suggest that the DataSifter protects privacy while maintaining data utility for different types of outcomes of interest. The application of DataSifter on ABIDE data provides a realistic demonstration of how to employ the proposed algorithm on EHR with more than 500 features. We are extending the DataSifter to desensitize longitudinal data and free-text. Time-permitting, some additional SOCR tools and resources may be demonstrated (http://www.socr.umich.edu).

Demos


References




Translate this page:

(default)
Uk flag.gif

Deutsch
De flag.gif

Español
Es flag.gif

Français
Fr flag.gif

Italiano
It flag.gif

Português
Pt flag.gif

日本語
Jp flag.gif

България
Bg flag.gif

الامارات العربية المتحدة
Ae flag.gif

Suomi
Fi flag.gif

इस भाषा में
In flag.gif

Norge
No flag.png

한국어
Kr flag.gif

中文
Cn flag.gif

繁体中文
Cn flag.gif

Русский
Ru flag.gif

Nederlands
Nl flag.gif

Ελληνικά
Gr flag.gif

Hrvatska
Hr flag.gif

Česká republika
Cz flag.gif

Danmark
Dk flag.gif

Polska
Pl flag.png

România
Ro flag.png

Sverige
Se flag.gif