SOCR News FSU DataImpact Symposium 2019

From SOCR
Revision as of 08:40, 8 March 2019 by Dinov (talk | contribs) (Created page with "== SOCR News & Events: Statistics, the Impact of Big Data Conference== ===Logistics=== * Event: Statistics, the Impact of Big Data - 60<sup>th</sup> Annivers...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

SOCR News & Events: Statistics, the Impact of Big Data Conference

Logistics

Presenter

Ivo Dinov

Title

DataSifter: Sharing of Sensitive Data via Statistical Obfuscation

Abstract

There are no practical, reliable, and effective mechanisms to share sensitive information to inspire novel methodological developments without compromising intellectual property, confidentiality, personal data. In many fields, like health, financial, intelligence, socioeconomics, high-dimensional data is prevalent and there is a profound need to develop advanced data interrogation techniques to extracting useful and actionable information the balancing the utility of the data with the risk of exposing private, personal, or secure organizational information. Excessive scrambling or encoding of the information makes it less useful for modelling, or analytical processing. Insufficient preprocessing may uncover sensitive information and introduce a substantial risk for re-identification of individuals or trade secrets by various stratification techniques. To address this problem, we developed a novel statistical method (DataSifter) that provides on-the-fly de-identification of sensitive structured and unstructured high dimensional data, such as clinical data from electronic health records (EHR). DataSifter technology enables administrative control over the balance between risk of data re-identification and preservation of the data information content. Under careful set up of user-defined privacy levels, our simulation experiments and real biomedical case-studies suggest that the DataSifter protects privacy while maintaining data utility for different types of outcomes of interest. The application of DataSifter on ABIDE data provides a realistic demonstration of how to employ the proposed algorithm on EHR with more than 500 features (DataSifter.org).





Translate this page:

(default)
Uk flag.gif

Deutsch
De flag.gif

Español
Es flag.gif

Français
Fr flag.gif

Italiano
It flag.gif

Português
Pt flag.gif

日本語
Jp flag.gif

България
Bg flag.gif

الامارات العربية المتحدة
Ae flag.gif

Suomi
Fi flag.gif

इस भाषा में
In flag.gif

Norge
No flag.png

한국어
Kr flag.gif

中文
Cn flag.gif

繁体中文
Cn flag.gif

Русский
Ru flag.gif

Nederlands
Nl flag.gif

Ελληνικά
Gr flag.gif

Hrvatska
Hr flag.gif

Česká republika
Cz flag.gif

Danmark
Dk flag.gif

Polska
Pl flag.png

România
Ro flag.png

Sverige
Se flag.gif