Quantifying Sensitive Information Leakage from Functinal Genomics Data

Arif Ozgun Harmanci
Yale University

The increase in the size of personal genomics data creates new challenges in safely sharing these large datasets. While these data have great potential for advancing disease research, the impact of these data with respect to implications on personal privacy. One of the significant challenges is the huge increase in the number of genomics data associated with functional annotation of the genome, such as RNA-sequencing. Unlike the variant datasets, these data aim at characterizing the dynamics of gene activity and gene regulation. In this talk, I will review these sources and the scenarios under which they may leak sensitive information such as genetic variant genotypes. Specifically, I will summarize the sources of leakage from different representations of RNA-sequencing datasets and the several future challenges that we must tackle to safely share these datasets.


Back to Algorithmic Challenges in Protecting Privacy for Biomedical Data