TY - JOUR
T1 - AnomiGAN
T2 - 25th Pacific Symposium on Biocomputing, PSB 2020
AU - Bae, Ho
AU - Jung, Dahuin
AU - Choi, Hyun Soo
AU - Yoon, Sungroh
N1 - Funding Information:
This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (Ministry of Science and ICT) [2014M3C9A3063541, 2018R1A2B3001628], the Brain Korea 21 Plus Project, and Strategic Initiative for Micro-biomes in Agriculture and Food, Ministry of Agriculture, Food and Rural Affairs, Republic of Korea (as part of the multi-ministerial Genome Technology to Business Translation Program, grant number 918013-4).
Publisher Copyright:
© 2019 The Authors.
PY - 2020
Y1 - 2020
N2 - Typical personal medical data contains sensitive information about individuals. Storing or sharing the personal medical data is thus often risky. For example, a short DNA sequence can provide information that can identify not only an individual, but also his or her relatives. Nonetheless, most countries and researchers agree on the necessity of collecting personal medical data. This stems from the fact that medical data, including genomic data, are an indispensable resource for further research and development regarding disease prevention and treatment. To prevent personal medical data from being misused, techniques to reliably preserve sensitive information should be developed for real world applications. In this paper, we propose a framework called anonymized generative adversarial networks (AnomiGAN), to preserve the privacy of personal medical data, while also maintaining high prediction performance. We compared our method to state-of-The-Art techniques and observed that our method preserves the same level of privacy as differential privacy (DP) and provides better prediction results. We also observed that there is a trade-off between privacy and prediction results that depends on the degree of preservation of the original data. Here, we provide a mathematical overview of our proposed model and demonstrate its validation using UCI machine learning repository datasets in order to highlight its utility in practice. The code is available at https://github.com/hobae/AnomiGAN/.
AB - Typical personal medical data contains sensitive information about individuals. Storing or sharing the personal medical data is thus often risky. For example, a short DNA sequence can provide information that can identify not only an individual, but also his or her relatives. Nonetheless, most countries and researchers agree on the necessity of collecting personal medical data. This stems from the fact that medical data, including genomic data, are an indispensable resource for further research and development regarding disease prevention and treatment. To prevent personal medical data from being misused, techniques to reliably preserve sensitive information should be developed for real world applications. In this paper, we propose a framework called anonymized generative adversarial networks (AnomiGAN), to preserve the privacy of personal medical data, while also maintaining high prediction performance. We compared our method to state-of-The-Art techniques and observed that our method preserves the same level of privacy as differential privacy (DP) and provides better prediction results. We also observed that there is a trade-off between privacy and prediction results that depends on the degree of preservation of the original data. Here, we provide a mathematical overview of our proposed model and demonstrate its validation using UCI machine learning repository datasets in order to highlight its utility in practice. The code is available at https://github.com/hobae/AnomiGAN/.
KW - Anonymization
KW - Deep neural networks
KW - Differential privacy
KW - Generative adversarial networks
UR - http://www.scopus.com/inward/record.url?scp=85075975591&partnerID=8YFLogxK
M3 - Conference article
C2 - 31797628
AN - SCOPUS:85075975591
SN - 2335-6928
VL - 25
SP - 563
EP - 574
JO - Pacific Symposium on Biocomputing
JF - Pacific Symposium on Biocomputing
IS - 2020
Y2 - 3 January 2020 through 7 January 2020
ER -