TY - JOUR
T1 - DNA steganalysis using deep recurrent neural networks
AU - Bae, Ho
AU - Lee, Byunghan
AU - Kwon, Sunyoung
AU - Yoon, Sungroh
N1 - Funding Information:
This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (Ministry of Science and ICT) [2014M3C9A3063541, 2018R1A2B3001628], and the Brain Korea 21 Plus Project in 2018.
Publisher Copyright:
© 2018 The Authors.
PY - 2019
Y1 - 2019
N2 - Recent advances in next-generation sequencing technologies have facilitated the use of deoxyribonucleic acid (DNA) as a novel covert channels in steganography. There are vari- ous methods that exist in other domains to detect hidden messages in conventional covert channels. However, they have not been applied to DNA steganography. The current most common detection approaches, namely frequency analysis-based methods, often overlook important signals when directly applied to DNA steganography because those methods de- pend on the distribution of the number of sequence characters. To address this limitation, we propose a general sequence learning-based DNA steganalysis framework. The proposed approach learns the intrinsic distribution of coding and non-coding sequences and detects hidden messages by exploiting distribution variations after hiding these messages. Using deep recurrent neural networks (RNNs), our framework identifies the distribution varia- tions by using the classification score to predict whether a sequence is to be a coding or non-coding sequence. We compare our proposed method to various existing methods and biological sequence analysis methods implemented on top of our framework. According to our experimental results, our approach delivers a robust detection performance compared to other tools.
AB - Recent advances in next-generation sequencing technologies have facilitated the use of deoxyribonucleic acid (DNA) as a novel covert channels in steganography. There are vari- ous methods that exist in other domains to detect hidden messages in conventional covert channels. However, they have not been applied to DNA steganography. The current most common detection approaches, namely frequency analysis-based methods, often overlook important signals when directly applied to DNA steganography because those methods de- pend on the distribution of the number of sequence characters. To address this limitation, we propose a general sequence learning-based DNA steganalysis framework. The proposed approach learns the intrinsic distribution of coding and non-coding sequences and detects hidden messages by exploiting distribution variations after hiding these messages. Using deep recurrent neural networks (RNNs), our framework identifies the distribution varia- tions by using the classification score to predict whether a sequence is to be a coding or non-coding sequence. We compare our proposed method to various existing methods and biological sequence analysis methods implemented on top of our framework. According to our experimental results, our approach delivers a robust detection performance compared to other tools.
KW - Deep recurrent neural network
KW - DNA steganalysis
KW - DNA steganography
KW - DNA watermarking
UR - http://www.scopus.com/inward/record.url?scp=85062761229&partnerID=8YFLogxK
M3 - Conference article
C2 - 30864313
AN - SCOPUS:85062761229
SN - 2335-6928
VL - 24
SP - 88
EP - 99
JO - Pacific Symposium on Biocomputing
JF - Pacific Symposium on Biocomputing
IS - 2019
T2 - 24th Pacific Symposium on Biocomputing, PSB 2019
Y2 - 3 January 2019 through 7 January 2019
ER -