TY - GEN
T1 - Two stage pattern clustering analysis in cross-over experimental design
AU - Huh, Iksoo
AU - Choi, Sunghoon
AU - Kim, Youjin
AU - Park, Soo Yeon
AU - Kwon, Oran
AU - Park, Taesung
N1 - Funding Information:
This research was supported by the Bio & Medical Technology Development Program of the National Research Foundation (NRF) funded by the Korean government (MSIT) (2013M3A9C4078158) results of these AUC measurements do not reflect dynamic patterns of biomarkers across time points; such patterns can be an important piece of information, with respect to reactivity to experimental conditions.
Publisher Copyright:
© 2019 IEEE.
PY - 2019/11
Y1 - 2019/11
N2 - In interventional studies, biomarkers such as metabolites, are usually measured across serial time points. And when the interest lies in comparing expression levels between different experimental conditions, summary measures such as area under curve (AUC), have been widely used. Although the summary measure based approaches have been successful in identifying novel biomarkers, they do not reveal anything about time-dependent changing patterns of biomarkers which can demonstrate the reactivity of biomarkers to various physiological conditions. To account for such patterns, all measurements across time points need to be used, and clustering analysis with the measurements can group together biomarkers having similar changing patterns. Some such popularly used clustering methods include hierarchical- and K-means clustering. While these may provide some well-clustered results, their patterns are quite dependent on input data sets, making it difficult to obtain consistent patterns across different interventional studies. In addition, it is problematic for these methods to discriminate biomarkers with weakly active patterns that need to be grouped as static, compared to those having strongly active patterns, when their patterns are highly similar. To address these issues, we propose a new clustering method for improving identification of changing patterns. Our approach is based on a two-stage process: the first is elimination of stable markers using Euclidean distances, while the second stage assigns the remaining biomarkers to predefined patterns using 1-correlation distance measure. By simulation studies, we showed that our proposed method had superior classification performances, compared to other unsupervised clustering methods. We expect that this approach can complement the existing summary measure based approaches.
AB - In interventional studies, biomarkers such as metabolites, are usually measured across serial time points. And when the interest lies in comparing expression levels between different experimental conditions, summary measures such as area under curve (AUC), have been widely used. Although the summary measure based approaches have been successful in identifying novel biomarkers, they do not reveal anything about time-dependent changing patterns of biomarkers which can demonstrate the reactivity of biomarkers to various physiological conditions. To account for such patterns, all measurements across time points need to be used, and clustering analysis with the measurements can group together biomarkers having similar changing patterns. Some such popularly used clustering methods include hierarchical- and K-means clustering. While these may provide some well-clustered results, their patterns are quite dependent on input data sets, making it difficult to obtain consistent patterns across different interventional studies. In addition, it is problematic for these methods to discriminate biomarkers with weakly active patterns that need to be grouped as static, compared to those having strongly active patterns, when their patterns are highly similar. To address these issues, we propose a new clustering method for improving identification of changing patterns. Our approach is based on a two-stage process: the first is elimination of stable markers using Euclidean distances, while the second stage assigns the remaining biomarkers to predefined patterns using 1-correlation distance measure. By simulation studies, we showed that our proposed method had superior classification performances, compared to other unsupervised clustering methods. We expect that this approach can complement the existing summary measure based approaches.
KW - Biomarker expression
KW - Pattern Clustering
UR - http://www.scopus.com/inward/record.url?scp=85084341498&partnerID=8YFLogxK
U2 - 10.1109/BIBM47256.2019.8983016
DO - 10.1109/BIBM47256.2019.8983016
M3 - Conference contribution
AN - SCOPUS:85084341498
T3 - Proceedings - 2019 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2019
SP - 1977
EP - 1981
BT - Proceedings - 2019 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2019
A2 - Yoo, Illhoi
A2 - Bi, Jinbo
A2 - Hu, Xiaohua Tony
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 18 November 2019 through 21 November 2019
ER -