Abstract
In a crossover design, individuals usually undergo all experimental conditions, and the measurements of biomarkers are repeatedly observed at serial time points for each experimental condition. To analyse time-dependent changing patterns of biomarkers, clustering algorithms are commonly used across time points to group together subjects having similar changing patterns. Among the clustering methods, hierarchical- and K-means clustering have been popularly used. However, since they are originally unsupervised approaches, they do not identify different changing patterns between experimental conditions. Therefore, we propose a new two-stage clustering method focusing on changing patterns. The first stage is to eliminate non-informative biomarkers using Euclidean distances, and the second stage is to allocate the remaining biomarkers to predefined patterns using a correlation-based distance. We demonstrate the advantages of our proposed method by simulation and real data analysis.
Original language | English |
---|---|
Pages (from-to) | 299-317 |
Number of pages | 19 |
Journal | International Journal of Data Mining and Bioinformatics |
Volume | 23 |
Issue number | 4 |
DOIs | |
State | Published - 2020 |
Bibliographical note
Publisher Copyright:Copyright © 2020 Inderscience Enterprises Ltd.
Keywords
- Biomarker expression
- Cross-over design
- Intervention study
- Pattern clustering
- Two-stage