Abstract
In a crossover design, individuals usually undergo all experimental conditions, and the measurements of biomarkers are repeatedly observed at serial time points for each experimental condition. To analyse time-dependent changing patterns of biomarkers, clustering algorithms are commonly used across time points to group together subjects having similar changing patterns. Among the clustering methods, hierarchical- and K-means clustering have been popularly used. However, since they are originally unsupervised approaches, they do not identify different changing patterns between experimental conditions. Therefore, we propose a new two-stage clustering method focusing on changing patterns. The first stage is to eliminate non-informative biomarkers using Euclidean distances, and the second stage is to allocate the remaining biomarkers to predefined patterns using a correlation-based distance. We demonstrate the advantages of our proposed method by simulation and real data analysis.
| Original language | English |
|---|---|
| Pages (from-to) | 299-317 |
| Number of pages | 19 |
| Journal | International Journal of Data Mining and Bioinformatics |
| Volume | 23 |
| Issue number | 4 |
| DOIs | |
| State | Published - 2020 |
Bibliographical note
Publisher Copyright:Copyright © 2020 Inderscience Enterprises Ltd.
Keywords
- Biomarker expression
- Cross-over design
- Intervention study
- Pattern clustering
- Two-stage