Sparse partial least-squares regression for high-throughput survival data analysis

Donghwan Lee, Youngjo Lee, Yudi Pawitan, Woojoo Lee

Research output: Contribution to journalArticlepeer-review

9 Scopus citations

Abstract

The partial least-square (PLS) method has been adapted to the Cox's proportional hazards model for analyzing high-dimensional survival data. But because the latent components constructed in PLS employ all predictors regardless of their relevance, it is often difficult to interpret the results. In this paper, we propose a new formulation of sparse PLS (SPLS) procedure for survival data to allow simultaneous sparse variable selection and dimension reduction. We develop a computing algorithm for SPLS by modifying an iteratively reweighted PLS algorithm and illustrate the method with the Swedish and the Netherlands Cancer Institute breast cancer datasets. Through the numerical studies, we find that our SPLS method generally performs better than the standard PLS and sparse Cox regression methods in variable selection and prediction.

Original languageEnglish
Pages (from-to)5340-5352
Number of pages13
JournalStatistics in Medicine
Volume32
Issue number30
DOIs
StatePublished - 30 Dec 2013

Keywords

  • High-dimensional problem
  • Partial least-squares
  • Penalized likelihood
  • Sparsity
  • Survival analysis

Fingerprint

Dive into the research topics of 'Sparse partial least-squares regression for high-throughput survival data analysis'. Together they form a unique fingerprint.

Cite this