Sparse partial least-squares regression for high-throughput survival data analysis

Donghwan Lee, Youngjo Lee, Yudi Pawitan, Woojoo Lee

Research output: Contribution to journalArticlepeer-review

9 Scopus citations


The partial least-square (PLS) method has been adapted to the Cox's proportional hazards model for analyzing high-dimensional survival data. But because the latent components constructed in PLS employ all predictors regardless of their relevance, it is often difficult to interpret the results. In this paper, we propose a new formulation of sparse PLS (SPLS) procedure for survival data to allow simultaneous sparse variable selection and dimension reduction. We develop a computing algorithm for SPLS by modifying an iteratively reweighted PLS algorithm and illustrate the method with the Swedish and the Netherlands Cancer Institute breast cancer datasets. Through the numerical studies, we find that our SPLS method generally performs better than the standard PLS and sparse Cox regression methods in variable selection and prediction.

Original languageEnglish
Pages (from-to)5340-5352
Number of pages13
JournalStatistics in Medicine
Issue number30
StatePublished - 30 Dec 2013


  • High-dimensional problem
  • Partial least-squares
  • Penalized likelihood
  • Sparsity
  • Survival analysis


Dive into the research topics of 'Sparse partial least-squares regression for high-throughput survival data analysis'. Together they form a unique fingerprint.

Cite this