PPtree: Projection pursuit classification tree

Yoon Dong Lee, Dianne Cook, Ji Won Park, Eun Kyung Lee

Research output: Contribution to journalArticlepeer-review

15 Scopus citations

Abstract

In this paper, we propose a new classification tree, the projection pursuit classification tree (PPtree). It combines tree structured methods with projection pursuit dimension reduction. This tree is originated from the projection pursuit method for classification. In each node, one of the projection pursuit indices using class information - LDA, Lr or PDA indices - is maximized to find the projection with the most separated group view. On this optimized data projection, the tree splitting criteria are applied to separate the groups. These steps are iterated until the last two classes are separated. The main advantages of this tree is that it effectively uses correlation between variables to find separations, and it has visual representation of the differences between groups in a 1-dimensional space that can be used to interpret results. Also in each node of the tree, the projection coefficients represent the variable importance for the group separation. This information is very helpful to select variables in classification problems.

Original languageEnglish
Pages (from-to)1369-1386
Number of pages18
JournalElectronic Journal of Statistics
Volume7
Issue number1
DOIs
StatePublished - 2013

Keywords

  • Classification tree
  • Projection pursuit
  • Variable selection

Fingerprint

Dive into the research topics of 'PPtree: Projection pursuit classification tree'. Together they form a unique fingerprint.

Cite this