Abstract
In this paper, we investigate methods of estimating the mixing proportion in the case when one of the probability densities is not specified analytically in a mixture model. The methodology we propose is motivated by a sequential clustering algorithm. After a sequential clustering algorithm finds the center of a cluster, the next step is to identify observations belonging to that cluster. If we assume that the center of the cluster is known and that the distribution of observations not belonging to the cluster is unknown, the problem of identifying observations in the cluster is similar to the problem of estimating the mixing proportion in a special two-component mixture model. The mixing proportion can be considered as the proportion of observations belonging to the cluster. We propose two estimators for parameters in the model and compare the performance of these two estimators in several different cases.
Original language | English |
---|---|
Pages (from-to) | 2276-2283 |
Number of pages | 8 |
Journal | Computational Statistics and Data Analysis |
Volume | 54 |
Issue number | 10 |
DOIs | |
State | Published - 1 Oct 2010 |
Bibliographical note
Funding Information:S. Song’s research was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education, Science and Technology (No. 2009-0065128 ).
Keywords
- Clustering
- Semiparametric mixture model