A Comparison of Information Criteria in Clustering Based on Mixture of Multivariate Normal Distributions
[摘要] Clustering analysis based on a mixture of multivariate normal distributions is commonly used in the clustering of multidimensional data sets. Model selection is one of the most important problems in mixture cluster analysis based on the mixture of multivariate normal distributions. Model selection involves the determination of the number of components (clusters) and the selection of an appropriate covariance structure in the mixture cluster analysis. In this study, the efficiency of information criteria that are commonly used in model selection is examined. The effectiveness of information criteria has been determined according to the success in the selection of the number of components and in the selection of an appropriate covariance matrix.
[发布日期] [发布机构]
[效力级别] [学科分类] 计算数学
[关键词] cluster analysis;mixture models;information criteria [时效性]