已收录 268921 条政策
 政策提纲
  • 暂无提纲
A Comparison of Six Methods for Missing Data Imputation
[摘要] Missing data are part of almost all research and introduce an element of ambiguity into data analysis. It follows that we need to consider them appropriately in order to provide an efficient and valid analysis. In the present study, we compare 6 different imputation methods: Mean, K-nearest neighbors (KNN), fuzzy K-means (FKM), singular value decomposition (SVD), bayesian principal component analysis (bPCA) and multiple imputations by chained equations (MICE). Comparison was performed on four real datasets of various sizes (from 4 to 65 variables), under a missing completely at random (MCAR) assumption, and based on four evaluation criteria: Root mean squared error (RMSE), unsupervised classification error (UCE), supervised classification error (SCE) and execution time. Our results suggest that bPCA and FKM are two imputation methods of interest which deserve further consideration in practice.
[发布日期]  [发布机构] 
[效力级别]  [学科分类] 
[关键词] Missing data;Imputation methods;Comparison study;Missing completely at random;bPCA [时效性] 
   浏览次数:16      统一登录查看全文      激活码登录查看全文