已收录 272962 条政策
 政策提纲
  • 暂无提纲
Contributions to Effect Size Analysis with Large Scale Data.
[摘要] Large and complex data are common to the modern life. These data sets are mines of information, statisticians are now developing the new statistical techniques to explore information from them. This dissertation contributes statistical methods to explore such challenging types of data sets.The second chapter estimates the dissimilarity among effect sizes in a regression model. A natural summary is the the ratio of the maximum magnitude to the minimum magnitude among the effects. For this nonstandard quantity, some standard techniques cannot be applied directly. Some procedures are discussed to improve the performance of point estimation and confidence intervals. We apply our procedures to the National Health and Nutrition Examination Survey (NHANES) from 2011 to 2012.The third chapter investigates functional summaries for a p by p covariance structure in an accessible and easily visualized form. The summaries reflect interpretable patterns in the data and are unaffected by relabeling of the variables. The proposed functional summaries allow us to visualize differences in the covariance structures between two data sets, even when they have different dimensions. Our summaries emphasize the degree by which each variable is predictable from the others, with a special focus on the number of variables required to predict another variable. We apply the functional summaries to two gene expression data sets, 108 normal heart tissue from the Cleveland Clinic Kaufman Center and 734 whole-blood RNA samples the from Estonian Biobank, to compare structures with different dimensions.The fourth chapter studies a projection-based approach for exploring conditional correlation paths. We propose a graphical tool that enables us to explore the change in dependence structure from marginal correlations to partial correlations. This path is built via adding information from others gradually to reach partial correlations. The projection-based proposed approach can be applied to another type of conditional correlation matrix which is conditioned on linear statistics of the data. We can explore the change in correlation matrices when the values of a linear statistics varied. We apply the approach to gene expression data set with 108 normal heart tissue from the Cleveland Clinic Kaufman Center.
[发布日期]  [发布机构] University of Michigan
[效力级别] functional summary [学科分类] 
[关键词] dissimilarity among effect sizes;functional summary;correlation path;Statistics and Numeric Data;Science;Statistics [时效性] 
   浏览次数:57      统一登录查看全文      激活码登录查看全文