Techniques for Automated Performance Analysis
[摘要] The performance of a particular HPC code depends on a multitude of variables, including compiler selection, optimization flags, OpenMP pool size, file system load, memory usage, MPI configuration, etc. As a result of this complexity, current predictive models have limited applicability, especially at scale. We present a formulation of scientific codes, nodes, and clusters that reduces complex performance analysis to well-known mathematical techniques. Building accurate predictive models and enhancing our understanding of scientific codes at scale is an important step towards exascale computing.
[发布日期] 2014-09-02 [发布机构]
[效力级别] [学科分类] 数学(综合)
[关键词] COMPUTER SCIENCE [时效性]