已收录 273123 条政策
 政策提纲
  • 暂无提纲
Automatic Thread-Level Parallelization in the Chombo AMR Library
[摘要] The increasing on-chip parallelism has some substantial implications for HPC applications. Currently, hybrid programming models (typically MPI+OpenMP) are employed for mapping software to the hardware in order to leverage the hardware?s architectural features. In this paper, we present an approach that automatically introduces thread level parallelism into Chombo, a parallel adaptive mesh refinement framework for finite difference type PDE solvers. In Chombo, core algorithms are specified in the ChomboFortran, a macro language extension to F77 that is part of the Chombo framework. This domain-specific language forms an already used target language for an automatic migration of the large number of existing algorithms into a hybrid MPI+OpenMP implementation. It also provides access to the auto-tuning methodology that enables tuning certain aspects of an algorithm to hardware characteristics. Performance measurements are presented for a few of the most relevant kernels with respect to a specific application benchmark using this technique as well as benchmark results for the entire application. The kernel benchmarks show that, using auto-tuning, up to a factor of 11 in performance was gained with 4 threads with respect to the serial reference implementation.
[发布日期] 2011-05-26 [发布机构] 
[效力级别]  [学科分类] 数学(综合)
[关键词] auto-tuning;Chombo;ChomboFortran;HPC;OpenMP;hybrid;AMR [时效性] 
   浏览次数:56      统一登录查看全文      激活码登录查看全文