已收录 273192 条政策
 政策提纲
  • 暂无提纲
Efficient call path profiles on unmodified, optimized code
[摘要] Identifying performance bottlenecks and their associated calling contexts is critical for tuning high-performance applications. This thesis presents a new approach to measuring resource utilization and its calling context. Previous instrumentation-based approaches for reporting calling context introduce overhead proportional to the number of function calls performed. We describe a new design for a call path profiler based on stack sampling. Our design enables profiling of unmodified binaries, provides low and controllable overhead, and accurately attributes context-dependent costs of calls. We use a special trampoline function that improves the efficiency of stack sampling and enables the association of unique invocation counts with sampled call sites. We evaluate a Tru64/Alpha implementation of our design and show that on call-intensive codes, the overhead of our approach is over two orders of magnitude lower than the overhead of an instrumentation-based approach, with comparable overhead on other codes.
[发布日期]  [发布机构] Rice University
[效力级别] science [学科分类] 
[关键词]  [时效性] 
   浏览次数:5      统一登录查看全文      激活码登录查看全文