Based on the COBRA framework, we implemented two dynamic compiler optimizations to reduce the impact of coherent memory accesses in OpenMP NAS parallel benchmarks. Using OpenMP NAS parallel benchmarks, we show how COBRA can adaptively choose appropriate optimizations according to observed changing runtime program behavior. The optimizations improve the performance of OpenMP NAS parallel benchmarks (BT, SP, LU, FT, MG, CG) up to 15% with an average of 4.79 on a 4-way Itanium 2 SMP server, and up to 68% with an average of 17.5% on a SGI Altix cc-NUMA system.The target address of the branch instruction is the start of the loop and the PC address of the branch instruction is the end of the loop 44 5.4 Conditions checked in the phase detection hardware 46 5.5 Schematic diagram of the hardwareanbsp;...
|Title||:||COBRA: A Framework for Continuous Profiling and Binary Re-adaptation|
|Publisher||:||ProQuest - 2008|