Virtual Institute — High Productivity Supercomputing

ParCo Minisymposium on Scalability and Usability of HPC Programming Tools


Sep 5, 2007


At the ParCo Conference 2007 in Aachen/Jülich


Facing increasing power dissipation and little instruction-level parallelism left to exploit, computer architects are realizing further performance gains by placing multiple "slower" processor cores on a chip rather than by building faster uni-processors. As a consequence, numerical simulations are being required to harness much higher degrees of parallelism in order to satisfy their growing demand for computing power. However, writing code that runs correctly and efficiently on large numbers of processors and cores is extraordinarily challenging and requires adequate tool support for debugging and performance analysis. Unfortunately, increased concurrency levels place higher scalability demands not only on applications but also on software tools. When applied to larger numbers of processors, familiar tools often cease to work in a satisfactory manner (e.g., due to escalating memory requirements or limited I/O bandwidth). In addition, tool usage is often complicated and requires the user to learn a separate interface. This minisymposium will serve as a forum to discuss methods and experiences related to scalability and usability of HPC programming tools, including their integration with compilers and the overall development environment.



  • Benchmarking the Stack Trace Analysis Tool for BlueGene/L
    Gregory L. Lee, Lawrence Livermore National Laboratory [PDF]
  • Scalable, Automated Performance Analysis with TAU and PerfExplorer
    Kevin A. Huck, University of Oregon [PDF]
  • Developing Scalable Applications with Vampir
    Matthias S. Müller, Technische Universität Dresden [PDF]


  • Scalable Collation and Presentation of Call-Path Profile Data with CUBE
    Markus Geimer, Forschungszentrum Jülich [PDF]
  • Coupling DDT and Marmot for Debugging of MPI Applications
    Bettina Krammer, HLRS - High Performance Computing Center Stuttgart [PDF]
  • Compiler Support for Efficient Profiling and Tracing
    Barbara Chapman, University of Houston [PDF]


  • Comparing Intel Thread Checker and Sun Thread Analyzer
    Christian Terboven, RWTH Aachen University [PDF]
  • Continuous Runtime Profiling of OpenMP Applications
    Karl Fürlinger, University of Tennessee [PDF]
  • Understanding Memory Access Bottlenecks on Multi-core
    Josef Weidendorfer, Technische Universität München [PDF]


Forschungszentrum Jülich, JSC, Rotunde

Programm Committee

  • Felix Wolf, Forschungszentrum Jülich (chair)
  • Daniel Becker, Forschungszentrum Jülich
  • Bettina Krammer, University of Stuttgart
  • Allen Malony, University of Oregon
  • Dieter an Mey, RWTH Aachen University
  • Shirley Moore, University of Tennessee
  • Matthias Müller, Technical University of Dresden
  • Martin Schulz, Lawrence Livermore National Lab