Virtual Institute — High Productivity Supercomputing

45th VI-HPS Tuning Workshop (LRZ, Garching, Germany)

Date

Monday 10th - Thursday 13th June 2024

Location

The workshop will take place at LRZ, Leibniz Supercomputing Centre on the university campus Garching near Munich, Germany.

Organising Institutions

LRZ

Goals

This workshop organised by VI-HPS and LRZ will:

  • give an overview of the VI-HPS programming tools suite
  • explain the functionality of individual tools, and how to use them effectively
  • offer hands-on experience and expert assistance using the tools

On completion participants should be familiar with common performance analysis and diagnosis techniques and how they can be employed in practice (on a range of HPC systems). Those who prepared their own application test cases will have been coached in the tuning of their measurement and analysis, and provided optimization suggestions.

Programme Overview

Presentations and hands-on sessions are planned on the following topics:

A brief overview of the capabilities of these and associated tools is provided in the VI-HPS Tools Guide.

The workshop will be held in English and run from 09:00 to not later than 18:00 each day, with breaks for lunch and refreshments.

Participants are encouraged to prepare their own MPI, OpenMP and hybrid MPI+OpenMP parallel application codes for analysis.

Programme in Detail (provisional) - all times given as CEST (UTC+2)

Day 1: Monday 10 June
09:00 Welcome [TBD]
  • Introduction to VI-HPS & overview of tools [Cédric Valensi, UVSQ]
  • Introduction to parallel performance engineering
  • CooLMUC2 computer system and software environment [Volker Weinberg, LRZ]
  • Building and running NPB/BT-MZ on CooLMUC2 [TBD]
  • 10:30 (break)
    11:00 Callgrind/kCachegrind cache utilisation analysis [Josef Weidendorfer, LRZ]
  • Hands-on exercises
  • 12:30 (lunch)
    14:00 Caliper performance analysis toolbox in a library [David Böhme, LLNL]
    15:30 (break)
    16:00 Hands-on coaching to apply Callgrind & Caliper to analyze participants' own code(s).
    17:30 (adjourn)
    Day 2: Tuesday 11 June
    09:00 Linaro MAP profiling and performance reports [Rudy Shand, Linaro]
  • Linaro hands-on exercises
  • 10:30 (break)
    11:00 MAQAO performance analysis framework [Cédric Valensi & Emmanuel Oseret, UVSQ]
  • MAQAO hands-on exercises (MAQAO quick reference)
  • 12:30 (lunch)
    14:00 Hands-on coaching to apply Linaro Tools & MAQAO to analyze participants' own code(s).
    15:30 (break)
    16:00 Hands-on coaching to apply Linaro Tools & MAQAO to analyze participants' own code(s).
    17:30 (adjourn)
    Day 3: Wednesday 12 June
    09:00 Score-P instrumentation & measurement toolset [Jan André Reuter & Ilya Zhukhov, JSC]
  • Score-P analysis scoring & measurement filtering
  • Score-P hands-on exercises
  • CUBE profile explorer hands-on exercises
  • Score-P specialized instrumentation and measurement [Jan André Reuter & Ilya Zhukhov, JSC]
    10:30 (break)
    11:00 Scalasca automated trace analysis [Jan André Reuter & Ilya Zhukhov, JSC]
  • Scalasca hands-on exercises
  • mpiP lightweight MPI profiling [Amir Raoofy, LRZ]
  • mpiP hands-on exercises
  • 12:30 (lunch)
    14:00 Hands-on coaching to apply Score-P/Scalasca and mpiP to analyze participants' own code(s).
    15:30 (break)
    16:00 Hands-on coaching to apply Score-P/Scalasca and mpiP to analyze participants' own code(s).
    17:30 (adjourn)
    Day 4: Thursday 13 June
    09:00 BSC performance tools [Germán Llort & Lau Mercadal, BSC]
  • Tools installation
  • BSC tools hands-on exercises
  • 10:30 (break)
    11:00 TAU performance system [Sameer Shende, UOregon]
  • TAU hands-on exercises
  • Review [TBD]
    12:30 (lunch)
    14:00 Hands-on coaching to apply BSC tools & TAU to analyze participants' own code(s).
    15:30 (break)
    16:00 Hands-on coaching to apply BSC tools & TAU to analyze participants' own code(s).
    17:30 (adjourn)
     

    Hardware and Software Platforms

    CooLMUC2: Haswell-based x86 Linux cluster system:

    • 28-way Haswell-based compute nodes and FDR14 Infiniband interconnect, used for both serial and parallel processing

    The local HPC system CooLMUC2 is the primary platform for the workshop and will be used for the hands-on exercises. Course accounts will be provided during the workshop to participants without existing accounts. Other systems where up-to-date versions of the tools are installed can also be used when preferred, though support may be limited and participants are expected to already possess user accounts on non-local systems. Regardless of whichever systems they intend to use, participants should be familiar with the relevant procedures for compiling and running their parallel applications (via batch queues where appropriate).

    Registration

    Registration via the course website.

    The workshop will be held in Leibniz Rechenzentrum on the university campus outside Garching near Munich, approximately 25 min north from the city centre of Munich. The U-bahn line U6 (station: Garching-Forschungszentrum) provides direct connection from the campus area to both Munich and Garching.
    Getting to/from LRZ

    It is recommended to choose a hotel in Garching or Munich city centre and use the U-bahn to reach LRZ.
    Accommodation in Garching
    Accommodation in Munich

    Contact

    Local Arrangements

    Volker Weinberg
    Leibniz Supercomputing Centre
    Garching-bei-München
    Email: education[at]lrz.de
       

    Tuning Workshop Series

    Cédric Valensi
    Université de Versailles Paris Saclay
    Email: cedric.valensi[at]uvsq.fr