Virtual Institute — High Productivity Supercomputing

32nd VI-HPS Tuning Workshop (Uni. Bristol, England)

Date

Wednesday 24th - Friday 26th April, 2019.

Location

The workshop will take place at the University of Bristol, Dept. Computer Science, room 1.11a, Merchant Venturers Building, Woodland Road, Bristol, BS8 1UB, England, UK.

Co-organizing Institutions

EPCC PRACE Uni Bristol

Goals

This workshop is organised by VI-HPS for the UK PRACE Training Centre to:

  • give an overview of the VI-HPS programming tools suite
  • explain the functionality of individual tools, and how to use them effectively
  • offer hands-on experience and expert assistance using the tools

On completion participants should be familiar with common performance analysis and diagnosis techniques and how they can be employed in practice (on a range of HPC systems). Those who prepared their own application test cases will have been coached in the tuning of their measurement and analysis, and provided optimization suggestions.

Programme Overview

Presentations and hands-on sessions are on the following topics:

  • BSC tools for trace analysis and performance prediction
  • Score-P instrumentation and measurement
  • Scalasca automated trace analysis
  • TAU performance system

A brief overview of the capabilities of these and associated tools is provided in the VI-HPS Tools Guide.

The workshop will be held in English and run from 09:00 to not later than 17:30 each day, with breaks for lunch and refreshments. There is no fee for participation, however, participants are responsible for their own travel and accommodation.

Classroom capacity is limited, therefore priority will be given to applicants with MPI, OpenMP and hybrid OpenMP+MPI parallel codes already running on the workshop computer systems, and those bringing codes from similar systems to work on. Attendees will need to bring their own notebook computers (with SSH and X11 configured) and use (eduroam) wifi to connect to the workshop computer systems.

Outline

The workshop introduces tools that provide a practical basis for portable performance analysis of parallel application execution, covering both profiling and tracing. It will be delivered as a series of presentations with associated hands-on practical exercises using the ARM-based Isambard Cray XC50 computer.

While analysis of provided example codes will be used to guide the class through the relevant steps and familiarise with usage of the tools, coaching will also be available to assist participants to analyse their own parallel application codes and may suggest opportunities for improving their execution performance and scalability.

Programme (preliminary)

Day 1: Wednesday 24th April
09:30 Welcome messages [James Price, UBristol]
  • ARCHER Training Courses [Evgenij Belikov, EPCC]
  • 09:45 Introduction
  • Introduction to VI-HPS & overview of tools [Markus Geimer, JSC]
  • Introduction to parallel performance engineering
  • Lab setup
  • Isambard Cray XC50 computer system and software environment
  • [James Price, UBristol]
  • Building and running NPB-MZ-MPI/BT-MZ on Isambard Cray XC50
  • [Markus Geimer, JSC]
  • Archer Cray XC30 computer system and software environment
  • Building and running NPB-MZ-MPI/BT-MZ on Archer Cray XC30
  • 11:00 (break)
    11:30 Cray tools [Kevin Roy, Cray]
  • Building and running your own code on Isambard
  • 13:00
    (lunch)
    14:00 BSC performance tools [Judit Giménez & Lau Mercadal, BSC]
  • BSC tools hands-on exercises
  • 15:30 (break)
    16:00 Hands-on coaching to apply tools to analyze participants' own code(s).
    17:15 Review of day and schedule for remainder of workshop
    17:30 (adjourn)

    Day 2: Thursday 25th April
    09:30 Instrumentation & measurement with Score-P [Markus Geimer, JSC]
  • Score-P hands-on exercises
    Execution profile analysis report exploration with CUBE [JSC]
  • CUBE hands-on exercises
  • Score-P analysis scoring & measurement filtering [Markus Geimer, JSC]
  • Score-P hands-on exercises
    Score-P specialised measurement
  • 11:00 (break)
    11:30 TAU performance system [Sameer Shende, UOregon]
  • TAU hands-on exercises
  • 13:00
    (lunch)
    14:00 Hands-on coaching to apply tools to analyze participants' own code(s).
    17:30 (adjourn)

    Day 3: Friday 26th April
    09:30 Automated trace analysis with Scalasca [Markus Geimer, JSC]
  • Scalasca hands-on exercises
  • 11:00 (break)
    11:30 TAU PerfExplorer [Sameer Shende, UOregon]
  • PerfExplorer hands-on exercises
  • Review of workshop
    13:00
    (lunch)
    14:00 Hands-on coaching to apply tools to analyze participants' own code(s).
    17:00 (adjourn)

    Hardware and Software Platforms

    Isambard: Cray XC50 with 164 dual Marvell ThunderX2 32-core 2.1 GHz nodes (64-bit ARMv8-A cores) with 256GB DRAM and Aries dragonfly interconnect, Cray MPI, Cray, GCC & ARM toolchains. Training accounts will be provided!

    ARCHER: Cray XC30 with 3008 compute nodes consisting of two 12-core Intel E5-2697 (IvyBridge) processors sharing 64GB (or 128GB) of NUMA memory, Aries dragonfly interconnect, Cray MPI, Cray, GCC & Intel compilers, PBS Pro job management system. Training accounts will be provided!

    Other systems where up-to-date versions of the tools are installed can also be used when preferred, though support may be limited. Participants are expected to already possess user accounts on non-local systems they intend to use, and should be familiar with the procedures for compiling and running parallel applications.

    Registration

    Registration is via the PRACE training portal.

    Contact

    Local Arrangements

    James Price
    University of Bristol
    E-mail: j.price[at]bristol.ac.uk
        Evgenij Belikov
    EPCC, University of Edinburgh
    Email: E.Belikov[at]epcc.ed.ac.uk

    Tuning Workshop Series

            Brian Wylie
            Jülich Supercomputing Centre
            Email: b.wylie[at]fz-juelich.de