MARMOT

OrganizationTechnische Universität Dresden and Universität Stuttgart
DescriptionMARMOT is a tool for analyzing and checking of MPI programs. During runtime it surveys the MPI-calls made and automatically checks the correct usage of these calls and their arguments. The observations are saved for later post-mortem analysis.
Programming modelsMPI
LicenseOpen source: GPL
URLhttp://www.hlrs.de/organization/amt/projects/marmot/


ompP

OrganizationUniversity of Tennessee
DescriptionompP is a profiling tool for OpenMP applications written in C/C++ or FORTRAN. ompP works with most UNIX-like operating systems and OpenMP compilers. The profiling report of ompP becomes available immediately after program termination in a human-readable ASCII text format. ompP supports the measurement of hardware performance counters using PAPI and contains several advanced productivity features such as overhead analysis and detection of common inefficiency situations (performance properties).
Programming modelsOpenMP
LicenseGNU General Public License (GPL v.2)
URLhttp://www.ompp-tool.com


OPARI

OrganizationForschungszentrum Jülich
DescriptionOPARI is a source-to-source translation tool that automatically inserts calls to the POMP profiling interface into the source code of OpenMP applications. OPARI works with Fortran, C, and C++ programs. The POMP interface can be implemented by tool builders who want, for example, to monitor the performance of OpenMP applications. OPARI is based on the idea of OpenMP pragma / directive rewriting. OPARI is used by many performance analysis tools (e.g. TAU, ompP, KOJAK, Scalasca, VampirTrace).
Programming modelsOpenMP
LicenseOpen source: BSD
URLhttp://www.fz-juelich.de/zam/kojak/opari


Open Trace Format (OTF)

OrganizationTechnische Universität Dresden
DescriptionThe Open Trace Format (OTF) is an open source API and library for reading and writing of event traces. It supports all common record types as used in most tools as well as a number of additional ones and it is open to future extensions. OTF's underlying storage scheme is able to use multiple streams, i.e. multiple files, for a parallel trace which allows efficient parallel I/O.
Programming modelsgeneral/all
LicenseOpen source: BSD
URLhttp://www.tu-dresden.de/zih/otf/


PAPI

OrganizationUniversity of Tennessee
DescriptionPAPI is a cross-platform interface to the hardware performance counters available on most modern microprocessors. In addition to defining a standard set of routines for configuring and accessing the counters, PAPI defines a common set of performance events considered most useful for application performance tuning. These events include operation and cycle counts, cache and memory access events, and branch behavior events. Most recently, PAPI has been extended to PAPI-C (component PAPI), which provides simultaneous access to multiple counter domains, including the previous on-processor counters as well as off-processor counters and sensors such as network counters and temperature sensors.
Programming modelsFortran and C calling interfaces
LicenseOpen source: New BSD
URLhttp://icl.cs.utk.edu/papi/


Periscope

OrganizationTechnische Universität München
DescriptionPeriscope is a distributed automatic on-line performance analysis system for large-scale parallel systems. It consists of a frontend and a hierarchy of communication and analysis agents. Each of the analysis agents searches autonomously for inefficiencies in a subset of the application processes. Using a convenient graphical user interface, users can start up the analysis process and inspect the resulting performance data. The GUI is developed as a plug-in for Eclipse so that the developer can also take advantage of other available programming tools within the IDE.
Programming modelsMPI and OpenMP
LicenseOpen source: BSD
URLhttp://www.lrr.in.tum.de/~petkovve/psc/


Scalasca

OrganizationForschungszentrum Jülich
DescriptionScalasca is an open-source toolset that can be used to analyze the performance behavior of parallel applications and to identify opportunities for optimization. It has been specifically designed for use on large-scale systems including IBM Blue Gene and Cray XT, but is also well-suited for small- and medium-scale HPC platforms. Scalasca integrates runtime summaries with in-depth studies of concurrent behavior via event tracing. A distinctive feature is the ability to identify wait states that occur, for example, as a result of unevenly distributed workloads.
Programming modelsMPI and OpenMP
LicenseOpen source: New BSD
URLhttp://www.scalasca.org


VAMPIR

OrganizationTechnische Universität Dresden
DescriptionThe VAMPIR software tool provides an easy-to-use framework that enables developers to quickly display and analyze arbitrary program behavior at any level of detail. The tool suite implements optimized event analysis algorithms and customizable displays that enable fast and interactive rendering of very complex performance monitoring data.
Programming modelsMPI and OpenMP
LicenseCommercial
URLhttp://www.vampir.eu


VampirTrace

OrganizationTechnische Universität Dresden
DescriptionDuring a program run of an application, VampirTrace generates a trace file, which can be analyzed and visualized by the visualization tool Vampir. The VampirTrace library allows MPI communication events of a parallel program to be recorded as a trace file. Additionally, certain program-specific events can be included. VampirTrace was derived from the KOJAK trace library EPILOG.
Programming modelsMPI and OpenMP
LicenseOpen source: BSD
URLhttp://www.tu-dresden.de/zih/vampirtrace