All

Jan Eitzinger from the HPC group is co-organising the WPMVP 2020 Workshop on Programming Models for SIMD/Vector Processing, which is co-located with the PPoPP Conference 2020 (Principles and Practice of Parallel Programming). The conference takes place from Sat 22 - Wed 26 February 2020 in San Diego...

Category: Allgemein, HPC

This year at SC19 in Denver, CO, members of our group will be part of numerous contributions: Our master student Jan Laukemann will present the paper “Automatic Throughput and Critical Path Analysis of x86 and ARM Assembly Kernels” at the PMBS 2019 workshop. It describes recent improvements to ...

Category: Allgemein, HPC

The first "Performance Evaluation Workshop" of the EoCoE-II project was successfully conducted at FAU from October 7-10, 2019. The workshop gave an introduction to node-level performance engineering and to parallel performance analysis using the Paraver tool. EoCoE-II is the second phase of t...

Category: HPC, Teaching

Im „HPC-Village“ auf der Langen Nacht der Wissenschaften am 19. Oktober 2019 präsentiert das RRZE gemeinsam mit seinen Partnern neben den neuesten Technologien verschiedene faszinierende HPC-Anwendungsfelder aus der Praxis. Biologie: Wie funktioniert eine Zellmembran? Klimaforschung: Wie interag...

Category: Allgemein, HPC

Our paper "Performance Engineering for a Tall & Skinny Matrix Multiplication Kernel on GPUs" by Dominik Ernst, Georg Hager, Jonas Thies, and Gerhard Wellein just received the best workshop paper award at PPAM 2019, the 13th International Conference on Parallel Processing and Applied Mathematics,...

Category: Allgemein, HPC

We are pleased to announce an invited talk by Benjamin Huth from the University of Regensburg about "Vectorized matrix-multiplication on the NEC SX-Aurora." The NEC SX-Aurora "Tsubasa" is NEC's latest vector processor architecture. Last year, RRZE has acquired a system with two "Tsubasa" vector a...

Category: All, Systems

The Bandwidth Benchmark is a collection of simple streaming kernels. While it may be used for microbenchmarking, we mainly want to use it in teaching. It is heavily inspired by John McCalpin's STREAM benchmark. It comprises eight typical streaming kernels with different data access patterns for m...

Category: HPC, Tools

LIKWID 4.3.4 is a bugfix release. These are the relevant changes: For systems using Intel Cluster-on-Die (CoD) or Sub-NUMA Clustering (SNC): Fix for detecting PCI devices Workaround for topology detection. The Linux kernel does not detect it properly sometimes. Don't pin accessDa...

Category: HPC, Tools