The objective of this course is to learn how /Paraver /and/Dimemas /tools can be used to analyze the performance of parallel applications and to familiarize with the tools usage as well as instrumenting applications.
The tutorial will motivate the audience on the need for portable, efficient programming models that put less pressure on program developers while still getting good performance for clusters and clusters with GPUs.
This tutorial will introduce the audience to the BSC tools for heterogenous programming on FPGA devices. It describes OmpSs@FPGA, as a productive programming environment for compute systems with FPGAs.
The objective of this course is to explain the different components of MareNostrum 4 and to understand the design decisions taken. We also analyze how the system administration is taken in this Petaflop system.
The aim of this course is to provide students with knowledge and hands-on experience in developing applications software for processors with massively parallel computing resources.
This is an expansion of the topic "OpenACC and other approaches to GPU computing" covered on last years editions of the Introduction to CUDA Programming.