Posts by Jiri Kraus
Simulation / Modeling / Design
Jan 22, 2021
Accelerating NVSHMEM 2.0 Team-Based Collectives Using NCCL
NVSHMEM 2.0 is introducing a new API for performing collective operations based on the Team Management feature of the OpenSHMEM 1.5 specification. A team is a...
9 MIN READ
Simulation / Modeling / Design
Nov 17, 2014
Increase Performance with GPU Boost and K80 Autoboost
NVIDIA® GPU Boost™ is a feature available on NVIDIA® GeForce® and Tesla® GPUs that boosts application performance by increasing GPU core and memory...
11 MIN READ
Simulation / Modeling / Design
Jun 19, 2014
CUDA Pro Tip: Profiling MPI Applications
When I profile MPI+CUDA applications, sometimes performance issues only occur for certain MPI ranks. To fix these, it's necessary to identify the MPI rank where...
4 MIN READ
Simulation / Modeling / Design
Jun 03, 2014
Accelerating a C++ CFD Code with OpenACC
Computational Fluid Dynamics (CFD) is a valuable tool to study the behavior of fluids. Today, many areas of engineering use CFD. For example, the automotive...
12 MIN READ
Simulation / Modeling / Design
Sep 03, 2013
CUDA Pro Tip: Generate Custom Application Profile Timelines with NVTX
The last time you used the timeline feature in the NVIDIA Visual Profiler, Nsight VSE or the new Nsight Systems to analyze a complex application, you might have...
9 MIN READ
Simulation / Modeling / Design
Mar 27, 2013
Benchmarking CUDA-Aware MPI
I introduced CUDA-aware MPI in my last post, with an introduction to MPI and a description of the functionality and benefits of CUDA-aware MPI. In this post I...
8 MIN READ