Parallel forall blog posts

CUDA pro tips

Compilers

Tools

nvvp does not support new GPUs (starting with Volta/Turing, sm_7*). nvvp (GUI) and nvprof (CLI) are marked as „deprecated“ in the documentation, they are being replaced by Nsight Compute (for profiling CUDA kernels) and Nsight Systems (for profiling global things, including CPU and „timeline“). These new tools are finally Java-free, but still have some quirks (e.g. remote connections ignore .ssh/config). I also could not find a way to export the report to PDF. Both of these tools require at least sm_7*. They can be launched by the ncu-ui and nsys-ui commands.

Miscellaneous

Benchmarks