Updates in 2020.2
General
- Added support for the NVIDIA Ampere GPUs with compute capability 8.6 and CUDA toolkit 11.1. 
- Added support for application replay to collect metric results across multiple application runs, instead of replaying individual kernels. 
- Added new - launch__device_idmetric.
- Added support for NVLink ( - nvl*) metrics for GPUs with compute capabilities 7.0, 7.5 and 8.0
- Added documentation for memory charts and tables in the Profiling Guide. 
NVIDIA Nsight Compute
- Updated menu and toolbar layout. 
- Added support for zoom and pan on roofline charts. 
- The Resources tool window shows the current CUDA stream attributes. 
- The memory chart shows a heatmap for link and port utilization. 
- The hot-spot tables in the Source Counters section now show values as percentages, too. 
- On-demand resolve of remote CUDA-C source is now available for MacOS hosts. 
- Metric columns in the Summary and Raw pages are now sortable. 
- Added a new option to set the number of recent API calls shown in the API Stream tool window. 
NVIDIA Nsight Compute CLI
- CLI output now shows NVTX payload information. 
- CSV output now shows NVTX states. 
- Added a new - --replay-modeoption to select the mechanism used for replaying a kernel launch multiple times.
- Added a new - --killoption to terminate the application once all requested kernels were profiled.
- Added a new - --log-fileoption to decide the output stream for printing tool output.
- Added a new - --check-exit-codeoption to decide if the child application exit code should be checked.
Resolved Issues
- The profiling progress dialog is not dismissed automatically anymore after an error. 
- The inter-process lock is now automatically given write permissions for all users. 
- All project extensions are enabled in the default dialog filter. 
- Fixed handling of targets using tcsh during remote profiling. 
- Fixed handling of quoted application arguments on Windows.