1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187
|
<!DOCTYPE html>
<html class="writer-html5" lang="en" >
<head>
<meta charset="utf-8" /><meta name="generator" content="Docutils 0.17.1: http://docutils.sourceforge.net/" />
<meta name="viewport" content="width=device-width, initial-scale=1.0" />
<title>Updates in 2019.5 — NsightCompute 12.4 documentation</title>
<link rel="stylesheet" href="../../_static/pygments.css" type="text/css" />
<link rel="stylesheet" href="../../_static/css/theme.css" type="text/css" />
<link rel="stylesheet" href="../../_static/design-style.b7bb847fb20b106c3d81b95245e65545.min.css" type="text/css" />
<link rel="stylesheet" href="../../_static/omni-style.css" type="text/css" />
<link rel="stylesheet" href="../../_static/api-styles.css" type="text/css" />
<link rel="shortcut icon" href="../../_static/nsight-compute.ico"/>
<!--[if lt IE 9]>
<script src="../../_static/js/html5shiv.min.js"></script>
<![endif]-->
<script data-url_root="../../" id="documentation_options" src="../../_static/documentation_options.js"></script>
<script src="../../_static/jquery.js"></script>
<script src="../../_static/underscore.js"></script>
<script src="../../_static/doctools.js"></script>
<script src="../../_static/mermaid-init.js"></script>
<script src="../../_static/design-tabs.js"></script>
<script src="../../_static/version.js"></script>
<script src="../../_static/social-media.js"></script>
<script src="../../_static/js/theme.js"></script>
<link rel="index" title="Index" href="../../genindex.html" />
<link rel="search" title="Search" href="../../search.html" />
</head>
<body class="wy-body-for-nav">
<div class="wy-grid-for-nav">
<nav data-toggle="wy-nav-shift" class="wy-nav-side">
<div class="wy-side-scroll">
<div class="wy-side-nav-search" >
<a href="../../index.html">
<img src="../../_static/nsight-compute.png" class="logo" alt="Logo"/>
</a>
<div role="search">
<form id="rtd-search-form" class="wy-form" action="../../search.html" method="get">
<input type="text" name="q" placeholder="Search docs" />
<input type="hidden" name="check_keywords" value="yes" />
<input type="hidden" name="area" value="default" />
</form>
</div>
</div><div class="wy-menu wy-menu-vertical" data-spy="affix" role="navigation" aria-label="Navigation menu">
<p class="caption" role="heading"><span class="caption-text">Nsight Compute</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../index.html">1. Release Notes</a></li>
<li class="toctree-l1"><a class="reference internal" href="../../ProfilingGuide/index.html">2. Kernel Profiling Guide</a></li>
<li class="toctree-l1"><a class="reference internal" href="../../NsightCompute/index.html">3. Nsight Compute</a></li>
<li class="toctree-l1"><a class="reference internal" href="../../NsightComputeCli/index.html">4. Nsight Compute CLI</a></li>
</ul>
<p class="caption" role="heading"><span class="caption-text">Developer Interfaces</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../../CustomizationGuide/index.html">1. Customization Guide</a></li>
<li class="toctree-l1"><a class="reference internal" href="../../NvRulesAPI/index.html">2. NvRules API</a></li>
</ul>
<p class="caption" role="heading"><span class="caption-text">Training</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../../Training/index.html">Training</a></li>
</ul>
<p class="caption" role="heading"><span class="caption-text">Release Information</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../../Archives/index.html">Archives</a></li>
</ul>
<p class="caption" role="heading"><span class="caption-text">Copyright and Licenses</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../../CopyrightAndLicenses/index.html">Copyright and Licenses</a></li>
</ul>
</div>
</div>
</nav>
<section data-toggle="wy-nav-shift" class="wy-nav-content-wrap"><nav class="wy-nav-top" aria-label="Mobile navigation menu" >
<i data-toggle="wy-nav-top" class="fa fa-bars"></i>
<a href="../../index.html">NsightCompute</a>
</nav>
<div class="wy-nav-content">
<div class="rst-content">
<div role="navigation" aria-label="Page navigation">
<ul class="wy-breadcrumbs">
<li><a href="../../index.html" class="icon icon-home"></a> »</li>
<li>Updates in 2019.5</li>
<li class="wy-breadcrumbs-aside">
</li>
<li class="wy-breadcrumbs-aside">
<span>v2024.1.1 |</span>
<a href="https://developer.nvidia.com/nsight-compute-history" class="reference external">Archive</a>
<span> </span>
</li>
</ul>
<hr/>
</div>
<div role="main" class="document" itemscope="itemscope" itemtype="http://schema.org/Article">
<div itemprop="articleBody">
<section id="updates-in-2019-5">
<h1>Updates in 2019.5<a class="headerlink" href="#updates-in-2019-5" title="Permalink to this headline"></a></h1>
<p><strong>General</strong></p>
<ul class="simple">
<li><p>Added <em>section sets</em> to reduce the default overhead and make it easier to configure metric sets for profiling</p></li>
<li><p>Reduced the size of the installation</p></li>
<li><p>Added support for CUDA Graphs Recapture API</p></li>
<li><p>The NvRules API now supports accessing correlation IDs for instanced metrics</p></li>
<li><p>Added breakdown tables for <em>SOL SM</em> and <em>SOL Memory</em> in the Speed Of Light section for Volta+ GPUs</p></li>
</ul>
<p><strong>NVIDIA Nsight Compute</strong></p>
<ul class="simple">
<li><p>Added a snap-select feature to the Source page heatmap help navigate large files</p></li>
<li><p>Added support for loading remote CUDA-C source files via SSH on demand for Linux x86_64 targets</p></li>
<li><p>Charts on the Details page provide better help in tool tips when hovering metric names</p></li>
<li><p>Improved the performance of the Source page when scrolling or collapsing</p></li>
<li><p>The charts for Warp States and Compute pipelines are now sorted by value</p></li>
</ul>
<p><strong>NVIDIA Nsight Compute CLI</strong></p>
<ul class="simple">
<li><p>Added support for GPU cache control, see <code class="docutils literal notranslate"><span class="pre">--cache-control</span></code></p></li>
<li><p>Added support for setting the kernel name base in command line output, see <code class="docutils literal notranslate"><span class="pre">--kernel-base</span></code></p></li>
<li><p>Added support for listing the available names for <code class="docutils literal notranslate"><span class="pre">--chips</span></code>, see <code class="docutils literal notranslate"><span class="pre">--list-chips</span></code></p></li>
<li><p>Improved the stability on Windows when using <code class="docutils literal notranslate"><span class="pre">--target-processes</span> <span class="pre">all</span></code></p></li>
<li><p>Reduced the profiling overhead for small metric sets in applications with many kernels</p></li>
</ul>
<p><strong>Resolved Issues</strong></p>
<ul class="simple">
<li><p>Reduced the overhead caused by demangling kernel names multiple times</p></li>
<li><p>Fixed an issue that kernel names were not demangled in CUDA Graph Nodes resources window</p></li>
<li><p>The connection dialog better disables unsupported combinations or warns of invalid entries</p></li>
<li><p>Fixed metric <em>thread_inst_executed_true</em> to derive from <em>smsp_not_predicated_off_thread_inst_executed</em> on Volta+ GPUs</p></li>
<li><p>Fixed an issue with computing the theoretical occupancy on GV100</p></li>
<li><p>Selecting an entry on the Source page heatmap no longer selects the respective source line, to avoid losing the current selection</p></li>
<li><p>Fixed the current view indicator of the Source page heatmap to be line-accurate</p></li>
<li><p>Fixed an issue when comparing metrics from Pascal and later architectures on the Summary page</p></li>
<li><p>Fixed an issue that metrics representing constant values on Volta+ couldn’t be collected without non-constant metrics</p></li>
</ul>
</section>
</div>
</div>
<footer>
<hr/>
<div role="contentinfo">
<p>© Copyright 2018-2024, NVIDIA Corporation & Affiliates. All rights reserved.
<span class="lastupdated">Last updated on Mar 06, 2024.
</span></p>
</div>
</footer>
</div>
</div>
</section>
</div>
<script>
jQuery(function () {
SphinxRtdTheme.Navigation.enable(true);
});
</script>
</body>
</html>
|