1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184
|
<!DOCTYPE html>
<html class="writer-html5" lang="en" >
<head>
<meta charset="utf-8" /><meta name="generator" content="Docutils 0.17.1: http://docutils.sourceforge.net/" />
<meta name="viewport" content="width=device-width, initial-scale=1.0" />
<title>Updates in 2023.1 — NsightCompute 12.4 documentation</title>
<link rel="stylesheet" href="../../_static/pygments.css" type="text/css" />
<link rel="stylesheet" href="../../_static/css/theme.css" type="text/css" />
<link rel="stylesheet" href="../../_static/design-style.b7bb847fb20b106c3d81b95245e65545.min.css" type="text/css" />
<link rel="stylesheet" href="../../_static/omni-style.css" type="text/css" />
<link rel="stylesheet" href="../../_static/api-styles.css" type="text/css" />
<link rel="shortcut icon" href="../../_static/nsight-compute.ico"/>
<!--[if lt IE 9]>
<script src="../../_static/js/html5shiv.min.js"></script>
<![endif]-->
<script data-url_root="../../" id="documentation_options" src="../../_static/documentation_options.js"></script>
<script src="../../_static/jquery.js"></script>
<script src="../../_static/underscore.js"></script>
<script src="../../_static/doctools.js"></script>
<script src="../../_static/mermaid-init.js"></script>
<script src="../../_static/design-tabs.js"></script>
<script src="../../_static/version.js"></script>
<script src="../../_static/social-media.js"></script>
<script src="../../_static/js/theme.js"></script>
<link rel="index" title="Index" href="../../genindex.html" />
<link rel="search" title="Search" href="../../search.html" />
</head>
<body class="wy-body-for-nav">
<div class="wy-grid-for-nav">
<nav data-toggle="wy-nav-shift" class="wy-nav-side">
<div class="wy-side-scroll">
<div class="wy-side-nav-search" >
<a href="../../index.html">
<img src="../../_static/nsight-compute.png" class="logo" alt="Logo"/>
</a>
<div role="search">
<form id="rtd-search-form" class="wy-form" action="../../search.html" method="get">
<input type="text" name="q" placeholder="Search docs" />
<input type="hidden" name="check_keywords" value="yes" />
<input type="hidden" name="area" value="default" />
</form>
</div>
</div><div class="wy-menu wy-menu-vertical" data-spy="affix" role="navigation" aria-label="Navigation menu">
<p class="caption" role="heading"><span class="caption-text">Nsight Compute</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../index.html">1. Release Notes</a></li>
<li class="toctree-l1"><a class="reference internal" href="../../ProfilingGuide/index.html">2. Kernel Profiling Guide</a></li>
<li class="toctree-l1"><a class="reference internal" href="../../NsightCompute/index.html">3. Nsight Compute</a></li>
<li class="toctree-l1"><a class="reference internal" href="../../NsightComputeCli/index.html">4. Nsight Compute CLI</a></li>
</ul>
<p class="caption" role="heading"><span class="caption-text">Developer Interfaces</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../../CustomizationGuide/index.html">1. Customization Guide</a></li>
<li class="toctree-l1"><a class="reference internal" href="../../NvRulesAPI/index.html">2. NvRules API</a></li>
</ul>
<p class="caption" role="heading"><span class="caption-text">Training</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../../Training/index.html">Training</a></li>
</ul>
<p class="caption" role="heading"><span class="caption-text">Release Information</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../../Archives/index.html">Archives</a></li>
</ul>
<p class="caption" role="heading"><span class="caption-text">Copyright and Licenses</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../../CopyrightAndLicenses/index.html">Copyright and Licenses</a></li>
</ul>
</div>
</div>
</nav>
<section data-toggle="wy-nav-shift" class="wy-nav-content-wrap"><nav class="wy-nav-top" aria-label="Mobile navigation menu" >
<i data-toggle="wy-nav-top" class="fa fa-bars"></i>
<a href="../../index.html">NsightCompute</a>
</nav>
<div class="wy-nav-content">
<div class="rst-content">
<div role="navigation" aria-label="Page navigation">
<ul class="wy-breadcrumbs">
<li><a href="../../index.html" class="icon icon-home"></a> »</li>
<li>Updates in 2023.1</li>
<li class="wy-breadcrumbs-aside">
</li>
<li class="wy-breadcrumbs-aside">
<span>v2024.1.1 |</span>
<a href="https://developer.nvidia.com/nsight-compute-history" class="reference external">Archive</a>
<span> </span>
</li>
</ul>
<hr/>
</div>
<div role="main" class="document" itemscope="itemscope" itemtype="http://schema.org/Article">
<div itemprop="articleBody">
<section id="updates-in-2023-1">
<h1>Updates in 2023.1<a class="headerlink" href="#updates-in-2023-1" title="Permalink to this headline"></a></h1>
<p><strong>General</strong></p>
<ul class="simple">
<li><p>Added support for the CUDA toolkit 12.1.</p></li>
<li><p>Added a new <a class="reference external" href="../ProfilingGuide/index.html#application-range-replay">app-range</a> replay mode to profile ranges without API capture by relaunching the entire application multiple times.</p></li>
<li><p>Added <em>sharedBankConflicts</em> sample CUDA application and document to show how NVIDIA Nsight Compute can be used to analyze and identify the shared memory bank conflicts which result in inefficient shared memory accesses. Refer to the <code class="docutils literal notranslate"><span class="pre">README.TXT</span></code> file, sample code and document under <code class="docutils literal notranslate"><span class="pre">extras/samples/sharedBankConflicts</span></code>.</p></li>
<li><p>Jupyter notebook samples are available in the Nsight training <a class="reference external" href="https://github.com/NVIDIA/nsight-training/blob/master/cuda/nsight_compute/python_report_interface">github repository</a>.</p></li>
<li><p>The equivalent of the <a class="reference external" href="../CustomizationGuide/index.html#python-report-interface-high-level">high-level Python report interface</a> is now available in rule files.</p></li>
</ul>
<p><strong>NVIDIA Nsight Compute</strong></p>
<ul class="simple">
<li><p>Added support for profiling individual metrics in <a class="reference external" href="../NsightCompute/index.html#connection-activity-interactive">Interactive Profile activity</a>. A new input field for metrics was added in the <a class="reference external" href="../NsightCompute/index.html#tool-window-sections-info">Metric Selection</a> tool window.</p></li>
<li><p>Files on remote systems can be opened directly from the <a class="reference external" href="../NsightCompute/index.html#main-menu">menu</a>.</p></li>
<li><p>Metric- and section-related entries in the menu, <a class="reference external" href="../NsightCompute/index.html#connection-activity-non-interactive">Profile activity</a> and <a class="reference external" href="../NsightCompute/index.html#tool-window-sections-info">Metric Selection</a> tool window were renamed to make them more clear.</p></li>
<li><p>CPU and GPU <a class="reference external" href="../ProfilingGuide/index.html#metrics-reference">NUMA topology metrics</a> can be collected on applicable systems. Topology information is shown in a new <a class="reference external" href="../ProfilingGuide/index.html#sections-and-rules">NUMA Affinity section</a>.</p></li>
<li><p>Added content-aware suggestions to the Details page to provide suggestions based on the selected profiling options.</p></li>
<li><p>Added support for <a class="reference external" href="../NsightCompute/index.html#profiler-report-source-page-navigation">re-resolving source files</a> on the Source page.</p></li>
<li><p>Not-issued warp stall reasons are removed from the Source Counters section tables and hidden by default on the Source page. Users should focus on regular warp stall reasons by default and only inspect not-issued samples if this distinction is needed.</p></li>
<li><p>Added support to search missing CUDA source files to permanently import into the report using <a class="reference external" href="../NsightCompute/index.html#options-source-lookup">Source Lookup options</a> in the <a class="reference external" href="../NsightCompute/index.html#connection-activity-interactive">Interactive Profile activity</a>.</p></li>
<li><p>The <a class="reference external" href="../NsightCompute/index.html#profiler-report-source-page-metrics">source page</a> now shows metric values as percentages by default. New buttons are added to support switching between different value modes.</p></li>
</ul>
<p><strong>NVIDIA Nsight Compute CLI</strong></p>
<ul class="simple">
<li><p>Added support for config files in the current working or user directory to set default ncu parameters. See the <a class="reference external" href="../NsightComputeCli/index.html#command-line-options-general">General options</a> for more details.</p></li>
<li><p>Added <code class="docutils literal notranslate"><span class="pre">--range-filter</span></code><a class="reference external" href="../NsightComputeCli/index.html#command-line-options-console-output">command line option</a> which allows to select subset of enabled profile ranges.</p></li>
<li><p>Added new <code class="docutils literal notranslate"><span class="pre">--source-folders</span></code><a class="reference external" href="../NsightComputeCli/index.html#command-line-options-profile">command line option</a> that allows to recursively search for missing CUDA source files to permanently import into the report.</p></li>
</ul>
<p><strong>Resolved Issues</strong></p>
<ul class="simple">
<li><p>Fixed performance issues on the Summary and Raw pages for large reports.</p></li>
<li><p>Improved support for non-ASCII characters in filenames.</p></li>
<li><p>Fixed an issue with delayed updates of assembly analysis information on the Source page’s Source and PTX views.</p></li>
<li><p>Fixed potential crashes when using the Python report interface.</p></li>
</ul>
</section>
</div>
</div>
<footer>
<hr/>
<div role="contentinfo">
<p>© Copyright 2018-2024, NVIDIA Corporation & Affiliates. All rights reserved.
<span class="lastupdated">Last updated on Mar 06, 2024.
</span></p>
</div>
</footer>
</div>
</div>
</section>
</div>
<script>
jQuery(function () {
SphinxRtdTheme.Navigation.enable(true);
});
</script>
</body>
</html>
|