File: updates-2021-3.html

package info (click to toggle)
nvidia-cuda-toolkit 12.4.1-3
  • links: PTS, VCS
  • area: non-free
  • in suites: forky, sid
  • size: 18,505,836 kB
  • sloc: ansic: 203,477; cpp: 64,769; python: 34,699; javascript: 22,006; xml: 13,410; makefile: 3,085; sh: 2,343; perl: 352
file content (189 lines) | stat: -rw-r--r-- 10,184 bytes parent folder | download | duplicates (6)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
<!DOCTYPE html>
<html class="writer-html5" lang="en" >
<head>
  <meta charset="utf-8" /><meta name="generator" content="Docutils 0.17.1: http://docutils.sourceforge.net/" />

  <meta name="viewport" content="width=device-width, initial-scale=1.0" />
  <title>Updates in 2021.3 &mdash; NsightCompute 12.4 documentation</title>
      <link rel="stylesheet" href="../../_static/pygments.css" type="text/css" />
      <link rel="stylesheet" href="../../_static/css/theme.css" type="text/css" />
      <link rel="stylesheet" href="../../_static/design-style.b7bb847fb20b106c3d81b95245e65545.min.css" type="text/css" />
      <link rel="stylesheet" href="../../_static/omni-style.css" type="text/css" />
      <link rel="stylesheet" href="../../_static/api-styles.css" type="text/css" />
    <link rel="shortcut icon" href="../../_static/nsight-compute.ico"/>
  <!--[if lt IE 9]>
    <script src="../../_static/js/html5shiv.min.js"></script>
  <![endif]-->
  
        <script data-url_root="../../" id="documentation_options" src="../../_static/documentation_options.js"></script>
        <script src="../../_static/jquery.js"></script>
        <script src="../../_static/underscore.js"></script>
        <script src="../../_static/doctools.js"></script>
        <script src="../../_static/mermaid-init.js"></script>
        <script src="../../_static/design-tabs.js"></script>
        <script src="../../_static/version.js"></script>
        <script src="../../_static/social-media.js"></script>
    <script src="../../_static/js/theme.js"></script>
    <link rel="index" title="Index" href="../../genindex.html" />
    <link rel="search" title="Search" href="../../search.html" />
 


</head>

<body class="wy-body-for-nav"> 
  <div class="wy-grid-for-nav">
    <nav data-toggle="wy-nav-shift" class="wy-nav-side">
      <div class="wy-side-scroll">
        <div class="wy-side-nav-search" >


  <a href="../../index.html">
  <img src="../../_static/nsight-compute.png" class="logo" alt="Logo"/>
</a>

<div role="search">
  <form id="rtd-search-form" class="wy-form" action="../../search.html" method="get">
    <input type="text" name="q" placeholder="Search docs" />
    <input type="hidden" name="check_keywords" value="yes" />
    <input type="hidden" name="area" value="default" />
  </form>
</div>
        </div><div class="wy-menu wy-menu-vertical" data-spy="affix" role="navigation" aria-label="Navigation menu">
              <p class="caption" role="heading"><span class="caption-text">Nsight Compute</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../index.html">1. Release Notes</a></li>
<li class="toctree-l1"><a class="reference internal" href="../../ProfilingGuide/index.html">2. Kernel Profiling Guide</a></li>
<li class="toctree-l1"><a class="reference internal" href="../../NsightCompute/index.html">3. Nsight Compute</a></li>
<li class="toctree-l1"><a class="reference internal" href="../../NsightComputeCli/index.html">4. Nsight Compute CLI</a></li>
</ul>
<p class="caption" role="heading"><span class="caption-text">Developer Interfaces</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../../CustomizationGuide/index.html">1. Customization Guide</a></li>
<li class="toctree-l1"><a class="reference internal" href="../../NvRulesAPI/index.html">2. NvRules API</a></li>
</ul>
<p class="caption" role="heading"><span class="caption-text">Training</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../../Training/index.html">Training</a></li>
</ul>
<p class="caption" role="heading"><span class="caption-text">Release Information</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../../Archives/index.html">Archives</a></li>
</ul>
<p class="caption" role="heading"><span class="caption-text">Copyright and Licenses</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../../CopyrightAndLicenses/index.html">Copyright and Licenses</a></li>
</ul>

        </div>
      </div>
    </nav>

    <section data-toggle="wy-nav-shift" class="wy-nav-content-wrap"><nav class="wy-nav-top" aria-label="Mobile navigation menu" >
          <i data-toggle="wy-nav-top" class="fa fa-bars"></i>
          <a href="../../index.html">NsightCompute</a>
      </nav>

      <div class="wy-nav-content">
        <div class="rst-content">
          <div role="navigation" aria-label="Page navigation">
  <ul class="wy-breadcrumbs">


<li><a href="../../index.html" class="icon icon-home"></a> &raquo;</li>
<li>Updates in 2021.3</li>

      <li class="wy-breadcrumbs-aside">
      </li>
<li class="wy-breadcrumbs-aside">


  <span>v2024.1.1 |</span>



  <a href="https://developer.nvidia.com/nsight-compute-history" class="reference external">Archive</a>


  <span>&nbsp;</span>
</li>

  </ul>
  <hr/>
</div>
          <div role="main" class="document" itemscope="itemscope" itemtype="http://schema.org/Article">
           <div itemprop="articleBody">
             
  <section id="updates-in-2021-3">
<h1>Updates in 2021.3<a class="headerlink" href="#updates-in-2021-3" title="Permalink to this headline"></a></h1>
<p><strong>General</strong></p>
<ul class="simple">
<li><p>Added support for the CUDA toolkit 11.5.</p></li>
<li><p>Added a new rule for detecting inefficient memory access patterns in the L1TEX cache and L2 cache.</p></li>
<li><p>Added a new rule for detecting high usage of system or peer memory.</p></li>
<li><p>Added new <code class="docutils literal notranslate"><span class="pre">IAction::sass_by_pc</span></code> function to the the <a class="reference external" href="../NvRulesAPI/index.html#abstract">NvRules API</a>.</p></li>
<li><p>The <a class="reference external" href="../CustomizationGuide/index.html#python-report-interface">Python-based report interface</a> is now available for Windows and MacOS hosts, too.</p></li>
<li><p>Added Hierarchical Roofline section files in a new “roofline” section set.</p></li>
<li><p>Added support for collecting CPU call stack information.</p></li>
</ul>
<p><strong>NVIDIA Nsight Compute</strong></p>
<ul class="simple">
<li><p>Added support for new remote profiling <a class="reference external" href="../NsightCompute/index.html#remote-connections">SSH connection and authentication options</a> as well as local SSH configuration files.</p></li>
<li><p>Added an <a class="reference external" href="../NsightCompute/index.html#occupancy-calculator">Occupancy Calculator</a> which can be opened directly from a profile report or as a new activity. It offers feature parity to the CUDA Occupancy Calculator <a class="reference external" href="http://docs.nvidia.com/cuda/cuda-occupancy-calculator/index.html">spreadsheet</a>.</p></li>
<li><p>Added new <a class="reference external" href="../NsightCompute/index.html#tool-window-baselines">Baselines tool window</a> to manage (hide, update, re-order, save/load) baseline selections.</p></li>
<li><p>The Source page views now support multi-line/cell selection and copy/paste. Different colors are used for highlighting selections and correlated lines.</p></li>
<li><p>The search edit on the Source page now supports <em>Shift+Enter</em> to search in reverse direction.</p></li>
<li><p>The <a class="reference external" href="../ProfilingGuide/index.html#memory-chart">Memory Workload Analysis Chart</a> can be configured to show throughput values instead of transferred bytes.</p></li>
<li><p>The <em>Profile</em> activity now supports the <code class="docutils literal notranslate"><span class="pre">--devices</span></code> option.</p></li>
<li><p>The <em>NVLink Topology</em> diagram displays per NVLink metrics.</p></li>
<li><p>Added a new tool window showing the CPU call stack at the location where the current thread was suspended during interactive profiling activities.</p></li>
<li><p>If enabled, the <em>Call Stack / NVTX</em> page of the profile report shows the captured CPU call stack for the selected kernel launch.</p></li>
</ul>
<p><strong>NVIDIA Nsight Compute CLI</strong></p>
<ul class="simple">
<li><p>Added support for printing source/metric content with the new <code class="docutils literal notranslate"><span class="pre">--page</span> <span class="pre">source</span></code> and <code class="docutils literal notranslate"><span class="pre">--print-source</span></code><a class="reference external" href="../NsightComputeCli/index.html#command-line-options-console-output">command line options</a>.</p></li>
<li><p>Added new option <code class="docutils literal notranslate"><span class="pre">--call-stack</span></code> to enable collecting the CPU call stack for every profiled kernel launch.</p></li>
</ul>
<p><strong>Resolved Issues</strong></p>
<ul class="simple">
<li><p>Fixed that <code class="docutils literal notranslate"><span class="pre">memory_*</span></code> metrics could not be collected with the <code class="docutils literal notranslate"><span class="pre">--metrics</span></code> option.</p></li>
<li><p>Fixed that selection and copy/paste was not supported for section header tables on the Details page.</p></li>
<li><p>Fixed issues with the Source page when collapsing the content.</p></li>
<li><p>Fixed that the UI could crash when applying rules to a new profile result.</p></li>
<li><p>Fixed that PC Sampling metrics were not available for <em>Profile Series</em>.</p></li>
<li><p>Fixed that local profiling did not work if no non-loopback address was configured for the system.</p></li>
<li><p>Fixed termination of remote-launched applications. On QNX, terminating an application profiled via <em>Remote Launch</em> is now supported. Canceling remote-launched <em>Profile</em> activities is now supported.</p></li>
</ul>
</section>


           </div>
          </div>
          <footer>

  <hr/>

  <div role="contentinfo">
    <p>&#169; Copyright 2018-2024, NVIDIA Corporation &amp; Affiliates. All rights reserved.
      <span class="lastupdated">Last updated on Mar 06, 2024.
      </span></p>
  </div>

   

</footer>
        </div>
      </div>
    </section>
  </div>
  <script>
      jQuery(function () {
          SphinxRtdTheme.Navigation.enable(true);
      });
  </script>
 



</body>
</html>