File: updates-2020-1.html

package info (click to toggle)
nvidia-cuda-toolkit 12.4.1-2
  • links: PTS, VCS
  • area: non-free
  • in suites: forky, trixie
  • size: 18,505,836 kB
  • sloc: ansic: 203,477; cpp: 64,769; python: 34,699; javascript: 22,006; xml: 13,410; makefile: 3,085; sh: 2,343; perl: 352
file content (195 lines) | stat: -rw-r--r-- 10,057 bytes parent folder | download | duplicates (6)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
<!DOCTYPE html>
<html class="writer-html5" lang="en" >
<head>
  <meta charset="utf-8" /><meta name="generator" content="Docutils 0.17.1: http://docutils.sourceforge.net/" />

  <meta name="viewport" content="width=device-width, initial-scale=1.0" />
  <title>Updates in 2020.1 &mdash; NsightCompute 12.4 documentation</title>
      <link rel="stylesheet" href="../../_static/pygments.css" type="text/css" />
      <link rel="stylesheet" href="../../_static/css/theme.css" type="text/css" />
      <link rel="stylesheet" href="../../_static/design-style.b7bb847fb20b106c3d81b95245e65545.min.css" type="text/css" />
      <link rel="stylesheet" href="../../_static/omni-style.css" type="text/css" />
      <link rel="stylesheet" href="../../_static/api-styles.css" type="text/css" />
    <link rel="shortcut icon" href="../../_static/nsight-compute.ico"/>
  <!--[if lt IE 9]>
    <script src="../../_static/js/html5shiv.min.js"></script>
  <![endif]-->
  
        <script data-url_root="../../" id="documentation_options" src="../../_static/documentation_options.js"></script>
        <script src="../../_static/jquery.js"></script>
        <script src="../../_static/underscore.js"></script>
        <script src="../../_static/doctools.js"></script>
        <script src="../../_static/mermaid-init.js"></script>
        <script src="../../_static/design-tabs.js"></script>
        <script src="../../_static/version.js"></script>
        <script src="../../_static/social-media.js"></script>
    <script src="../../_static/js/theme.js"></script>
    <link rel="index" title="Index" href="../../genindex.html" />
    <link rel="search" title="Search" href="../../search.html" />
 


</head>

<body class="wy-body-for-nav"> 
  <div class="wy-grid-for-nav">
    <nav data-toggle="wy-nav-shift" class="wy-nav-side">
      <div class="wy-side-scroll">
        <div class="wy-side-nav-search" >


  <a href="../../index.html">
  <img src="../../_static/nsight-compute.png" class="logo" alt="Logo"/>
</a>

<div role="search">
  <form id="rtd-search-form" class="wy-form" action="../../search.html" method="get">
    <input type="text" name="q" placeholder="Search docs" />
    <input type="hidden" name="check_keywords" value="yes" />
    <input type="hidden" name="area" value="default" />
  </form>
</div>
        </div><div class="wy-menu wy-menu-vertical" data-spy="affix" role="navigation" aria-label="Navigation menu">
              <p class="caption" role="heading"><span class="caption-text">Nsight Compute</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../index.html">1. Release Notes</a></li>
<li class="toctree-l1"><a class="reference internal" href="../../ProfilingGuide/index.html">2. Kernel Profiling Guide</a></li>
<li class="toctree-l1"><a class="reference internal" href="../../NsightCompute/index.html">3. Nsight Compute</a></li>
<li class="toctree-l1"><a class="reference internal" href="../../NsightComputeCli/index.html">4. Nsight Compute CLI</a></li>
</ul>
<p class="caption" role="heading"><span class="caption-text">Developer Interfaces</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../../CustomizationGuide/index.html">1. Customization Guide</a></li>
<li class="toctree-l1"><a class="reference internal" href="../../NvRulesAPI/index.html">2. NvRules API</a></li>
</ul>
<p class="caption" role="heading"><span class="caption-text">Training</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../../Training/index.html">Training</a></li>
</ul>
<p class="caption" role="heading"><span class="caption-text">Release Information</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../../Archives/index.html">Archives</a></li>
</ul>
<p class="caption" role="heading"><span class="caption-text">Copyright and Licenses</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../../CopyrightAndLicenses/index.html">Copyright and Licenses</a></li>
</ul>

        </div>
      </div>
    </nav>

    <section data-toggle="wy-nav-shift" class="wy-nav-content-wrap"><nav class="wy-nav-top" aria-label="Mobile navigation menu" >
          <i data-toggle="wy-nav-top" class="fa fa-bars"></i>
          <a href="../../index.html">NsightCompute</a>
      </nav>

      <div class="wy-nav-content">
        <div class="rst-content">
          <div role="navigation" aria-label="Page navigation">
  <ul class="wy-breadcrumbs">


<li><a href="../../index.html" class="icon icon-home"></a> &raquo;</li>
<li>Updates in 2020.1</li>

      <li class="wy-breadcrumbs-aside">
      </li>
<li class="wy-breadcrumbs-aside">


  <span>v2024.1.1 |</span>



  <a href="https://developer.nvidia.com/nsight-compute-history" class="reference external">Archive</a>


  <span>&nbsp;</span>
</li>

  </ul>
  <hr/>
</div>
          <div role="main" class="document" itemscope="itemscope" itemtype="http://schema.org/Article">
           <div itemprop="articleBody">
             
  <section id="updates-in-2020-1">
<h1>Updates in 2020.1<a class="headerlink" href="#updates-in-2020-1" title="Permalink to this headline"></a></h1>
<p><strong>General</strong></p>
<ul class="simple">
<li><p>Added support for the NVIDIA GA100/SM 8.x GPU architecture</p></li>
<li><p>Removed support for the Pascal SM 6.x GPU architecture</p></li>
<li><p>Windows 7 is not a supported host or target platform anymore</p></li>
<li><p>Added a rule for reporting uncoalesced memory accesses as part of the <em>Source Counters</em> section</p></li>
<li><p>Added support for report name placeholders %p, %q, %i and %h</p></li>
<li><p>The <a class="reference external" href="../ProfilingGuide/index.html#abstract">Kernel Profiling Guide</a> was added to the documentation</p></li>
</ul>
<p><strong>NVIDIA Nsight Compute</strong></p>
<ul class="simple">
<li><p>The UI command was renamed from <code class="docutils literal notranslate"><span class="pre">nv-nsight-cu</span></code> to <code class="docutils literal notranslate"><span class="pre">ncu-ui</span></code>. Old names remain for backwards compatibility.</p></li>
<li><p>Added support for roofline analysis charts</p></li>
<li><p>Added linked hot spot tables in section bodies to indicate performance problems in the source code</p></li>
<li><p>Added section navigation links in rule results to quickly jump to the referenced section</p></li>
<li><p>Added a new option to select how kernel names are shown in the UI</p></li>
<li><p>Added new memory tables for the L1/TEX cache and the L2 cache. The old tables are still available for backwards compatibility and moved to a new section containing deprecated UI elements.</p></li>
<li><p>Memory tables now show the metric name as a tooltip</p></li>
<li><p>Source resolution now takes into account file properties when selecting a file from disk</p></li>
<li><p>Results in the profile report can now be filtered by NVTX range</p></li>
<li><p>The Source page now supports collapsing views even for single files</p></li>
<li><p>The UI shows profiler error messages as dismissible banners for increased visibility</p></li>
<li><p>Improved the baseline name control in the profiler report header</p></li>
</ul>
<p><strong>NVIDIA Nsight Compute CLI</strong></p>
<ul class="simple">
<li><p>The CLI command was renamed from <code class="docutils literal notranslate"><span class="pre">nv-nsight-cu-cli</span></code> to <code class="docutils literal notranslate"><span class="pre">ncu</span></code>. Old names remain for backwards compatibility.</p></li>
<li><p>Queried metrics on GV100 and newer chips are sorted alphabetically</p></li>
<li><p>Multiple instances of NVIDIA Nsight Compute CLI can now run concurrently on the same system, e.g. for profiling individual MPI ranks. Profiled kernels are serialized across all processes using a system-wide file lock.</p></li>
</ul>
<p><strong>Resolved Issues</strong></p>
<ul class="simple">
<li><p>More C++ kernel names can be properly demangled</p></li>
<li><p>Fixed a <code class="docutils literal notranslate"><span class="pre">free():</span> <span class="pre">invalid</span> <span class="pre">pointer</span></code> error when profiling applications using pytorch &gt; 19.07</p></li>
<li><p>Fixed profiling IBM Spectrum MPI applications that require PAMI GPU hooks (<code class="docutils literal notranslate"><span class="pre">--smpiargs=&quot;-gpu&quot;</span></code>)</p></li>
<li><p>Fixed that the first kernel instruction was missed when computing <code class="docutils literal notranslate"><span class="pre">sass__inst_executed_per_opcode</span></code></p></li>
<li><p>Reduced surplus DRAM write traffic created from flushing caches during kernel replay</p></li>
<li><p>The <em>Compute Workload Analysis</em> section shows the IMMA pipeline on GV11b GPUs</p></li>
<li><p>Profile reports now scroll properly on MacOS when using a trackpad</p></li>
<li><p>Relative output filenames for the Profile activity now use the document directory, instead of the current working directory</p></li>
<li><p>Fixed path expansion of <code class="docutils literal notranslate"><span class="pre">~</span></code> on Windows</p></li>
<li><p>Memory access information is now shown properly for RED assembly instructions on the Source page</p></li>
<li><p>Fixed that user <code class="docutils literal notranslate"><span class="pre">PYTHONHOME</span></code> and <code class="docutils literal notranslate"><span class="pre">PYTHONPATH</span></code> environment variables would be picked up by NVIDIA Nsight Compute, resulting in locale encoding issues.</p></li>
</ul>
</section>


           </div>
          </div>
          <footer>

  <hr/>

  <div role="contentinfo">
    <p>&#169; Copyright 2018-2024, NVIDIA Corporation &amp; Affiliates. All rights reserved.
      <span class="lastupdated">Last updated on Mar 06, 2024.
      </span></p>
  </div>

   

</footer>
        </div>
      </div>
    </section>
  </div>
  <script>
      jQuery(function () {
          SphinxRtdTheme.Navigation.enable(true);
      });
  </script>
 



</body>
</html>