File: known-issues.html

package info (click to toggle)
nvidia-cuda-toolkit 12.4.1-2
  • links: PTS, VCS
  • area: non-free
  • in suites: trixie
  • size: 18,505,836 kB
  • sloc: ansic: 203,477; cpp: 64,769; python: 34,699; javascript: 22,006; xml: 13,410; makefile: 3,085; sh: 2,343; perl: 352
file content (230 lines) | stat: -rw-r--r-- 19,871 bytes parent folder | download | duplicates (6)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
<!DOCTYPE html>
<html class="writer-html5" lang="en" >
<head>
  <meta charset="utf-8" /><meta name="generator" content="Docutils 0.17.1: http://docutils.sourceforge.net/" />

  <meta name="viewport" content="width=device-width, initial-scale=1.0" />
  <title>Known Issues &mdash; NsightCompute 12.4 documentation</title>
      <link rel="stylesheet" href="../../_static/pygments.css" type="text/css" />
      <link rel="stylesheet" href="../../_static/css/theme.css" type="text/css" />
      <link rel="stylesheet" href="../../_static/design-style.b7bb847fb20b106c3d81b95245e65545.min.css" type="text/css" />
      <link rel="stylesheet" href="../../_static/omni-style.css" type="text/css" />
      <link rel="stylesheet" href="../../_static/api-styles.css" type="text/css" />
    <link rel="shortcut icon" href="../../_static/nsight-compute.ico"/>
  <!--[if lt IE 9]>
    <script src="../../_static/js/html5shiv.min.js"></script>
  <![endif]-->
  
        <script data-url_root="../../" id="documentation_options" src="../../_static/documentation_options.js"></script>
        <script src="../../_static/jquery.js"></script>
        <script src="../../_static/underscore.js"></script>
        <script src="../../_static/doctools.js"></script>
        <script src="../../_static/mermaid-init.js"></script>
        <script src="../../_static/design-tabs.js"></script>
        <script src="../../_static/version.js"></script>
        <script src="../../_static/social-media.js"></script>
    <script src="../../_static/js/theme.js"></script>
    <link rel="index" title="Index" href="../../genindex.html" />
    <link rel="search" title="Search" href="../../search.html" />
 


</head>

<body class="wy-body-for-nav"> 
  <div class="wy-grid-for-nav">
    <nav data-toggle="wy-nav-shift" class="wy-nav-side">
      <div class="wy-side-scroll">
        <div class="wy-side-nav-search" >


  <a href="../../index.html">
  <img src="../../_static/nsight-compute.png" class="logo" alt="Logo"/>
</a>

<div role="search">
  <form id="rtd-search-form" class="wy-form" action="../../search.html" method="get">
    <input type="text" name="q" placeholder="Search docs" />
    <input type="hidden" name="check_keywords" value="yes" />
    <input type="hidden" name="area" value="default" />
  </form>
</div>
        </div><div class="wy-menu wy-menu-vertical" data-spy="affix" role="navigation" aria-label="Navigation menu">
              <p class="caption" role="heading"><span class="caption-text">Nsight Compute</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../index.html">1. Release Notes</a></li>
<li class="toctree-l1"><a class="reference internal" href="../../ProfilingGuide/index.html">2. Kernel Profiling Guide</a></li>
<li class="toctree-l1"><a class="reference internal" href="../../NsightCompute/index.html">3. Nsight Compute</a></li>
<li class="toctree-l1"><a class="reference internal" href="../../NsightComputeCli/index.html">4. Nsight Compute CLI</a></li>
</ul>
<p class="caption" role="heading"><span class="caption-text">Developer Interfaces</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../../CustomizationGuide/index.html">1. Customization Guide</a></li>
<li class="toctree-l1"><a class="reference internal" href="../../NvRulesAPI/index.html">2. NvRules API</a></li>
</ul>
<p class="caption" role="heading"><span class="caption-text">Training</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../../Training/index.html">Training</a></li>
</ul>
<p class="caption" role="heading"><span class="caption-text">Release Information</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../../Archives/index.html">Archives</a></li>
</ul>
<p class="caption" role="heading"><span class="caption-text">Copyright and Licenses</span></p>
<ul>
<li class="toctree-l1"><a class="reference internal" href="../../CopyrightAndLicenses/index.html">Copyright and Licenses</a></li>
</ul>

        </div>
      </div>
    </nav>

    <section data-toggle="wy-nav-shift" class="wy-nav-content-wrap"><nav class="wy-nav-top" aria-label="Mobile navigation menu" >
          <i data-toggle="wy-nav-top" class="fa fa-bars"></i>
          <a href="../../index.html">NsightCompute</a>
      </nav>

      <div class="wy-nav-content">
        <div class="rst-content">
          <div role="navigation" aria-label="Page navigation">
  <ul class="wy-breadcrumbs">


<li><a href="../../index.html" class="icon icon-home"></a> &raquo;</li>
<li>Known Issues</li>

      <li class="wy-breadcrumbs-aside">
      </li>
<li class="wy-breadcrumbs-aside">


  <span>v2024.1.1 |</span>



  <a href="https://developer.nvidia.com/nsight-compute-history" class="reference external">Archive</a>


  <span>&nbsp;</span>
</li>

  </ul>
  <hr/>
</div>
          <div role="main" class="document" itemscope="itemscope" itemtype="http://schema.org/Article">
           <div itemprop="articleBody">
             
  <section id="known-issues">
<h1>Known Issues<a class="headerlink" href="#known-issues" title="Permalink to this headline"></a></h1>
<p><strong>Installation</strong></p>
<ul>
<li><p>The installer might not show all patch-level version numbers during installation.</p></li>
<li><p>Some command line options listed in the help of a <em>.run</em> installer of NVIDIA Nsight Compute are affecting only the archive extraction, but not the installation stage. To pass command line options to the embedded installer script, specify those options after <code class="docutils literal notranslate"><span class="pre">--</span></code> in the form of <code class="docutils literal notranslate"><span class="pre">--</span> <span class="pre">-&lt;option&gt;</span></code>. The available options for the installer script are:</p>
<div class="highlight-text notranslate"><div class="highlight"><pre><span></span>-help               : Print help message
-targetpath=&lt;PATH&gt;  : Specify install path
-noprompt           : No prompts. Implies acceptance of the EULA
</pre></div>
</div>
<p>For example, specifying only option <code class="docutils literal notranslate"><span class="pre">--quiet</span></code> extracts the installer archive without any output to the console, but still prompts for user interaction during the installation. To install NVIDIA Nsight Compute without any console output nor any user interaction, please specify <code class="docutils literal notranslate"><span class="pre">--quiet</span> <span class="pre">--</span> <span class="pre">-noprompt</span></code>.</p>
</li>
<li><p>After using the SDK Manager to install the NVIDIA Nsight Compute tools, their binary path needs to be manually added to your <code class="docutils literal notranslate"><span class="pre">PATH</span></code> environment variable.</p></li>
<li><p>See also the <a class="reference external" href="index.html#system-requirements">System Requirements</a> for more installation instructions.</p></li>
</ul>
<p><strong>Launch and Connection</strong></p>
<ul class="simple">
<li><p>Launching applications on remote targets/platforms is not supported for several combinations. See <a class="reference external" href="index.html#platform-support">Platform Support</a> for details. Manually launch the application using command line <code class="docutils literal notranslate"><span class="pre">ncu</span> <span class="pre">--mode=launch</span></code> on the remote system and connect using the UI or CLI afterwards.</p></li>
<li><p>In the NVIDIA Nsight Compute connection dialog, a remote system can only be specified for one target platform. Remove a connection from its current target platform in order to be able to add it to another.</p></li>
<li><p>Loading of CUDA sources via SSH requires that the remote connection is configured, and that the hostname/IP address of the connection matches the target (as seen in the report session details). For example, prefer my-machine.my-domain.com, instead of my-machine, even though the latter resolves to the same.</p></li>
<li><p>Other issues concerning remote connections are discussed in the documentation for <a class="reference external" href="../NsightCompute/index.html#remote-connections">remote connections</a>.</p></li>
<li><p>Local connections between NVIDIA Nsight Compute and the launched target application might not work on some ppc64le or aarch64 (sbsa) systems configured to only support IPv6. On these platforms, the <a class="reference external" href="../NsightComputeCli/index.html#environment-variables">NV_COMPUTE_PROFILER_LOCAL_CONNECTION_OVERRIDE=uds</a> environment variable can be set to use <em>Unix Domain Sockets</em> instead of <em>TCP</em> for local connections to workaround the problem. On x86_64 Linux, Unix Domain Sockets are used by default, but local TCP connections can be forced using <a class="reference external" href="../NsightComputeCli/index.html#environment-variables">NV_COMPUTE_PROFILER_LOCAL_CONNECTION_OVERRIDE=tcp</a>.</p></li>
</ul>
<p><strong>Profiling and Metrics</strong></p>
<ul class="simple">
<li><p>Profiling of 32-bit processes is not supported.</p></li>
<li><p>Profiling kernels executed on a device that is part of an SLI group is not supported. An “Unsupported GPU” error is shown in this case.</p></li>
<li><p>Profiling a kernel while other contexts are active on the same device (e.g. X server, or secondary CUDA or graphics application) can result in varying metric values for L2/FB (Device Memory) related metrics. Specifically, L2/FB traffic from non-profiled contexts cannot be excluded from the metric results. To completely avoid this issue, profile the application on a GPU without secondary contexts accessing the same device (e.g. no X server on Linux).</p></li>
<li><p>In the current release, profiling a kernel while any other GPU work is executing on the same MIG compute instance can result in varying metric values for all units. NVIDIA Nsight Compute enforces serialization of the CUDA launches within the target application to ensure those kernels do not influence each other. See <a class="reference external" href="../ProfilingGuide/index.html#serialization">Serialization</a> for more details. However, GPU work issued through other APIs in the target process or workloads created by non-target processes running simultaneously in the same MIG compute instance will influence the collected metrics. Note that it is acceptable to run CUDA processes in other MIG compute instances as they will not influence the profiled MIG compute instance.</p></li>
<li><p>On Linux kernels settings <code class="docutils literal notranslate"><span class="pre">fs.protected_regular=1</span></code> (e.g. some Ubuntu 20.04 cloud service provider instances), root users may not be able to access the <a class="reference external" href="../ProfilingGuide/index.html#serialization">inter-process lock file</a>. See the <a class="reference external" href="../ProfilingGuide/index.html#faq">FAQ</a> for workarounds.</p></li>
<li><p>Profiling only supports up to 32 device instances, including instances of MIG partitions. Profiling the 33rd or higher device instance will result in indeterminate data.</p></li>
<li><p>Enabling certain metrics can cause GPU kernels to run longer than the driver’s watchdog time-out limit. In these cases the driver will terminate the GPU kernel resulting in an application error and profiling data will not be available. Please disable the driver watchdog time out before profiling such long running CUDA kernels.</p>
<ul>
<li><p>On Linux, setting the X Config option Interactive to false is recommended.</p></li>
<li><p>For Windows, detailed information on disabling the Windows TDR is available at <a class="reference external" href="https://docs.microsoft.com/en-us/windows-hardware/drivers/display/timeout-detection-and-recovery">https://docs.microsoft.com/en-us/windows-hardware/drivers/display/timeout-detection-and-recovery</a></p></li>
</ul>
</li>
<li><p>Collecting device-level metrics, such as the NVLink metrics (<code class="docutils literal notranslate"><span class="pre">nvl*</span></code>), is not supported on <a class="reference external" href="https://www.nvidia.com/en-us/data-center/virtual-gpu-technology/">NVIDIA virtual GPUs</a> (vGPUs).</p></li>
<li><p>As of CUDA 11.4 and R470 TRD1 driver release, NVIDIA Nsight Compute is supported in a vGPU environment which requires a vGPU license. If the license is not obtained after 20 minutes, the reported performance metrics data from the GPU will be inaccurate. This is because of a feature in vGPU environment which reduces performance but retains functionality as specified <a class="reference external" href="https://docs.nvidia.com/grid/latest/grid-licensing-user-guide/index.html#software-enforcement-grid-licensing">here</a>.</p></li>
<li><p>Profiling on <a class="reference external" href="https://www.nvidia.com/en-us/data-center/virtualization/virtual-gpu-migration/">NVIDIA live-migrated virtual machines</a> is not supported and can result in undefined behavior.</p></li>
<li><p>Profiling with enabled multi-process service (MPS) can result in undefined behavior.</p></li>
<li><p>When Profiling using <em>Range Replay</em> or <em>Application Range Replay</em> with multiple CUDA Green Contexts active which belong to the same device context, the range result will contain counter values aggregated on all Green Contexts</p></li>
<li><p>The NVLink Topology section is not supported for a configuration using NVSwitch.</p></li>
<li><p>NVIDIA Nsight Compute does not support per-NVLink metrics.</p></li>
<li><p>NVIDIA Nsight Compute does not support the <em>Logical NVLink Throughput</em> table.</p></li>
<li><p>Setting a reduced NvLink Bandwidth mode does not impact the reported peak values for NvLink metrics. All peak values and corresponding percentages are calculated off the non-reduced NvLink bandwidth. Reconfiguring the NvLink Bandwidth mode using <cite>nvidia-smi`</cite> while profiling may lead to undefined tools’ behavior.</p></li>
</ul>
<ul class="simple">
<li><p>Profiling CUDA graph kernel nodes that can launch device graphs or are part of device-launchable graphs is not supported. Use <a class="reference external" href="../NsightComputeCli/index.html#command-line-options-profile">Graph Profiling</a> mode instead.</p></li>
<li><p>Profiling in <a class="reference external" href="../NsightComputeCli/index.html#command-line-options-profile">Graph Profiling</a> mode is performed on the context that is specified by the stream handle for the graph launch. Consequently, only memory allocated on this context is saved off and restored during graph replay and only kernel nodes executed on this context are profiled.</p></li>
</ul>
<ul class="simple">
<li><p>On CUDA drivers older than 530.x, profiling on Windows Subsystem for Linux (WSL) is not supported if the system has multiple physical NVIDIA GPUs. This is not affected by setting <code class="docutils literal notranslate"><span class="pre">CUDA_VISIBLE_DEVICES</span></code>.</p></li>
<li><p>Collecting software counters through PerfWorks currently forces all functions in the module of the profiled kernel to be loaded. This increases the host and device memory footprint of the target application for the remainder of the process lifetime.</p></li>
<li><p>PM Sampling is not supported when collecting a Profile Series.</p></li>
</ul>
<p><strong>Compatibility</strong></p>
<ul class="simple">
<li><p>Applications calling blocking functions on std input/output streams can result in the profiler to stop, until the blocking function call is resolved.</p></li>
<li><p>NVIDIA Nsight Compute can hang on applications using RAPIDS in versions 0.6 and 0.7, due to an issue in cuDF.</p></li>
<li><p>Profiling child processes launched via <code class="docutils literal notranslate"><span class="pre">clone()</span></code> is not supported.</p></li>
<li><p>Profiling child processes launched from Python using <code class="docutils literal notranslate"><span class="pre">os.system()</span></code> is not supported on ppc64le.</p></li>
<li><p>Profiling of Cooperative Groups kernels launched with <code class="docutils literal notranslate"><span class="pre">cuLaunchCooperativeKernelMultiDevice</span></code> is not yet supported.</p></li>
<li><p>On Linux systems, when profiling <em>bsd-csh</em> scripts, the original application output will not be printed. As a workaround, use a different C-shell, e.g. <em>tcsh</em>.</p></li>
<li><p>Attempting to use the <code class="docutils literal notranslate"><span class="pre">--clock-control</span></code> option to set the GPU clocks will fail when profiling on a GPU partition. Please use <code class="docutils literal notranslate"><span class="pre">nvidia-smi</span></code> (installed with NVIDIA display driver) to control the clocks for the entire GPU. This will require administrative privileges when the GPU is partitioned.</p></li>
<li><p>On Linux aarch64, NVIDIA Nsight Compute does not work if the <em>HOME</em> environment variable is not set.</p></li>
<li><p>NVIDIA Nsight Compute versions 2020.1.0 to 2020.2.1 are not compatible with CUDA driver version 460+ if the application launches Cooperative Groups kernels. Profiling will fail with error “UnknownError”.</p></li>
<li><p>Collecting CPU call stack information on Windows Server 2016 can hang NVIDIA Nsight Compute in some cases. Currently, the only workaround is to skip CPU call stack collection on such systems by not specifying the option <code class="docutils literal notranslate"><span class="pre">--call-stack</span></code>.</p></li>
<li><p>When profiling a script, <code class="docutils literal notranslate"><span class="pre">--target-processes</span> <span class="pre">all</span></code> may target utility executables such as <em>xargs</em>, <em>uname</em> or <em>ls</em>. To avoid profiling these, use the <code class="docutils literal notranslate"><span class="pre">--target-processes-filter</span></code> option accordingly.</p></li>
<li><p>On mobile platforms, <code class="docutils literal notranslate"><span class="pre">--kill</span></code> option is not supported with application replay mode.</p></li>
<li><p>NVIDIA Nsight Compute might show invalid characters for Unicode names and paths on Windows 10. As a workaround, use a third-party terminal emulator, e.g. Git bash.</p></li>
</ul>
<p><strong>User Interface</strong></p>
<ul class="simple">
<li><p>The API Statistics filter in NVIDIA Nsight Compute does not support units.</p></li>
<li><p>File size is the only property considered when resolving source files. Timestamps are currently ignored.</p></li>
<li><p>Terminating or disconnecting an application in the <em>Interactive Profiling</em> activity while the API Stream View is updated can lead to a crash.</p></li>
<li><p>See the <a class="reference external" href="index.html#library-support-optix">OptiX library support section</a> for limitations concerning the <a class="reference external" href="../NsightCompute/index.html#as-viewer">Acceleration Structure Viewer</a>.</p></li>
<li><p>After updating from a previous version of NVIDIA Nsight Compute on Linux, the file load dialog may not allow column resizing and sorting. As a workaround, the <em>~/.config/QtProject.conf</em> file can be edited to remove the <em>treeViewHeader</em> entry from the <em>[FileDialog]</em> section.</p></li>
</ul>
</section>


           </div>
          </div>
          <footer>

  <hr/>

  <div role="contentinfo">
    <p>&#169; Copyright 2018-2024, NVIDIA Corporation &amp; Affiliates. All rights reserved.
      <span class="lastupdated">Last updated on Mar 06, 2024.
      </span></p>
  </div>

   

</footer>
        </div>
      </div>
    </section>
  </div>
  <script>
      jQuery(function () {
          SphinxRtdTheme.Navigation.enable(true);
      });
  </script>
 



</body>
</html>