File: logging.rst

package info (click to toggle)
rocsolver 6.4.3-2
  • links: PTS, VCS
  • area: main
  • in suites: sid
  • size: 17,876 kB
  • sloc: cpp: 151,850; python: 2,275; sh: 875; objc: 642; ansic: 402; makefile: 71; xml: 26
file content (176 lines) | stat: -rw-r--r-- 7,613 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
.. meta::
  :description: rocSOLVER documentation and API reference library
  :keywords: rocSOLVER, ROCm, API, documentation

.. _logging-label:

******************************
rocSOLVER multi-level logging
******************************

rocSOLVER provides logging facilities that can be used to output information on rocSOLVER function calls, 
similar to `Logging in rocBLAS <https://rocm.docs.amd.com/projects/rocBLAS/en/latest/reference/logging.html>`_. 
Three modes of logging are supported: trace logging, bench logging, and profile logging.

.. note::
   Performance will degrade when logging is enabled.

Logging modes
================================================

Trace logging
--------------

Trace logging outputs a line each time an internal rocSOLVER or rocBLAS routine is called,
outputting the function name and the values of its arguments (excluding stride arguments). The
maximum depth of nested function calls that can appear in the log is specified by the user.

Bench logging
----------------

Bench logging outputs a line each time a public rocSOLVER routine is called (excluding
auxiliary library functions), outputting a line that can be used with the executable
``rocsolver-bench`` to call the function with the same size arguments.

.. _log_profile:

Profile logging
-------------------

Profile logging, upon calling ``rocsolver_log_write_profile`` or ``rocsolver_log_flush_profile``,
or terminating the logging session using ``rocsolver_log_end``, will output statistics on each
called internal rocSOLVER and rocBLAS routine. These include the number of times each function
was called, the total program runtime occupied by the function, and the total program runtime
occupied by its nested function calls. As with trace logging, the maximum depth of nested output
is specified by the user. Note that, when profile logging is enabled, the stream will be synchronized
after every internal function call.


Initialization and set-up
================================================

In order to use rocSOLVER's logging facilities, the user must first call ``rocsolver_log_begin``
in order to allocate the internal data structures used for logging and begin the logging session.
The user may then specify a layer mode and max level depth, either programmatically using
``rocsolver_log_set_layer_mode``, ``rocsolver_log_set_max_levels``, or by setting the corresponding
environment variables.

The layer mode specifies which logging type(s) are activated, and can be ``rocblas_layer_mode_none``,
``rocblas_layer_mode_log_trace``, ``rocblas_layer_mode_log_bench``, ``rocblas_layer_mode_log_profile``,
or a bitwise combination of these. The max level depth specifies the default maximum depth of nested
function calls that may appear in the trace and profile logging.

Both the default layer mode and max level depth can be specified using environment variables.

* ``ROCSOLVER_LAYER``
* ``ROCSOLVER_LEVELS``

If these variables are not set, the layer mode will default to ``rocblas_layer_mode_none`` and the
max level depth will default to 1. These defaults can be restored by calling the function
``rocsolver_log_restore_defaults``.

``ROCSOLVER_LAYER`` is a bitwise OR of zero or more bit masks as follows:

*  If ``ROCSOLVER_LAYER`` is not set, then there is no logging
*  If ``(ROCSOLVER_LAYER & 1) != 0``, then there is trace logging
*  If ``(ROCSOLVER_LAYER & 2) != 0``, then there is bench logging
*  If ``(ROCSOLVER_LAYER & 4) != 0``, then there is profile logging

Three environment variables can set the full path name for a log file:

* ``ROCSOLVER_LOG_TRACE_PATH`` sets the full path name for trace logging
* ``ROCSOLVER_LOG_BENCH_PATH`` sets the full path name for bench logging
* ``ROCSOLVER_LOG_PROFILE_PATH`` sets the full path name for profile logging

If one of these environment variables is not set, then ``ROCSOLVER_LOG_PATH`` sets the full path
for the corresponding logging, if it is set. If neither the above nor ``ROCSOLVER_LOG_PATH`` are
set, then the corresponding logging output is streamed to standard error.

The results of profile logging, if enabled, can be printed using ``rocsolver_log_write_profile``
or ``rocsolver_log_flush_profile``. Once logging facilities are no longer required (e.g. at
program termination), the user must call ``rocsolver_log_end`` to free the data structures used
for logging. If the profile log has not been flushed beforehand, then ``rocsolver_log_end``
will also output the results of profile logging.

For more details on the mentioned logging functions, see the :ref:`rocSOLVER Logging functions <api_logging>`
reference section.


Example code
================================================

Code examples that illustrate the use of rocSOLVER's multi-level logging facilities can be found
in this section or in the ``example_logging.cpp`` file in the ``clients/samples`` directory.

The following example shows some basic use: enabling trace and profile logging, and setting the
max depth for their output.

.. code-block:: cpp

   // initialization
   rocblas_handle handle;
   rocblas_create_handle(&handle);
   rocsolver_log_begin();

   // begin trace logging and profile logging (max depth = 5)
   rocsolver_log_set_layer_mode(rocblas_layer_mode_log_trace | rocblas_layer_mode_log_profile);
   rocsolver_log_set_max_levels(5);

   // call rocSOLVER functions...

   // terminate logging and print profile results
   rocsolver_log_flush_profile();
   rocsolver_log_end();
   rocblas_destroy_handle(handle);

Alternatively, users may control which logging modes are enabled by using environment variables.
The benefit of this approach is that the program does not need to be recompiled if a different
logging environment is desired. This requires that ``rocsolver_log_set_layer_mode`` and
``rocsolver_log_set_max_levels`` are not called in the code, e.g.

.. code-block:: cpp

   // initialization
   rocblas_handle handle;
   rocblas_create_handle(&handle);
   rocsolver_log_begin();

   // call rocSOLVER functions...

   // termination
   rocsolver_log_end();
   rocblas_destroy_handle(handle);

The user may then set the desired logging modes and max depth on the command line as follows:

.. code-block:: bash

   export ROCSOLVER_LAYER=5
   export ROCSOLVER_LEVELS=5


Kernel logging
================================================

Kernel launches from within rocSOLVER can be added to the trace and profile logs using an
additional layer mode flag. The flag ``rocblas_layer_mode_ex_log_kernel`` can be combined with
``rocblas_layer_mode`` flags and passed to ``rocsolver_log_set_layer_mode`` in order to enable
kernel logging. Alternatively, the environment variable ``ROCSOLVER_LAYER`` can be set such that
``(ROCSOLVER_LAYER & 16) != 0``:

*  If ``(ROCSOLVER_LAYER & 17) != 0``, then kernel calls will be added to the trace log
*  If ``(ROCSOLVER_LAYER & 20) != 0``, then kernel calls will be added to the profile log


Multiple host threads
================================================

The logging facilities for rocSOLVER assume that each ``rocblas_handle`` is associated with at
most one host thread. When using rocSOLVER's multi-level logging setup, it is recommended to
create a separate ``rocblas_handle`` for each host thread.

The rocsolver_log_* functions are not thread-safe. Calling a log function while any rocSOLVER
routine is executing on another host thread will result in undefined behaviour. Once enabled,
logging data collection is thread-safe. However, note that trace logging will likely result in
garbled trace trees if rocSOLVER routines are called from multiple host threads.