File: config.rst

package info (click to toggle)
rocprim 6.4.3-2
  • links: PTS, VCS
  • area: main
  • in suites: forky, sid
  • size: 11,428 kB
  • sloc: cpp: 153,383; python: 1,397; sh: 404; xml: 217; makefile: 119
file content (36 lines) | stat: -rw-r--r-- 1,207 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
.. meta::
  :description: rocPRIM documentation and API reference library
  :keywords: rocPRIM, ROCm, API, documentation

.. _dev-config:

********************************************************************
 Configuring the Kernels
********************************************************************

A kernel config is a way to select the grid/block dimensions, but also
how the data will be fetched and stored (the algorithms used for
``load`` and ``store``) for the operations using them (such as ``select``).

.. doxygenstruct:: rocprim::kernel_config

Setting the configuration is important to better tune the kernel to a given GPU model.
``rocPRIM`` uses a placeholder type to let the macros select the default configuration for
the GPU model

.. doxygenstruct:: rocprim::default_config

.. warning::

   To provide information about the GPU you're targeting, you have to
   set ``ROCPRIM_TARGET_ARCH``.

   If the target is not supported by ``rocPRIM``, the templates will
   use the configuration for the model ``900``.

   If ``ROCPRIM_TARGET_TARGET`` is not defined, it defaults to ``0``,
   which is not supported by ``rocPRIM`` and thus the configurations
   will be for the model ``900``.