File: README.TXT

package info (click to toggle)
nvidia-cuda-toolkit 12.4.1-2
  • links: PTS, VCS
  • area: non-free
  • in suites: trixie
  • size: 18,505,836 kB
  • sloc: ansic: 203,477; cpp: 64,769; python: 34,699; javascript: 22,006; xml: 13,410; makefile: 3,085; sh: 2,343; perl: 352
file content (47 lines) | stat: -rw-r--r-- 2,433 bytes parent folder | download | duplicates (6)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
== Sample CUDA application for instruction mix ==
Applies a Sobel filter to a image in global memory
and generates an output image in global memory.

Compiling the code:
==================
nvcc -lineinfo -gencode arch=compute_70,code=sm_70 -gencode arch=compute_72,code=sm_72 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_87,code=sm_87  -gencode arch=compute_89,code=sm_89 instructionMix.cu -o instructionMix

Command line arguments (all are optional):
==========================================
1) <kernel option> Version of Sobel filter kernel to use
                     1 : double version of kernel
                     2 : float version of kernel
                   Default value: 1
2) <Image width> Image width should be greater than or equal to block size (BLOCK_SIZE - defined in source file "instructionMix.cu")
                         and must be an integral multiple of block size. 
                     Default value: DEFAULT_IMAGE_SIZE (defined in source file "instructionMix.cu")

3) <Image height> Image height should be greater than or equal to block size (BLOCK_SIZE - defined in source file "instructionMix.cu")
                         and must be an integral multiple of block size. 
                     Default value: Equal to image width

Sample usage:
============
- Use double precision floating point version of Sobel filter kernel with default image size
  > ./instructionMix

- Use single precision floating point version of Sobel filter kernel with default image size
  > ./instructionMix 2

- Use double precision floating point version of Sobel filter kernel with image size 1024 x 1024
  > ./instructionMix 1 1024

- Use double precision floating point version of Sobel filter kernel with image size 1024 x 256
  > ./instructionMix 1 1024 256


Profiling the sample using Nsight Compute command line
======================================================
- Profile the double precision floating point version of Sobel filter kernel
  > ncu --set full --import-source on  -o sobelDouble.ncu-rep ./instructionMix 1

- Profile the single precision floating point version of Sobel filter kernel
  > ncu --set full --import-source on  -o sobelFloat.ncu-rep ./instructionMix 2

The profiler report files for the sample are also provided and they can be opened in the 
Nsight Compute UI using the "File->Open" menu option.