File: verbose-tensile-source-kernel-build.patch

package info (click to toggle)
rocblas 6.4.4-1
  • links: PTS, VCS
  • area: main
  • in suites: sid
  • size: 1,082,920 kB
  • sloc: cpp: 245,009; f90: 50,012; python: 50,003; sh: 24,623; asm: 8,917; makefile: 147; ansic: 107; xml: 36; awk: 14
file content (49 lines) | stat: -rw-r--r-- 2,126 bytes parent folder | download | duplicates (3)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
From: Cordell Bloor <cgmb@slerp.xyz>
Date: Fri, 28 Jul 2023 14:04:58 -0600
Subject: verbose tensile source kernel build

The build of the Tensile source kernels takes quite a long time, so it
may time out on slower machines if there is no output in too long. The
verbose flag should add some output at the start of the build for each
offload architecture, which should help prevent timeout.

Forwarded: not-needed
Last-Update: 2025-07-04
---
 tensile/Tensile/TensileCreateLibrary.py | 16 ++++++++++------
 1 file changed, 10 insertions(+), 6 deletions(-)

diff --git a/tensile/Tensile/TensileCreateLibrary.py b/tensile/Tensile/TensileCreateLibrary.py
index 527e14b..5240056 100644
--- a/tensile/Tensile/TensileCreateLibrary.py
+++ b/tensile/Tensile/TensileCreateLibrary.py
@@ -255,6 +255,8 @@ def buildSourceCodeObjectFile(CxxCompiler, outputPath, kernelFile, removeTempora
         # if CxxCompiler == "amdclang++":
         # hipFlags += ["-mllvm", "-amdgpu-early-inline-all=true", "-mllvm", "-amdgpu-function-calls=false"]
         hipFlags += ["-I", outputPath]
+        # Debian
+        hipFlags += ["-v"]
 
         # Add build-id for builds with rocm 5.3+
         compilerVer = globalParameters["HipClangVersion"].split(".")[:2]
@@ -292,12 +294,14 @@ def buildSourceCodeObjectFile(CxxCompiler, outputPath, kernelFile, removeTempora
 
         tPrint(2, f"Build object file command: {compileArgs}")
         # change to use  check_output to force windows cmd block util command finish
-        try:
-            out = subprocess.check_output(compileArgs, stderr=subprocess.STDOUT)
-            tPrint(3, out)
-        except subprocess.CalledProcessError as err:
-            print(err.output)
-            raise
+        #try:
+        #    out = subprocess.check_output(compileArgs, stderr=subprocess.STDOUT)
+        #    tPrint(3, out)
+        #except subprocess.CalledProcessError as err:
+        #    print(err.output)
+        #    raise
+        # Debian
+        subprocess.check_call(compileArgs)
 
         # get hipcc version due to compatiblity reasons
         # If we aren't using hipcc what happens?