File: 1_introduction.md

package info (click to toggle)
intel-graphics-compiler 1.0.12504.6-1%2Bdeb12u1
  • links: PTS, VCS
  • area: main
  • in suites: bookworm
  • size: 83,912 kB
  • sloc: cpp: 910,147; lisp: 202,655; ansic: 15,197; python: 4,025; yacc: 2,241; lex: 1,570; pascal: 244; sh: 104; makefile: 25
file content (129 lines) | stat: -rw-r--r-- 5,718 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
<!---======================= begin_copyright_notice ============================

Copyright (C) 2020-2021 Intel Corporation

SPDX-License-Identifier: MIT

============================= end_copyright_notice ==========================-->

# Introduction

Purpose / Scope
===============

This document provides a specification of the common instruction set
architecture (ISA) for Intel Graphics Media Accelerators (GEN)
architecture. The virtual ISA specification defines a portable and
secure byte code intermediate representation (IR) for compilers and
assembly developers that wish to target Gen. High-level compilers for
languages such as C++ and CM compile the source kernel program into
virtual ISA byte code. At runtime, the just-in-time (JIT) virtual ISA
finalizer translates the virtual ISA byte code into native GEN
instructions for GPU execution.

The virtual ISA has the following goals:

-   Provide a portable ISA that spans multiple GEN platforms.
-   Incur low JIT-translation overhead when translating from virtual ISA
    to GEN binary.
-   Achieve performance that is comparable to that of the hand-written
    GEN assembly kernels.
-   Expose most of GEN hardware's capabilities, specifically its shared
    function units such as sampler and video motion estimation.
-   Provide a code distribution format for kernels executing on Gen.

Virtual ISA and Gen
===================

The virtual ISA is designed to be executed on GEN starting from the
Broadwell generation. As such, it makes many assumptions about the
availability of hardware features such as SIMD instructions, regions,
and data port messages. In particular, the virtual ISA adopts the
fundamental data types used in Gen, and most virtual ISA instructions
have a corresponding GEN instruction. Notable differences between a
virtual ISA and a GEN instruction are:

1.  The virtual ISA instructions are variable-length unlike the GEN
    instructions.
2.  The virtual ISA instructions use virtual variables as their
    operands. The general-purpose and architectural registers are
    replaced by variables that are grouped into different categories
    based on their functionality (General, Address, Predicate, etc.),
    and there are generally no limtis on the number of variables.
    Certain architecture registers such as the accumulator are also not
    exposed.
3.  Instead of a single send instruction, virtual ISA provides
    higher-level instructions represent accesses to different functional
    units (load/store, sampler, barrier, etc.). A special raw_send
    instruction can be used to directly access shared function
    capabilities that are not available otherwise.
4.  The virtual ISA relaxes the GEN register region restrictions, in
    particular abstracting away most of the platform specific region
    alignment rules.
5.  Each virtual ISA instruction operand is not associated with a type
    but will instead obtain their type from the variable declarations.

While the virtual ISA specification strives to be platform independent,
not all of its features will be supported on every platform due to
hardware limitations. Such exceptions are marked with the notation
"{platform}" in the specification, where "platform" is the list of GEN
platforms that support the feature. When a virtual ISA program exercises
a feature that is not supported by the target platform, the vISA
finalizer will return an error.

Document Structure
==================

The organization of this document is as follows:

-   Section [1](2_datatypes.md) describes the data types and
    type conversion rules.
-   Section [2](3_execution_model.md) describes the execution model for a virtual
    ISA object.
-   Section [3](4_visa_header.md) describes the virtual ISA object format as
    well as the format of various global symbol tables.
-   Section [4](5_operands.md) describes the format of instruction
    operands.
-   Section [5](6_instructions.md) describes the instruction set of the virtual
-   Section [6](7_appendix_debug_information.md) describes the virtual ISA debug information.
    ISA.
-   Section [7](8_appendix_visa_assembly_syntax.md) describes the virtual ISA assembly syntax.

Definitions, Acronyms, and Abbreviation
=======================================

|     |     |
| --- | --- |
| SIMD | Single Instruction Multiple Data |
| GPU | Graphics Processing Unit |
| GenX | Graphics core generations for Intel Graphics Media Accelerators \[1\] |
| Host | IA-32 and Intel 64 architecture processors |
| Device | Intel GenX GPU |
| Kernel | A program that can be executed on GenX hardware |
| Thread | An instance of a kernel program that is executed on a GenX hardware |
| LSB | Least Significant Bit |
| GRF | General Register File, a set of general-purpose registers available in GenX. |
| DWORD | Double-word, represents 4 bytes for GenX |
| VME | Video Motion Estimation |
| BDW | Intel Broadwell processor microarchitecture |
| SKL | Intel Skylake processor microarchitecture |
| ICLLP | Intel IceLake processor microarchitecture (low power) |
| TGLLP | Intel TigerLake processor microarchitecture (low power) |



References and Related Information
==================================

\[1\] Intel Graphics Media Accelerator Developer's Guide,
<http://software.intel.com/en-us/articles/intel-graphics-media-accelerator-developers-guide/>.

\[2\] United States Patent 7257695, 2007.

\[3\] Intel C++ Compiler User and Reference Guides,
<http://www.intel.com/cd/software/products/asmo-na/eng/347618.htm>.

\[4\] Intel Media SDK Reference Manual, Supplement A: Frame VME
Emulation Library for Intel SNB Graphics, April 8, 2009 (Version 1.21s).

\[5\] CM(C for Metal) Runtime API Specification, Version 3.0.