File: 3D_SAMPLE4.md

package info (click to toggle)
intel-graphics-compiler 1.0.12504.6-1%2Bdeb12u1
  • links: PTS, VCS
  • area: main
  • in suites: bookworm
  • size: 83,912 kB
  • sloc: cpp: 910,147; lisp: 202,655; ansic: 15,197; python: 4,025; yacc: 2,241; lex: 1,570; pascal: 244; sh: 104; makefile: 25
file content (215 lines) | stat: -rw-r--r-- 7,685 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
<!---======================= begin_copyright_notice ============================

Copyright (C) 2020-2021 Intel Corporation

SPDX-License-Identifier: MIT

============================= end_copyright_notice ==========================-->

 

## Opcode

  3D_SAMPLE4 = 0x6f

## Format


### CONDITION

- Op.op=gather4


| | | | | | | |
| --- | --- | --- | --- | --- | --- | --- |
| 0x6f(3D_SAMPLE4) | Op          | Exec_size | Pred      | Src_channel | Aoffimmi | Sampler |
|                  | Surface     | Dst       | NumParams | U           | V        | R       |
|                  | Ai\*        |           |           |             |          |         |


### CONDITION

- Op.op=gather4_c


| | | | | | | |
| --- | --- | --- | --- | --- | --- | --- |
| 0x6f(3D_SAMPLE4) | Op            | Exec_size | Pred      | Src_channel | Aoffimmi | Sampler |
|                  | Surface       | Dst       | NumParams | Ref\*       | U        | V       |
|                  | R             | Ai\*      |           |             |          |         |


### CONDITION

- Op.op=gather4_po


| | | | | | | |
| --- | --- | --- | --- | --- | --- | --- |
| 0x6f(3D_SAMPLE4) | Op             | Exec_size | Pred      | Src_channel | Aoffimmi | Sampler |
|                  | Surface        | Dst       | NumParams | U           | V        | Offu\*  |
|                  | Offv\*         | R         |           |             |          |         |


### CONDITION

- Op.op=gather4_po_c


| | | | | | | |
| --- | --- | --- | --- | --- | --- | --- |
| 0x6f(3D_SAMPLE4) | Op               | Exec_size | Pred      | Src_channel | Aoffimmi | Sampler |
|                  | Surface          | Dst       | NumParams | Ref\*       | U        | V       |
|                  | Offu\*           | Offv\*    | R         |             |          |         |


## Semantics




    Sample the surface using bilinear filtering, and return four samples.

## Description


    Sample <surface> using bilinear filtering, and return four samples for each pixel in <dst>.

- **Op(ub):** 
 
  - Bit[4..0]: encodes the sampler operation
 
    - 0b01000:  gather4 
    - 0b10000:  gather4_c 
    - 0b10001:  gather4_po 
    - 0b10010:  gather4_po_c 
  - Bit[5]: pixel null mask enable. Specifies whether the writeback message will include an extra phase indicating the pixel null mask.

- **Exec_size(ub):** Execution size
 
  - Bit[2..0]: size of the region for source and destination operands
 
    - 0b011:  8 elements 
    - 0b100:  16 elements 
  - Bit[7..4]: execution mask (explicit control over the enabled channels)
 
    - 0b0000:  M1 
    - 0b0001:  M2 
    - 0b0010:  M3 
    - 0b0011:  M4 
    - 0b0100:  M5 
    - 0b0101:  M6 
    - 0b0110:  M7 
    - 0b0111:  M8 
    - 0b1000:  M1_NM 
    - 0b1001:  M2_NM 
    - 0b1010:  M3_NM 
    - 0b1011:  M4_NM 
    - 0b1100:  M5_NM 
    - 0b1101:  M6_NM 
    - 0b1110:  M7_NM 
    - 0b1111:  M8_NM
- **Pred(uw):** Predication control

- **Src_channel(ub):** 
 
  - Bit[1..0]: controls the source channel to be sampled
 
    - 0b00:  R 
    - 0b01:  G 
    - 0b10:  B 
    - 0b11:  A
- **Aoffimmi(scalar):**  A UW representing the _aoffimmi modifier with the following format:
 
  - Bit[3..0]: stores the R offset. Valid values are [-8-7]
 
  - Bit[7..4]: stores the V offset. Valid values are [-8-7]
 
  - Bit[11..8]: stores the U offset. Valid values are [-8-7]
 
  - Bit[15..12]: reserved. Must be zero

- **Sampler(ub):** Index of the sampler variable

- **Surface(ub):** Index of the surface variable

- **Dst(raw_operand):** The result of the sample. The four samples map to the RGBA channels as follows:. Must have type HF,F,W,UW,D,UD


        +-------------------------+--------------------------+
        | upper left sample = A   | upper right sample = B   |
        +-------------------------+--------------------------+
        | lower left sample = R   | lower right sample = G   |
        +-------------------------+--------------------------+

      - **NumParams(ub):** number of additional parameters for this instruction. Valid values are  [1-15]

- **Ref(raw_operand):** The first <exec_size> elements contain the reference value that is compared against the red channel of the sampled surface

- **U(raw_operand):** The first <exec_size> elements contain the X pixel address

- **V(raw_operand):** The first <exec_size> elements contain for

            -  1D_array surfaces: the unnormalized array index
            -  Other surface: the normalized y coordinate
      - **Offu(raw_operand):** The first <exec_size> elements contain the pixel offset from U. Must have type D

- **Offv(raw_operand):** The first <exec_size> elements contain the pixel offset from  V. Must have type D

- **R(raw_operand):** The first <exec_size> elements contain for

            -  2D_array: the unnormalized array index.
            -  3D and cube: the normalized z coordinate
       - **Ai(raw_operand):** The first <exec_size> elements contain the array index for a cube surface

#### Properties


## Text
```
    

[(<P>)] SAMPLE4[.pixel_null_mask].<Src_channel> (Exec_size) <Aoffimmi> <Sampler> <Surface> <Dst> <u> <v> <r> <ai>

[(<P>)] SAMPLE4_C[.pixel_null_mask].<Src_channel> (Exec_size) <Aoffimmi> <Sampler> <Surface> <Dst> <ref> <u> <v> <r> <ai>

[(<P>)] SAMPLE4_PO[.pixel_null_mask].<Src_channel> (Exec_size) <Aoffimmi> <Sampler> <Surface> <Dst> <u> <v> <offu> <offv> <r>

[(<P>)] SAMPLE4_PO_C[.pixel_null_mask].<Src_channel> (Exec_size) <Aoffimmi> <Sampler> <Surface> <Dst> <ref> <u> <v> <offu> <offv> <r>

// instruction specific parameters may vary
```



## Notes




For each enabled channel <exec_size> elements are returned in RGBA order, with the disabled channels skipped in the results. Only the enabled pixels are returned in <dst>. Each channel's return data start in the next GRF; if <exec_size> * sizeof(dst_type) is smaller than the register size, the remaining portions of the register have undefined values.

For all operations, if <pixel_null_mask> is set, an additional GRF is returned after the sampler data, with <exec_size> bits in the first DWord containing the pixel null mask values. This field has the bit for all pixels set to 1 except those pixels in which a null page was source for at least one texel.


Extra parameters (after NumParams) for this instruction are required only for certain operations and surface types.

  - **{pre-ICLLP}** All operands must have type F.
  - **{ICLLP+}** All operands must have the same type, which can be either HF or F.
  - It is permitted to skip the trailing parameters; the missing parameters will have the value of 0.

The table below summarizes the additional arguments for each of the sample4 operations.

        +------------------+---------------------------------------------+
        | Operation        | Parameters                                  |
        |                  +------+-----+--------+--------+--------+-----+
        |                  | 0    | 1   | 2      | 3      | 4      | 5   |
        +------------------+------+-----+--------+--------+--------+-----+
        | gather4          | u    | v   | r      | ai     |        |     |
        +------------------+------+-----+--------+--------+--------+-----+
        | gather4_c        | ref  | u   | v      | r      | ai     |     |
        +------------------+------+-----+--------+--------+--------+-----+
        | gather4_po       | u    | v   | offu   | offv   | r      |     |
        +------------------+------+-----+--------+--------+--------+-----+
        | gather4_po_c     | ref  | u   | v      | offu   | offv   | r   |
        +------------------+------+-----+--------+--------+--------+-----+