1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111
|
/*========================== begin_copyright_notice ============================
Copyright (C) 2022 Intel Corporation
SPDX-License-Identifier: MIT
============================= end_copyright_notice ===========================*/
#include "Compiler/Optimizer/RuntimeValueVectorExtractPass.h"
#include "Compiler/IGCPassSupport.h"
#include "GenISAIntrinsics/GenIntrinsicInst.h"
#include "common/LLVMWarningsPush.hpp"
#include <llvmWrapper/IR/DerivedTypes.h>
#include "common/LLVMWarningsPop.hpp"
using namespace llvm;
using namespace IGC;
#define PASS_FLAG "igc-runtimevalue-vector-extract-pass"
#define PASS_DESCRIPTION "Shader extract element from vector of constants optimization"
#define PASS_CFG_ONLY false
#define PASS_ANALYSIS false
IGC_INITIALIZE_PASS_BEGIN(RuntimeValueVectorExtractPass, PASS_FLAG, PASS_DESCRIPTION, PASS_CFG_ONLY, PASS_ANALYSIS)
IGC_INITIALIZE_PASS_END(RuntimeValueVectorExtractPass, PASS_FLAG, PASS_DESCRIPTION, PASS_CFG_ONLY, PASS_ANALYSIS)
namespace IGC
{
char RuntimeValueVectorExtractPass::ID = 0;
////////////////////////////////////////////////////////////////////////////
RuntimeValueVectorExtractPass::RuntimeValueVectorExtractPass() :
llvm::FunctionPass(ID),
changed(false)
{
initializeRuntimeValueVectorExtractPassPass(*llvm::PassRegistry::getPassRegistry());
}
////////////////////////////////////////////////////////////////////////////
void RuntimeValueVectorExtractPass::getAnalysisUsage(llvm::AnalysisUsage& AU) const
{
AU.setPreservesCFG();
}
////////////////////////////////////////////////////////////////////////////
bool RuntimeValueVectorExtractPass::runOnFunction(llvm::Function& F)
{
changed = false;
visit(F);
return changed;
}
////////////////////////////////////////////////////////////////////////////
// @brief Converts extracts of elements from corresponding RuntimeValue
// vector to RuntimeValue calls returning concrete scalars.
// Only extracts using constant indexes are converted.
// Only 32-bit and 64-bit RuntimeValues are supported at the moment.
//
// Replace:
// %0 = call <8 x i32> @llvm.genx.GenISA.RuntimeValue.v8i32(i32 4)
// %scalar = extractelement <8 x i32> %0, i32 0
// %scalar1 = extractelement <8 x i32> %0, i32 1
// with:
// %scalar = call i32 @llvm.genx.GenISA.RuntimeValue.i32(i32 4)
// %scalar1 = call i32 @llvm.genx.GenISA.RuntimeValue.i32(i32 5)
void RuntimeValueVectorExtractPass::visitExtractElementInst(llvm::ExtractElementInst& I)
{
// Optimization works only on constant indexes
if (isa<ConstantInt>(I.getIndexOperand()))
{
GenIntrinsicInst* GII = dyn_cast<GenIntrinsicInst>(I.getVectorOperand());
if (GII &&
GII->getIntrinsicID() == GenISAIntrinsic::GenISA_RuntimeValue &&
isa<ConstantInt>(GII->getOperand(0)) &&
isa<IGCLLVM::FixedVectorType>(GII->getType()))
{
IGCLLVM::FixedVectorType* giiVectorType = cast<IGCLLVM::FixedVectorType>(GII->getType());
// Only 32-bit and 64-bit values are supported at the moment
if (giiVectorType->getElementType()->getPrimitiveSizeInBits() == 32 ||
giiVectorType->getElementType()->getPrimitiveSizeInBits() == 64)
{
bool is64bit = giiVectorType->getElementType()->getPrimitiveSizeInBits() == 64;
IRBuilder<> Builder(&I);
Function* runtimeValueFunc = GenISAIntrinsic::getDeclaration(I.getModule(),
GenISAIntrinsic::GenISA_RuntimeValue,
giiVectorType->getElementType());
const uint32_t eeiIndex = int_cast<uint32_t>(cast<ConstantInt>(I.getIndexOperand())->getZExtValue());
const uint32_t giiOffset = int_cast<uint32_t>(cast<ConstantInt>(GII->getOperand(0))->getZExtValue());
// Calculate new offset
const uint32_t offset = giiOffset + (is64bit ? eeiIndex * 2 : eeiIndex);
Value* CI = Builder.CreateCall(runtimeValueFunc, Builder.getInt32(offset));
I.replaceAllUsesWith(CI);
I.eraseFromParent();
changed = true;
}
else
{
IGC_ASSERT_MESSAGE(0, "Only 32-bit and 64-bit values are supported at the moment");
}
}
}
}
}
|