This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/
-
llvm/
-
CodeGen/
-
MachinePassRegistry.def
-
Passes.h
-
IR/
-
Intrinsics.td
-
VPIntrinsics.def
-
InitializePasses.h
-
lib/CodeGen/
-
CodeGen/
-
CMakeLists.txt
13/17
ExpandPowi.cpp
-
TargetPassConfig.cpp
-
test/CodeGen/Generic/
-
CodeGen/
-
Generic/
1
expand-powi.ll
-
tools/opt/
-
opt/
-
opt.cpp

Differential D143578

[VP] Add vp.powi and a pass for expanding vp.powi before DAG.
Needs ReviewPublic

Authored by fakepaper56 on Feb 8 2023, 6:09 AM.

Download Raw Diff

Details

Reviewers

craig.topper
reames
frasercrmck
rogfer01
simoll

Summary

The patch uses different expanding way for vp.powi from the method of powi.
Vector powi is unrolled to multiple powi() libary calls in SelectionDAG, but the
method is not work for scalable vectors.
To support scalable vectors, the patch expands vp.powi at IR level. The
expanding way of vp.powi is based on compiler-rt/__powidf2.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

fakepaper56 created this revision.Feb 8 2023, 6:09 AM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 8 2023, 6:09 AM

Herald added a subscriber: hiraditya. · View Herald Transcript

fakepaper56 requested review of this revision.Feb 8 2023, 6:09 AM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 8 2023, 6:09 AM

Herald added subscribers: llvm-commits, alextsao1999, jdoerfert. · View Herald Transcript

Maybe I missed the rationale, but why not use the ExpandVectorPredicationPass for this?

Harbormaster completed remote builds in B212590: Diff 495819.Feb 8 2023, 7:21 AM

Maybe I missed the rationale, but why not use the ExpandVectorPredicationPass for this?

LLVM could not expanding scalable vector type powi now. So this pass is not only for vp.powi, but also expanding scalable vector type powi in the future.

craig.topper added inline comments.Feb 9 2023, 7:32 PM

llvm/lib/CodeGen/ExpandPowi.cpp
35	expansion*
70	CreatePHI returns a PHINode*, can we use that to avoid casts?
124	support*
157	Why does this require AA?

Address Craig's comment and add missing test case.

Harbormaster completed remote builds in B212951: Diff 496323.Feb 9 2023, 9:36 PM

fakepaper56 marked 3 inline comments as done.Feb 10 2023, 1:17 AM

fakepaper56 added inline comments.

llvm/lib/CodeGen/ExpandPowi.cpp
157	Sorry, they are my misuse.

craig.topper added inline comments.Feb 10 2023, 10:09 PM

llvm/lib/CodeGen/ExpandPowi.cpp
17	GlobalsModRef Probably uneeded?

craig.topper added inline comments.Feb 10 2023, 10:10 PM

llvm/lib/CodeGen/ExpandPowi.cpp
83	What's preventing using vp.icmp?

Cleanup headers.

fakepaper56 marked 2 inline comments as done.Feb 11 2023, 4:09 AM

fakepaper56 added inline comments.

llvm/lib/CodeGen/ExpandPowi.cpp
83	It is only that I don't know how to construct vp.icmp/vp.fcmp instructions.

fakepaper56 added inline comments.Feb 11 2023, 4:18 AM

llvm/lib/CodeGen/ExpandPowi.cpp
83	I don't understand how make predicate to a pointer of `Value`.

Harbormaster completed remote builds in B213209: Diff 496678.Feb 11 2023, 5:09 AM

craig.topper added inline comments.Feb 11 2023, 11:58 AM

llvm/lib/CodeGen/ExpandPowi.cpp

Should be something like this code from IRBuilder with the assert removed.

Value *getConstrainedFPPredicate(CmpInst::Predicate Predicate) {               
  assert(CmpInst::isFPPredicate(Predicate) &&                                  
         Predicate != CmpInst::FCMP_FALSE &&                                   
         Predicate != CmpInst::FCMP_TRUE &&                                    
         "Invalid constrained FP comparison predicate!");                      
                                                                               
  StringRef PredicateStr = CmpInst::getPredicateName(Predicate);               
  auto *PredicateMDS = MDString::get(Context, PredicateStr);                   
                                                                               
  return MetadataAsValue::get(Context, PredicateMDS);                          
}

Use vp.icmp instead of icmp.

fakepaper56 marked an inline comment as done.Feb 11 2023, 11:50 PM

fakepaper56 added inline comments.

llvm/lib/CodeGen/ExpandPowi.cpp
83	Thank you for the recommendation.

Harbormaster completed remote builds in B213265: Diff 496743.Feb 12 2023, 1:18 AM

Rebase and ping.

craig.topper added inline comments.Feb 21 2023, 12:13 AM

llvm/lib/CodeGen/ExpandPowi.cpp
114	old fixme?
132	Drop curly braces.

craig.topper added inline comments.Feb 21 2023, 12:14 AM

llvm/lib/CodeGen/ExpandPowi.cpp
58	why "forward"?

Address Craig's comment.

fakepaper56 marked 3 inline comments as done.Feb 21 2023, 12:28 AM

fakepaper56 added inline comments.

llvm/lib/CodeGen/ExpandPowi.cpp
58	Sorry, it didn't make sense. I changed it to powi-expansion-loop.

Harbormaster completed remote builds in B214939: Diff 499057.Feb 21 2023, 1:32 AM

In D143578#4113149, @fakepaper56 wrote:

Maybe I missed the rationale, but why not use the ExpandVectorPredicationPass for this?

LLVM could not expanding scalable vector type powi now. So this pass is not only for vp.powi, but also expanding scalable vector type powi in the future.

Apologies, I was away on holiday.

Thanks - I missed that the plan was also to support llvm.powi. I guess I just find ExpandPowi and ExpandVectorPredicationPass to be doing two very similar things (in this patch) with regards to vp.powi: expanding it into an equivalent set of operations; that seems unfortunate.

I get that scalable-vector llvm.powi is different, but so would many other scalable-vector intrinsics if the target doesn't support that operation: llvm.sin, llvm.cos, etc. So would we have passes for each intrinsic? If not, ExpandPowi seems too restrictive in its scope.

If we're supporting intrinsics, what about plain scalable-vector add on a target without scalable vectors, like x86?

I'd basically like to know how this fits in with some longer-term strategy about what we want to support for illegal scalable-vector operations, rather than this specific powi use-case. If we start to open the door to specific intrinsics, I think it'd help to have a well-defined rationale and plan in mind.

In D143578#4140944, @frasercrmck wrote:

I get that scalable-vector llvm.powi is different, but so would many other scalable-vector intrinsics if the target doesn't support that operation: llvm.sin, llvm.cos, etc. So would we have passes for each intrinsic? If not, ExpandPowi seems too restrictive in its scope.

If we're supporting intrinsics, what about plain scalable-vector add on a target without scalable vectors, like x86?

I agree with you that only expanding powi is too restrictive. I think at least we should expand all the math function in a pass. But I don't have no idea that whether we should expand scalable operations for target without scalable vectors?

In D143578#4140944, @frasercrmck wrote:

In D143578#4113149, @fakepaper56 wrote:

Maybe I missed the rationale, but why not use the ExpandVectorPredicationPass for this?

LLVM could not expanding scalable vector type powi now. So this pass is not only for vp.powi, but also expanding scalable vector type powi in the future.

Apologies, I was away on holiday.

Thanks - I missed that the plan was also to support llvm.powi. I guess I just find ExpandPowi and ExpandVectorPredicationPass to be doing two very similar things (in this patch) with regards to vp.powi: expanding it into an equivalent set of operations; that seems unfortunate.

I get that scalable-vector llvm.powi is different, but so would many other scalable-vector intrinsics if the target doesn't support that operation: llvm.sin, llvm.cos, etc. So would we have passes for each intrinsic? If not, ExpandPowi seems too restrictive in its scope.

If we're supporting intrinsics, what about plain scalable-vector add on a target without scalable vectors, like x86?

I'd basically like to know how this fits in with some longer-term strategy about what we want to support for illegal scalable-vector operations, rather than this specific powi use-case. If we start to open the door to specific intrinsics, I think it'd help to have a well-defined rationale and plan in mind.

Note that this pass doesn't scalarize

In D143578#4141678, @fakepaper56 wrote:

In D143578#4140944, @frasercrmck wrote:

I get that scalable-vector llvm.powi is different, but so would many other scalable-vector intrinsics if the target doesn't support that operation: llvm.sin, llvm.cos, etc. So would we have passes for each intrinsic? If not, ExpandPowi seems too restrictive in its scope.

If we're supporting intrinsics, what about plain scalable-vector add on a target without scalable vectors, like x86?

I agree with you that only expanding powi is too restrictive. I think at least we should expand all the math function in a pass. But I don't have no idea that whether we should expand scalable operations for target without scalable vectors?

How would we expand the other math functions? Many of them are large and probably difficult to keep in vector form. We could scalarize them with a loop and use scalar libcalls. But that makes it very different than what we're doing for powi here.

How do envision sharing this code for llvm.powi. A lot of this code creates VP intrinsics. Do you have an abstraction plan?

In D143578#4142322, @craig.topper wrote:

How do envision sharing this code for llvm.powi. A lot of this code creates VP intrinsics. Do you have an abstraction plan?

My plan is use same expanding function but use true mask for its mask and the elementcount for its evl.

Also expanding llvm.powi.

Harbormaster completed remote builds in B216934: Diff 501809.Mar 2 2023, 3:11 AM

No test for RISC-V?

llvm/test/CodeGen/Generic/expand-powi.ll
3	This needs a `REQUIRES: x86-registered-target` or it needs to be moved into the X86 directory.

craig.topper added inline comments.Mar 7 2023, 9:54 PM

llvm/lib/CodeGen/ExpandPowi.cpp
128	I think we should do a vp_icmp followed by a mask vp_reduce_or.

All the existing tests for llvm.powi use a scalar exponent even when the result is a vector. Should vp.powi only accept scalar exponent?

I think we should follow rule of llvm.powi first.

This update does,

Make vp.powi follows llvm.powi to only accept scalar exponent.
Add tests for RISC-V.
Update test cases.

But it still a test fail for ir unit test. I don't know how to debug it. I even
can not use gdb to trace it.
The below command about the test fails.

$ LLVM_SYMBOLIZER_PATH=./build/bin/llvm-symbolizer ./build/unittests/IR/./IRTests
...
[ RUN      ] VPIntrinsicTest.VPIntrinsicDeclarationForParams
IRTests: /home/yeting/x86-riscv-llvm/llvm/include/llvm/ADT/ArrayRef.h:255: const T& llvm::ArrayRef<T>::operator[](size_t) const [with T = llvm::Type*; size_t = long unsigned int]: Assertion `Index < Length && "Invalid index!"' failed.
 #0 0x00005620c5f909ee llvm::sys::PrintStackTrace(llvm::raw_ostream&, int) /home/yeting/x86-riscv-llvm/llvm/lib/Support/Unix/Signals.inc:567:22
 #1 0x00005620c5f90dc0 PrintStackTraceSignalHandler(void*) /home/yeting/x86-riscv-llvm/llvm/lib/Support/Unix/Signals.inc:641:1
 #2 0x00005620c5f8e4fe llvm::sys::RunSignalHandlers() /home/yeting/x86-riscv-llvm/llvm/lib/Support/Signals.cpp:104:20
 #3 0x00005620c5f9033f SignalHandler(int) /home/yeting/x86-riscv-llvm/llvm/lib/Support/Unix/Signals.inc:412:1
 #4 0x00007f3c933e0980 __restore_rt (/lib/x86_64-linux-gnu/libpthread.so.0+0x12980)
 #5 0x00007f3c91d92e87 raise /build/glibc-uZu3wS/glibc-2.27/signal/../sysdeps/unix/sysv/linux/raise.c:51:0
 #6 0x00007f3c91d947f1 abort /build/glibc-uZu3wS/glibc-2.27/stdlib/abort.c:81:0
 #7 0x00007f3c91d843fa __assert_fail_base /build/glibc-uZu3wS/glibc-2.27/assert/assert.c:89:0
 #8 0x00007f3c91d84472 (/lib/x86_64-linux-gnu/libc.so.6+0x30472)
 #9 0x00005620c5b9b1c6 llvm::ArrayRef<llvm::Type*>::operator[](unsigned long) const /home/yeting/x86-riscv-llvm/llvm/include/llvm/ADT/ArrayRef.h:256:14
#10 0x00005620c5cde47b DecodeFixedType(llvm::ArrayRef<llvm::Intrinsic::IITDescriptor>&, llvm::ArrayRef<llvm::Type*>, llvm::LLVMContext&) /home/yeting/x86-riscv-llvm/llvm/lib/IR/Function.cpp:1401:37
#11 0x00005620c5cdeae6 llvm::Intrinsic::getType(llvm::LLVMContext&, unsigned int, llvm::ArrayRef<llvm::Type*>) /home/yeting/x86-riscv-llvm/llvm/lib/IR/Function.cpp:1480:21
#12 0x00005620c5cf70c8 llvm::Intrinsic::getDeclaration(llvm::Module*, unsigned int, llvm::ArrayRef<llvm::Type*>) /home/yeting/x86-riscv-llvm/llvm/lib/IR/Function.cpp:1505:21
#13 0x00005620c5d49863 llvm::VPIntrinsic::getDeclarationForParams(llvm::Module*, unsigned int, llvm::Type*, llvm::ArrayRef<llvm::Value*>) /home/yeting/x86-riscv-llvm/llvm/lib/IR/IntrinsicInst.cpp:594:39
#14 0x00005620c56b2283 (anonymous namespace)::VPIntrinsicTest_VPIntrinsicDeclarationForParams_Test::TestBody() /home/yeting/x86-riscv-llvm/llvm/unittests/IR/VPIntrinsicTest.cpp:367:72

Herald added subscribers: luke, kosarev, • pcwang-thead and 24 others. · View Herald TranscriptMar 15 2023, 7:50 AM

craig.topper added inline comments.Mar 15 2023, 8:31 AM

llvm/docs/LangRef.rst
19934 ↗	(On Diff #505490)	`Predicated version of raising a vector of floating-point values to an integer power.`

Fixed crash by adding special case in llvm::VPIntrinsic::getDeclarationForParams

Harbormaster completed remote builds in B219650: Diff 505511.Mar 15 2023, 9:56 AM

In D143578#4142322, @craig.topper wrote:

How would we expand the other math functions? Many of them are large and probably difficult to keep in vector form. We could scalarize them with a loop and use scalar libcalls.

I want to second this point. I think doing the fancy expansion here is a bad idea at this time. We can come back to that, but an initial implementation should scalarize via a loop. The lowering works for all of the lane-wise math routines. Only once we have correct lowering for the majority of the routines should we bother optimizing any of them.

Even then, I'm not convinced that inlining this loop is profitable over generating a runtime call to a new routine.

llvm/lib/CodeGen/ExpandPowi.cpp
36	This appears to correspond to the recently introduced IRBuilder::CreateElementCount.

In D143578#4197721, @reames wrote:

In D143578#4142322, @craig.topper wrote:

How would we expand the other math functions? Many of them are large and probably difficult to keep in vector form. We could scalarize them with a loop and use scalar libcalls.

I want to second this point. I think doing the fancy expansion here is a bad idea at this time. We can come back to that, but an initial implementation should scalarize via a loop. The lowering works for all of the lane-wise math routines. Only once we have correct lowering for the majority of the routines should we bother optimizing any of them.

Even then, I'm not convinced that inlining this loop is profitable over generating a runtime call to a new routine.

I want to mention that powi is weird and does not correspond to a real math routine. It's a fast math optimization for pow with an integer argument. The scalar version of powi is provided in libgcc/compiler-rt while pow itself is in libm. This almost makes it a compiler implementation detail. Should a vector math library provide this function?

In D143578#4197800, @craig.topper wrote:

Even then, I'm not convinced that inlining this loop is profitable over generating a runtime call to a new routine.

I want to mention that powi is weird and does not correspond to a real math routine. It's a fast math optimization for pow with an integer argument. The scalar version of powi is provided in libgcc/compiler-rt while pow itself is in libm. This almost makes it a compiler implementation detail. Should a vector math library provide this function?

One of the options which was mentioned in the recent compiler-rt thread on discourse was to have a weak definition defined in each object file so that the linker could pick one (including the runtime libs if available). I'd lean towards something like that.

Use CreateElementCount and fix typos in LangRef.rst.

Harbormaster completed remote builds in B219806: Diff 505725.Mar 16 2023, 3:09 AM

In D143578#4197817, @reames wrote:

One of the options which was mentioned in the recent compiler-rt thread on discourse was to have a weak definition defined in each object file so that the linker could pick one (including the runtime libs if available). I'd lean towards something like that.

Could you provide the link of the discourse you mentioned?

Revision Contents

Path

Size

llvm/

include/

llvm/

CodeGen/

MachinePassRegistry.def

1 line

Passes.h

3 lines

IR/

Intrinsics.td

5 lines

VPIntrinsics.def

4 lines

InitializePasses.h

1 line

lib/

CodeGen/

CMakeLists.txt

1 line

ExpandPowi.cpp

168 lines

TargetPassConfig.cpp

1 line

test/

CodeGen/

Generic/

expand-powi.ll

30 lines

tools/

opt/

opt.cpp

2 lines

Diff 499051

llvm/include/llvm/CodeGen/MachinePassRegistry.def

	Show All 39 Lines
	FUNCTION_PASS("unreachableblockelim", UnreachableBlockElimPass, ())			FUNCTION_PASS("unreachableblockelim", UnreachableBlockElimPass, ())
	FUNCTION_PASS("consthoist", ConstantHoistingPass, ())			FUNCTION_PASS("consthoist", ConstantHoistingPass, ())
	FUNCTION_PASS("replace-with-veclib", ReplaceWithVeclib, ())			FUNCTION_PASS("replace-with-veclib", ReplaceWithVeclib, ())
	FUNCTION_PASS("partially-inline-libcalls", PartiallyInlineLibCallsPass, ())			FUNCTION_PASS("partially-inline-libcalls", PartiallyInlineLibCallsPass, ())
	FUNCTION_PASS("ee-instrument", EntryExitInstrumenterPass, (false))			FUNCTION_PASS("ee-instrument", EntryExitInstrumenterPass, (false))
	FUNCTION_PASS("post-inline-ee-instrument", EntryExitInstrumenterPass, (true))			FUNCTION_PASS("post-inline-ee-instrument", EntryExitInstrumenterPass, (true))
	FUNCTION_PASS("expand-large-div-rem", ExpandLargeDivRemPass, ())			FUNCTION_PASS("expand-large-div-rem", ExpandLargeDivRemPass, ())
	FUNCTION_PASS("expand-large-fp-convert", ExpandLargeFpConvertPass, ())			FUNCTION_PASS("expand-large-fp-convert", ExpandLargeFpConvertPass, ())
				FUNCTION_PASS("expand-powi", ExpandPowiPass, ())
	FUNCTION_PASS("expand-reductions", ExpandReductionsPass, ())			FUNCTION_PASS("expand-reductions", ExpandReductionsPass, ())
	FUNCTION_PASS("expandvp", ExpandVectorPredicationPass, ())			FUNCTION_PASS("expandvp", ExpandVectorPredicationPass, ())
	FUNCTION_PASS("lowerinvoke", LowerInvokePass, ())			FUNCTION_PASS("lowerinvoke", LowerInvokePass, ())
	FUNCTION_PASS("scalarize-masked-mem-intrin", ScalarizeMaskedMemIntrinPass, ())			FUNCTION_PASS("scalarize-masked-mem-intrin", ScalarizeMaskedMemIntrinPass, ())
	FUNCTION_PASS("tlshoist", TLSVariableHoistPass, ())			FUNCTION_PASS("tlshoist", TLSVariableHoistPass, ())
	FUNCTION_PASS("verify", VerifierPass, ())			FUNCTION_PASS("verify", VerifierPass, ())
	#undef FUNCTION_PASS			#undef FUNCTION_PASS

	▲ Show 20 Lines • Show All 156 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/Passes.h

Show First 20 Lines • Show All 512 Lines • ▼ Show 20 Lines	namespace llvm {
FunctionPass *createExpandVectorPredicationPass();		FunctionPass *createExpandVectorPredicationPass();

// Expands large div/rem instructions.		// Expands large div/rem instructions.
FunctionPass *createExpandLargeDivRemPass();		FunctionPass *createExpandLargeDivRemPass();

// Expands large div/rem instructions.		// Expands large div/rem instructions.
FunctionPass *createExpandLargeFpConvertPass();		FunctionPass *createExpandLargeFpConvertPass();

		// Expands powi instructions.
		FunctionPass *createExpandPowiPass();

// This pass expands memcmp() to load/stores.		// This pass expands memcmp() to load/stores.
FunctionPass *createExpandMemCmpPass();		FunctionPass *createExpandMemCmpPass();

/// Creates Break False Dependencies pass. \see BreakFalseDeps.cpp		/// Creates Break False Dependencies pass. \see BreakFalseDeps.cpp
FunctionPass *createBreakFalseDeps();		FunctionPass *createBreakFalseDeps();

// This pass expands indirectbr instructions.		// This pass expands indirectbr instructions.
FunctionPass *createIndirectBrExpandPass();		FunctionPass *createIndirectBrExpandPass();
▲ Show 20 Lines • Show All 76 Lines • Show Last 20 Lines

llvm/include/llvm/IR/Intrinsics.td

Show First 20 Lines • Show All 1,680 Lines • ▼ Show 20 Lines	let IntrProperties = [IntrNoMem, IntrNoSync, IntrWillReturn] in {
def int_vp_rint : DefaultAttrsIntrinsic<[ llvm_anyvector_ty ],		def int_vp_rint : DefaultAttrsIntrinsic<[ llvm_anyvector_ty ],
[ LLVMMatchType<0>,		[ LLVMMatchType<0>,
LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,		LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,
llvm_i32_ty]>;		llvm_i32_ty]>;
def int_vp_nearbyint : DefaultAttrsIntrinsic<[ llvm_anyvector_ty ],		def int_vp_nearbyint : DefaultAttrsIntrinsic<[ llvm_anyvector_ty ],
[ LLVMMatchType<0>,		[ LLVMMatchType<0>,
LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,		LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,
llvm_i32_ty]>;		llvm_i32_ty]>;
		def int_vp_powi : DefaultAttrsIntrinsic<[ llvm_anyvector_ty ],
		[ LLVMMatchType<0>,
		llvm_anyvector_ty,
		LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,
		llvm_i32_ty]>;

// Casts		// Casts
def int_vp_trunc : DefaultAttrsIntrinsic<[ llvm_anyvector_ty ],		def int_vp_trunc : DefaultAttrsIntrinsic<[ llvm_anyvector_ty ],
[ llvm_anyvector_ty,		[ llvm_anyvector_ty,
LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,		LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,
llvm_i32_ty]>;		llvm_i32_ty]>;
def int_vp_zext : DefaultAttrsIntrinsic<[ llvm_anyvector_ty ],		def int_vp_zext : DefaultAttrsIntrinsic<[ llvm_anyvector_ty ],
[ llvm_anyvector_ty,		[ llvm_anyvector_ty,
▲ Show 20 Lines • Show All 513 Lines • Show Last 20 Lines

llvm/include/llvm/IR/VPIntrinsics.def

	Show First 20 Lines • Show All 356 Lines • ▼ Show 20 Lines
	// llvm.vp.rint(x,mask,vlen)			// llvm.vp.rint(x,mask,vlen)
	BEGIN_REGISTER_VP(vp_rint, 1, 2, VP_FRINT, -1)			BEGIN_REGISTER_VP(vp_rint, 1, 2, VP_FRINT, -1)
	END_REGISTER_VP(vp_rint, VP_FRINT)			END_REGISTER_VP(vp_rint, VP_FRINT)

	// llvm.vp.nearbyint(x,mask,vlen)			// llvm.vp.nearbyint(x,mask,vlen)
	BEGIN_REGISTER_VP(vp_nearbyint, 1, 2, VP_FNEARBYINT, -1)			BEGIN_REGISTER_VP(vp_nearbyint, 1, 2, VP_FNEARBYINT, -1)
	END_REGISTER_VP(vp_nearbyint, VP_FNEARBYINT)			END_REGISTER_VP(vp_nearbyint, VP_FNEARBYINT)

				// llvm.vp.powi(x, y, mask,vlen)
				BEGIN_REGISTER_VP_INTRINSIC(vp_powi, 2, 3)
				VP_PROPERTY_BINARYOP
				END_REGISTER_VP_INTRINSIC(vp_powi)
	///// } Floating-Point Arithmetic			///// } Floating-Point Arithmetic

	///// Type Casts {			///// Type Casts {
	// Specialized helper macro for type conversions.			// Specialized helper macro for type conversions.
	// <operation>(%x, %mask, %evl).			// <operation>(%x, %mask, %evl).
	#ifdef HELPER_REGISTER_FP_CAST_VP			#ifdef HELPER_REGISTER_FP_CAST_VP
	#error \			#error \
	"The internal helper macro HELPER_REGISTER_FP_CAST_VP is already defined!"			"The internal helper macro HELPER_REGISTER_FP_CAST_VP is already defined!"
	▲ Show 20 Lines • Show All 271 Lines • Show Last 20 Lines

llvm/include/llvm/InitializePasses.h

	Show First 20 Lines • Show All 124 Lines • ▼ Show 20 Lines
	void initializeEarlyTailDuplicatePass(PassRegistry&);			void initializeEarlyTailDuplicatePass(PassRegistry&);
	void initializeEdgeBundlesPass(PassRegistry&);			void initializeEdgeBundlesPass(PassRegistry&);
	void initializeEHContGuardCatchretPass(PassRegistry &);			void initializeEHContGuardCatchretPass(PassRegistry &);
	void initializeEliminateAvailableExternallyLegacyPassPass(PassRegistry&);			void initializeEliminateAvailableExternallyLegacyPassPass(PassRegistry&);
	void initializeExpandLargeFpConvertLegacyPassPass(PassRegistry&);			void initializeExpandLargeFpConvertLegacyPassPass(PassRegistry&);
	void initializeExpandLargeDivRemLegacyPassPass(PassRegistry&);			void initializeExpandLargeDivRemLegacyPassPass(PassRegistry&);
	void initializeExpandMemCmpPassPass(PassRegistry&);			void initializeExpandMemCmpPassPass(PassRegistry&);
	void initializeExpandPostRAPass(PassRegistry&);			void initializeExpandPostRAPass(PassRegistry&);
				void initializeExpandPowiLegacyPassPass(PassRegistry &);
	void initializeExpandReductionsPass(PassRegistry&);			void initializeExpandReductionsPass(PassRegistry&);
	void initializeExpandVectorPredicationPass(PassRegistry &);			void initializeExpandVectorPredicationPass(PassRegistry &);
	void initializeMakeGuardsExplicitLegacyPassPass(PassRegistry&);			void initializeMakeGuardsExplicitLegacyPassPass(PassRegistry&);
	void initializeExternalAAWrapperPassPass(PassRegistry&);			void initializeExternalAAWrapperPassPass(PassRegistry&);
	void initializeFEntryInserterPass(PassRegistry&);			void initializeFEntryInserterPass(PassRegistry&);
	void initializeFinalizeISelPass(PassRegistry&);			void initializeFinalizeISelPass(PassRegistry&);
	void initializeFinalizeMachineBundlesPass(PassRegistry&);			void initializeFinalizeMachineBundlesPass(PassRegistry&);
	void initializeFixIrreduciblePass(PassRegistry &);			void initializeFixIrreduciblePass(PassRegistry &);
	▲ Show 20 Lines • Show All 260 Lines • Show Last 20 Lines

llvm/lib/CodeGen/CMakeLists.txt

Show First 20 Lines • Show All 53 Lines • ▼ Show 20 Lines	add_llvm_component_library(LLVMCodeGen
EarlyIfConversion.cpp		EarlyIfConversion.cpp
EdgeBundles.cpp		EdgeBundles.cpp
EHContGuardCatchret.cpp		EHContGuardCatchret.cpp
ExecutionDomainFix.cpp		ExecutionDomainFix.cpp
ExpandLargeDivRem.cpp		ExpandLargeDivRem.cpp
ExpandLargeFpConvert.cpp		ExpandLargeFpConvert.cpp
ExpandMemCmp.cpp		ExpandMemCmp.cpp
ExpandPostRAPseudos.cpp		ExpandPostRAPseudos.cpp
		ExpandPowi.cpp
ExpandReductions.cpp		ExpandReductions.cpp
ExpandVectorPredication.cpp		ExpandVectorPredication.cpp
FaultMaps.cpp		FaultMaps.cpp
FEntryInserter.cpp		FEntryInserter.cpp
FinalizeISel.cpp		FinalizeISel.cpp
FixupStatepointCallerSaved.cpp		FixupStatepointCallerSaved.cpp
FuncletLayout.cpp		FuncletLayout.cpp
GCMetadata.cpp		GCMetadata.cpp
▲ Show 20 Lines • Show All 209 Lines • Show Last 20 Lines

llvm/lib/CodeGen/ExpandPowi.cpp

This file was added.

				//===--- ExpandPowi.cpp - Expand Powi intrinsics ---------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// This pass implements IR expansion for powi/vp.powi. The expansion is based on
				// compiler-rt/__powidf2.c.
				//
				//===----------------------------------------------------------------------===//

				#include "llvm/ADT/SmallVector.h"
				#include "llvm/CodeGen/Passes.h"
				#include "llvm/CodeGen/TargetLowering.h"
				#include "llvm/IR/IRBuilder.h"
				craig.topperUnsubmitted Done Reply Inline Actions GlobalsModRef Probably uneeded? craig.topper: GlobalsModRef Probably uneeded?
				#include "llvm/IR/InstIterator.h"
				#include "llvm/IR/Intrinsics.h"
				#include "llvm/IR/PassManager.h"
				#include "llvm/InitializePasses.h"
				#include "llvm/Pass.h"

				#define DEBUG_TYPE "expand-powi"

				using namespace llvm;

				// Helper function to generate Value for CmpInst::Predicate.
				// FIXME: Support createVPCmp in IRBuilderBase.
				static Value *getPredicateValue(LLVMContext &Context,
				CmpInst::Predicate Predicate) {
				StringRef PredicateStr = CmpInst::getPredicateName(Predicate);
				auto *PredicateMDS = MDString::get(Context, PredicateStr);
				return MetadataAsValue::get(Context, PredicateMDS);
				}
				craig.topperUnsubmitted Done Reply Inline Actions expansion* craig.topper: expansion*

				reamesUnsubmitted Not Done Reply Inline Actions This appears to correspond to the recently introduced IRBuilder::CreateElementCount. reames: This appears to correspond to the recently introduced IRBuilder::CreateElementCount.
				// The expansion is based on the c code of compiler-rt/__powidf2.c,
				// const int recip = b < 0;
				// double r = 1;
				// while (1) {
				// if (b & 1)
				// r *= a;
				// b /= 2;
				// if (b == 0)
				// break;
				// a *= a;
				// }
				// return recip ? 1 / r : r;
				static void expandPowi(IntrinsicInst *II) {
				Value *OrigBase = II->getOperand(0);
				Value *OrigExp = II->getOperand(1);
				Value *Mask = II->getOperand(2);
				Value *EVL = II->getOperand(3);

				BasicBlock *PreLoopBB = II->getParent();
				BasicBlock *PostLoopBB = PreLoopBB->splitBasicBlock(II, "powi-post-loop");
				BasicBlock *LoopBody =
				BasicBlock::Create(PreLoopBB->getContext(), "powi-forward-loop",
				craig.topperUnsubmitted Done Reply Inline Actions why "forward"? craig.topper: why "forward"?
				fakepaper56AuthorUnsubmitted Done Reply Inline Actions Sorry, it didn't make sense. I changed it to powi-expansion-loop. fakepaper56: Sorry, it didn't make sense. I changed it to powi-expansion-loop.
				PreLoopBB->getParent(), PostLoopBB);

				IRBuilder<> Builder(PreLoopBB->getTerminator());
				Builder.CreateBr(LoopBody);
				PreLoopBB->getTerminator()->eraseFromParent();

				Type *BaseTy = OrigBase->getType();
				Type *ExpTy = OrigExp->getType();
				Type *CondTy = ExpTy->getWithNewBitWidth(1);
				Value *True = ConstantInt::get(CondTy, 1);
				LLVMContext &C = II->getContext();

				craig.topperUnsubmitted Done Reply Inline Actions CreatePHI returns a PHINode, can we use that to avoid casts? craig.topper:* CreatePHI returns a PHINode*, can we use that to avoid casts?
				Builder.SetInsertPoint(LoopBody);
				// Create phi of base.
				PHINode *Base = Builder.CreatePHI(BaseTy, 2, "base");
				Base->addIncoming(OrigBase, PreLoopBB);
				// Create phi of exponent.
				PHINode *Exp = Builder.CreatePHI(ExpTy, 2, "exp");
				Exp->addIncoming(OrigExp, PreLoopBB);
				// Create phi of res.
				PHINode *Res = Builder.CreatePHI(BaseTy, 2, "res");
				Res->addIncoming(ConstantFP::get(BaseTy, 1.), PreLoopBB);
				// Res *= Base if Exp is odd.
				Value *Tmp = Builder.CreateIntrinsic(BaseTy, Intrinsic::vp_fmul,
				{Res, Base, True, EVL});
				craig.topperUnsubmitted Not Done Reply Inline Actions What's preventing using vp.icmp? craig.topper: What's preventing using vp.icmp?
				fakepaper56AuthorUnsubmitted Not Done Reply Inline Actions It is only that I don't know how to construct vp.icmp/vp.fcmp instructions. fakepaper56: It is only that I don't know how to construct vp.icmp/vp.fcmp instructions.
				fakepaper56AuthorUnsubmitted Done Reply Inline Actions I don't understand how make predicate to a pointer of `Value`. fakepaper56: I don't understand how make predicate to a pointer of `Value`.
				craig.topperUnsubmitted Done Reply Inline Actions Should be something like this code from IRBuilder with the assert removed. Value getConstrainedFPPredicate(CmpInst::Predicate Predicate) { assert(CmpInst::isFPPredicate(Predicate) && Predicate != CmpInst::FCMP_FALSE && Predicate != CmpInst::FCMP_TRUE && "Invalid constrained FP comparison predicate!"); StringRef PredicateStr = CmpInst::getPredicateName(Predicate); auto PredicateMDS = MDString::get(Context, PredicateStr); return MetadataAsValue::get(Context, PredicateMDS); } craig.topper: Should be something like this code from IRBuilder with the assert removed. ``` Value…
				fakepaper56AuthorUnsubmitted Done Reply Inline Actions Thank you for the recommendation. fakepaper56: Thank you for the recommendation.
				Value *And1 = Builder.CreateIntrinsic(
				ExpTy, Intrinsic::vp_and, {Exp, ConstantInt::get(ExpTy, 1), True, EVL});
				Value *PredicateNE = getPredicateValue(C, CmpInst::ICMP_NE);
				Value *IsOdd = Builder.CreateIntrinsic(
				CondTy, Intrinsic::vp_icmp,
				{And1, ConstantInt::get(ExpTy, 0), PredicateNE, True, EVL});
				Value *NewRes = Builder.CreateIntrinsic(BaseTy, Intrinsic::vp_select,
				{IsOdd, Tmp, Res, EVL});
				Res->addIncoming(NewRes, LoopBody);
				// Update Exp.
				Value *NewExp = Builder.CreateIntrinsic(
				ExpTy, Intrinsic::vp_lshr, {Exp, ConstantInt::get(ExpTy, 1), True, EVL});
				Exp->addIncoming(NewExp, LoopBody);
				// Update Base.
				Value *NewBase = Builder.CreateIntrinsic(BaseTy, Intrinsic::vp_fmul,
				{Base, Base, True, EVL});
				Base->addIncoming(NewBase, LoopBody);
				// Check whether the elements of Exp are all zeros.
				Type *ExpScalarTy = ExpTy->getScalarType();
				Value *ScalarZero = ConstantInt::get(ExpScalarTy, 0);
				Value *OrSum = Builder.CreateIntrinsic(ExpScalarTy, Intrinsic::vp_reduce_or,
				{ScalarZero, NewExp, Mask, EVL});
				Builder.CreateCondBr(Builder.CreateICmpEQ(OrSum, ScalarZero), PostLoopBB,
				LoopBody);

				Builder.SetInsertPoint(&PostLoopBB->front());
				// Use reciprocal if power is negative.
				Value *Recip =
				Builder.CreateIntrinsic(BaseTy, Intrinsic::vp_fdiv,
				{ConstantFP::get(BaseTy, 1.), NewRes, Mask, EVL});
				// FIXME: Use vp.icmp.
				craig.topperUnsubmitted Done Reply Inline Actions old fixme? craig.topper: old fixme?
				Value *PredicateSLT = getPredicateValue(C, CmpInst::ICMP_SLT);
				Value *IsNegative = Builder.CreateIntrinsic(
				CondTy, Intrinsic::vp_icmp,
				{OrigExp, ConstantInt::get(ExpTy, 0), PredicateSLT, True, EVL});
				Value *Powi = Builder.CreateIntrinsic(BaseTy, Intrinsic::vp_select,
				{IsNegative, Recip, NewRes, EVL});

				II->replaceAllUsesWith(Powi);
				II->eraseFromParent();
				}
				craig.topperUnsubmitted Done Reply Inline Actions support* craig.topper: support*

				// TODO: Add cost model to skip small fixed vectors powi.
				static bool runImpl(Function &F) {
				SmallVector<IntrinsicInst *, 4> Replace;
				craig.topperUnsubmitted Not Done Reply Inline Actions I think we should do a vp_icmp followed by a mask vp_reduce_or. craig.topper: I think we should do a vp_icmp followed by a mask vp_reduce_or.
				for (auto &I : instructions(F)) {
				if (auto *II = dyn_cast<IntrinsicInst>(&I)) {
				// TODO: Also support llvm.powi.
				if (II->getIntrinsicID() == Intrinsic::vp_powi) {
				craig.topperUnsubmitted Done Reply Inline Actions Drop curly braces. craig.topper: Drop curly braces.
				Replace.push_back(II);
				}
				}
				}

				if (Replace.empty())
				return false;

				for (IntrinsicInst *II : Replace)
				expandPowi(II);

				return true;
				}

				namespace {
				class ExpandPowiLegacyPass : public FunctionPass {
				public:
				static char ID;

				ExpandPowiLegacyPass() : FunctionPass(ID) {
				initializeExpandPowiLegacyPassPass(*PassRegistry::getPassRegistry());
				}

				bool runOnFunction(Function &F) override { return runImpl(F); }
				};
				craig.topperUnsubmitted Done Reply Inline Actions Why does this require AA? craig.topper: Why does this require AA?
				fakepaper56AuthorUnsubmitted Done Reply Inline Actions Sorry, they are my misuse. fakepaper56: Sorry, they are my misuse.
				} // namespace

				char ExpandPowiLegacyPass::ID = 0;
				INITIALIZE_PASS_BEGIN(ExpandPowiLegacyPass, "expand-powi",
				"Expand powi functions", false, false)
				INITIALIZE_PASS_END(ExpandPowiLegacyPass, "expand-powi",
				"Expand powi functions", false, false)

				FunctionPass *llvm::createExpandPowiPass() {
				return new ExpandPowiLegacyPass();
				}

llvm/lib/CodeGen/TargetPassConfig.cpp

	Show First 20 Lines • Show All 1,083 Lines • ▼ Show 20 Lines
	bool TargetPassConfig::addISelPasses() {			bool TargetPassConfig::addISelPasses() {
	if (TM->useEmulatedTLS())			if (TM->useEmulatedTLS())
	addPass(createLowerEmuTLSPass());			addPass(createLowerEmuTLSPass());

	addPass(createPreISelIntrinsicLoweringPass());			addPass(createPreISelIntrinsicLoweringPass());
	PM->add(createTargetTransformInfoWrapperPass(TM->getTargetIRAnalysis()));			PM->add(createTargetTransformInfoWrapperPass(TM->getTargetIRAnalysis()));
	addPass(createExpandLargeDivRemPass());			addPass(createExpandLargeDivRemPass());
	addPass(createExpandLargeFpConvertPass());			addPass(createExpandLargeFpConvertPass());
				addPass(createExpandPowiPass());
	addIRPasses();			addIRPasses();
	addCodeGenPrepare();			addCodeGenPrepare();
	addPassesToHandleExceptions();			addPassesToHandleExceptions();
	addISelPrepare();			addISelPrepare();

	return addCoreISelPasses();			return addCoreISelPasses();
	}			}

	▲ Show 20 Lines • Show All 460 Lines • Show Last 20 Lines

llvm/test/CodeGen/Generic/expand-powi.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
				; RUN: opt -mtriple=x86_64-unknown-linux-gnu -expand-powi -S < %s \| FileCheck %s
				declare <vscale x 1 x float> @llvm.vp.powi.nxv1f32.nxv1i32(<vscale x 1 x float>, <vscale x 1 x i32>, <vscale x 1 x i1>, i32)
				craig.topperUnsubmitted Not Done Reply Inline Actions This needs a `REQUIRES: x86-registered-target` or it needs to be moved into the X86 directory. craig.topper: This needs a `REQUIRES: x86-registered-target` or it needs to be moved into the X86 directory.
				define <vscale x 1 x float> @foo(<vscale x 1 x float> %a, <vscale x 1 x i32> %b, <vscale x 1 x i1> %m, i32 %evl) {
				; CHECK-LABEL: @foo(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: br label [[POWI_FORWARD_LOOP:%.*]]
				; CHECK: powi-forward-loop:
				; CHECK-NEXT: [[BASE:%.]] = phi <vscale x 1 x float> [ [[A:%.]], [[ENTRY:%.]] ], [ [[TMP5:%.]], [[POWI_FORWARD_LOOP]] ]
				; CHECK-NEXT: [[EXP:%.]] = phi <vscale x 1 x i32> [ [[B:%.]], [[ENTRY]] ], [ [[TMP4:%.*]], [[POWI_FORWARD_LOOP]] ]
				; CHECK-NEXT: [[RES:%.]] = phi <vscale x 1 x float> [ shufflevector (<vscale x 1 x float> insertelement (<vscale x 1 x float> poison, float 1.000000e+00, i64 0), <vscale x 1 x float> poison, <vscale x 1 x i32> zeroinitializer), [[ENTRY]] ], [ [[TMP3:%.]], [[POWI_FORWARD_LOOP]] ]
				; CHECK-NEXT: [[TMP0:%.]] = call <vscale x 1 x float> @llvm.vp.fmul.nxv1f32(<vscale x 1 x float> [[RES]], <vscale x 1 x float> [[BASE]], <vscale x 1 x i1> shufflevector (<vscale x 1 x i1> insertelement (<vscale x 1 x i1> poison, i1 true, i64 0), <vscale x 1 x i1> poison, <vscale x 1 x i32> zeroinitializer), i32 [[EVL:%.]])
				; CHECK-NEXT: [[TMP1:%.*]] = call <vscale x 1 x i32> @llvm.vp.and.nxv1i32(<vscale x 1 x i32> [[EXP]], <vscale x 1 x i32> shufflevector (<vscale x 1 x i32> insertelement (<vscale x 1 x i32> poison, i32 1, i64 0), <vscale x 1 x i32> poison, <vscale x 1 x i32> zeroinitializer), <vscale x 1 x i1> shufflevector (<vscale x 1 x i1> insertelement (<vscale x 1 x i1> poison, i1 true, i64 0), <vscale x 1 x i1> poison, <vscale x 1 x i32> zeroinitializer), i32 [[EVL]])
				; CHECK-NEXT: [[TMP2:%.*]] = call <vscale x 1 x i1> @llvm.vp.icmp.nxv1i32(<vscale x 1 x i32> [[TMP1]], <vscale x 1 x i32> zeroinitializer, metadata !"ne", <vscale x 1 x i1> shufflevector (<vscale x 1 x i1> insertelement (<vscale x 1 x i1> poison, i1 true, i64 0), <vscale x 1 x i1> poison, <vscale x 1 x i32> zeroinitializer), i32 [[EVL]])
				; CHECK-NEXT: [[TMP3]] = call <vscale x 1 x float> @llvm.vp.select.nxv1f32(<vscale x 1 x i1> [[TMP2]], <vscale x 1 x float> [[TMP0]], <vscale x 1 x float> [[RES]], i32 [[EVL]])
				; CHECK-NEXT: [[TMP4]] = call <vscale x 1 x i32> @llvm.vp.lshr.nxv1i32(<vscale x 1 x i32> [[EXP]], <vscale x 1 x i32> shufflevector (<vscale x 1 x i32> insertelement (<vscale x 1 x i32> poison, i32 1, i64 0), <vscale x 1 x i32> poison, <vscale x 1 x i32> zeroinitializer), <vscale x 1 x i1> shufflevector (<vscale x 1 x i1> insertelement (<vscale x 1 x i1> poison, i1 true, i64 0), <vscale x 1 x i1> poison, <vscale x 1 x i32> zeroinitializer), i32 [[EVL]])
				; CHECK-NEXT: [[TMP5]] = call <vscale x 1 x float> @llvm.vp.fmul.nxv1f32(<vscale x 1 x float> [[BASE]], <vscale x 1 x float> [[BASE]], <vscale x 1 x i1> shufflevector (<vscale x 1 x i1> insertelement (<vscale x 1 x i1> poison, i1 true, i64 0), <vscale x 1 x i1> poison, <vscale x 1 x i32> zeroinitializer), i32 [[EVL]])
				; CHECK-NEXT: [[TMP6:%.]] = call i32 @llvm.vp.reduce.or.nxv1i32(i32 0, <vscale x 1 x i32> [[TMP4]], <vscale x 1 x i1> [[M:%.]], i32 [[EVL]])
				; CHECK-NEXT: [[TMP7:%.*]] = icmp eq i32 [[TMP6]], 0
				; CHECK-NEXT: br i1 [[TMP7]], label [[POWI_POST_LOOP:%.*]], label [[POWI_FORWARD_LOOP]]
				; CHECK: powi-post-loop:
				; CHECK-NEXT: [[TMP8:%.*]] = call <vscale x 1 x float> @llvm.vp.fdiv.nxv1f32(<vscale x 1 x float> shufflevector (<vscale x 1 x float> insertelement (<vscale x 1 x float> poison, float 1.000000e+00, i64 0), <vscale x 1 x float> poison, <vscale x 1 x i32> zeroinitializer), <vscale x 1 x float> [[TMP3]], <vscale x 1 x i1> [[M]], i32 [[EVL]])
				; CHECK-NEXT: [[TMP9:%.*]] = call <vscale x 1 x i1> @llvm.vp.icmp.nxv1i32(<vscale x 1 x i32> [[B]], <vscale x 1 x i32> zeroinitializer, metadata !"slt", <vscale x 1 x i1> shufflevector (<vscale x 1 x i1> insertelement (<vscale x 1 x i1> poison, i1 true, i64 0), <vscale x 1 x i1> poison, <vscale x 1 x i32> zeroinitializer), i32 [[EVL]])
				; CHECK-NEXT: [[TMP10:%.*]] = call <vscale x 1 x float> @llvm.vp.select.nxv1f32(<vscale x 1 x i1> [[TMP9]], <vscale x 1 x float> [[TMP8]], <vscale x 1 x float> [[TMP3]], i32 [[EVL]])
				; CHECK-NEXT: ret <vscale x 1 x float> [[TMP10]]
				;
				entry:
				%0 = call <vscale x 1 x float> @llvm.vp.powi.nxv1f32.nxv1i32(<vscale x 1 x float> %a, <vscale x 1 x i32> %b, <vscale x 1 x i1> %m, i32 %evl)
				ret <vscale x 1 x float> %0
				}

llvm/tools/opt/opt.cpp

Show First 20 Lines • Show All 388 Lines • ▼ Show 20 Lines	std::vector<StringRef> PassNameExact = {
"view-regions",		"view-regions",
"view-regions-only",		"view-regions-only",
"select-optimize",		"select-optimize",
"expand-large-div-rem",		"expand-large-div-rem",
"structurizecfg",		"structurizecfg",
"fix-irreducible",		"fix-irreducible",
"expand-large-fp-convert",		"expand-large-fp-convert",
"callbrprepare",		"callbrprepare",
		"expand-powi",
};		};
for (const auto &P : PassNamePrefix)		for (const auto &P : PassNamePrefix)
if (Pass.startswith(P))		if (Pass.startswith(P))
return true;		return true;
for (const auto &P : PassNameContain)		for (const auto &P : PassNameContain)
if (Pass.contains(P))		if (Pass.contains(P))
return true;		return true;
return llvm::is_contained(PassNameExact, Pass);		return llvm::is_contained(PassNameExact, Pass);
Show All 33 Lines	int main(int argc, char **argv) {
initializeTransformUtils(Registry);		initializeTransformUtils(Registry);
initializeInstCombine(Registry);		initializeInstCombine(Registry);
initializeTarget(Registry);		initializeTarget(Registry);
// For codegen passes, only passes that do IR to IR transformation are		// For codegen passes, only passes that do IR to IR transformation are
// supported.		// supported.
initializeExpandLargeDivRemLegacyPassPass(Registry);		initializeExpandLargeDivRemLegacyPassPass(Registry);
initializeExpandLargeFpConvertLegacyPassPass(Registry);		initializeExpandLargeFpConvertLegacyPassPass(Registry);
initializeExpandMemCmpPassPass(Registry);		initializeExpandMemCmpPassPass(Registry);
		initializeExpandPowiLegacyPassPass(Registry);
initializeScalarizeMaskedMemIntrinLegacyPassPass(Registry);		initializeScalarizeMaskedMemIntrinLegacyPassPass(Registry);
initializeSelectOptimizePass(Registry);		initializeSelectOptimizePass(Registry);
initializeCallBrPreparePass(Registry);		initializeCallBrPreparePass(Registry);
initializeCodeGenPreparePass(Registry);		initializeCodeGenPreparePass(Registry);
initializeAtomicExpandPass(Registry);		initializeAtomicExpandPass(Registry);
initializeRewriteSymbolsLegacyPassPass(Registry);		initializeRewriteSymbolsLegacyPassPass(Registry);
initializeWinEHPreparePass(Registry);		initializeWinEHPreparePass(Registry);
initializeDwarfEHPrepareLegacyPassPass(Registry);		initializeDwarfEHPrepareLegacyPassPass(Registry);
▲ Show 20 Lines • Show All 472 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[VP] Add vp.powi and a pass for expanding vp.powi before DAG.Needs ReviewPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 499051

llvm/include/llvm/CodeGen/MachinePassRegistry.def

llvm/include/llvm/CodeGen/Passes.h

llvm/include/llvm/IR/Intrinsics.td

llvm/include/llvm/IR/VPIntrinsics.def

llvm/include/llvm/InitializePasses.h

llvm/lib/CodeGen/CMakeLists.txt

llvm/lib/CodeGen/ExpandPowi.cpp

llvm/lib/CodeGen/TargetPassConfig.cpp

llvm/test/CodeGen/Generic/expand-powi.ll

llvm/tools/opt/opt.cpp

[VP] Add vp.powi and a pass for expanding vp.powi before DAG.
Needs ReviewPublic