This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/
-
llvm/
-
CodeGen/
-
MachinePassRegistry.def
-
Passes.h
-
IR/
-
Intrinsics.td
-
VPIntrinsics.def
-
InitializePasses.h
-
lib/CodeGen/
-
CodeGen/
-
CMakeLists.txt
13/17
ExpandPowi.cpp
-
TargetPassConfig.cpp
-
test/CodeGen/Generic/
-
CodeGen/
-
Generic/
1
expand-powi.ll
-
tools/opt/
-
opt/
-
opt.cpp

Differential D143578

[VP] Add vp.powi and a pass for expanding vp.powi before DAG.
Needs ReviewPublic

Authored by fakepaper56 on Feb 8 2023, 6:09 AM.

Download Raw Diff

Details

Reviewers

craig.topper
reames
frasercrmck
rogfer01
simoll

Summary

The patch uses different expanding way for vp.powi from the method of powi.
Vector powi is unrolled to multiple powi() libary calls in SelectionDAG, but the
method is not work for scalable vectors.
To support scalable vectors, the patch expands vp.powi at IR level. The
expanding way of vp.powi is based on compiler-rt/__powidf2.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	0 ms	x64 debian > LLVM-Unit.IR/_/IRTests/VPIntrinsicTest::VPModuleComplete
	110 ms	x64 debian > LLVM.CodeGen/AArch64::O0-pipeline.ll
	100 ms	x64 debian > LLVM.CodeGen/AArch64::O3-pipeline.ll
	160 ms	x64 debian > LLVM.CodeGen/AMDGPU::llc-pipeline.ll
	120 ms	x64 debian > LLVM.CodeGen/ARM::O3-pipeline.ll
		View Full Test Results (13 Failed)

Event Timeline

fakepaper56 created this revision.Feb 8 2023, 6:09 AM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 8 2023, 6:09 AM

Herald added a subscriber: hiraditya. · View Herald Transcript

fakepaper56 requested review of this revision.Feb 8 2023, 6:09 AM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 8 2023, 6:09 AM

Herald added subscribers: llvm-commits, alextsao1999, jdoerfert. · View Herald Transcript

Maybe I missed the rationale, but why not use the ExpandVectorPredicationPass for this?

Harbormaster completed remote builds in B212590: Diff 495819.Feb 8 2023, 7:21 AM

Maybe I missed the rationale, but why not use the ExpandVectorPredicationPass for this?

LLVM could not expanding scalable vector type powi now. So this pass is not only for vp.powi, but also expanding scalable vector type powi in the future.

craig.topper added inline comments.Feb 9 2023, 7:32 PM

llvm/lib/CodeGen/ExpandPowi.cpp
35	expansion*
70	CreatePHI returns a PHINode*, can we use that to avoid casts?
124	support*
157	Why does this require AA?

Address Craig's comment and add missing test case.

Harbormaster completed remote builds in B212951: Diff 496323.Feb 9 2023, 9:36 PM

fakepaper56 marked 3 inline comments as done.Feb 10 2023, 1:17 AM

fakepaper56 added inline comments.

llvm/lib/CodeGen/ExpandPowi.cpp
157	Sorry, they are my misuse.

craig.topper added inline comments.Feb 10 2023, 10:09 PM

llvm/lib/CodeGen/ExpandPowi.cpp
17	GlobalsModRef Probably uneeded?

craig.topper added inline comments.Feb 10 2023, 10:10 PM

llvm/lib/CodeGen/ExpandPowi.cpp
83	What's preventing using vp.icmp?

Cleanup headers.

fakepaper56 marked 2 inline comments as done.Feb 11 2023, 4:09 AM

fakepaper56 added inline comments.

llvm/lib/CodeGen/ExpandPowi.cpp
83	It is only that I don't know how to construct vp.icmp/vp.fcmp instructions.

fakepaper56 added inline comments.Feb 11 2023, 4:18 AM

llvm/lib/CodeGen/ExpandPowi.cpp
83	I don't understand how make predicate to a pointer of `Value`.

Harbormaster completed remote builds in B213209: Diff 496678.Feb 11 2023, 5:09 AM

craig.topper added inline comments.Feb 11 2023, 11:58 AM

llvm/lib/CodeGen/ExpandPowi.cpp

Should be something like this code from IRBuilder with the assert removed.

Value *getConstrainedFPPredicate(CmpInst::Predicate Predicate) {               
  assert(CmpInst::isFPPredicate(Predicate) &&                                  
         Predicate != CmpInst::FCMP_FALSE &&                                   
         Predicate != CmpInst::FCMP_TRUE &&                                    
         "Invalid constrained FP comparison predicate!");                      
                                                                               
  StringRef PredicateStr = CmpInst::getPredicateName(Predicate);               
  auto *PredicateMDS = MDString::get(Context, PredicateStr);                   
                                                                               
  return MetadataAsValue::get(Context, PredicateMDS);                          
}

Use vp.icmp instead of icmp.

fakepaper56 marked an inline comment as done.Feb 11 2023, 11:50 PM

fakepaper56 added inline comments.

llvm/lib/CodeGen/ExpandPowi.cpp
83	Thank you for the recommendation.

Harbormaster completed remote builds in B213265: Diff 496743.Feb 12 2023, 1:18 AM

Rebase and ping.

craig.topper added inline comments.Feb 21 2023, 12:13 AM

llvm/lib/CodeGen/ExpandPowi.cpp
115	old fixme?
133	Drop curly braces.

craig.topper added inline comments.Feb 21 2023, 12:14 AM

llvm/lib/CodeGen/ExpandPowi.cpp
59	why "forward"?

Address Craig's comment.

fakepaper56 marked 3 inline comments as done.Feb 21 2023, 12:28 AM

fakepaper56 added inline comments.

llvm/lib/CodeGen/ExpandPowi.cpp
59	Sorry, it didn't make sense. I changed it to powi-expansion-loop.

Harbormaster completed remote builds in B214939: Diff 499057.Feb 21 2023, 1:32 AM

In D143578#4113149, @fakepaper56 wrote:

Maybe I missed the rationale, but why not use the ExpandVectorPredicationPass for this?

LLVM could not expanding scalable vector type powi now. So this pass is not only for vp.powi, but also expanding scalable vector type powi in the future.

Apologies, I was away on holiday.

Thanks - I missed that the plan was also to support llvm.powi. I guess I just find ExpandPowi and ExpandVectorPredicationPass to be doing two very similar things (in this patch) with regards to vp.powi: expanding it into an equivalent set of operations; that seems unfortunate.

I get that scalable-vector llvm.powi is different, but so would many other scalable-vector intrinsics if the target doesn't support that operation: llvm.sin, llvm.cos, etc. So would we have passes for each intrinsic? If not, ExpandPowi seems too restrictive in its scope.

If we're supporting intrinsics, what about plain scalable-vector add on a target without scalable vectors, like x86?

I'd basically like to know how this fits in with some longer-term strategy about what we want to support for illegal scalable-vector operations, rather than this specific powi use-case. If we start to open the door to specific intrinsics, I think it'd help to have a well-defined rationale and plan in mind.

In D143578#4140944, @frasercrmck wrote:

I get that scalable-vector llvm.powi is different, but so would many other scalable-vector intrinsics if the target doesn't support that operation: llvm.sin, llvm.cos, etc. So would we have passes for each intrinsic? If not, ExpandPowi seems too restrictive in its scope.

If we're supporting intrinsics, what about plain scalable-vector add on a target without scalable vectors, like x86?

I agree with you that only expanding powi is too restrictive. I think at least we should expand all the math function in a pass. But I don't have no idea that whether we should expand scalable operations for target without scalable vectors?

In D143578#4140944, @frasercrmck wrote:

In D143578#4113149, @fakepaper56 wrote:

Maybe I missed the rationale, but why not use the ExpandVectorPredicationPass for this?

LLVM could not expanding scalable vector type powi now. So this pass is not only for vp.powi, but also expanding scalable vector type powi in the future.

Apologies, I was away on holiday.

Thanks - I missed that the plan was also to support llvm.powi. I guess I just find ExpandPowi and ExpandVectorPredicationPass to be doing two very similar things (in this patch) with regards to vp.powi: expanding it into an equivalent set of operations; that seems unfortunate.

I get that scalable-vector llvm.powi is different, but so would many other scalable-vector intrinsics if the target doesn't support that operation: llvm.sin, llvm.cos, etc. So would we have passes for each intrinsic? If not, ExpandPowi seems too restrictive in its scope.

If we're supporting intrinsics, what about plain scalable-vector add on a target without scalable vectors, like x86?

I'd basically like to know how this fits in with some longer-term strategy about what we want to support for illegal scalable-vector operations, rather than this specific powi use-case. If we start to open the door to specific intrinsics, I think it'd help to have a well-defined rationale and plan in mind.

Note that this pass doesn't scalarize

In D143578#4141678, @fakepaper56 wrote:

In D143578#4140944, @frasercrmck wrote:

I get that scalable-vector llvm.powi is different, but so would many other scalable-vector intrinsics if the target doesn't support that operation: llvm.sin, llvm.cos, etc. So would we have passes for each intrinsic? If not, ExpandPowi seems too restrictive in its scope.

If we're supporting intrinsics, what about plain scalable-vector add on a target without scalable vectors, like x86?

I agree with you that only expanding powi is too restrictive. I think at least we should expand all the math function in a pass. But I don't have no idea that whether we should expand scalable operations for target without scalable vectors?

How would we expand the other math functions? Many of them are large and probably difficult to keep in vector form. We could scalarize them with a loop and use scalar libcalls. But that makes it very different than what we're doing for powi here.

How do envision sharing this code for llvm.powi. A lot of this code creates VP intrinsics. Do you have an abstraction plan?

In D143578#4142322, @craig.topper wrote:

How do envision sharing this code for llvm.powi. A lot of this code creates VP intrinsics. Do you have an abstraction plan?

My plan is use same expanding function but use true mask for its mask and the elementcount for its evl.

Also expanding llvm.powi.

Harbormaster completed remote builds in B216934: Diff 501809.Mar 2 2023, 3:11 AM

No test for RISC-V?

llvm/test/CodeGen/Generic/expand-powi.ll
3	This needs a `REQUIRES: x86-registered-target` or it needs to be moved into the X86 directory.

craig.topper added inline comments.Mar 7 2023, 9:54 PM

llvm/lib/CodeGen/ExpandPowi.cpp
128	I think we should do a vp_icmp followed by a mask vp_reduce_or.

All the existing tests for llvm.powi use a scalar exponent even when the result is a vector. Should vp.powi only accept scalar exponent?

I think we should follow rule of llvm.powi first.

This update does,

Make vp.powi follows llvm.powi to only accept scalar exponent.
Add tests for RISC-V.
Update test cases.

But it still a test fail for ir unit test. I don't know how to debug it. I even
can not use gdb to trace it.
The below command about the test fails.

$ LLVM_SYMBOLIZER_PATH=./build/bin/llvm-symbolizer ./build/unittests/IR/./IRTests
...
[ RUN      ] VPIntrinsicTest.VPIntrinsicDeclarationForParams
IRTests: /home/yeting/x86-riscv-llvm/llvm/include/llvm/ADT/ArrayRef.h:255: const T& llvm::ArrayRef<T>::operator[](size_t) const [with T = llvm::Type*; size_t = long unsigned int]: Assertion `Index < Length && "Invalid index!"' failed.
 #0 0x00005620c5f909ee llvm::sys::PrintStackTrace(llvm::raw_ostream&, int) /home/yeting/x86-riscv-llvm/llvm/lib/Support/Unix/Signals.inc:567:22
 #1 0x00005620c5f90dc0 PrintStackTraceSignalHandler(void*) /home/yeting/x86-riscv-llvm/llvm/lib/Support/Unix/Signals.inc:641:1
 #2 0x00005620c5f8e4fe llvm::sys::RunSignalHandlers() /home/yeting/x86-riscv-llvm/llvm/lib/Support/Signals.cpp:104:20
 #3 0x00005620c5f9033f SignalHandler(int) /home/yeting/x86-riscv-llvm/llvm/lib/Support/Unix/Signals.inc:412:1
 #4 0x00007f3c933e0980 __restore_rt (/lib/x86_64-linux-gnu/libpthread.so.0+0x12980)
 #5 0x00007f3c91d92e87 raise /build/glibc-uZu3wS/glibc-2.27/signal/../sysdeps/unix/sysv/linux/raise.c:51:0
 #6 0x00007f3c91d947f1 abort /build/glibc-uZu3wS/glibc-2.27/stdlib/abort.c:81:0
 #7 0x00007f3c91d843fa __assert_fail_base /build/glibc-uZu3wS/glibc-2.27/assert/assert.c:89:0
 #8 0x00007f3c91d84472 (/lib/x86_64-linux-gnu/libc.so.6+0x30472)
 #9 0x00005620c5b9b1c6 llvm::ArrayRef<llvm::Type*>::operator[](unsigned long) const /home/yeting/x86-riscv-llvm/llvm/include/llvm/ADT/ArrayRef.h:256:14
#10 0x00005620c5cde47b DecodeFixedType(llvm::ArrayRef<llvm::Intrinsic::IITDescriptor>&, llvm::ArrayRef<llvm::Type*>, llvm::LLVMContext&) /home/yeting/x86-riscv-llvm/llvm/lib/IR/Function.cpp:1401:37
#11 0x00005620c5cdeae6 llvm::Intrinsic::getType(llvm::LLVMContext&, unsigned int, llvm::ArrayRef<llvm::Type*>) /home/yeting/x86-riscv-llvm/llvm/lib/IR/Function.cpp:1480:21
#12 0x00005620c5cf70c8 llvm::Intrinsic::getDeclaration(llvm::Module*, unsigned int, llvm::ArrayRef<llvm::Type*>) /home/yeting/x86-riscv-llvm/llvm/lib/IR/Function.cpp:1505:21
#13 0x00005620c5d49863 llvm::VPIntrinsic::getDeclarationForParams(llvm::Module*, unsigned int, llvm::Type*, llvm::ArrayRef<llvm::Value*>) /home/yeting/x86-riscv-llvm/llvm/lib/IR/IntrinsicInst.cpp:594:39
#14 0x00005620c56b2283 (anonymous namespace)::VPIntrinsicTest_VPIntrinsicDeclarationForParams_Test::TestBody() /home/yeting/x86-riscv-llvm/llvm/unittests/IR/VPIntrinsicTest.cpp:367:72

Herald added subscribers: luke, kosarev, • pcwang-thead and 24 others. · View Herald TranscriptMar 15 2023, 7:50 AM

craig.topper added inline comments.Mar 15 2023, 8:31 AM

llvm/docs/LangRef.rst
19934 ↗	(On Diff #505490)	`Predicated version of raising a vector of floating-point values to an integer power.`

Fixed crash by adding special case in llvm::VPIntrinsic::getDeclarationForParams

Harbormaster completed remote builds in B219650: Diff 505511.Mar 15 2023, 9:56 AM

In D143578#4142322, @craig.topper wrote:

How would we expand the other math functions? Many of them are large and probably difficult to keep in vector form. We could scalarize them with a loop and use scalar libcalls.

I want to second this point. I think doing the fancy expansion here is a bad idea at this time. We can come back to that, but an initial implementation should scalarize via a loop. The lowering works for all of the lane-wise math routines. Only once we have correct lowering for the majority of the routines should we bother optimizing any of them.

Even then, I'm not convinced that inlining this loop is profitable over generating a runtime call to a new routine.

llvm/lib/CodeGen/ExpandPowi.cpp
36	This appears to correspond to the recently introduced IRBuilder::CreateElementCount.

In D143578#4197721, @reames wrote:

In D143578#4142322, @craig.topper wrote:

How would we expand the other math functions? Many of them are large and probably difficult to keep in vector form. We could scalarize them with a loop and use scalar libcalls.

I want to second this point. I think doing the fancy expansion here is a bad idea at this time. We can come back to that, but an initial implementation should scalarize via a loop. The lowering works for all of the lane-wise math routines. Only once we have correct lowering for the majority of the routines should we bother optimizing any of them.

Even then, I'm not convinced that inlining this loop is profitable over generating a runtime call to a new routine.

I want to mention that powi is weird and does not correspond to a real math routine. It's a fast math optimization for pow with an integer argument. The scalar version of powi is provided in libgcc/compiler-rt while pow itself is in libm. This almost makes it a compiler implementation detail. Should a vector math library provide this function?

In D143578#4197800, @craig.topper wrote:

Even then, I'm not convinced that inlining this loop is profitable over generating a runtime call to a new routine.

I want to mention that powi is weird and does not correspond to a real math routine. It's a fast math optimization for pow with an integer argument. The scalar version of powi is provided in libgcc/compiler-rt while pow itself is in libm. This almost makes it a compiler implementation detail. Should a vector math library provide this function?

One of the options which was mentioned in the recent compiler-rt thread on discourse was to have a weak definition defined in each object file so that the linker could pick one (including the runtime libs if available). I'd lean towards something like that.

Use CreateElementCount and fix typos in LangRef.rst.

Harbormaster completed remote builds in B219806: Diff 505725.Mar 16 2023, 3:09 AM

In D143578#4197817, @reames wrote:

One of the options which was mentioned in the recent compiler-rt thread on discourse was to have a weak definition defined in each object file so that the linker could pick one (including the runtime libs if available). I'd lean towards something like that.

Could you provide the link of the discourse you mentioned?

Revision Contents

Path

Size

llvm/

include/

llvm/

CodeGen/

MachinePassRegistry.def

1 line

Passes.h

3 lines

IR/

Intrinsics.td

5 lines

VPIntrinsics.def

4 lines

InitializePasses.h

1 line

lib/

CodeGen/

CMakeLists.txt

1 line

ExpandPowi.cpp

154 lines

TargetPassConfig.cpp

1 line

test/

CodeGen/

Generic/

expand-powi.ll

30 lines

tools/

opt/

opt.cpp

2 lines

Diff 496678

llvm/include/llvm/CodeGen/MachinePassRegistry.def

	Show All 39 Lines
	FUNCTION_PASS("unreachableblockelim", UnreachableBlockElimPass, ())			FUNCTION_PASS("unreachableblockelim", UnreachableBlockElimPass, ())
	FUNCTION_PASS("consthoist", ConstantHoistingPass, ())			FUNCTION_PASS("consthoist", ConstantHoistingPass, ())
	FUNCTION_PASS("replace-with-veclib", ReplaceWithVeclib, ())			FUNCTION_PASS("replace-with-veclib", ReplaceWithVeclib, ())
	FUNCTION_PASS("partially-inline-libcalls", PartiallyInlineLibCallsPass, ())			FUNCTION_PASS("partially-inline-libcalls", PartiallyInlineLibCallsPass, ())
	FUNCTION_PASS("ee-instrument", EntryExitInstrumenterPass, (false))			FUNCTION_PASS("ee-instrument", EntryExitInstrumenterPass, (false))
	FUNCTION_PASS("post-inline-ee-instrument", EntryExitInstrumenterPass, (true))			FUNCTION_PASS("post-inline-ee-instrument", EntryExitInstrumenterPass, (true))
	FUNCTION_PASS("expand-large-div-rem", ExpandLargeDivRemPass, ())			FUNCTION_PASS("expand-large-div-rem", ExpandLargeDivRemPass, ())
	FUNCTION_PASS("expand-large-fp-convert", ExpandLargeFpConvertPass, ())			FUNCTION_PASS("expand-large-fp-convert", ExpandLargeFpConvertPass, ())
				FUNCTION_PASS("expand-powi", ExpandPowiPass, ())
	FUNCTION_PASS("expand-reductions", ExpandReductionsPass, ())			FUNCTION_PASS("expand-reductions", ExpandReductionsPass, ())
	FUNCTION_PASS("expandvp", ExpandVectorPredicationPass, ())			FUNCTION_PASS("expandvp", ExpandVectorPredicationPass, ())
	FUNCTION_PASS("lowerinvoke", LowerInvokePass, ())			FUNCTION_PASS("lowerinvoke", LowerInvokePass, ())
	FUNCTION_PASS("scalarize-masked-mem-intrin", ScalarizeMaskedMemIntrinPass, ())			FUNCTION_PASS("scalarize-masked-mem-intrin", ScalarizeMaskedMemIntrinPass, ())
	FUNCTION_PASS("tlshoist", TLSVariableHoistPass, ())			FUNCTION_PASS("tlshoist", TLSVariableHoistPass, ())
	FUNCTION_PASS("verify", VerifierPass, ())			FUNCTION_PASS("verify", VerifierPass, ())
	#undef FUNCTION_PASS			#undef FUNCTION_PASS

	▲ Show 20 Lines • Show All 155 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/Passes.h

Show First 20 Lines • Show All 512 Lines • ▼ Show 20 Lines	namespace llvm {
FunctionPass *createExpandVectorPredicationPass();		FunctionPass *createExpandVectorPredicationPass();

// Expands large div/rem instructions.		// Expands large div/rem instructions.
FunctionPass *createExpandLargeDivRemPass();		FunctionPass *createExpandLargeDivRemPass();

// Expands large div/rem instructions.		// Expands large div/rem instructions.
FunctionPass *createExpandLargeFpConvertPass();		FunctionPass *createExpandLargeFpConvertPass();

		// Expands powi instructions.
		FunctionPass *createExpandPowiPass();

// This pass expands memcmp() to load/stores.		// This pass expands memcmp() to load/stores.
FunctionPass *createExpandMemCmpPass();		FunctionPass *createExpandMemCmpPass();

/// Creates Break False Dependencies pass. \see BreakFalseDeps.cpp		/// Creates Break False Dependencies pass. \see BreakFalseDeps.cpp
FunctionPass *createBreakFalseDeps();		FunctionPass *createBreakFalseDeps();

// This pass expands indirectbr instructions.		// This pass expands indirectbr instructions.
FunctionPass *createIndirectBrExpandPass();		FunctionPass *createIndirectBrExpandPass();
▲ Show 20 Lines • Show All 74 Lines • Show Last 20 Lines

llvm/include/llvm/IR/Intrinsics.td

Show First 20 Lines • Show All 1,675 Lines • ▼ Show 20 Lines	let IntrProperties = [IntrNoMem, IntrNoSync, IntrWillReturn] in {
def int_vp_rint : DefaultAttrsIntrinsic<[ llvm_anyvector_ty ],		def int_vp_rint : DefaultAttrsIntrinsic<[ llvm_anyvector_ty ],
[ LLVMMatchType<0>,		[ LLVMMatchType<0>,
LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,		LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,
llvm_i32_ty]>;		llvm_i32_ty]>;
def int_vp_nearbyint : DefaultAttrsIntrinsic<[ llvm_anyvector_ty ],		def int_vp_nearbyint : DefaultAttrsIntrinsic<[ llvm_anyvector_ty ],
[ LLVMMatchType<0>,		[ LLVMMatchType<0>,
LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,		LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,
llvm_i32_ty]>;		llvm_i32_ty]>;
		def int_vp_powi : DefaultAttrsIntrinsic<[ llvm_anyvector_ty ],
		[ LLVMMatchType<0>,
		llvm_anyvector_ty,
		LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,
		llvm_i32_ty]>;

// Casts		// Casts
def int_vp_trunc : DefaultAttrsIntrinsic<[ llvm_anyvector_ty ],		def int_vp_trunc : DefaultAttrsIntrinsic<[ llvm_anyvector_ty ],
[ llvm_anyvector_ty,		[ llvm_anyvector_ty,
LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,		LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,
llvm_i32_ty]>;		llvm_i32_ty]>;
def int_vp_zext : DefaultAttrsIntrinsic<[ llvm_anyvector_ty ],		def int_vp_zext : DefaultAttrsIntrinsic<[ llvm_anyvector_ty ],
[ llvm_anyvector_ty,		[ llvm_anyvector_ty,
▲ Show 20 Lines • Show All 502 Lines • Show Last 20 Lines

llvm/include/llvm/IR/VPIntrinsics.def

	Show First 20 Lines • Show All 356 Lines • ▼ Show 20 Lines
	// llvm.vp.rint(x,mask,vlen)			// llvm.vp.rint(x,mask,vlen)
	BEGIN_REGISTER_VP(vp_rint, 1, 2, VP_FRINT, -1)			BEGIN_REGISTER_VP(vp_rint, 1, 2, VP_FRINT, -1)
	END_REGISTER_VP(vp_rint, VP_FRINT)			END_REGISTER_VP(vp_rint, VP_FRINT)

	// llvm.vp.nearbyint(x,mask,vlen)			// llvm.vp.nearbyint(x,mask,vlen)
	BEGIN_REGISTER_VP(vp_nearbyint, 1, 2, VP_FNEARBYINT, -1)			BEGIN_REGISTER_VP(vp_nearbyint, 1, 2, VP_FNEARBYINT, -1)
	END_REGISTER_VP(vp_nearbyint, VP_FNEARBYINT)			END_REGISTER_VP(vp_nearbyint, VP_FNEARBYINT)

				// llvm.vp.powi(x, y, mask,vlen)
				BEGIN_REGISTER_VP_INTRINSIC(vp_powi, 2, 3)
				VP_PROPERTY_BINARYOP
				END_REGISTER_VP_INTRINSIC(vp_powi)
	///// } Floating-Point Arithmetic			///// } Floating-Point Arithmetic

	///// Type Casts {			///// Type Casts {
	// Specialized helper macro for type conversions.			// Specialized helper macro for type conversions.
	// <operation>(%x, %mask, %evl).			// <operation>(%x, %mask, %evl).
	#ifdef HELPER_REGISTER_FP_CAST_VP			#ifdef HELPER_REGISTER_FP_CAST_VP
	#error \			#error \
	"The internal helper macro HELPER_REGISTER_FP_CAST_VP is already defined!"			"The internal helper macro HELPER_REGISTER_FP_CAST_VP is already defined!"
	▲ Show 20 Lines • Show All 271 Lines • Show Last 20 Lines

llvm/include/llvm/InitializePasses.h

	Show First 20 Lines • Show All 123 Lines • ▼ Show 20 Lines
	void initializeEarlyTailDuplicatePass(PassRegistry&);			void initializeEarlyTailDuplicatePass(PassRegistry&);
	void initializeEdgeBundlesPass(PassRegistry&);			void initializeEdgeBundlesPass(PassRegistry&);
	void initializeEHContGuardCatchretPass(PassRegistry &);			void initializeEHContGuardCatchretPass(PassRegistry &);
	void initializeEliminateAvailableExternallyLegacyPassPass(PassRegistry&);			void initializeEliminateAvailableExternallyLegacyPassPass(PassRegistry&);
	void initializeExpandLargeFpConvertLegacyPassPass(PassRegistry&);			void initializeExpandLargeFpConvertLegacyPassPass(PassRegistry&);
	void initializeExpandLargeDivRemLegacyPassPass(PassRegistry&);			void initializeExpandLargeDivRemLegacyPassPass(PassRegistry&);
	void initializeExpandMemCmpPassPass(PassRegistry&);			void initializeExpandMemCmpPassPass(PassRegistry&);
	void initializeExpandPostRAPass(PassRegistry&);			void initializeExpandPostRAPass(PassRegistry&);
				void initializeExpandPowiLegacyPassPass(PassRegistry &);
	void initializeExpandReductionsPass(PassRegistry&);			void initializeExpandReductionsPass(PassRegistry&);
	void initializeExpandVectorPredicationPass(PassRegistry &);			void initializeExpandVectorPredicationPass(PassRegistry &);
	void initializeMakeGuardsExplicitLegacyPassPass(PassRegistry&);			void initializeMakeGuardsExplicitLegacyPassPass(PassRegistry&);
	void initializeExternalAAWrapperPassPass(PassRegistry&);			void initializeExternalAAWrapperPassPass(PassRegistry&);
	void initializeFEntryInserterPass(PassRegistry&);			void initializeFEntryInserterPass(PassRegistry&);
	void initializeFinalizeISelPass(PassRegistry&);			void initializeFinalizeISelPass(PassRegistry&);
	void initializeFinalizeMachineBundlesPass(PassRegistry&);			void initializeFinalizeMachineBundlesPass(PassRegistry&);
	void initializeFixIrreduciblePass(PassRegistry &);			void initializeFixIrreduciblePass(PassRegistry &);
	▲ Show 20 Lines • Show All 272 Lines • Show Last 20 Lines

llvm/lib/CodeGen/CMakeLists.txt

Show First 20 Lines • Show All 52 Lines • ▼ Show 20 Lines	add_llvm_component_library(LLVMCodeGen
EarlyIfConversion.cpp		EarlyIfConversion.cpp
EdgeBundles.cpp		EdgeBundles.cpp
EHContGuardCatchret.cpp		EHContGuardCatchret.cpp
ExecutionDomainFix.cpp		ExecutionDomainFix.cpp
ExpandLargeDivRem.cpp		ExpandLargeDivRem.cpp
ExpandLargeFpConvert.cpp		ExpandLargeFpConvert.cpp
ExpandMemCmp.cpp		ExpandMemCmp.cpp
ExpandPostRAPseudos.cpp		ExpandPostRAPseudos.cpp
		ExpandPowi.cpp
ExpandReductions.cpp		ExpandReductions.cpp
ExpandVectorPredication.cpp		ExpandVectorPredication.cpp
FaultMaps.cpp		FaultMaps.cpp
FEntryInserter.cpp		FEntryInserter.cpp
FinalizeISel.cpp		FinalizeISel.cpp
FixupStatepointCallerSaved.cpp		FixupStatepointCallerSaved.cpp
FuncletLayout.cpp		FuncletLayout.cpp
GCMetadata.cpp		GCMetadata.cpp
▲ Show 20 Lines • Show All 209 Lines • Show Last 20 Lines

llvm/lib/CodeGen/ExpandPowi.cpp

This file was added.

				//===--- ExpandPowi.cpp - Expand Powi intrinsics ---------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// This pass implements IR expansion for powi/vp.powi. The expansion is based on
				// compiler-rt/__powidf2.c.
				//
				//===----------------------------------------------------------------------===//

				#include "llvm/ADT/SmallVector.h"
				#include "llvm/CodeGen/Passes.h"
				#include "llvm/CodeGen/TargetLowering.h"
				#include "llvm/IR/IRBuilder.h"
				craig.topperUnsubmitted Done Reply Inline Actions GlobalsModRef Probably uneeded? craig.topper: GlobalsModRef Probably uneeded?
				#include "llvm/IR/InstIterator.h"
				#include "llvm/IR/Intrinsics.h"
				#include "llvm/IR/PassManager.h"
				#include "llvm/InitializePasses.h"
				#include "llvm/Pass.h"

				#define DEBUG_TYPE "expand-powi"

				using namespace llvm;

				// The expansion is based on the c code of compiler-rt/__powidf2.c,
				// const int recip = b < 0;
				// double r = 1;
				// while (1) {
				// if (b & 1)
				// r *= a;
				// b /= 2;
				// if (b == 0)
				craig.topperUnsubmitted Done Reply Inline Actions expansion* craig.topper: expansion*
				// break;
				reamesUnsubmitted Not Done Reply Inline Actions This appears to correspond to the recently introduced IRBuilder::CreateElementCount. reames: This appears to correspond to the recently introduced IRBuilder::CreateElementCount.
				// a *= a;
				// }
				// return recip ? 1 / r : r;
				static void expandPowi(IntrinsicInst *II) {
				Value *OrigBase = II->getOperand(0);
				Value *OrigExp = II->getOperand(1);
				Value *Mask = II->getOperand(2);
				Value *EVL = II->getOperand(3);

				BasicBlock *PreLoopBB = II->getParent();
				BasicBlock *PostLoopBB = PreLoopBB->splitBasicBlock(II, "powi-post-loop");
				BasicBlock *LoopBody =
				BasicBlock::Create(PreLoopBB->getContext(), "powi-forward-loop",
				PreLoopBB->getParent(), PostLoopBB);

				IRBuilder<> Builder(PreLoopBB->getTerminator());
				Builder.CreateBr(LoopBody);
				PreLoopBB->getTerminator()->eraseFromParent();

				Type *BaseTy = OrigBase->getType();
				Type *ExpTy = OrigExp->getType();
				Type *CondTy = ExpTy->getWithNewBitWidth(1);
				Value *True = ConstantInt::get(CondTy, 1);
				craig.topperUnsubmitted Done Reply Inline Actions why "forward"? craig.topper: why "forward"?
				fakepaper56AuthorUnsubmitted Done Reply Inline Actions Sorry, it didn't make sense. I changed it to powi-expansion-loop. fakepaper56: Sorry, it didn't make sense. I changed it to powi-expansion-loop.

				Builder.SetInsertPoint(LoopBody);
				// Create phi of base.
				PHINode *Base = Builder.CreatePHI(BaseTy, 2, "base");
				Base->addIncoming(OrigBase, PreLoopBB);
				// Create phi of exponent.
				PHINode *Exp = Builder.CreatePHI(ExpTy, 2, "exp");
				Exp->addIncoming(OrigExp, PreLoopBB);
				// Create phi of res.
				PHINode *Res = Builder.CreatePHI(BaseTy, 2, "res");
				Res->addIncoming(ConstantFP::get(BaseTy, 1.), PreLoopBB);
				craig.topperUnsubmitted Done Reply Inline Actions CreatePHI returns a PHINode, can we use that to avoid casts? craig.topper:* CreatePHI returns a PHINode*, can we use that to avoid casts?
				// Res *= Base if Exp is odd.
				Value *Tmp = Builder.CreateIntrinsic(BaseTy, Intrinsic::vp_fmul,
				{Res, Base, True, EVL});
				Value *And1 = Builder.CreateIntrinsic(
				ExpTy, Intrinsic::vp_and, {Exp, ConstantInt::get(ExpTy, 1), True, EVL});
				// FIXME: Use vp.icmp.
				Value *IsOdd = Builder.CreateICmpNE(And1, ConstantInt::get(ExpTy, 0));
				Value *NewRes = Builder.CreateIntrinsic(BaseTy, Intrinsic::vp_select,
				{IsOdd, Tmp, Res, EVL});
				Res->addIncoming(NewRes, LoopBody);
				// Update Exp.
				Value *NewExp = Builder.CreateIntrinsic(
				ExpTy, Intrinsic::vp_lshr, {Exp, ConstantInt::get(ExpTy, 1), True, EVL});
				craig.topperUnsubmitted Not Done Reply Inline Actions What's preventing using vp.icmp? craig.topper: What's preventing using vp.icmp?
				fakepaper56AuthorUnsubmitted Not Done Reply Inline Actions It is only that I don't know how to construct vp.icmp/vp.fcmp instructions. fakepaper56: It is only that I don't know how to construct vp.icmp/vp.fcmp instructions.
				fakepaper56AuthorUnsubmitted Done Reply Inline Actions I don't understand how make predicate to a pointer of `Value`. fakepaper56: I don't understand how make predicate to a pointer of `Value`.
				craig.topperUnsubmitted Done Reply Inline Actions Should be something like this code from IRBuilder with the assert removed. Value getConstrainedFPPredicate(CmpInst::Predicate Predicate) { assert(CmpInst::isFPPredicate(Predicate) && Predicate != CmpInst::FCMP_FALSE && Predicate != CmpInst::FCMP_TRUE && "Invalid constrained FP comparison predicate!"); StringRef PredicateStr = CmpInst::getPredicateName(Predicate); auto PredicateMDS = MDString::get(Context, PredicateStr); return MetadataAsValue::get(Context, PredicateMDS); } craig.topper: Should be something like this code from IRBuilder with the assert removed. ``` Value…
				fakepaper56AuthorUnsubmitted Done Reply Inline Actions Thank you for the recommendation. fakepaper56: Thank you for the recommendation.
				Exp->addIncoming(NewExp, LoopBody);
				// Update Base.
				Value *NewBase = Builder.CreateIntrinsic(BaseTy, Intrinsic::vp_fmul,
				{Base, Base, True, EVL});
				Base->addIncoming(NewBase, LoopBody);
				// Check whether the elements of Exp are all zeros.
				Type *ExpScalarTy = ExpTy->getScalarType();
				Value *ScalarZero = ConstantInt::get(ExpScalarTy, 0);
				Value *OrSum = Builder.CreateIntrinsic(ExpScalarTy, Intrinsic::vp_reduce_or,
				{ScalarZero, NewExp, Mask, EVL});
				Builder.CreateCondBr(Builder.CreateICmpEQ(OrSum, ScalarZero), PostLoopBB,
				LoopBody);

				Builder.SetInsertPoint(&PostLoopBB->front());
				// Use reciprocal if power is negative.
				Value *Recip =
				Builder.CreateIntrinsic(BaseTy, Intrinsic::vp_fdiv,
				{ConstantFP::get(BaseTy, 1.), NewRes, Mask, EVL});
				// FIXME: Use vp.icmp.
				Value *IsNegative =
				Builder.CreateICmpSLT(OrigExp, ConstantInt::get(ExpTy, 0));
				Value *Powi = Builder.CreateIntrinsic(BaseTy, Intrinsic::vp_select,
				{IsNegative, Recip, NewRes, EVL});

				II->replaceAllUsesWith(Powi);
				II->eraseFromParent();
				}

				// TODO: Add cost model to skip small fixed vectors powi.
				static bool runImpl(Function &F) {
				SmallVector<IntrinsicInst *, 4> Replace;
				for (auto &I : instructions(F)) {
				craig.topperUnsubmitted Done Reply Inline Actions old fixme? craig.topper: old fixme?
				if (auto *II = dyn_cast<IntrinsicInst>(&I)) {
				// TODO: Also support llvm.powi.
				if (II->getIntrinsicID() == Intrinsic::vp_powi) {
				Replace.push_back(II);
				}
				}
				}

				if (Replace.empty())
				craig.topperUnsubmitted Done Reply Inline Actions support* craig.topper: support*
				return false;

				for (IntrinsicInst *II : Replace)
				expandPowi(II);
				craig.topperUnsubmitted Not Done Reply Inline Actions I think we should do a vp_icmp followed by a mask vp_reduce_or. craig.topper: I think we should do a vp_icmp followed by a mask vp_reduce_or.

				return true;
				}

				namespace {
				craig.topperUnsubmitted Done Reply Inline Actions Drop curly braces. craig.topper: Drop curly braces.
				class ExpandPowiLegacyPass : public FunctionPass {
				public:
				static char ID;

				ExpandPowiLegacyPass() : FunctionPass(ID) {
				initializeExpandPowiLegacyPassPass(*PassRegistry::getPassRegistry());
				}

				bool runOnFunction(Function &F) override { return runImpl(F); }
				};
				} // namespace

				char ExpandPowiLegacyPass::ID = 0;
				INITIALIZE_PASS_BEGIN(ExpandPowiLegacyPass, "expand-powi",
				"Expand powi functions", false, false)
				INITIALIZE_PASS_END(ExpandPowiLegacyPass, "expand-powi",
				"Expand powi functions", false, false)

				FunctionPass *llvm::createExpandPowiPass() {
				return new ExpandPowiLegacyPass();
				}
				craig.topperUnsubmitted Done Reply Inline Actions Why does this require AA? craig.topper: Why does this require AA?
				fakepaper56AuthorUnsubmitted Done Reply Inline Actions Sorry, they are my misuse. fakepaper56: Sorry, they are my misuse.

llvm/lib/CodeGen/TargetPassConfig.cpp

	Show First 20 Lines • Show All 1,081 Lines • ▼ Show 20 Lines
	bool TargetPassConfig::addISelPasses() {			bool TargetPassConfig::addISelPasses() {
	if (TM->useEmulatedTLS())			if (TM->useEmulatedTLS())
	addPass(createLowerEmuTLSPass());			addPass(createLowerEmuTLSPass());

	addPass(createPreISelIntrinsicLoweringPass());			addPass(createPreISelIntrinsicLoweringPass());
	PM->add(createTargetTransformInfoWrapperPass(TM->getTargetIRAnalysis()));			PM->add(createTargetTransformInfoWrapperPass(TM->getTargetIRAnalysis()));
	addPass(createExpandLargeDivRemPass());			addPass(createExpandLargeDivRemPass());
	addPass(createExpandLargeFpConvertPass());			addPass(createExpandLargeFpConvertPass());
				addPass(createExpandPowiPass());
	addIRPasses();			addIRPasses();
	addCodeGenPrepare();			addCodeGenPrepare();
	addPassesToHandleExceptions();			addPassesToHandleExceptions();
	addISelPrepare();			addISelPrepare();

	return addCoreISelPasses();			return addCoreISelPasses();
	}			}

	▲ Show 20 Lines • Show All 460 Lines • Show Last 20 Lines

llvm/test/CodeGen/Generic/expand-powi.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
				; RUN: opt -mtriple=x86_64-unknown-linux-gnu -expand-powi -S < %s \| FileCheck %s
				declare <vscale x 1 x float> @llvm.vp.powi.nxv1f32.nxv1i32(<vscale x 1 x float>, <vscale x 1 x i32>, <vscale x 1 x i1>, i32)
				craig.topperUnsubmitted Not Done Reply Inline Actions This needs a `REQUIRES: x86-registered-target` or it needs to be moved into the X86 directory. craig.topper: This needs a `REQUIRES: x86-registered-target` or it needs to be moved into the X86 directory.
				define <vscale x 1 x float> @foo(<vscale x 1 x float> %a, <vscale x 1 x i32> %b, <vscale x 1 x i1> %m, i32 %evl) {
				; CHECK-LABEL: @foo(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: br label [[POWI_FORWARD_LOOP:%.*]]
				; CHECK: powi-forward-loop:
				; CHECK-NEXT: [[BASE:%.]] = phi <vscale x 1 x float> [ [[A:%.]], [[ENTRY:%.]] ], [ [[TMP5:%.]], [[POWI_FORWARD_LOOP]] ]
				; CHECK-NEXT: [[EXP:%.]] = phi <vscale x 1 x i32> [ [[B:%.]], [[ENTRY]] ], [ [[TMP4:%.*]], [[POWI_FORWARD_LOOP]] ]
				; CHECK-NEXT: [[RES:%.]] = phi <vscale x 1 x float> [ shufflevector (<vscale x 1 x float> insertelement (<vscale x 1 x float> poison, float 1.000000e+00, i64 0), <vscale x 1 x float> poison, <vscale x 1 x i32> zeroinitializer), [[ENTRY]] ], [ [[TMP3:%.]], [[POWI_FORWARD_LOOP]] ]
				; CHECK-NEXT: [[TMP0:%.]] = call <vscale x 1 x float> @llvm.vp.fmul.nxv1f32(<vscale x 1 x float> [[RES]], <vscale x 1 x float> [[BASE]], <vscale x 1 x i1> shufflevector (<vscale x 1 x i1> insertelement (<vscale x 1 x i1> poison, i1 true, i64 0), <vscale x 1 x i1> poison, <vscale x 1 x i32> zeroinitializer), i32 [[EVL:%.]])
				; CHECK-NEXT: [[TMP1:%.*]] = call <vscale x 1 x i32> @llvm.vp.and.nxv1i32(<vscale x 1 x i32> [[EXP]], <vscale x 1 x i32> shufflevector (<vscale x 1 x i32> insertelement (<vscale x 1 x i32> poison, i32 1, i64 0), <vscale x 1 x i32> poison, <vscale x 1 x i32> zeroinitializer), <vscale x 1 x i1> shufflevector (<vscale x 1 x i1> insertelement (<vscale x 1 x i1> poison, i1 true, i64 0), <vscale x 1 x i1> poison, <vscale x 1 x i32> zeroinitializer), i32 [[EVL]])
				; CHECK-NEXT: [[TMP2:%.*]] = icmp ne <vscale x 1 x i32> [[TMP1]], zeroinitializer
				; CHECK-NEXT: [[TMP3]] = call <vscale x 1 x float> @llvm.vp.select.nxv1f32(<vscale x 1 x i1> [[TMP2]], <vscale x 1 x float> [[TMP0]], <vscale x 1 x float> [[RES]], i32 [[EVL]])
				; CHECK-NEXT: [[TMP4]] = call <vscale x 1 x i32> @llvm.vp.lshr.nxv1i32(<vscale x 1 x i32> [[EXP]], <vscale x 1 x i32> shufflevector (<vscale x 1 x i32> insertelement (<vscale x 1 x i32> poison, i32 1, i64 0), <vscale x 1 x i32> poison, <vscale x 1 x i32> zeroinitializer), <vscale x 1 x i1> shufflevector (<vscale x 1 x i1> insertelement (<vscale x 1 x i1> poison, i1 true, i64 0), <vscale x 1 x i1> poison, <vscale x 1 x i32> zeroinitializer), i32 [[EVL]])
				; CHECK-NEXT: [[TMP5]] = call <vscale x 1 x float> @llvm.vp.fmul.nxv1f32(<vscale x 1 x float> [[BASE]], <vscale x 1 x float> [[BASE]], <vscale x 1 x i1> shufflevector (<vscale x 1 x i1> insertelement (<vscale x 1 x i1> poison, i1 true, i64 0), <vscale x 1 x i1> poison, <vscale x 1 x i32> zeroinitializer), i32 [[EVL]])
				; CHECK-NEXT: [[TMP6:%.]] = call i32 @llvm.vp.reduce.or.nxv1i32(i32 0, <vscale x 1 x i32> [[TMP4]], <vscale x 1 x i1> [[M:%.]], i32 [[EVL]])
				; CHECK-NEXT: [[TMP7:%.*]] = icmp eq i32 [[TMP6]], 0
				; CHECK-NEXT: br i1 [[TMP7]], label [[POWI_POST_LOOP:%.*]], label [[POWI_FORWARD_LOOP]]
				; CHECK: powi-post-loop:
				; CHECK-NEXT: [[TMP8:%.*]] = call <vscale x 1 x float> @llvm.vp.fdiv.nxv1f32(<vscale x 1 x float> shufflevector (<vscale x 1 x float> insertelement (<vscale x 1 x float> poison, float 1.000000e+00, i64 0), <vscale x 1 x float> poison, <vscale x 1 x i32> zeroinitializer), <vscale x 1 x float> [[TMP3]], <vscale x 1 x i1> [[M]], i32 [[EVL]])
				; CHECK-NEXT: [[TMP9:%.*]] = icmp slt <vscale x 1 x i32> [[B]], zeroinitializer
				; CHECK-NEXT: [[TMP10:%.*]] = call <vscale x 1 x float> @llvm.vp.select.nxv1f32(<vscale x 1 x i1> [[TMP9]], <vscale x 1 x float> [[TMP8]], <vscale x 1 x float> [[TMP3]], i32 [[EVL]])
				; CHECK-NEXT: ret <vscale x 1 x float> [[TMP10]]
				;
				entry:
				%0 = call <vscale x 1 x float> @llvm.vp.powi.nxv1f32.nxv1i32(<vscale x 1 x float> %a, <vscale x 1 x i32> %b, <vscale x 1 x i1> %m, i32 %evl)
				ret <vscale x 1 x float> %0
				}

llvm/tools/opt/opt.cpp

//===- opt.cpp - The LLVM Modular Optimizer -------------------------------===//		//===- opt.cpp - The LLVM Modular Optimizer -------------------------------===//
		Lint: Lint Inline Actions clang-format suggested style edits found: Lint: Lint: clang-format suggested style edits found:
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// Optimizations may be specified an arbitrary number of times on the command		// Optimizations may be specified an arbitrary number of times on the command
▲ Show 20 Lines • Show All 378 Lines • ▼ Show 20 Lines	std::vector<StringRef> PassNameExact = {
"dot-regions",		"dot-regions",
"dot-regions-only",		"dot-regions-only",
"view-regions",		"view-regions",
"view-regions-only",		"view-regions-only",
"select-optimize",		"select-optimize",
"expand-large-div-rem",		"expand-large-div-rem",
"structurizecfg",		"structurizecfg",
"fix-irreducible",		"fix-irreducible",
		"expand-powi",
"expand-large-fp-convert"		"expand-large-fp-convert"
};		};
for (const auto &P : PassNamePrefix)		for (const auto &P : PassNamePrefix)
if (Pass.startswith(P))		if (Pass.startswith(P))
return true;		return true;
for (const auto &P : PassNameContain)		for (const auto &P : PassNameContain)
if (Pass.contains(P))		if (Pass.contains(P))
return true;		return true;
Show All 34 Lines	int main(int argc, char **argv) {
initializeTransformUtils(Registry);		initializeTransformUtils(Registry);
initializeInstCombine(Registry);		initializeInstCombine(Registry);
initializeTarget(Registry);		initializeTarget(Registry);
// For codegen passes, only passes that do IR to IR transformation are		// For codegen passes, only passes that do IR to IR transformation are
// supported.		// supported.
initializeExpandLargeDivRemLegacyPassPass(Registry);		initializeExpandLargeDivRemLegacyPassPass(Registry);
initializeExpandLargeFpConvertLegacyPassPass(Registry);		initializeExpandLargeFpConvertLegacyPassPass(Registry);
initializeExpandMemCmpPassPass(Registry);		initializeExpandMemCmpPassPass(Registry);
		initializeExpandPowiLegacyPassPass(Registry);
initializeScalarizeMaskedMemIntrinLegacyPassPass(Registry);		initializeScalarizeMaskedMemIntrinLegacyPassPass(Registry);
initializeSelectOptimizePass(Registry);		initializeSelectOptimizePass(Registry);
initializeCodeGenPreparePass(Registry);		initializeCodeGenPreparePass(Registry);
initializeAtomicExpandPass(Registry);		initializeAtomicExpandPass(Registry);
initializeRewriteSymbolsLegacyPassPass(Registry);		initializeRewriteSymbolsLegacyPassPass(Registry);
initializeWinEHPreparePass(Registry);		initializeWinEHPreparePass(Registry);
initializeDwarfEHPrepareLegacyPassPass(Registry);		initializeDwarfEHPrepareLegacyPassPass(Registry);
initializeSafeStackLegacyPassPass(Registry);		initializeSafeStackLegacyPassPass(Registry);
▲ Show 20 Lines • Show All 472 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[VP] Add vp.powi and a pass for expanding vp.powi before DAG.Needs ReviewPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 496678

llvm/include/llvm/CodeGen/MachinePassRegistry.def

llvm/include/llvm/CodeGen/Passes.h

llvm/include/llvm/IR/Intrinsics.td

llvm/include/llvm/IR/VPIntrinsics.def

llvm/include/llvm/InitializePasses.h

llvm/lib/CodeGen/CMakeLists.txt

llvm/lib/CodeGen/ExpandPowi.cpp

llvm/lib/CodeGen/TargetPassConfig.cpp

llvm/test/CodeGen/Generic/expand-powi.ll

llvm/tools/opt/opt.cpp

[VP] Add vp.powi and a pass for expanding vp.powi before DAG.
Needs ReviewPublic