This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/Conversion/
-
mlir/
-
Conversion/
-
Passes.td
-
lib/Conversion/MathToFuncs/
-
Conversion/
-
MathToFuncs/
-
CMakeLists.txt
7/13
MathToFuncs.cpp
-
test/Conversion/MathToFuncs/
-
Conversion/
-
MathToFuncs/
-
ctlz.mlir
-
ctlz_runner.mlir
-
utils/bazel/llvm-project-overlay/mlir/
-
bazel/
-
llvm-project-overlay/
-
mlir/
-
BUILD.bazel

Differential D146261

Add support for math.ctlz in convert-math-to-funcs
ClosedPublic

Authored by j2kun on Mar 16 2023, 2:32 PM.

Download Raw Diff

Details

Reviewers

vzakhari
Mogball
nicolasvasilache

Commits

rGbfbccfa17c97: Add support for math.ctlz in convert-math-to-funcs

Summary

This change adds a software implementation of the math.ctlz operation
and includes it in --convert-math-to-funcs.

This is my first change to MLIR, so please bear with me as I'm still learning
the idioms of the codebase.

The context for this change is that I have some larger scale project in which
I'd like to lower from a mix of MLIR dialects to CIRCT, but many of the CIRCT
passes don't support the math dialect.

I noticed the content of convert-math-to-funcs was limited entirely to
the pow functions, but otherwise provided the needed structure to implement
this feature with minimal changes.

Highlight of the changes:

Add a dependence on the SCF dialect for this lower. I could have lowered directly to cf, following the pow lowerings in the same pass, but I felt it was not necessary given the existing support for lowering scf to cf.
Generalize the DenseMap storing op implementations: modify the callback function hashmap to be keyed by both OperationType (for me this effectively means the name of the op being implemented in software) and the type signature of the resulting function.
Implement the ctlz function as a loop. I had researched a variety of implementations that claimed to be more efficient (such as those based on a de Bruijn sequence), but it seems to me that the simplest approach would make it easier for later compiler optimizations to do a better (platform-aware) job optimizing this than I could do by hand.

Questions I had for the reviewer:

[edit: found mlir-cpu-runner and added two tests] What would I add to the filecheck invocation to actually run the resulting MLIR on a value and assert the output is correct? I have done this manually with the C implementation but I'm not confident my port to MLIR is correct.
Should I add a test for a vectorized version of this lowering? I followed suit with the VecOpToScalarOp but I admit I don't fully understand what it's doing.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

j2kun created this revision.Mar 16 2023, 2:32 PM

Herald added a project: Restricted Project. · View Herald TranscriptMar 16 2023, 2:32 PM

Herald added subscribers: Moerafaat, zero9178, bzcheeseman and 21 others. · View Herald Transcript

j2kun edited the summary of this revision. (Show Details)Mar 16 2023, 2:46 PM

Harbormaster completed remote builds in B219957: Diff 505929.Mar 16 2023, 3:17 PM

j2kun published this revision for review.Mar 17 2023, 9:47 AM

Herald added a project: Restricted Project. · View Herald TranscriptMar 17 2023, 9:47 AM

Herald added subscribers: stephenneuendorffer, nicolasvasilache. · View Herald Transcript

j2kun added reviewers: vzakhari, Mogball.Mar 17 2023, 9:48 AM

Added mlir-cpu-runner tests for ctlz

j2kun edited the summary of this revision. (Show Details)Mar 17 2023, 10:20 AM

Harbormaster completed remote builds in B220099: Diff 506128.Mar 17 2023, 11:03 AM

Hi @j2kun, welcome to MLIR code base :)

The story behind only the power operations handled in this converter is that we would prefer to convert them to MLIR LLVM dialect (and then into LLVM IR), but there is no support for ipowi and not all flavors of fpowi are supported by LLVM IR. So I introduced this converter to transform these math operations into inline implementations.

I am not sure that extending it for math::CountLeadingZerosOp is the right direction. For example, in Flang pipeline this operation is converted into LLVM::CountLeadingZerosOp by MathToLLVM converter, and I think we want to keep it this way so that the LLVM backend handles it in a target-dependent fashion. I would prefer to keep Flang flow untouched, so at the very least the ctlz conversion has to be optional.

I am not aware about CIRCT specifics. Are there other options to resolve the problem, e.g. convert it to LLVM::CountLeadingZerosOp?

The backend we're ultimately planning to support does not have a ctlz builtin, so we have no choice but to emulate it in software. This is why lowering to llvm's builtin doesn't help us (and IIUC CIRCT also doesn't support llvm's ctlz)

Since this pass is aimed at software implementations of math functions in general, I would expect the fact that it is used in flang shouldn't have a blocking influence on supporting new ops (the pass is, after all, in MLIR and not just the Flang subproject). Perhaps a good middle ground would be to add a configuration option that turns off the lowering of specific ops, with all conversions enabled by default? I could have ctlz turned off by default in this diff and, in a later change, flang could turn it off explicitly and then the default could be changed to enable it. Or if everyone agrees that ctlz is an unusual case due to most backends having it built in, it could stay off by default forever.

Does that plan sound agreeable?

Yes, I think the option to control ctlz conversion will do. The default is up to you.

Added default-disabled convert-ctlz option

Harbormaster completed remote builds in B220534: Diff 506702.Mar 20 2023, 3:00 PM

Thank you for the update!

mlir/lib/Conversion/MathToFuncs/MathToFuncs.cpp
612	What would be the result for `i1 0` input?
613	I believe the way you encoded it in MLIR matches `continue` rather than `break`: `if (x < 0) continue` - is this intentional?
644	Please guard with `LLVM_DEBUG` and use `llvm::dbgs()` stream.
646	I would rather use `llvm_unreachable`.
733	nit: no need for braces.
866	nit: the braces are not needed.

Address reviewer comments

for input zero, output integer bit width
update/augment tests for above
use dbgs/unreachable

Sorry for the delay, I was in Korea at a conference.

mlir/lib/Conversion/MathToFuncs/MathToFuncs.cpp
612	It is zero in this case, but I'm reading now that most architectures define this to be the bitwidth of the underlying type, so updated it to reflect that. Added an extra if branch for the check, and an extra mlir-cpu-runner test to ensure this case is evaluated properly.
613	I think they are semantically the same, but I did this because I was unable to find the equivalent of an scf.break operation, just scf.yield. Would you recommend I use cf.br instead here? I wasn't sure if it made more sense to keep the output entirely scf, do a mix of scf and cf, and/or let later optimization passes recognize this branch is effectively a break.

thenBuilder -> elseBuilder

Fixed a typo/bad variable name choice from the last update

Harbormaster completed remote builds in B222803: Diff 509755.Mar 30 2023, 12:18 PM

Thank you for the update. Looks good to me in general.

mlir/lib/Conversion/MathToFuncs/MathToFuncs.cpp
612	Thank you.
613	I think just changing `break` to `continue` in the header comment is enough.
719	Please change `break` to `continue` here as well.

break -> continue, plus move mlir-cpu-runner

Herald added a reviewer: nicolasvasilache. · View Herald TranscriptApr 4 2023, 1:46 PM

Harbormaster completed remote builds in B223664: Diff 510917.Apr 4 2023, 4:16 PM

LGTM, but the windows pre-merge check failure looks suspicious.

This revision is now accepted and ready to land.Apr 4 2023, 5:38 PM

Try syncing with head

Herald added a subscriber: bviyer. · View Herald TranscriptApr 6 2023, 10:14 PM

Harbormaster completed remote builds in B224160: Diff 511611.Apr 6 2023, 10:41 PM

This time only Bazel failed, so ok to merge :)

(btw, I don't have commit access, so someone else will need to commit it for me)

In D146261#4250663, @j2kun wrote:

(btw, I don't have commit access, so someone else will need to commit it for me)

I can merge it on Monday. Please ping me then.

@vzakhari ping :) thanks for your review and advice!

In D146261#4255634, @j2kun wrote:

@vzakhari ping :) thanks for your review and advice!

Thanks for the ping! I will merge it today.

This revision was landed with ongoing or failed builds.Apr 10 2023, 10:02 AM

Closed by commit rGbfbccfa17c97: Add support for math.ctlz in convert-math-to-funcs (authored by j2kun, committed by vzakhari). · Explain Why

This revision was automatically updated to reflect the committed changes.

vzakhari added a commit: rGbfbccfa17c97: Add support for math.ctlz in convert-math-to-funcs.

@j2kun, this update is causing problems. Here's a short program:

  integer :: i1, i2
  i2 = leadz(i1)
end

When I compile this program, I get the following output from the compiler:

error: loc("/local/home/psteinfeld/up/install/x.f90":2:3): failed to legalize operation 'math.ctlz' that was explicitly marked illegal
mlir did not succeed

I'm not able to get to my computer til Monday, but perhaps your issue is
that this feature is disabled by default (see comments above), and you need
to explicitly enable it. The test files in this patch show the invocation.

mehdi_amini added inline comments.Apr 15 2023, 12:12 PM

mlir/lib/Conversion/MathToFuncs/MathToFuncs.cpp
874–876	The issue is actually here. This line should be guarded by the option.

PeteSteinfeld added inline comments.Apr 16 2023, 10:40 AM

mlir/lib/Conversion/MathToFuncs/MathToFuncs.cpp

874–876

Thanks, @mehdi_amini. I tried making this change, and I did get different results. But I still get errors. Here's the output from invoking the compiler on the test program I previously provided:

error: loc("/local/home/psteinfeld/up/install/x.f90":0:0): failed to legalize operation 'scf.if'
error: Lowering to LLVM IR failed
error: loc("/local/home/psteinfeld/up/install/x.f90":1:3): cannot be converted to LLVM IR: missing `LLVMTranslationDialectInterface` registration for dialect for op: func.func
error: failed to create the LLVM module
PLEASE submit a bug report to https://github.com/llvm/llvm-project/issues/ and include the crash backtrace.
Stack dump:
0.      Program arguments: /local/home/psteinfeld/up/install/bin/flang-new -fc1 -triple x86_64-unknown-linux-gnu -emit-obj -mrelocation-model pic -pic-level 2 -pic-is-pie -target-cpu x86-64 -o /tmp/x-f64cb2.o -x f95-cpp-input x.f90
 #0 0x00005565430284eb llvm::sys::PrintStackTrace(llvm::raw_ostream&, int) (/local/home/psteinfeld/up/install/bin/flang-new+0x1a0d4eb)
 #1 0x0000556543025c44 SignalHandler(int) Signals.cpp:0:0
 #2 0x000014ece2f5bc20 __restore_rt sigaction.c:0:0
 #3 0x000055654377d6ed Fortran::frontend::CodeGenAction::executeAction() (/local/home/psteinfeld/up/install/bin/flang-new+0x21626ed)
 #4 0x000055654305ff8d Fortran::frontend::FrontendAction::execute() (/local/home/psteinfeld/up/install/bin/flang-new+0x1a44f8d)
 #5 0x000055654304f1bb Fortran::frontend::CompilerInstance::executeAction(Fortran::frontend::FrontendAction&) (/local/home/psteinfeld/up/install/bin/flang-new+0x1a341bb)
 #6 0x00005565430653de Fortran::frontend::executeCompilerInvocation(Fortran::frontend::CompilerInstance*) (/local/home/psteinfeld/up/install/bin/flang-new+0x1a4a3de)
 #7 0x0000556542b90058 fc1_main(llvm::ArrayRef<char const*>, char const*) (/local/home/psteinfeld/up/install/bin/flang-new+0x1575058)
 #8 0x0000556542ae05bc main (/local/home/psteinfeld/up/install/bin/flang-new+0x14c55bc)
 #9 0x000014ece1b7a493 __libc_start_main (/lib64/libc.so.6+0x23493)
#10 0x0000556542b8d70e _start (/local/home/psteinfeld/up/install/bin/flang-new+0x157270e)
flang-new: error: unable to execute command: Segmentation fault (core dumped)
flang-new: error: flang frontend command failed due to signal (use -v to see invocation)
flang-new version 17.0.0 (https://github.com/llvm/llvm-project.git 282d114c21bfe5292db659579a944403bec345a2)
Target: x86_64-unknown-linux-gnu
Thread model: posix

failed to legalize operation 'scf.if'
error: Lowering to LLVM IR failed

I don’t think this is because of this patch, likely another problem. But the change I mentioned seems clearly necessary: @j2kun are you on it?

In D146261#4272324, @mehdi_amini wrote:
failed to legalize operation 'scf.if'
error: Lowering to LLVM IR failed
I don’t think this is because of this patch, likely another problem. But the change I mentioned seems clearly necessary: @j2kun are you on it?

@vzakhari : may want to revert in the meantime.

I will try to fix it or revert soon.

I can fix it Monday morning. A rollback is fine if it's urgent.

vzakhari mentioned this in D148477: [mlir][math] Properly disable ctlz conversion in MathToFuncs..Apr 16 2023, 1:15 PM

vzakhari mentioned this in rGce47090d00cb: [mlir][math] Properly disable ctlz conversion in MathToFuncs..Apr 16 2023, 3:54 PM

Monday's soon enough for me!

Revision Contents

Path

Size

mlir/

include/

mlir/

Conversion/

Passes.td

14 lines

lib/

Conversion/

MathToFuncs/

CMakeLists.txt

1 line

MathToFuncs.cpp

256 lines

test/

Conversion/

MathToFuncs/

ctlz.mlir

90 lines

ctlz_runner.mlir

48 lines

utils/

bazel/

llvm-project-overlay/

mlir/

BUILD.bazel

1 line

Diff 509749

mlir/include/mlir/Conversion/Passes.td

Show First 20 Lines • Show All 612 Lines • ▼ Show 20 Lines	let description = [{
This pass converts supported Math ops to calls of compiler generated		This pass converts supported Math ops to calls of compiler generated
functions implementing these operations in software.		functions implementing these operations in software.
The LLVM dialect is used for LinkonceODR linkage of the generated functions.		The LLVM dialect is used for LinkonceODR linkage of the generated functions.
}];		}];
let dependentDialects = [		let dependentDialects = [
"arith::ArithDialect",		"arith::ArithDialect",
"cf::ControlFlowDialect",		"cf::ControlFlowDialect",
"func::FuncDialect",		"func::FuncDialect",
		"scf::SCFDialect",
"vector::VectorDialect",		"vector::VectorDialect",
"LLVM::LLVMDialect",		"LLVM::LLVMDialect",
];		];
let options = [		let options = [
Option<"minWidthOfFPowIExponent", "min-width-of-fpowi-exponent", "unsigned",		Option<"minWidthOfFPowIExponent", "min-width-of-fpowi-exponent", "unsigned",
/default=/"1",		/default=/"1",
"Convert FPowI only if the width of its exponent's integer type "		"Convert FPowI only if the width of its exponent's integer type "
"is greater than or equal to this value">		"is greater than or equal to this value">,
		// Most backend targets support a native ctlz operation, so by default
		// ctrlz conversion is disabled.
		Option<"convertCtlz", "convert-ctlz", "bool", /default=/"false",
		"Convert math.ctlz to a software implementation. Enable "
		"for targets that do not natively support ctlz.">,
];		];
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// MemRefToLLVM		// MemRefToLLVM
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

def FinalizeMemRefToLLVMConversionPass :		def FinalizeMemRefToLLVMConversionPass :
Pass<"finalize-memref-to-llvm", "ModuleOp"> {		Pass<"finalize-memref-to-llvm", "ModuleOp"> {
let summary = "Finalize MemRef dialect to LLVM dialect conversion";		let summary = "Finalize MemRef dialect to LLVM dialect conversion";
let description = [{		let description = [{
Finalize the conversion of the operations from the MemRef		Finalize the conversion of the operations from the MemRef
dialect to the LLVM dialect.		dialect to the LLVM dialect.
This conversion will not convert some complex MemRef		This conversion will not convert some complex MemRef
operations. Make sure to run `expand-strided-metadata`		operations. Make sure to run `expand-strided-metadata`
beforehand for these.		beforehand for these.
}];		}];
let dependentDialects = ["LLVM::LLVMDialect"];		let dependentDialects = ["LLVM::LLVMDialect"];
let options = [		let options = [
Option<"useAlignedAlloc", "use-aligned-alloc", "bool", /default=/"false",		Option<"useAlignedAlloc", "use-aligned-alloc", "bool", /default=/"false",
"Use aligned_alloc in place of malloc for heap allocations">,		"Use aligned_alloc in place of malloc for heap allocations">,
Option<"indexBitwidth", "index-bitwidth", "unsigned",		Option<"indexBitwidth", "index-bitwidth", "unsigned",
/default=kDeriveIndexBitwidthFromDataLayout/"0",		/default=kDeriveIndexBitwidthFromDataLayout/"0",
▲ Show 20 Lines • Show All 463 Lines • Show Last 20 Lines

mlir/lib/Conversion/MathToFuncs/CMakeLists.txt

Show All 11 Lines	add_mlir_conversion_library(MLIRMathToFuncs

LINK_LIBS PUBLIC		LINK_LIBS PUBLIC
MLIRArithDialect		MLIRArithDialect
MLIRControlFlowDialect		MLIRControlFlowDialect
MLIRFuncDialect		MLIRFuncDialect
MLIRLLVMDialect		MLIRLLVMDialect
MLIRMathDialect		MLIRMathDialect
MLIRPass		MLIRPass
		MLIRSCFDialect
MLIRTransforms		MLIRTransforms
MLIRVectorDialect		MLIRVectorDialect
MLIRVectorUtils		MLIRVectorUtils
)		)

mlir/lib/Conversion/MathToFuncs/MathToFuncs.cpp

//===- MathToFuncs.cpp - Math to outlined implementation conversion -------===//		//===- MathToFuncs.cpp - Math to outlined implementation conversion -------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "mlir/Conversion/MathToFuncs/MathToFuncs.h"		#include "mlir/Conversion/MathToFuncs/MathToFuncs.h"

#include "mlir/Dialect/Arith/IR/Arith.h"		#include "mlir/Dialect/Arith/IR/Arith.h"
#include "mlir/Dialect/ControlFlow/IR/ControlFlowOps.h"		#include "mlir/Dialect/ControlFlow/IR/ControlFlowOps.h"
#include "mlir/Dialect/Func/IR/FuncOps.h"		#include "mlir/Dialect/Func/IR/FuncOps.h"
#include "mlir/Dialect/LLVMIR/LLVMDialect.h"		#include "mlir/Dialect/LLVMIR/LLVMDialect.h"
#include "mlir/Dialect/Math/IR/Math.h"		#include "mlir/Dialect/Math/IR/Math.h"
		#include "mlir/Dialect/SCF/IR/SCF.h"
#include "mlir/Dialect/Utils/IndexingUtils.h"		#include "mlir/Dialect/Utils/IndexingUtils.h"
#include "mlir/Dialect/Vector/IR/VectorOps.h"		#include "mlir/Dialect/Vector/IR/VectorOps.h"
#include "mlir/Dialect/Vector/Utils/VectorUtils.h"		#include "mlir/Dialect/Vector/Utils/VectorUtils.h"
#include "mlir/IR/ImplicitLocOpBuilder.h"		#include "mlir/IR/ImplicitLocOpBuilder.h"
#include "mlir/IR/TypeUtilities.h"		#include "mlir/IR/TypeUtilities.h"
#include "mlir/Pass/Pass.h"		#include "mlir/Pass/Pass.h"
#include "mlir/Transforms/DialectConversion.h"		#include "mlir/Transforms/DialectConversion.h"
#include "llvm/ADT/DenseMap.h"		#include "llvm/ADT/DenseMap.h"
#include "llvm/ADT/TypeSwitch.h"		#include "llvm/ADT/TypeSwitch.h"
		#include "llvm/Support/Debug.h"

namespace mlir {		namespace mlir {
#define GEN_PASS_DEF_CONVERTMATHTOFUNCS		#define GEN_PASS_DEF_CONVERTMATHTOFUNCS
#include "mlir/Conversion/Passes.h.inc"		#include "mlir/Conversion/Passes.h.inc"
} // namespace mlir		} // namespace mlir

using namespace mlir;		using namespace mlir;

		#define DEBUG_TYPE "math-to-funcs"
		#define DBGS() (llvm::dbgs() << "[" DEBUG_TYPE "]: ")

namespace {		namespace {
// Pattern to convert vector operations to scalar operations.		// Pattern to convert vector operations to scalar operations.
template <typename Op>		template <typename Op>
struct VecOpToScalarOp : public OpRewritePattern<Op> {		struct VecOpToScalarOp : public OpRewritePattern<Op> {
public:		public:
using OpRewritePattern<Op>::OpRewritePattern;		using OpRewritePattern<Op>::OpRewritePattern;

LogicalResult matchAndRewrite(Op op, PatternRewriter &rewriter) const final;		LogicalResult matchAndRewrite(Op op, PatternRewriter &rewriter) const final;
};		};

// Callback type for getting pre-generated FuncOp implementing		// Callback type for getting pre-generated FuncOp implementing
// a power operation of the given type.		// an operation of the given type.
using GetPowerFuncCallbackTy = function_ref<func::FuncOp(Type)>;		using GetFuncCallbackTy = function_ref<func::FuncOp(Operation *, Type)>;

// Pattern to convert scalar IPowIOp into a call of outlined		// Pattern to convert scalar IPowIOp into a call of outlined
// software implementation.		// software implementation.
class IPowIOpLowering : public OpRewritePattern<math::IPowIOp> {		class IPowIOpLowering : public OpRewritePattern<math::IPowIOp> {
public:		public:
IPowIOpLowering(MLIRContext *context, GetPowerFuncCallbackTy cb)		IPowIOpLowering(MLIRContext *context, GetFuncCallbackTy cb)
: OpRewritePattern<math::IPowIOp>(context), getFuncOpCallback(cb) {}		: OpRewritePattern<math::IPowIOp>(context), getFuncOpCallback(cb) {}

/// Convert IPowI into a call to a local function implementing		/// Convert IPowI into a call to a local function implementing
/// the power operation. The local function computes a scalar result,		/// the power operation. The local function computes a scalar result,
/// so vector forms of IPowI are linearized.		/// so vector forms of IPowI are linearized.
LogicalResult matchAndRewrite(math::IPowIOp op,		LogicalResult matchAndRewrite(math::IPowIOp op,
PatternRewriter &rewriter) const final;		PatternRewriter &rewriter) const final;

private:		private:
GetPowerFuncCallbackTy getFuncOpCallback;		GetFuncCallbackTy getFuncOpCallback;
};		};

// Pattern to convert scalar FPowIOp into a call of outlined		// Pattern to convert scalar FPowIOp into a call of outlined
// software implementation.		// software implementation.
class FPowIOpLowering : public OpRewritePattern<math::FPowIOp> {		class FPowIOpLowering : public OpRewritePattern<math::FPowIOp> {
public:		public:
FPowIOpLowering(MLIRContext *context, GetPowerFuncCallbackTy cb)		FPowIOpLowering(MLIRContext *context, GetFuncCallbackTy cb)
: OpRewritePattern<math::FPowIOp>(context), getFuncOpCallback(cb) {}		: OpRewritePattern<math::FPowIOp>(context), getFuncOpCallback(cb) {}

/// Convert FPowI into a call to a local function implementing		/// Convert FPowI into a call to a local function implementing
/// the power operation. The local function computes a scalar result,		/// the power operation. The local function computes a scalar result,
/// so vector forms of FPowI are linearized.		/// so vector forms of FPowI are linearized.
LogicalResult matchAndRewrite(math::FPowIOp op,		LogicalResult matchAndRewrite(math::FPowIOp op,
PatternRewriter &rewriter) const final;		PatternRewriter &rewriter) const final;

private:		private:
GetPowerFuncCallbackTy getFuncOpCallback;		GetFuncCallbackTy getFuncOpCallback;
		};

		// Pattern to convert scalar ctlz into a call of outlined software
		// implementation.
		class CtlzOpLowering : public OpRewritePattern<math::CountLeadingZerosOp> {
		public:
		CtlzOpLowering(MLIRContext *context, GetFuncCallbackTy cb)
		: OpRewritePattern<math::CountLeadingZerosOp>(context),
		getFuncOpCallback(cb) {}

		/// Convert ctlz into a call to a local function implementing
		/// the count leading zeros operation.
		LogicalResult matchAndRewrite(math::CountLeadingZerosOp op,
		PatternRewriter &rewriter) const final;

		private:
		GetFuncCallbackTy getFuncOpCallback;
};		};
} // namespace		} // namespace

template <typename Op>		template <typename Op>
LogicalResult		LogicalResult
VecOpToScalarOp<Op>::matchAndRewrite(Op op, PatternRewriter &rewriter) const {		VecOpToScalarOp<Op>::matchAndRewrite(Op op, PatternRewriter &rewriter) const {
Type opType = op.getType();		Type opType = op.getType();
Location loc = op.getLoc();		Location loc = op.getLoc();
▲ Show 20 Lines • Show All 254 Lines • ▼ Show 20 Lines	IPowIOpLowering::matchAndRewrite(math::IPowIOp op,
PatternRewriter &rewriter) const {		PatternRewriter &rewriter) const {
auto baseType = op.getOperands()[0].getType().dyn_cast<IntegerType>();		auto baseType = op.getOperands()[0].getType().dyn_cast<IntegerType>();

if (!baseType)		if (!baseType)
return rewriter.notifyMatchFailure(op, "non-integer base operand");		return rewriter.notifyMatchFailure(op, "non-integer base operand");

// The outlined software implementation must have been already		// The outlined software implementation must have been already
// generated.		// generated.
func::FuncOp elementFunc = getFuncOpCallback(baseType);		func::FuncOp elementFunc = getFuncOpCallback(op, baseType);
if (!elementFunc)		if (!elementFunc)
return rewriter.notifyMatchFailure(op, "missing software implementation");		return rewriter.notifyMatchFailure(op, "missing software implementation");

rewriter.replaceOpWithNewOp<func::CallOp>(op, elementFunc, op.getOperands());		rewriter.replaceOpWithNewOp<func::CallOp>(op, elementFunc, op.getOperands());
return success();		return success();
}		}

/// Create linkonce_odr function to implement the power function with		/// Create linkonce_odr function to implement the power function with
▲ Show 20 Lines • Show All 208 Lines • ▼ Show 20 Lines	FPowIOpLowering::matchAndRewrite(math::FPowIOp op,
PatternRewriter &rewriter) const {		PatternRewriter &rewriter) const {
if (op.getType().template dyn_cast<VectorType>())		if (op.getType().template dyn_cast<VectorType>())
return rewriter.notifyMatchFailure(op, "non-scalar operation");		return rewriter.notifyMatchFailure(op, "non-scalar operation");

FunctionType funcType = getElementalFuncTypeForOp(op);		FunctionType funcType = getElementalFuncTypeForOp(op);

// The outlined software implementation must have been already		// The outlined software implementation must have been already
// generated.		// generated.
func::FuncOp elementFunc = getFuncOpCallback(funcType);		func::FuncOp elementFunc = getFuncOpCallback(op, funcType);
if (!elementFunc)		if (!elementFunc)
return rewriter.notifyMatchFailure(op, "missing software implementation");		return rewriter.notifyMatchFailure(op, "missing software implementation");

rewriter.replaceOpWithNewOp<func::CallOp>(op, elementFunc, op.getOperands());		rewriter.replaceOpWithNewOp<func::CallOp>(op, elementFunc, op.getOperands());
return success();		return success();
}		}

		/// Create function to implement the ctlz function the given \p elementType type
		/// inside \p module. The \p elementType must be IntegerType, an the created
		/// function has 'IntegerType (*)(IntegerType)' function type.
		///
		/// template <typename T>
		/// T __mlir_math_ctlz_*(T x) {
		/// bits = sizeof(x) * 8;
		/// if (x == 0)
		/// return bits;
		vzakhariUnsubmitted Done Reply Inline Actions What would be the result for `i1 0` input? vzakhari: What would be the result for `i1 0` input?
		j2kunAuthorUnsubmitted Done Reply Inline Actions It is zero in this case, but I'm reading now that most architectures define this to be the bitwidth of the underlying type, so updated it to reflect that. Added an extra if branch for the check, and an extra mlir-cpu-runner test to ensure this case is evaluated properly. j2kun: It is zero in this case, but I'm reading now that most architectures define this to be the…
		vzakhariUnsubmitted Not Done Reply Inline Actions Thank you. vzakhari: Thank you.
		///
		vzakhariUnsubmitted Not Done Reply Inline Actions I believe the way you encoded it in MLIR matches `continue` rather than `break`: `if (x < 0) continue` - is this intentional? vzakhari: I believe the way you encoded it in MLIR matches `continue` rather than `break`: `if (x < 0)…
		j2kunAuthorUnsubmitted Not Done Reply Inline Actions I think they are semantically the same, but I did this because I was unable to find the equivalent of an scf.break operation, just scf.yield. Would you recommend I use cf.br instead here? I wasn't sure if it made more sense to keep the output entirely scf, do a mix of scf and cf, and/or let later optimization passes recognize this branch is effectively a break. j2kun: I think they are semantically the same, but I did this because I was unable to find the…
		vzakhariUnsubmitted Not Done Reply Inline Actions I think just changing `break` to `continue` in the header comment is enough. vzakhari: I think just changing `break` to `continue` in the header comment is enough.
		/// uint32_t n = 0;
		/// for (int i = 1; i < bits; ++i) {
		/// if (x < 0) break;
		/// n++;
		/// x <<= 1;
		/// }
		/// return n;
		/// }
		///
		/// Converts to (for i32):
		///
		/// func.func private @__mlir_math_ctlz_i32(%arg: i32) -> i32 {
		/// %c_32 = arith.constant 32 : index
		/// %c_0 = arith.constant 0 : i32
		/// %arg_eq_zero = arith.cmpi eq, %arg, %c_0 : i1
		/// %out = scf.if %arg_eq_zero {
		/// scf.yield %c_32 : i32
		/// } else {
		/// %c_1index = arith.constant 1 : index
		/// %c_1i32 = arith.constant 1 : i32
		/// %n = arith.constant 0 : i32
		/// %arg_out, %n_out = scf.for %i = %c_1index to %c_32 step %c_1index
		/// iter_args(%arg_iter = %arg, %n_iter = %n) -> (i32, i32) {
		/// %cond = arith.cmpi slt, %arg_iter, %c_0 : i32
		/// %yield_val = scf.if %cond {
		/// scf.yield %arg_iter, %n_iter : i32, i32
		/// } else {
		/// %arg_next = arith.shli %arg_iter, %c_1i32 : i32
		/// %n_next = arith.addi %n_iter, %c_1i32 : i32
		/// scf.yield %arg_next, %n_next : i32, i32
		/// }
		vzakhariUnsubmitted Done Reply Inline Actions Please guard with `LLVM_DEBUG` and use `llvm::dbgs()` stream. vzakhari: Please guard with `LLVM_DEBUG` and use `llvm::dbgs()` stream.
		/// scf.yield %yield_val: i32, i32
		/// }
		vzakhariUnsubmitted Done Reply Inline Actions I would rather use `llvm_unreachable`. vzakhari: I would rather use `llvm_unreachable`.
		/// scf.yield %n_out : i32
		/// }
		/// return %out: i32
		/// }
		static func::FuncOp createCtlzFunc(ModuleOp *module, Type elementType) {
		if (!elementType.isa<IntegerType>()) {
		LLVM_DEBUG({
		DBGS() << "non-integer element type for CtlzFunc; type was: ";
		elementType.print(llvm::dbgs());
		});
		llvm_unreachable("non-integer element type");
		}
		int64_t bitWidth = elementType.getIntOrFloatBitWidth();

		Location loc = module->getLoc();
		ImplicitLocOpBuilder builder =
		ImplicitLocOpBuilder::atBlockEnd(loc, module->getBody());

		std::string funcName("__mlir_math_ctlz");
		llvm::raw_string_ostream nameOS(funcName);
		nameOS << '_' << elementType;
		FunctionType funcType =
		FunctionType::get(builder.getContext(), {elementType}, elementType);
		auto funcOp = builder.create<func::FuncOp>(funcName, funcType);

		// LinkonceODR ensures that there is only one implementation of this function
		// across all math.ctlz functions that are lowered in this way.
		LLVM::linkage::Linkage inlineLinkage = LLVM::linkage::Linkage::LinkonceODR;
		Attribute linkage =
		LLVM::LinkageAttr::get(builder.getContext(), inlineLinkage);
		funcOp->setAttr("llvm.linkage", linkage);
		funcOp.setPrivate();

		// set the insertion point to the start of the function
		Block *funcBody = funcOp.addEntryBlock();
		builder.setInsertionPointToStart(funcBody);

		Value arg = funcOp.getArgument(0);
		Type indexType = builder.getIndexType();
		Value bitWidthValue = builder.create<arith::ConstantOp>(
		elementType, builder.getIntegerAttr(elementType, bitWidth));
		Value zeroValue = builder.create<arith::ConstantOp>(
		elementType, builder.getIntegerAttr(elementType, 0));

		Value inputEqZero =
		builder.create<arith::CmpIOp>(arith::CmpIPredicate::eq, arg, zeroValue);

		// if input == 0, return bit width, else enter loop.
		scf::IfOp ifOp = builder.create<scf::IfOp>(
		elementType, inputEqZero, /addThenBlock=/true, /addElseBlock=/true);
		ifOp.getThenBodyBuilder().create<scf::YieldOp>(loc, bitWidthValue);

		auto thenBuilder =
		ImplicitLocOpBuilder::atBlockEnd(loc, &ifOp.getElseRegion().front());

		Value oneIndex = thenBuilder.create<arith::ConstantOp>(
		indexType, thenBuilder.getIndexAttr(1));
		Value oneValue = thenBuilder.create<arith::ConstantOp>(
		elementType, thenBuilder.getIntegerAttr(elementType, 1));
		Value bitWidthIndex = thenBuilder.create<arith::ConstantOp>(
		indexType, thenBuilder.getIndexAttr(bitWidth));
		Value nValue = thenBuilder.create<arith::ConstantOp>(
		elementType, thenBuilder.getIntegerAttr(elementType, 0));

		auto loop = thenBuilder.create<scf::ForOp>(
		oneIndex, bitWidthIndex, oneIndex,
		// Initial values for two loop induction variables, the arg which is being
		// shifted left in each iteration, and the n value which tracks the count
		// of leading zeros.
		ValueRange{arg, nValue},
		// Callback to build the body of the for loop
		// if (arg < 0) {
		// break;
		vzakhariUnsubmitted Done Reply Inline Actions Please change `break` to `continue` here as well. vzakhari: Please change `break` to `continue` here as well.
		// } else {
		// n++;
		// arg <<= 1;
		// }
		[&](OpBuilder &b, Location loc, Value iv, ValueRange args) {
		Value argIter = args[0];
		Value nIter = args[1];

		Value argIsNonNegative = b.create<arith::CmpIOp>(
		loc, arith::CmpIPredicate::slt, argIter, zeroValue);
		scf::IfOp ifOp = b.create<scf::IfOp>(
		loc, argIsNonNegative,
		[&](OpBuilder &b, Location loc) {
		// If arg is negative, break out of the loop.
		vzakhariUnsubmitted Done Reply Inline Actions nit: no need for braces. vzakhari: nit: no need for braces.
		b.create<scf::YieldOp>(loc, ValueRange{argIter, nIter});
		},
		[&](OpBuilder &b, Location loc) {
		// Otherwise, increment n and shift arg left.
		Value nNext = b.create<arith::AddIOp>(loc, nIter, oneValue);
		Value argNext = b.create<arith::ShLIOp>(loc, argIter, oneValue);
		b.create<scf::YieldOp>(loc, ValueRange{argNext, nNext});
		});
		b.create<scf::YieldOp>(loc, ifOp.getResults());
		});
		thenBuilder.create<scf::YieldOp>(loop.getResult(1));

		builder.create<func::ReturnOp>(ifOp.getResult(0));
		return funcOp;
		}

		/// Convert ctlz into a call to a local function implementing the ctlz
		/// operation.
		LogicalResult CtlzOpLowering::matchAndRewrite(math::CountLeadingZerosOp op,
		PatternRewriter &rewriter) const {
		if (op.getType().template dyn_cast<VectorType>())
		return rewriter.notifyMatchFailure(op, "non-scalar operation");

		Type type = getElementTypeOrSelf(op.getResult().getType());
		func::FuncOp elementFunc = getFuncOpCallback(op, type);
		if (!elementFunc)
		return rewriter.notifyMatchFailure(op, [&](::mlir::Diagnostic &diag) {
		diag << "Missing software implementation for op " << op->getName()
		<< " and type " << type;
		});

		rewriter.replaceOpWithNewOp<func::CallOp>(op, elementFunc, op.getOperand());
		return success();
		}

namespace {		namespace {
struct ConvertMathToFuncsPass		struct ConvertMathToFuncsPass
: public impl::ConvertMathToFuncsBase<ConvertMathToFuncsPass> {		: public impl::ConvertMathToFuncsBase<ConvertMathToFuncsPass> {
ConvertMathToFuncsPass() = default;		ConvertMathToFuncsPass() = default;
ConvertMathToFuncsPass(const ConvertMathToFuncsOptions &options)		ConvertMathToFuncsPass(const ConvertMathToFuncsOptions &options)
: impl::ConvertMathToFuncsBase<ConvertMathToFuncsPass>(options) {}		: impl::ConvertMathToFuncsBase<ConvertMathToFuncsPass>(options) {}

void runOnOperation() override;		void runOnOperation() override;

private:		private:
// Return true, if this FPowI operation must be converted		// Return true, if this FPowI operation must be converted
// because the width of its exponent's type is greater than		// because the width of its exponent's type is greater than
// or equal to minWidthOfFPowIExponent option value.		// or equal to minWidthOfFPowIExponent option value.
bool isFPowIConvertible(math::FPowIOp op);		bool isFPowIConvertible(math::FPowIOp op);

// Generate outlined implementations for power operations		// Generate outlined implementations for power operations
// and store them in powerFuncs map.		// and store them in funcImpls map.
void preprocessPowOperations();		void generateOpImplementations();

// A map between function types deduced from power operations		// A map between pairs of (operation, type) deduced from operations that this
// and the corresponding outlined software implementations		// pass will convert, and the corresponding outlined software implementations
// of these operations.		// of these operations for the given type.
DenseMap<Type, func::FuncOp> powerFuncs;		DenseMap<std::pair<OperationName, Type>, func::FuncOp> funcImpls;
};		};
} // namespace		} // namespace

bool ConvertMathToFuncsPass::isFPowIConvertible(math::FPowIOp op) {		bool ConvertMathToFuncsPass::isFPowIConvertible(math::FPowIOp op) {
auto expTy =		auto expTy =
getElementTypeOrSelf(op.getRhs().getType()).dyn_cast<IntegerType>();		getElementTypeOrSelf(op.getRhs().getType()).dyn_cast<IntegerType>();
return (expTy && expTy.getWidth() >= minWidthOfFPowIExponent);		return (expTy && expTy.getWidth() >= minWidthOfFPowIExponent);
}		}

void ConvertMathToFuncsPass::preprocessPowOperations() {		void ConvertMathToFuncsPass::generateOpImplementations() {
ModuleOp module = getOperation();		ModuleOp module = getOperation();

module.walk([&](Operation *op) {		module.walk([&](Operation *op) {
TypeSwitch<Operation *>(op)		TypeSwitch<Operation *>(op)
		.Case<math::CountLeadingZerosOp>([&](math::CountLeadingZerosOp op) {
		Type resultType = getElementTypeOrSelf(op.getResult().getType());

		// Generate the software implementation of this operation,
		// if it has not been generated yet.
		auto key = std::pair(op->getName(), resultType);
		auto entry = funcImpls.try_emplace(key, func::FuncOp{});
		if (entry.second)
		entry.first->second = createCtlzFunc(&module, resultType);
		})
.Case<math::IPowIOp>([&](math::IPowIOp op) {		.Case<math::IPowIOp>([&](math::IPowIOp op) {
Type resultType = getElementTypeOrSelf(op.getResult().getType());		Type resultType = getElementTypeOrSelf(op.getResult().getType());

// Generate the software implementation of this operation,		// Generate the software implementation of this operation,
// if it has not been generated yet.		// if it has not been generated yet.
auto entry = powerFuncs.try_emplace(resultType, func::FuncOp{});		auto key = std::pair(op->getName(), resultType);
		auto entry = funcImpls.try_emplace(key, func::FuncOp{});
if (entry.second)		if (entry.second)
entry.first->second = createElementIPowIFunc(&module, resultType);		entry.first->second = createElementIPowIFunc(&module, resultType);
})		})
.Case<math::FPowIOp>([&](math::FPowIOp op) {		.Case<math::FPowIOp>([&](math::FPowIOp op) {
if (!isFPowIConvertible(op))		if (!isFPowIConvertible(op))
return;		return;

FunctionType funcType = getElementalFuncTypeForOp(op);		FunctionType funcType = getElementalFuncTypeForOp(op);

// Generate the software implementation of this operation,		// Generate the software implementation of this operation,
// if it has not been generated yet.		// if it has not been generated yet.
// FPowI implementations are mapped via the FunctionType		// FPowI implementations are mapped via the FunctionType
// created from the operation's result and operands.		// created from the operation's result and operands.
auto entry = powerFuncs.try_emplace(funcType, func::FuncOp{});		auto key = std::pair(op->getName(), funcType);
		auto entry = funcImpls.try_emplace(key, func::FuncOp{});
if (entry.second)		if (entry.second)
entry.first->second = createElementFPowIFunc(&module, funcType);		entry.first->second = createElementFPowIFunc(&module, funcType);
});		});
});		});
}		}

void ConvertMathToFuncsPass::runOnOperation() {		void ConvertMathToFuncsPass::runOnOperation() {
ModuleOp module = getOperation();		ModuleOp module = getOperation();

// Create outlined implementations for power operations.		// Create outlined implementations for power operations.
preprocessPowOperations();		generateOpImplementations();

RewritePatternSet patterns(&getContext());		RewritePatternSet patterns(&getContext());
patterns.add<VecOpToScalarOp<math::IPowIOp>, VecOpToScalarOp<math::FPowIOp>>(		patterns.add<VecOpToScalarOp<math::IPowIOp>, VecOpToScalarOp<math::FPowIOp>,
		VecOpToScalarOp<math::CountLeadingZerosOp>>(
patterns.getContext());		patterns.getContext());

// For the given Type Returns FuncOp stored in powerFuncs map.		// For the given Type Returns FuncOp stored in funcImpls map.
auto getPowerFuncOpByType = [&](Type type) -> func::FuncOp {		auto getFuncOpByType = [&](Operation *op, Type type) -> func::FuncOp {
auto it = powerFuncs.find(type);		auto it = funcImpls.find(std::pair(op->getName(), type));
if (it == powerFuncs.end())		if (it == funcImpls.end())
return {};		return {};

return it->second;		return it->second;
};		};
patterns.add<IPowIOpLowering, FPowIOpLowering>(patterns.getContext(),		patterns.add<IPowIOpLowering, FPowIOpLowering>(patterns.getContext(),
getPowerFuncOpByType);		getFuncOpByType);

		if (convertCtlz)
		vzakhariUnsubmitted Done Reply Inline Actions nit: the braces are not needed. vzakhari: nit: the braces are not needed.
		patterns.add<CtlzOpLowering>(patterns.getContext(), getFuncOpByType);

ConversionTarget target(getContext());		ConversionTarget target(getContext());
target.addLegalDialect<arith::ArithDialect, cf::ControlFlowDialect,		target.addLegalDialect<arith::ArithDialect, cf::ControlFlowDialect,
func::FuncDialect, vector::VectorDialect>();		func::FuncDialect, scf::SCFDialect,
		vector::VectorDialect>();

target.addIllegalOp<math::IPowIOp>();		target.addIllegalOp<math::IPowIOp>();
		target.addIllegalOp<math::CountLeadingZerosOp>();
target.addDynamicallyLegalOp<math::FPowIOp>(		target.addDynamicallyLegalOp<math::FPowIOp>(
		mehdi_aminiUnsubmitted Not Done Reply Inline Actions The issue is actually here. This line should be guarded by the option. mehdi_amini: The issue is actually here. This line should be guarded by the option.
		PeteSteinfeldUnsubmitted Not Done Reply Inline Actions Thanks, @mehdi_amini. I tried making this change, and I did get different results. But I still get errors. Here's the output from invoking the compiler on the test program I previously provided: error: loc("/local/home/psteinfeld/up/install/x.f90":0:0): failed to legalize operation 'scf.if' error: Lowering to LLVM IR failed error: loc("/local/home/psteinfeld/up/install/x.f90":1:3): cannot be converted to LLVM IR: missing `LLVMTranslationDialectInterface` registration for dialect for op: func.func error: failed to create the LLVM module PLEASE submit a bug report to https://github.com/llvm/llvm-project/issues/ and include the crash backtrace. Stack dump: 0. Program arguments: /local/home/psteinfeld/up/install/bin/flang-new -fc1 -triple x86_64-unknown-linux-gnu -emit-obj -mrelocation-model pic -pic-level 2 -pic-is-pie -target-cpu x86-64 -o /tmp/x-f64cb2.o -x f95-cpp-input x.f90 #0 0x00005565430284eb llvm::sys::PrintStackTrace(llvm::raw_ostream&, int) (/local/home/psteinfeld/up/install/bin/flang-new+0x1a0d4eb) #1 0x0000556543025c44 SignalHandler(int) Signals.cpp:0:0 #2 0x000014ece2f5bc20 __restore_rt sigaction.c:0:0 #3 0x000055654377d6ed Fortran::frontend::CodeGenAction::executeAction() (/local/home/psteinfeld/up/install/bin/flang-new+0x21626ed) #4 0x000055654305ff8d Fortran::frontend::FrontendAction::execute() (/local/home/psteinfeld/up/install/bin/flang-new+0x1a44f8d) #5 0x000055654304f1bb Fortran::frontend::CompilerInstance::executeAction(Fortran::frontend::FrontendAction&) (/local/home/psteinfeld/up/install/bin/flang-new+0x1a341bb) #6 0x00005565430653de Fortran::frontend::executeCompilerInvocation(Fortran::frontend::CompilerInstance) (/local/home/psteinfeld/up/install/bin/flang-new+0x1a4a3de) #7 0x0000556542b90058 fc1_main(llvm::ArrayRef<char const>, char const) (/local/home/psteinfeld/up/install/bin/flang-new+0x1575058) #8 0x0000556542ae05bc main (/local/home/psteinfeld/up/install/bin/flang-new+0x14c55bc) #9 0x000014ece1b7a493 __libc_start_main (/lib64/libc.so.6+0x23493) #10 0x0000556542b8d70e _start (/local/home/psteinfeld/up/install/bin/flang-new+0x157270e) flang-new: error: unable to execute command: Segmentation fault (core dumped) flang-new: error: flang frontend command failed due to signal (use -v to see invocation) flang-new version 17.0.0 (https://github.com/llvm/llvm-project.git 282d114c21bfe5292db659579a944403bec345a2) Target: x86_64-unknown-linux-gnu Thread model: posix PeteSteinfeld:* Thanks, @mehdi_amini. I tried making this change, and I did get different results. But I…
[this](math::FPowIOp op) { return !isFPowIConvertible(op); });		[this](math::FPowIOp op) { return !isFPowIConvertible(op); });
if (failed(applyPartialConversion(module, target, std::move(patterns))))		if (failed(applyPartialConversion(module, target, std::move(patterns))))
signalPassFailure();		signalPassFailure();
}		}

mlir/test/Conversion/MathToFuncs/ctlz.mlir

This file was added.

				// RUN: mlir-opt %s -split-input-file -pass-pipeline="builtin.module(convert-math-to-funcs{convert-ctlz})" \| FileCheck %s

				// Check a golden-path i32 conversion

				// CHECK-LABEL: func.func @main(
				// CHECK-SAME: %[[VAL_0:.*]]: i32
				// CHECK-SAME: ) {
				// CHECK: %[[VAL_1:.*]] = call @__mlir_math_ctlz_i32(%[[VAL_0]]) : (i32) -> i32
				// CHECK: return
				// CHECK: }

				// CHECK-LABEL: func.func private @__mlir_math_ctlz_i32(
				// CHECK-SAME: %[[ARG:.*]]: i32
				// CHECK-SAME: ) -> i32 attributes {llvm.linkage = #llvm.linkage<linkonce_odr>} {
				// CHECK: %[[C_32:.*]] = arith.constant 32 : i32
				// CHECK: %[[C_0:.*]] = arith.constant 0 : i32
				// CHECK: %[[ARGCMP:.*]] = arith.cmpi eq, %[[ARG]], %[[C_0]] : i32
				// CHECK: %[[OUT:.*]] = scf.if %[[ARGCMP]] -> (i32) {
				// CHECK: scf.yield %[[C_32]] : i32
				// CHECK: } else {
				// CHECK: %[[C_1INDEX:.*]] = arith.constant 1 : index
				// CHECK: %[[C_1I32:.*]] = arith.constant 1 : i32
				// CHECK: %[[C_32INDEX:.*]] = arith.constant 32 : index
				// CHECK: %[[N:.*]] = arith.constant 0 : i32
				// CHECK: %[[FOR_RET:.]]:2 = scf.for %[[I:.]] = %[[C_1INDEX]] to %[[C_32INDEX]] step %[[C_1INDEX]]
				// CHECK: iter_args(%[[ARG_ITER:.]] = %[[ARG]], %[[N_ITER:.]] = %[[N]]) -> (i32, i32) {
				// CHECK: %[[COND:.*]] = arith.cmpi slt, %[[ARG_ITER]], %[[C_0]] : i32
				// CHECK: %[[IF_RET:.*]]:2 = scf.if %[[COND]] -> (i32, i32) {
				// CHECK: scf.yield %[[ARG_ITER]], %[[N_ITER]] : i32, i32
				// CHECK: } else {
				// CHECK: %[[N_NEXT:.*]] = arith.addi %[[N_ITER]], %[[C_1I32]] : i32
				// CHECK: %[[ARG_NEXT:.*]] = arith.shli %[[ARG_ITER]], %[[C_1I32]] : i32
				// CHECK: scf.yield %[[ARG_NEXT]], %[[N_NEXT]] : i32, i32
				// CHECK: }
				// CHECK: scf.yield %[[IF_RET]]#0, %[[IF_RET]]#1 : i32, i32
				// CHECK: }
				// CHECK: scf.yield %[[FOR_RET]]#1 : i32
				// CHECK: }
				// CHECK: return %[[OUT]] : i32
				// CHECK: }
				func.func @main(%arg0: i32) {
				%0 = math.ctlz %arg0 : i32
				func.return
				}

				// -----

				// Check that i8 input is preserved

				// CHECK-LABEL: func.func @main(
				// CHECK-SAME: %[[VAL_0:.*]]: i8
				// CHECK-SAME: ) {
				// CHECK: %[[VAL_1:.*]] = call @__mlir_math_ctlz_i8(%[[VAL_0]]) : (i8) -> i8
				// CHECK: return
				// CHECK: }

				// CHECK-LABEL: func.func private @__mlir_math_ctlz_i8(
				// CHECK-SAME: %[[ARG:.*]]: i8
				// CHECK-SAME: ) -> i8 attributes {llvm.linkage = #llvm.linkage<linkonce_odr>} {
				// CHECK: %[[C_8:.*]] = arith.constant 8 : i8
				// CHECK: %[[C_0:.*]] = arith.constant 0 : i8
				// CHECK: %[[ARGCMP:.*]] = arith.cmpi eq, %[[ARG]], %[[C_0]] : i8
				// CHECK: %[[OUT:.*]] = scf.if %[[ARGCMP]] -> (i8) {
				// CHECK: scf.yield %[[C_8]] : i8
				// CHECK: } else {
				// CHECK: %[[C_1INDEX:.*]] = arith.constant 1 : index
				// CHECK: %[[C_1I32:.*]] = arith.constant 1 : i8
				// CHECK: %[[C_8INDEX:.*]] = arith.constant 8 : index
				// CHECK: %[[N:.*]] = arith.constant 0 : i8
				// CHECK: %[[FOR_RET:.]]:2 = scf.for %[[I:.]] = %[[C_1INDEX]] to %[[C_8INDEX]] step %[[C_1INDEX]]
				// CHECK: iter_args(%[[ARG_ITER:.]] = %[[ARG]], %[[N_ITER:.]] = %[[N]]) -> (i8, i8) {
				// CHECK: %[[COND:.*]] = arith.cmpi slt, %[[ARG_ITER]], %[[C_0]] : i8
				// CHECK: %[[IF_RET:.*]]:2 = scf.if %[[COND]] -> (i8, i8) {
				// CHECK: scf.yield %[[ARG_ITER]], %[[N_ITER]] : i8, i8
				// CHECK: } else {
				// CHECK: %[[N_NEXT:.*]] = arith.addi %[[N_ITER]], %[[C_1I32]] : i8
				// CHECK: %[[ARG_NEXT:.*]] = arith.shli %[[ARG_ITER]], %[[C_1I32]] : i8
				// CHECK: scf.yield %[[ARG_NEXT]], %[[N_NEXT]] : i8, i8
				// CHECK: }
				// CHECK: scf.yield %[[IF_RET]]#0, %[[IF_RET]]#1 : i8, i8
				// CHECK: }
				// CHECK: scf.yield %[[FOR_RET]]#1 : i8
				// CHECK: }
				// CHECK: return %[[OUT]] : i8
				// CHECK: }
				func.func @main(%arg0: i8) {
				%0 = math.ctlz %arg0 : i8
				func.return
				}

mlir/test/Conversion/MathToFuncs/ctlz_runner.mlir

This file was added.

				// RUN: mlir-opt %s \
				// RUN: -pass-pipeline="builtin.module( \
				// RUN: convert-math-to-funcs{convert-ctlz}, \
				// RUN: func.func(convert-scf-to-cf,convert-arith-to-llvm), \
				// RUN: convert-func-to-llvm, \
				// RUN: convert-cf-to-llvm, \
				// RUN: reconcile-unrealized-casts)" \
				// RUN: \| mlir-cpu-runner -e test_7i32_to_29 -entry-point-result=i32 \| FileCheck %s --check-prefix=CHECK_TEST_7i32_TO_29

				func.func @test_7i32_to_29() -> i32 {
				%arg = arith.constant 7 : i32
				%0 = math.ctlz %arg : i32
				func.return %0 : i32
				}
				// CHECK_TEST_7i32_TO_29: 29

				// RUN: mlir-opt %s \
				// RUN: -pass-pipeline="builtin.module( \
				// RUN: convert-math-to-funcs{convert-ctlz}, \
				// RUN: func.func(convert-scf-to-cf,convert-arith-to-llvm), \
				// RUN: convert-func-to-llvm, \
				// RUN: convert-cf-to-llvm, \
				// RUN: reconcile-unrealized-casts)" \
				// RUN: \| mlir-cpu-runner -e test_zero -entry-point-result=i32 \| FileCheck %s --check-prefix=CHECK_TEST_ZERO

				func.func @test_zero() -> i32 {
				%arg = arith.constant 0 : i32
				%0 = math.ctlz %arg : i32
				func.return %0 : i32
				}
				// CHECK_TEST_ZERO: 32

				// Apparently mlir-cpu-runner doesn't support i8 return values, so testing i64 instead
				// RUN: mlir-opt %s \
				// RUN: -pass-pipeline="builtin.module( \
				// RUN: convert-math-to-funcs, \
				// RUN: func.func(convert-scf-to-cf,convert-arith-to-llvm), \
				// RUN: convert-func-to-llvm, \
				// RUN: convert-cf-to-llvm, \
				// RUN: reconcile-unrealized-casts)" \
				// RUN: \| mlir-cpu-runner -e test_7i64_to_61 -entry-point-result=i64 \| FileCheck %s --check-prefix=CHECK_TEST_7i64_TO_61

				func.func @test_7i64_to_61() -> i64 {
				%arg = arith.constant 7 : i64
				%0 = math.ctlz %arg : i64
				func.return %0 : i64
				}
				// CHECK_TEST_7i64_TO_61: 61

utils/bazel/llvm-project-overlay/mlir/BUILD.bazel

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 6,381 Lines • ▼ Show 20 Lines	deps = [
":ControlFlowDialect",		":ControlFlowDialect",
":ConversionPassIncGen",		":ConversionPassIncGen",
":DialectUtils",		":DialectUtils",
":FuncDialect",		":FuncDialect",
":IR",		":IR",
":LLVMDialect",		":LLVMDialect",
":MathDialect",		":MathDialect",
":Pass",		":Pass",
		":SCFDialect",
":Transforms",		":Transforms",
":VectorDialect",		":VectorDialect",
":VectorUtils",		":VectorUtils",
"//llvm:Support",		"//llvm:Support",
],		],
)		)

gentbl_cc_library(		gentbl_cc_library(
▲ Show 20 Lines • Show All 4,394 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Add support for math.ctlz in convert-math-to-funcsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 509749

mlir/include/mlir/Conversion/Passes.td

mlir/lib/Conversion/MathToFuncs/CMakeLists.txt

mlir/lib/Conversion/MathToFuncs/MathToFuncs.cpp

mlir/test/Conversion/MathToFuncs/ctlz.mlir

mlir/test/Conversion/MathToFuncs/ctlz_runner.mlir

utils/bazel/llvm-project-overlay/mlir/BUILD.bazel

Add support for math.ctlz in convert-math-to-funcs
ClosedPublic