mlir/lib/Bindings/Python/mlir/runtime/memref/np_to_memref.py
27
mlir/test/Bindings/Python/execution_engine.py
144	This will not work with a memref that has strides I believe. You'll need to use something like `np.lib.stride_tricks.as_strided`. I think this should be a utility in the runtime as well `memref_to_numpy_view` or something like that. (If you can add a test with an array and a view with strides that could show this)
183	Debug left-over? (same below)

Clean left over.

Harbormaster completed remote builds in B97613: Diff 335948.Apr 7 2021, 5:10 PM

Harbormaster completed remote builds in B97615: Diff 335950.

Harbormaster completed remote builds in B97617: Diff 335952.Apr 7 2021, 5:18 PM

bondhugula added reviewers: mehdi_amini, stellaraccident, bondhugula.Apr 7 2021, 8:27 PM

I'd really like to see this support! Thanks for implementing this.

mlir/lib/Bindings/Python/mlir/runtime/memref/np_to_memref.py
5	Please add a line on what this file is about.
42	Doc comment.
mlir/test/Bindings/Python/execution_engine.py
136	Terminate all comments with a full stop.
144	Is there a way to restrict this patch to identity layout map? This would enable and unlock many things even if it just handles the default/row major identity layout. Can we check and bail out / assert on non-default strides?
191	Unrelated to this PR: If the wrong number of arguments are provided, will `ExecutionEngine` catch it? I think from a developer standpoint, this is quite useful since folks might sometimes lower the MLIR module to a form where there is a mismatch in the number of function arguments.

This revision now requires changes to proceed.Apr 7 2021, 8:35 PM

bondhugula added inline comments.Apr 7 2021, 8:36 PM

mlir/lib/Bindings/Python/mlir/runtime/memref/np_to_memref.py
48	`get_ranked_memref_descriptor`? Here and elsewhere, can we name these things descriptively?

mehdi_amini added inline comments.Apr 7 2021, 8:50 PM

mlir/test/Bindings/Python/execution_engine.py
144	The stride is really easy to add though, maybe easier than bailing out (and adding the tests for all the bail out). But if you prefer to add these tests, I can implement the strides in a follow up.
191	This is a layer of consistency that is important, but we'll have to build above this, these are all quite low-level APIs: the execution engine does not have any knowledge of the APIs present in the Module.

The only comment I have (not covered elsewhere) is a weak preference to unnest by one directory level (i.e. remove the memref directory and put this in the parent). All of the exported APIs already have memref in the name and fewer imports/less repeating for these kind of things is better, imo. Also fine to see how things evolve and rework the namespace once/if there is more there.

Added support for strided memrefs.

nicolasvasilache added inline comments.Apr 9 2021, 1:40 PM

mlir/lib/Bindings/Python/mlir/runtime/memref/np_to_memref.py
69	Note that numpy uses byte quantities to express strides. MLIR OTOH uses the torch abstraction which specifies strides in terms of elements; the ConversionToLLVM takes care of generating the right addresses (which will also requirer furrther hooking the data layout better). Bottom line, I experimented with `memref<...xf32>` by hacking the following on top of this: + # x.strides = nparray.ctypes.strides + strides_ctype_t = ctypes.c_longlong * nparray.ndim + x.strides = strides_ctype_t(*[t // 4 for t in nparray.strides]) It seemed to help a bit but I still saw issues that I have not yet debugged. I am not sure whether the ctype lifetime is reasonable the code I wrote.
mlir/test/Bindings/Python/execution_engine.py
144	Also note the discrepancy between np.strides and memref / torch strides I signal above.

Remove the memref directory and put files in the parent.

nicolasvasilache added inline comments.Apr 9 2021, 1:47 PM

mlir/lib/Bindings/Python/mlir/runtime/memref/np_to_memref.py
111	nice, you also want to pointwise-multiply by the element size in bytes.

Harbormaster completed remote builds in B98064: Diff 336556.Apr 9 2021, 2:17 PM

Harbormaster completed remote builds in B98065: Diff 336557.Apr 9 2021, 2:35 PM

Cleaning up.

pashu123 marked 3 inline comments as done.Apr 11 2021, 1:15 PM

pashu123 marked an inline comment as done.Apr 11 2021, 1:23 PM

Harbormaster completed remote builds in B98169: Diff 336686.Apr 11 2021, 1:43 PM

nicolasvasilache requested changes to this revision.Apr 12 2021, 12:08 AM

nicolasvasilache added inline comments.

mlir/lib/Bindings/Python/mlir/runtime/np_to_memref.py
74 ↗	(On Diff #336686)	The `4` is only valid for 4-byte type like `f32`, you need to get the size in bytes of the type (here and other places you updated).

This revision now requires changes to proceed.Apr 12 2021, 12:08 AM

pashu123 added inline comments.Apr 12 2021, 12:15 AM

mlir/lib/Bindings/Python/mlir/runtime/np_to_memref.py
74 ↗	(On Diff #336686)	I see, thanks for pointing this out.

Fix itemsize.

Formatting.

pashu123 marked an inline comment as done.Apr 12 2021, 12:27 AM

Harbormaster completed remote builds in B98223: Diff 336761.Apr 12 2021, 12:57 AM

Harbormaster completed remote builds in B98224: Diff 336763.Apr 12 2021, 1:03 AM

bondhugula requested changes to this revision.Apr 12 2021, 5:21 AM

bondhugula added inline comments.

mlir/lib/Bindings/Python/mlir/runtime/np_to_memref.py
74 ↗	(On Diff #336686)	This also means that test cases were missing - you don't seem to have f64 test cases.
mlir/test/Bindings/Python/execution_engine.py
159	You are missing test cases on float64 and so this isn't exercising all of the code well.

This revision now requires changes to proceed.Apr 12 2021, 5:21 AM

nicolasvasilache added inline comments.Apr 12 2021, 6:30 AM

mlir/lib/Bindings/Python/mlir/runtime/np_to_memref.py
59 ↗	(On Diff #336763)	This is not the element size but the displacement from the "base pointer address" at which the relevant data lives. You probably picked the value here: https://numpy.org/doc/stable/reference/generated/numpy.ndarray.html ? This shows that similary to strides, numpy represents this "offset in bytes" whereas MLIR is in number of elements. I verified that setting this to 0 in my local experiment (here and below), worked in simple cases. In the general case you need to translate between MLIR and NP offsets similarly as you do for strides.

Herald added a subscriber: shabalin. · View Herald TranscriptApr 12 2021, 6:30 AM

pashu123 added inline comments.Apr 12 2021, 11:30 AM

mlir/lib/Bindings/Python/mlir/runtime/np_to_memref.py
59 ↗	(On Diff #336763)	I tried setting this to multiple different values and it worked. If offset is represented as a number of elements in MLIR, then the offset should be 1. Similarly, it should be 1 * nparray.itemsize while converting back to NumPy from memrefs world.

nicolasvasilache added inline comments.Apr 12 2021, 11:43 AM

mlir/lib/Bindings/Python/mlir/runtime/np_to_memref.py
59 ↗	(On Diff #336763)	I am unclear what "it worked" means in the absence of more context on what you tried? Please note that lowering will do the right thing in the static case here: https://github.com/llvm/llvm-project/blob/main/mlir/lib/Conversion/StandardToLLVM/StandardToLLVM.cpp#L3399. The problem I mention is only exhibited by dynamic offsets which I do not see your tests implementing. For instance, this commit will fail with an offset different than `0` (at runtime, in this case I hardcoded 0 for expediency while awaiting for this PR to land) : https://github.com/google/iree-llvm-sandbox/commit/87015b29c5f7cbc445bd85e1ce4a5d7597e80361. The underlying reason is that after tiling, the offset becomes dependence on the loop IV (i.e. the memref has a static `?` in its type) and then the dynamic branch of the StandardToLLVM.cpp will kick in. Address computation will compute an offset that is 1 + actual value and will segfault at runtime.

pashu123 added inline comments.Apr 13 2021, 4:45 AM

mlir/lib/Bindings/Python/mlir/runtime/np_to_memref.py
59 ↗	(On Diff #336763)	Thanks for pointing this out. I didn't try on Dynamic memrefs.

Redirecting prints to stderr.

Harbormaster completed remote builds in B98464: Diff 337108.Apr 13 2021, 6:02 AM

LGTM, seems like a good basis to iterate on!

@bondhugula @nicolasvasilache : can you confirm the recent changes address your previous comments?

@mehdi_amini not yet, the author should at the very least set the 2 offsets to 0 to as described in the commit message of https://github.com/google/iree-llvm-sandbox/commit/87015b29c5f7cbc445bd85e1ce4a5d7597e80361, so that it passes without crashing.

I'm fine with iterating in-tree once that is fixed.

mehdi_amini added inline comments.Apr 13 2021, 4:04 PM

mlir/lib/Bindings/Python/mlir/runtime/np_to_memref.py
59 ↗	(On Diff #336763)	I don't quite indeed this `x.offset = ctypes.c_longlong(4)`? Shouldn't it always be zero? I suspect Numpy is baking the offset into the pointer we get with `nparray.ctypes.data_as`.

@mehdi_amini indeed, in https://numpy.org/doc/stable/reference/generated/numpy.ndarray.html we can see:

>>> np.ndarray((2,), buffer=np.array([1,2,3]),
...            offset=np.int_().itemsize,
...            dtype=int) # offset = 1*itemsize, i.e. skip first element
array([2, 3])

as well as the default = 0 for offset:

class numpy.ndarray(shape, dtype=float, buffer=None, offset=0, strides=None, order=None)[source]¶

0 is the right value to use for now.
When we do morer fancy numpy / linalg level subviews etc it will be a different story; but for now anything else than 0 will crash non-trivial cases.

In D100077#2687200, @nicolasvasilache wrote:
@mehdi_amini indeed, in https://numpy.org/doc/stable/reference/generated/numpy.ndarray.html we can see:
>>> np.ndarray((2,), buffer=np.array([1,2,3]),
...            offset=np.int_().itemsize,
...            dtype=int) # offset = 1*itemsize, i.e. skip first element
array([2, 3])

Right, but the ctypes interface does not expose the offset, so I wonder if in such cases the offset that you provide on construction will not just be computed in the "data" field internally (I could check the source code for ndarray...)

Changing the default value of offset to zero.

In D100077#2687134, @nicolasvasilache wrote:

@mehdi_amini not yet, the author should at the very least set the 2 offsets to 0 to as described in the commit message of https://github.com/google/iree-llvm-sandbox/commit/87015b29c5f7cbc445bd85e1ce4a5d7597e80361, so that it passes without crashing.

I'm fine with iterating in-tree once that is fixed.

Done. I have changed it to zero. I had pushed the wrong changes.

In D100077#2687129, @mehdi_amini wrote:

@bondhugula @nicolasvasilache : can you confirm the recent changes address your previous comments?

Is this revision also meant to handle dynamic memrefs? If yes, a test case is missing. If not, we should check and assert (and add a TODO)?

Harbormaster completed remote builds in B98616: Diff 337336.Apr 14 2021, 12:24 AM

In D100077#2687846, @bondhugula wrote:

In D100077#2687129, @mehdi_amini wrote:

@bondhugula @nicolasvasilache : can you confirm the recent changes address your previous comments?

Is this revision also meant to handle dynamic memrefs? If yes, a test case is missing. If not, we should check and assert (and add a TODO)?

It supports a subset of cases (e.g. anything that takes subviews and crosses the NP -> MLIR or MLIR -> NP boundary should be considered broken atm).
+1 on adding a simple mixed static / dynamic 2-D add and compare against NP's.

Adding test case of element wise addition of dynamic and static memrefs.

Great, thanks for your contribution, let's land this ! :)

nicolasvasilache accepted this revision.Apr 15 2021, 12:17 PM

Harbormaster completed remote builds in B98989: Diff 337860.Apr 15 2021, 1:39 PM

This revision was not accepted when it landed; it landed in state Needs Review.Apr 15 2021, 4:41 PM

Closed by commit rG102fd1cb8b40: Add support for numpy arrays to memref conversions. (authored by pashu123, committed by mehdi_amini). · Explain Why

This revision was automatically updated to reflect the committed changes.

mehdi_amini added a commit: rG102fd1cb8b40: Add support for numpy arrays to memref conversions..

Diff 335948

mlir/lib/Bindings/Python/mlir/runtime/init.py

This file was added.

This is an empty file.

mlir/lib/Bindings/Python/mlir/runtime/memref/init.py

This file was added.

from .np_to_memref import *

mlir/lib/Bindings/Python/mlir/runtime/memref/np_to_memref.py

This file was added.

# Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.

# See https://llvm.org/LICENSE.txt for license information.

# SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception

import numpy as np

bondhugulaUnsubmitted

Not Done

Please add a line on what this file is about.

bondhugula: Please add a line on what this file is about.

import ctypes

def make_nd_memref_descriptor(rank, dtype):

class MemRefDescriptor(ctypes.Structure):

"""

Build an empty descriptor for the given rank/dtype, where rank>0.

"""

_fields_ = [

("allocated", ctypes.c_longlong),

("aligned", ctypes.POINTER(dtype)),

("offset", ctypes.c_longlong),

("shape", ctypes.c_longlong * rank),

("strides", ctypes.c_longlong * rank),

]

return MemRefDescriptor

def make_zero_d_memref_descriptor(dtype):

class MemRefDescriptor(ctypes.Structure):

"""

Build an empty descriptor for the given rank/dtype, where rank=0.

mehdi_aminiUnsubmitted

Not Done

class MemRefDescriptor(ctypes.Structure):

"""

- Build an empty descriptor for the given rank/dtype, where rank=0.

+ Build an empty descriptor for the given dtype, where rank=0.

"""

_fields_ = [

mehdi_amini:

"""

_fields_ = [

("allocated", ctypes.c_longlong),

("aligned", ctypes.POINTER(dtype)),

("offset", ctypes.c_longlong),

]

return MemRefDescriptor

class UnrankedMemRefDescriptor(ctypes.Structure):

""" Creates a ctype struct for memref descriptor"""

_fields_ = [

("rank", ctypes.c_longlong),

bondhugulaUnsubmitted

Not Done

Doc comment.

bondhugula: Doc comment.

("descriptor", ctypes.c_void_p)

]

def to_memref(nparray):

"""

bondhugulaUnsubmitted

Done

get_ranked_memref_descriptor?

Here and elsewhere, can we name these things descriptively?

bondhugula: `get_ranked_memref_descriptor`? Here and elsewhere, can we name these things descriptively?

Return a ranked memref descriptor for the given numpy array.

"""

if nparray.ndim == 0:

x = make_zero_d_memref_descriptor(np.ctypeslib.as_ctypes_type(nparray.dtype))()

x.allocated = nparray.ctypes.data

x.aligned = nparray.ctypes.data_as(

ctypes.POINTER(np.ctypeslib.as_ctypes_type(nparray.dtype))

)

x.offset = ctypes.c_longlong(nparray.dtype.itemsize)

return x

x = make_nd_memref_descriptor(

nparray.ndim, np.ctypeslib.as_ctypes_type(nparray.dtype)

)()

x.allocated = nparray.ctypes.data

x.aligned = nparray.ctypes.data_as(

ctypes.POINTER(np.ctypeslib.as_ctypes_type(nparray.dtype))

)

x.offset = ctypes.c_longlong(nparray.dtype.itemsize)

x.shape = nparray.ctypes.shape

x.strides = nparray.ctypes.strides

nicolasvasilacheUnsubmitted

Not Done

Note that numpy uses byte quantities to express strides.

MLIR OTOH uses the torch abstraction which specifies strides in terms of elements; the ConversionToLLVM takes care of generating the right addresses (which will also requirer furrther hooking the data layout better).

Bottom line, I experimented with memref<...xf32> by hacking the following on top of this:

+    # x.strides = nparray.ctypes.strides
+   strides_ctype_t = ctypes.c_longlong * nparray.ndim
+    x.strides = strides_ctype_t(*[t // 4 for t in nparray.strides])

It seemed to help a bit but I still saw issues that I have not yet debugged.
I am not sure whether the ctype lifetime is reasonable the code I wrote.

nicolasvasilache: Note that numpy uses byte quantities to express strides. MLIR OTOH uses the torch abstraction…

return x

def make_unranked_memref_descriptor(nparray):

'''

Return a generic/unranked memref descriptor for the given numpy array.

'''

d = UnrankedMemRefDescriptor()

d.rank = nparray.ndim

x = to_memref(nparray)

d.descriptor = ctypes.cast(ctypes.pointer(x), ctypes.c_void_p)

return d

nicolasvasilacheUnsubmitted

Not Done

nice, you also want to pointwise-multiply by the element size in bytes.

nicolasvasilache: nice, you also want to pointwise-multiply by the element size in bytes.

mlir/test/Bindings/Python/execution_engine.py

# RUN: %PYTHON %s 2>&1 \| FileCheck %s		# RUN: %PYTHON %s 2>&1 \| FileCheck %s

import gc, sys		import gc, sys
from mlir.ir import *		from mlir.ir import *
from mlir.passmanager import *		from mlir.passmanager import *
from mlir.execution_engine import *		from mlir.execution_engine import *
		from mlir.runtime.memref import *

# Log everything to stderr and flush so that we have a unified stream to match		# Log everything to stderr and flush so that we have a unified stream to match
# errors/info emitted by MLIR to stderr.		# errors/info emitted by MLIR to stderr.
def log(*args):		def log(*args):
print(*args, file=sys.stderr)		print(*args, file=sys.stderr)
sys.stderr.flush()		sys.stderr.flush()

def run(f):		def run(f):
▲ Show 20 Lines • Show All 111 Lines • ▼ Show 20 Lines	func private @some_callback_into_python(f32, i32) -> f32 attributes { llvm.emit_c_interface }
arg0 = c_float_p(42.)		arg0 = c_float_p(42.)
arg1 = c_int_p(2)		arg1 = c_int_p(2)
res = c_float_p(-1.)		res = c_float_p(-1.)
execution_engine.invoke("add", arg0, arg1, res)		execution_engine.invoke("add", arg0, arg1, res)
# CHECK: 42.0 + 2 = 44.0		# CHECK: 42.0 + 2 = 44.0
log("{0} + {1} = {2}".format(arg0[0], arg1[0], res[0]*2))		log("{0} + {1} = {2}".format(arg0[0], arg1[0], res[0]*2))

run(testBasicCallback)		run(testBasicCallback)

		# Test callback with a memref
		bondhugulaUnsubmitted Not Done Reply Inline Actions Terminate all comments with a full stop. bondhugula: Terminate all comments with a full stop.
		# CHECK-LABEL: TEST: testMemRefCallback
		def testMemRefCallback():
		# Define a callback function that takes a memref, converts it to a numpy array and prints it.
		@ctypes.CFUNCTYPE(None, ctypes.POINTER(UnrankedMemRefDescriptor))
		def callback(a):
		d = make_nd_memref_descriptor(a[0].rank, ctypes.c_float)
		x = ctypes.cast(a[0].descriptor, ctypes.POINTER(d))
		arr = np.ctypeslib.as_array(x[0].aligned, shape=x[0].shape)
		mehdi_aminiUnsubmitted Done Reply Inline Actions This will not work with a memref that has strides I believe. You'll need to use something like `np.lib.stride_tricks.as_strided`. I think this should be a utility in the runtime as well `memref_to_numpy_view` or something like that. (If you can add a test with an array and a view with strides that could show this) mehdi_amini: This will not work with a memref that has strides I believe. You'll need to use something like…
		bondhugulaUnsubmitted Not Done Reply Inline Actions Is there a way to restrict this patch to identity layout map? This would enable and unlock many things even if it just handles the default/row major identity layout. Can we check and bail out / assert on non-default strides? bondhugula: Is there a way to restrict this patch to identity layout map? This would enable and unlock…
		mehdi_aminiUnsubmitted Not Done Reply Inline Actions The stride is really easy to add though, maybe easier than bailing out (and adding the tests for all the bail out). But if you prefer to add these tests, I can implement the strides in a follow up. mehdi_amini: The stride is really easy to add though, maybe easier than bailing out (and adding the tests…
		nicolasvasilacheUnsubmitted Done Reply Inline Actions Also note the discrepancy between np.strides and memref / torch strides I signal above. nicolasvasilache: Also note the discrepancy between np.strides and memref / torch strides I signal above.
		print("Inside callback: ")
		print (arr)

		with Context():
		# The module just forwards to a runtime function known as "some_callback_into_python".
		module = Module.parse(r"""
		func @callback_memref(%arg0: memref<*xf32>) attributes { llvm.emit_c_interface } {
		call @some_callback_into_python(%arg0) : (memref<*xf32>) -> ()
		return
		}
		func private @some_callback_into_python(memref<*xf32>) -> () attributes { llvm.emit_c_interface }
		""")
		execution_engine = ExecutionEngine(lowerToLLVM(module))
		execution_engine.register_runtime("some_callback_into_python", callback)
		inp_arr = np.array([[1.,2.],[3.,4.]], np.float32)
		bondhugulaUnsubmitted Not Done Reply Inline Actions You are missing test cases on float64 and so this isn't exercising all of the code well. bondhugula: You are missing test cases on float64 and so this isn't exercising all of the code well.
		# CHECK: Inside callback:
		# CHECK: [[1. 2.
		# CHECK: 3. 4.]]
		execution_engine.invoke("callback_memref", ctypes.pointer(ctypes.pointer(make_unranked_memref_descriptor(inp_arr))))

		run(testMemRefCallback)

		def testInvokeMemrefAdd():
		with Context():
		module = Module.parse(
		"""
		module {
		func @main(%arg0: memref<1xf32>, %arg1: memref<f32>, %arg2: memref<1xf32>) attributes { llvm.emit_c_interface } {
		%0 = constant 0 : index
		%1 = memref.load %arg0[%0] : memref<1xf32>
		%2 = memref.load %arg1[] : memref<f32>
		%3 = addf %1, %2 : f32
		memref.store %3, %arg2[%0] : memref<1xf32>
		return
		}
		} """
		)
		arg1 = np.array([32.5]).astype(np.float32)
		print(arg1)
		mehdi_aminiUnsubmitted Done Reply Inline Actions Debug left-over? (same below) mehdi_amini: Debug left-over? (same below)
		arg2 = np.array(6).astype(np.float32)
		print(arg2)
		res = np.array([0]).astype(np.float32)

		arg1_memref_ptr = ctypes.pointer(ctypes.pointer(to_memref(arg1)))
		arg2_memref_ptr = ctypes.pointer(ctypes.pointer(to_memref(arg2)))
		res_memref_ptr = ctypes.pointer(ctypes.pointer(to_memref(res)))

		bondhugulaUnsubmitted Not Done Reply Inline Actions Unrelated to this PR: If the wrong number of arguments are provided, will `ExecutionEngine` catch it? I think from a developer standpoint, this is quite useful since folks might sometimes lower the MLIR module to a form where there is a mismatch in the number of function arguments. bondhugula: Unrelated to this PR: If the wrong number of arguments are provided, will `ExecutionEngine`…
		mehdi_aminiUnsubmitted Not Done Reply Inline Actions This is a layer of consistency that is important, but we'll have to build above this, these are all quite low-level APIs: the execution engine does not have any knowledge of the APIs present in the Module. mehdi_amini: This is a layer of consistency that is important, but we'll have to build above this, these are…
		execution_engine = ExecutionEngine(lowerToLLVM(module))
		execution_engine.invoke("main", arg1_memref_ptr, arg2_memref_ptr, res_memref_ptr)
		# CHECK: [32.5] + 6.0 = [38.5]
		log("{0} + {1} = {2}".format(arg1, arg2, res))

		run(testInvokeMemrefAdd)

This is an archive of the discontinued LLVM Phabricator instance.

Add support for numpy arrays to memref conversions.
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 335948

mlir/lib/Bindings/Python/mlir/runtime/init.py

mlir/lib/Bindings/Python/mlir/runtime/memref/init.py

mlir/lib/Bindings/Python/mlir/runtime/memref/np_to_memref.py

mlir/test/Bindings/Python/execution_engine.py

This is an archive of the discontinued LLVM Phabricator instance.

Add support for numpy arrays to memref conversions.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 335948

mlir/lib/Bindings/Python/mlir/runtime/__init__.py

mlir/lib/Bindings/Python/mlir/runtime/memref/__init__.py

mlir/lib/Bindings/Python/mlir/runtime/memref/np_to_memref.py

mlir/test/Bindings/Python/execution_engine.py

Add support for numpy arrays to memref conversions.
ClosedPublic

mlir/lib/Bindings/Python/mlir/runtime/init.py

mlir/lib/Bindings/Python/mlir/runtime/memref/init.py