This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
docs/
1/3
ClangOffloadBundler.rst
-
lib/Driver/ToolChains/
-
Driver/
-
ToolChains/
1
Clang.cpp
-
test/Driver/
-
Driver/
1
clang-offload-bundler.c
-
hip-rdc-device-only.hip
-
hip-toolchain-rdc-separate.hip
-
tools/clang-offload-bundler/
-
clang-offload-bundler/
5/13
ClangOffloadBundler.cpp

Differential D93525

[clang-offload-bundler] Add unbundling of archives containing bundled object files into device specific archives
ClosedPublic

Authored by saiislam on Dec 18 2020, 1:51 AM.

Download Raw Diff

Details

Reviewers

grokos
hfinkel
jdoerfert
JonChesterfield
ronlieb
ABataev
mdtoguchi
kbobrovs
sdmitriev
gregrodgers
kkwli0
dreachem
Tyker
jsjodin
yaxunl
t-tye

Commits

rGf7ce532d622d: [clang-offload-bundler] Add unbundling of archives containing bundled object…

Summary

This patch adds unbundling support of an archive file. It takes an
archive file along with a set of offload targets as input.
Output is a device specific archive for each given offload target.
Input archive contains bundled code objects bundled using
clang-offload-bundler. Each generated device specific archive contains
a set of device code object files which are named as
<Parent Bundle Name>-<CodeObject-GPUArch>.
Entries in input archive can be of any binary type which is
supported by clang-offload-bundler, like *.bc. Output archives will
contain files in same type.
Example Usuage:
clang-offload-bundler --unbundle --inputs=lib-generic.a -type=a -targets=openmp-amdgcn-amdhsa--gfx906,openmp-amdgcn-amdhsa--gfx908 -outputs=devicelib-gfx906.a,deviceLib-gfx908.a

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

saiislam created this revision.Dec 18 2020, 1:51 AM

saiislam requested review of this revision.Dec 18 2020, 1:51 AM

Herald added a project: Restricted Project. · View Herald TranscriptDec 18 2020, 1:51 AM

Herald added a subscriber: cfe-commits. · View Herald Transcript

saiislam edited the summary of this revision. (Show Details)Dec 18 2020, 1:54 AM

Refer to D80816 for earlier review of this patch.

Harbormaster completed remote builds in B82922: Diff 312724.Dec 18 2020, 2:29 AM

ABataev added inline comments.Dec 18 2020, 8:49 AM

clang/tools/clang-offload-bundler/ClangOffloadBundler.cpp
186–188	No need `else` here
1065–1069	No need for `else` here
1072–1074	I think llvm Support lib has all required functions for this.
1112	Do not use `auto` where the type is not obvious.
1160–1161	Just `continue` and make `else if` just `if`

sdmitriev mentioned this in D94005: [clang-offload-bundler] Add support for unbundling archives with fat objects.Jan 4 2021, 5:57 AM

Modified to handle multiple targets/outputs in one run of the tool for archive unbundling. Other minor changes as requested in the review.

saiislam marked 3 inline comments as done.Jan 5 2021, 8:47 AM

saiislam added inline comments.

clang/tools/clang-offload-bundler/ClangOffloadBundler.cpp
1160–1161	wasn't possible with the code flow. there is stuff to be processed in case of failure as well.

saiislam edited the summary of this revision. (Show Details)Jan 5 2021, 8:49 AM

saiislam added reviewers: yaxunl, t-tye.

Harbormaster completed remote builds in B84066: Diff 314633.Jan 5 2021, 9:55 AM

Ping.

can you document this in ClangOffloadBundler.rst ? I think we need a clear description about how clang-offload-bundler knows which file in the .a file belongs to which target.

In D93525#2493024, @yaxunl wrote:

can you document this in ClangOffloadBundler.rst ? I think we need a clear description about how clang-offload-bundler knows which file in the .a file belongs to which target.

How does the .a relate to bundled code objects? Does the .a have a number of bundled code objects? If so wouldn't the identity of code objects be defined by the existing bundled code object ABI already documented? If the .a is a set of non-bundled code objects then defining how they are identified is not part of the clang-offload-bundler documentation as there are no bundled code objects involved. It would seem that the documentation belongs with the OpenMP runtime/compiler that is choosing to use .a files in this manner.

In D93525#2493752, @t-tye wrote:

In D93525#2493024, @yaxunl wrote:

can you document this in ClangOffloadBundler.rst ? I think we need a clear description about how clang-offload-bundler knows which file in the .a file belongs to which target.

How does the .a relate to bundled code objects? Does the .a have a number of bundled code objects? If so wouldn't the identity of code objects be defined by the existing bundled code object ABI already documented? If the .a is a set of non-bundled code objects then defining how they are identified is not part of the clang-offload-bundler documentation as there are no bundled code objects involved. It would seem that the documentation belongs with the OpenMP runtime/compiler that is choosing to use .a files in this manner.

Bundles (created using clang-offload-bundler) are passed to llvm-ar to create an archive of bundled objects (*.a file). An archive can have bundles for multiple device types. So, yes, the identity of code objects is defined by the existing bundled code object ABI.
This patch reads such an archive and produces a device-specific archive for each of the target devices given as input. Each device-specific archive contains all the code objects corresponding to that particular device and are written as per llvm archive format.

Here is a snippet of relevant lit run lines:

// RUN: %clang -O0 -target %itanium_abi_triple %s -c -o %t.o

// RUN: echo 'Content of device file 1' > %t.tgt1
// RUN: clang-offload-bundler -type=o -targets=host-%itanium_abi_triple,openmp-amdgcn-amd-amdhsa-gfx900 -inputs=%t.o,%t.tgt1 -outputs=%t.abundle1.o
 
// RUN: echo 'Content of device file 2' > %t.tgt2
// RUN: clang-offload-bundler -type=o -targets=host-%itanium_abi_triple,openmp-amdgcn-amd-amdhsa-gfx900 -inputs=%t.o,%t.tgt2 -outputs=%t.abundle2.o
 
// RUN: llvm-ar cr %t.lib.a %t.abundle1.o %t.abundle2.o

This patch ==>
// RUN: clang-offload-bundler -unbundle -type=a -targets=openmp-amdgcn-amd-amdhsa-gfx900 -inputs=%t.lib.a -outputs=%t.devicelib.a

%t.devicelib.a will contain all devices objects corresponding to gfx900

Though my interest originates from OpenMP side, Device-specific Archive Libraries created like this can be used by other offloading languages like HIP, CUDA, and OpenCL. Pelase refer D81109 for the an earlier patch in the series of patches which will enable this.

In D93525#2495374, @saiislam wrote:
In D93525#2493752, @t-tye wrote:

In D93525#2493024, @yaxunl wrote:

can you document this in ClangOffloadBundler.rst ? I think we need a clear description about how clang-offload-bundler knows which file in the .a file belongs to which target.

How does the .a relate to bundled code objects? Does the .a have a number of bundled code objects? If so wouldn't the identity of code objects be defined by the existing bundled code object ABI already documented? If the .a is a set of non-bundled code objects then defining how they are identified is not part of the clang-offload-bundler documentation as there are no bundled code objects involved. It would seem that the documentation belongs with the OpenMP runtime/compiler that is choosing to use .a files in this manner.

Bundles (created using clang-offload-bundler) are passed to llvm-ar to create an archive of bundled objects (*.a file). An archive can have bundles for multiple device types. So, yes, the identity of code objects is defined by the existing bundled code object ABI.
This patch reads such an archive and produces a device-specific archive for each of the target devices given as input. Each device-specific archive contains all the code objects corresponding to that particular device and are written as per llvm archive format.

Here is a snippet of relevant lit run lines:
// RUN: %clang -O0 -target %itanium_abi_triple %s -c -o %t.o

// RUN: echo 'Content of device file 1' > %t.tgt1
// RUN: clang-offload-bundler -type=o -targets=host-%itanium_abi_triple,openmp-amdgcn-amd-amdhsa-gfx900 -inputs=%t.o,%t.tgt1 -outputs=%t.abundle1.o
 
// RUN: echo 'Content of device file 2' > %t.tgt2
// RUN: clang-offload-bundler -type=o -targets=host-%itanium_abi_triple,openmp-amdgcn-amd-amdhsa-gfx900 -inputs=%t.o,%t.tgt2 -outputs=%t.abundle2.o
 
// RUN: llvm-ar cr %t.lib.a %t.abundle1.o %t.abundle2.o

This patch ==>
// RUN: clang-offload-bundler -unbundle -type=a -targets=openmp-amdgcn-amd-amdhsa-gfx900 -inputs=%t.lib.a -outputs=%t.devicelib.a

%t.devicelib.a will contain all devices objects corresponding to gfx900
Though my interest originates from OpenMP side, Device-specific Archive Libraries created like this can be used by other offloading languages like HIP, CUDA, and OpenCL. Pelase refer D81109 for the an earlier patch in the series of patches which will enable this.

The naming of code objects in a bundled code object includes the processor name and the settings for target features (see https://clang.llvm.org/docs/ClangOffloadBundler.html#target-id and https://llvm.org/docs/AMDGPUUsage.html#target-id). The compatibility of code objects considers both target processor matching and target feature compatibility. Target features can have three settings: on, off and any. The compatibility is that each feature that is on/off must exactly match, but any will match either on or off.

So when unbundling an archive how is the desired code object being requested? How is it handling the target features? For example, if code objects that will be compatible with a feature being on is required, then matching code objects in the archive would be those that have that feature either on or any.

In D93525#2495937, @t-tye wrote:
In D93525#2495374, @saiislam wrote:
In D93525#2493752, @t-tye wrote:

In D93525#2493024, @yaxunl wrote:

can you document this in ClangOffloadBundler.rst ? I think we need a clear description about how clang-offload-bundler knows which file in the .a file belongs to which target.

How does the .a relate to bundled code objects? Does the .a have a number of bundled code objects? If so wouldn't the identity of code objects be defined by the existing bundled code object ABI already documented? If the .a is a set of non-bundled code objects then defining how they are identified is not part of the clang-offload-bundler documentation as there are no bundled code objects involved. It would seem that the documentation belongs with the OpenMP runtime/compiler that is choosing to use .a files in this manner.

Bundles (created using clang-offload-bundler) are passed to llvm-ar to create an archive of bundled objects (*.a file). An archive can have bundles for multiple device types. So, yes, the identity of code objects is defined by the existing bundled code object ABI.
This patch reads such an archive and produces a device-specific archive for each of the target devices given as input. Each device-specific archive contains all the code objects corresponding to that particular device and are written as per llvm archive format.

Here is a snippet of relevant lit run lines:
// RUN: %clang -O0 -target %itanium_abi_triple %s -c -o %t.o

// RUN: echo 'Content of device file 1' > %t.tgt1
// RUN: clang-offload-bundler -type=o -targets=host-%itanium_abi_triple,openmp-amdgcn-amd-amdhsa-gfx900 -inputs=%t.o,%t.tgt1 -outputs=%t.abundle1.o
 
// RUN: echo 'Content of device file 2' > %t.tgt2
// RUN: clang-offload-bundler -type=o -targets=host-%itanium_abi_triple,openmp-amdgcn-amd-amdhsa-gfx900 -inputs=%t.o,%t.tgt2 -outputs=%t.abundle2.o
 
// RUN: llvm-ar cr %t.lib.a %t.abundle1.o %t.abundle2.o

This patch ==>
// RUN: clang-offload-bundler -unbundle -type=a -targets=openmp-amdgcn-amd-amdhsa-gfx900 -inputs=%t.lib.a -outputs=%t.devicelib.a

%t.devicelib.a will contain all devices objects corresponding to gfx900
Though my interest originates from OpenMP side, Device-specific Archive Libraries created like this can be used by other offloading languages like HIP, CUDA, and OpenCL. Pelase refer D81109 for the an earlier patch in the series of patches which will enable this.
The naming of code objects in a bundled code object includes the processor name and the settings for target features (see https://clang.llvm.org/docs/ClangOffloadBundler.html#target-id and https://llvm.org/docs/AMDGPUUsage.html#target-id). The compatibility of code objects considers both target processor matching and target feature compatibility. Target features can have three settings: on, off and any. The compatibility is that each feature that is on/off must exactly match, but any will match either on or off.

So when unbundling an archive how is the desired code object being requested? How is it handling the target features? For example, if code objects that will be compatible with a feature being on is required, then matching code objects in the archive would be those that have that feature either on or any.

At the moment this patch defines compatibility as exact string match of bundler entry ID. So, it doesn't support target ID concept fully. But, following example work.
Supporting target ID requires little more work and discussion.

// RUN: clang-offload-bundler -type=o -targets=host-%itanium_abi_triple,openmp-amdgcn-amd-amdhsa--gfx908 -inputs=%t.o,%t.tgt1 -outputs=%t.abundle1.o
// RUN: clang-offload-bundler -type=o -targets=host-%itanium_abi_triple,openmp-amdgcn-amd-amdhsa--gfx908:sramecc+:xnack+,openmp-amdgcn-amd-amdhsa--gfx908:sramecc-:xnack+ -inputs=%t.o,%t.tgt1,%t.tgt2 -outputs=%t.targetIDbundle.o
// RUN: llvm-ar cr %t.targetIDlib.a %t.abundle1.o %t.targetIDbundle.o
// RUN: clang-offload-bundler -unbundle -type=a -targets=openmp-amdgcn-amd-amdhsa--gfx908:sramecc+:xnack+ -inputs=%t.targetIDlib.a -outputs=%t.devicelibt-sramecc+.a
// RUN: llvm-ar t %t.devicelibt-sramecc+.a | FileCheck %s -check-prefix=SRAMECCplus
// SRAMECCplus: targetIDbundle.bc
// SRAMECCplus-NOT: abundle1.bc

At the moment this patch defines compatibility as exact string match of bundler entry ID.
[...]
Supporting target ID requires little more work and discussion.

Let's get this in first, then revisit target ID support as we need it.

In D93525#2509796, @jdoerfert wrote:

At the moment this patch defines compatibility as exact string match of bundler entry ID.
[...]
Supporting target ID requires little more work and discussion.

Let's get this in first, then revisit target ID support as we need it.

I do not think this patch should ignore target ID as that is now upstreamed and documented. What is involved in correcting the compatibility test to be correct by the target ID rules? There are examples of doing this in all the runtimes and I can help if that is useful.

This revision now requires changes to proceed.Jan 20 2021, 8:16 AM

In D93525#2509836, @t-tye wrote:

In D93525#2509796, @jdoerfert wrote:

At the moment this patch defines compatibility as exact string match of bundler entry ID.
[...]
Supporting target ID requires little more work and discussion.

Let's get this in first, then revisit target ID support as we need it.

I do not think this patch should ignore target ID as that is now upstreamed and documented. What is involved in correcting the compatibility test to be correct by the target ID rules? There are examples of doing this in all the runtimes and I can help if that is useful.

First, there is no reason not to have multiple patches as long as they are self contained and testable. Arguably, smaller patches are better.

That said, target ID is a new feature and, as discussed in the OpenMP call today, there is a chance we have to revisit this to support more involved information. As this discussion is open ended (and hasn't started yet), it seems absolutely sensible to continue with a tested and working patch that provides features we need for sure instead of forcing some support of a feature we don't use right now anyway.

Added support for optional TargetID during unbundling of archives.

saiislam retitled this revision from [OpenMP] Add unbundling of archives containing bundled object files into device specific archives to [clang-offload-bundler] Add unbundling of archives containing bundled object files into device specific archives.Feb 10 2021, 10:00 AM

saiislam edited the summary of this revision. (Show Details)

Harbormaster completed remote builds in B88658: Diff 322724.Feb 10 2021, 10:35 AM

ashi1 added a subscriber: ashi1.Apr 5 2021, 8:24 AM

Can we split this patch now and make progress?

cchen added a subscriber: cchen.May 19 2021, 8:30 AM

Removed TargetID support, to be reviewed in a followup patch.
Added OffloadTargetInfo class to encapsulate handling of bundle entry ID components: OffloadKind, Triple, GPUArch.

Removed unused header.

saiislam edited the summary of this revision. (Show Details)Jun 9 2021, 6:50 AM

Thanks for splitting this. I quickly went over it only.

clang/docs/ClangOffloadBundler.rst
137	not needed here.
clang/tools/clang-offload-bundler/ClangOffloadBundler.cpp
1446	By now, a proper conditional seems appropriate.

Removed Triple format example from documentation and simplified conditional calling of bundling/unbundling functions.

Harbormaster completed remote builds in B108416: Diff 350897.Jun 9 2021, 8:19 AM

Ping.

Does this pass internal CI (ePSDB) ? I am concerned about the enforcement of the canonical format of target triple since this may break backward compatibility.

Updated clang and hip tests to ensure that all 4 components of triple are mandataroly available in the bundle entry ID.

Harbormaster completed remote builds in B110596: Diff 353922.Jun 23 2021, 4:46 AM

yaxunl added inline comments.Jun 23 2021, 7:04 AM

clang/lib/Driver/ToolChains/Clang.cpp
7632–7641	This is not HIP specific. Other toolchain could use a non-canonical triple too. Also there may be more components of triple missing. A generic fix would be use Triple::normalize for all toolchain. same as below.

Generalized padding of Triple fields of Bundle Entry ID while generating command for clang-offload-bundler.

Harbormaster completed remote builds in B110788: Diff 354197.Jun 24 2021, 4:26 AM

LGTM. Thanks! Pls make sure it passes internal CI (ePSDB) before committing.

In D93525#2838535, @yaxunl wrote:

LGTM. Thanks! Pls make sure it passes internal CI (ePSDB) before committing.

Sure, I will take care of it. Thanks!

Any comments @jdoerfert ?

Looks reasonable to me. We can always refine it as we go.

clang/tools/clang-offload-bundler/ClangOffloadBundler.cpp
137
1263	Add a message to asserts. also other places.

This revision was not accepted when it landed; it landed in state Needs Review.Jun 30 2021, 5:26 AM

This revision was landed with ongoing or failed builds.

Closed by commit rGf7ce532d622d: [clang-offload-bundler] Add unbundling of archives containing bundled object… (authored by saiislam). · Explain Why

This revision was automatically updated to reflect the committed changes.

saiislam added a commit: rGf7ce532d622d: [clang-offload-bundler] Add unbundling of archives containing bundled object….

@yaxunl
this patch on its own is failing in our internal CI. I have an internal patch (542569) to integrate it cleanly there.

grokos added inline comments.Jun 30 2021, 5:34 AM

clang/docs/ClangOffloadBundler.rst
128	A bit of wordplay, but it's weird that a triple now has 4 elements...
clang/tools/clang-offload-bundler/ClangOffloadBundler.cpp
147	Leftover? `Components` is already 6 elements long.
1102	`CodeObject` --> `CodeObjectInfo`

In D93525#2849815, @saiislam wrote:

@yaxunl
this patch on its own is failing in our internal CI. I have an internal patch (542569) to integrate it cleanly there.

Fine. Thanks.

saiislam added inline comments.Jun 30 2021, 7:25 AM

clang/docs/ClangOffloadBundler.rst
128	I think llvm::Triple it is named Triple because of historical reasons. Otherwise, it already has these components (including the environment). As llvm::Triple API's don't force presence of all components it is not an issue in most cases, but in our case of Bundle Entry ID we need a way to differentiate between 4th component of Triple and a GPU arch, hence this rule.
clang/tools/clang-offload-bundler/ClangOffloadBundler.cpp
147	Not necessarily. It is possible that target has less than 6 elements. For example all bundling/unbundling cases which do not require GPUArch field. E.g. "openmp-powerpc64le-ibm-linux-gnu"

grokos added inline comments.Jun 30 2021, 7:45 AM

clang/tools/clang-offload-bundler/ClangOffloadBundler.cpp
147	OK, thanks!

rsmith added a subscriber: rsmith.Jun 30 2021, 2:27 PM

rsmith added inline comments.

clang/test/Driver/clang-offload-bundler.c
390	This test does not depend on `llvm-ar`, and this change causes `check-clang` to fail in the case where `llvm-ar` has not previously been built. Please can you fix? (Might need some changes to the build rules to add a dependency on `llvm-ar`, if you can't avoid depending on it for this test.)

@rsmith,
Thanks for pointing out this issue. I have proposed a fix in D105285. Please review.

saiislam mentioned this in D106809: [clang-offload-bundler] Make Bundle Entry ID backward compatible.Jul 26 2021, 10:09 AM

saiislam mentioned this in D105191: [Clang][OpenMP] Add partial support for Static Device Libraries.Aug 18 2021, 5:06 AM

This is mentioned as broken in the referenced patch and landed with @t-tye still marked as requires changes. Revert warranted?

Testing looks very sparse for something handling archives. Presumably it has the same set of bugs as D108291 e.g. implicit whole archive semantics

In D93525#2952012, @JonChesterfield wrote:

This is mentioned as broken in the referenced patch and landed with @t-tye still marked as requires changes. Revert warranted?

Testing looks very sparse for something handling archives. Presumably it has the same set of bugs as D108291 e.g. implicit whole archive semantics

Tony's earlier objection/ask was to include TargetID support along with archive unbundling support in this. I had verbal consent from him to split the patch and propose TargetID support for OpenMP in a separate patch. The same was agreed upon in the multi-company meeting as well.
Also, D106870 places necessary infrastructure to support TargetID in a follow up patch. Once it lands, TargetID patch is fairly straightforward.
Whole archive semantics didn't introduce any bug anywhere, though performance is a separate issue. D108291 is required because nvlink silently fails to link cubin files inside an archive.

Any suggestions on how can I improve testing for archives?

saiislam mentioned this in D108291: [clang-nvlink-wrapper] Wrapper around nvlink for archive files.Sep 3 2021, 4:35 AM

saiislam mentioned this in rG98380762c3b7: [clang-offload-bundler] Make Bundle Entry ID backward compatible.Sep 8 2021, 3:37 AM

saiislam mentioned this in D110083: [clang-offload-bundler][docs][NFC] Add archive unbundling documentation.Sep 20 2021, 10:30 AM

saiislam mentioned this in rGee31ad0ab5f7: [clang-offload-bundler][docs][NFC] Add archive unbundling documentation.Sep 21 2021, 6:55 AM

lamb-j added a subscriber: lamb-j.Jun 7 2022, 11:23 PM

Herald added a project: Restricted Project. · View Herald TranscriptJun 7 2022, 11:23 PM

Herald added subscribers: kosarev, MaskRay. · View Herald Transcript

Revision Contents

Path

Size

clang/

docs/

ClangOffloadBundler.rst

10 lines

lib/

Driver/

ToolChains/

Clang.cpp

30 lines

test/

Driver/

clang-offload-bundler.c

44 lines

hip-rdc-device-only.hip

8 lines

hip-toolchain-rdc-separate.hip

12 lines

tools/

clang-offload-bundler/

ClangOffloadBundler.cpp

352 lines

Diff 355514

clang/docs/ClangOffloadBundler.rst

Show First 20 Lines • Show All 115 Lines • ▼ Show 20 Lines	.. table:: Bundled Code Object Offload Kind
to be loaded by the HIP runtime. The fat binary can be		to be loaded by the HIP runtime. The fat binary can be
loaded directly from a file, or be embedded in the host code		loaded directly from a file, or be embedded in the host code
object as a data section with the name ``.hip_fatbin``.		object as a data section with the name ``.hip_fatbin``.

openmp Offload code object for the OpenMP language extension.		openmp Offload code object for the OpenMP language extension.
============= ==============================================================		============= ==============================================================

target-triple		target-triple
The target triple of the code object.		The target triple of the code object:

		.. code::

		<Architecture>-<Vendor>-<OS>-<Environment>
		grokosUnsubmitted Not Done Reply Inline Actions A bit of wordplay, but it's weird that a triple now has 4 elements... grokos: A bit of wordplay, but it's weird that a triple now has 4 elements...
		saiislamAuthorUnsubmitted Done Reply Inline Actions I think llvm::Triple it is named Triple because of historical reasons. Otherwise, it already has these components (including the environment). As llvm::Triple API's don't force presence of all components it is not an issue in most cases, but in our case of Bundle Entry ID we need a way to differentiate between 4th component of Triple and a GPU arch, hence this rule. saiislam: I think llvm::Triple it is named Triple because of historical reasons. Otherwise, it already…

		It is required to have all four components present, if target-id is present.
		Components are hyphen separated. If a component is not specified then the
		empty string must be used in its place.

target-id		target-id
The canonical target ID of the code object. Present only if the target		The canonical target ID of the code object. Present only if the target
supports a target ID. See :ref:`clang-target-id`.		supports a target ID. See :ref:`clang-target-id`.

		jdoerfertUnsubmitted Not Done Reply Inline Actions not needed here. jdoerfert: not needed here.
Each entry of a bundled code object must have a different bundle entry ID. There		Each entry of a bundled code object must have a different bundle entry ID. There
can be multiple entries for the same processor provided they differ in target		can be multiple entries for the same processor provided they differ in target
feature settings. If there is an entry with a target feature specified as Any,		feature settings. If there is an entry with a target feature specified as Any,
then all entries must specify that target feature as Any for the same		then all entries must specify that target feature as Any for the same
processor. There may be additional target specific restrictions.		processor. There may be additional target specific restrictions.

.. _clang-target-id:		.. _clang-target-id:

▲ Show 20 Lines • Show All 74 Lines • Show Last 20 Lines

clang/lib/Driver/ToolChains/Clang.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 7,623 Lines • ▼ Show 20 Lines	if (const auto *OA = dyn_cast<OffloadAction>(CurDep)) {
CurTC = nullptr;		CurTC = nullptr;
OA->doOnEachDependence([&](Action A, const ToolChain TC, const char *) {		OA->doOnEachDependence([&](Action A, const ToolChain TC, const char *) {
assert(CurTC == nullptr && "Expected one dependence!");		assert(CurTC == nullptr && "Expected one dependence!");
CurKind = A->getOffloadingDeviceKind();		CurKind = A->getOffloadingDeviceKind();
CurTC = TC;		CurTC = TC;
});		});
}		}
Triples += Action::GetOffloadKindName(CurKind);		Triples += Action::GetOffloadKindName(CurKind);
Triples += '-';		Triples += "-";
Triples += CurTC->getTriple().normalize();		std::string NormalizedTriple = CurTC->getTriple().normalize();
if (CurKind == Action::OFK_HIP && CurDep->getOffloadingArch()) {		Triples += NormalizedTriple;
Triples += '-';
		if (CurDep->getOffloadingArch() != nullptr) {
		// If OffloadArch is present it can only appear as the 6th hypen
		// sepearated field of Bundle Entry ID. So, pad required number of
		// hyphens in Triple.
		for (int i = 4 - StringRef(NormalizedTriple).count("-"); i > 0; i--)
		Triples += "-";
		yaxunlUnsubmitted Not Done Reply Inline Actions This is not HIP specific. Other toolchain could use a non-canonical triple too. Also there may be more components of triple missing. A generic fix would be use Triple::normalize for all toolchain. same as below. yaxunl: This is not HIP specific. Other toolchain could use a non-canonical triple too. Also there may…
Triples += CurDep->getOffloadingArch();		Triples += CurDep->getOffloadingArch();
}		}
}		}
CmdArgs.push_back(TCArgs.MakeArgString(Triples));		CmdArgs.push_back(TCArgs.MakeArgString(Triples));

// Get bundled file command.		// Get bundled file command.
CmdArgs.push_back(		CmdArgs.push_back(
TCArgs.MakeArgString(Twine("-outputs=") + Output.getFilename()));		TCArgs.MakeArgString(Twine("-outputs=") + Output.getFilename()));
▲ Show 20 Lines • Show All 53 Lines • ▼ Show 20 Lines	void OffloadBundler::ConstructJobMultipleOutputs(
Triples += "-targets=";		Triples += "-targets=";
auto DepInfo = UA.getDependentActionsInfo();		auto DepInfo = UA.getDependentActionsInfo();
for (unsigned I = 0; I < DepInfo.size(); ++I) {		for (unsigned I = 0; I < DepInfo.size(); ++I) {
if (I)		if (I)
Triples += ',';		Triples += ',';

auto &Dep = DepInfo[I];		auto &Dep = DepInfo[I];
Triples += Action::GetOffloadKindName(Dep.DependentOffloadKind);		Triples += Action::GetOffloadKindName(Dep.DependentOffloadKind);
Triples += '-';		Triples += "-";
Triples += Dep.DependentToolChain->getTriple().normalize();		std::string NormalizedTriple =
if (Dep.DependentOffloadKind == Action::OFK_HIP &&		Dep.DependentToolChain->getTriple().normalize();
!Dep.DependentBoundArch.empty()) {		Triples += NormalizedTriple;
Triples += '-';
		if (!Dep.DependentBoundArch.empty()) {
		// If OffloadArch is present it can only appear as the 6th hypen
		// sepearated field of Bundle Entry ID. So, pad required number of
		// hyphens in Triple.
		for (int i = 4 - StringRef(NormalizedTriple).count("-"); i > 0; i--)
		Triples += "-";
Triples += Dep.DependentBoundArch;		Triples += Dep.DependentBoundArch;
}		}
}		}

CmdArgs.push_back(TCArgs.MakeArgString(Triples));		CmdArgs.push_back(TCArgs.MakeArgString(Triples));

// Get bundled file command.		// Get bundled file command.
CmdArgs.push_back(		CmdArgs.push_back(
▲ Show 20 Lines • Show All 50 Lines • Show Last 20 Lines

clang/test/Driver/clang-offload-bundler.c

	Show All 40 Lines
	// CK-HELP: {{.*}}-type=<string> - Type of the files to be bundled/unbundled.			// CK-HELP: {{.*}}-type=<string> - Type of the files to be bundled/unbundled.
	// CK-HELP: {{.*}}Current supported types are:			// CK-HELP: {{.*}}Current supported types are:
	// CK-HELP: {{.}}i {{.}}- cpp-output			// CK-HELP: {{.}}i {{.}}- cpp-output
	// CK-HELP: {{.}}ii {{.}}- c++-cpp-output			// CK-HELP: {{.}}ii {{.}}- c++-cpp-output
	// CK-HELP: {{.}}ll {{.}}- llvm			// CK-HELP: {{.}}ll {{.}}- llvm
	// CK-HELP: {{.}}bc {{.}}- llvm-bc			// CK-HELP: {{.}}bc {{.}}- llvm-bc
	// CK-HELP: {{.}}s {{.}}- assembler			// CK-HELP: {{.}}s {{.}}- assembler
	// CK-HELP: {{.}}o {{.}}- object			// CK-HELP: {{.}}o {{.}}- object
				// CK-HELP: {{.}}a {{.}}- archive of objects
	// CK-HELP: {{.}}gch {{.}}- precompiled-header			// CK-HELP: {{.}}gch {{.}}- precompiled-header
	// CK-HELP: {{.}}ast {{.}}- clang AST file			// CK-HELP: {{.}}ast {{.}}- clang AST file
	// CK-HELP: {{.}}-unbundle {{.}}- Unbundle bundled file into several output files.			// CK-HELP: {{.}}-unbundle {{.}}- Unbundle bundled file into several output files.

	//			//
	// Check errors.			// Check errors.
	//			//
	// RUN: not clang-offload-bundler -type=i -targets=host-%itanium_abi_triple,openmp-powerpc64le-ibm-linux-gnu,openmp-x86_64-pc-linux-gnu -inputs=%t.i,%t.tgt1,%t.tgt2 -outputs=%t.bundle.i -unbundle 2>&1 \| FileCheck %s --check-prefix CK-ERR1			// RUN: not clang-offload-bundler -type=i -targets=host-%itanium_abi_triple,openmp-powerpc64le-ibm-linux-gnu,openmp-x86_64-pc-linux-gnu -inputs=%t.i,%t.tgt1,%t.tgt2 -outputs=%t.bundle.i -unbundle 2>&1 \| FileCheck %s --check-prefix CK-ERR1
	▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines
	// CK-ERR8B: error: invalid target 'xpenmp-x86_xx-pc-linux-gnu', unknown offloading kind 'xpenmp', unknown target triple 'x86_xx-pc-linux-gnu'			// CK-ERR8B: error: invalid target 'xpenmp-x86_xx-pc-linux-gnu', unknown offloading kind 'xpenmp', unknown target triple 'x86_xx-pc-linux-gnu'

	// RUN: not clang-offload-bundler -type=i -targets=openmp-powerpc64le-linux,openmp-powerpc64le-ibm-linux-gnu,openmp-x86_64-pc-linux-gnu -inputs=%t.i,%t.tgt1,%t.tgt2 -outputs=%t.bundle.i 2>&1 \| FileCheck %s --check-prefix CK-ERR9A			// RUN: not clang-offload-bundler -type=i -targets=openmp-powerpc64le-linux,openmp-powerpc64le-ibm-linux-gnu,openmp-x86_64-pc-linux-gnu -inputs=%t.i,%t.tgt1,%t.tgt2 -outputs=%t.bundle.i 2>&1 \| FileCheck %s --check-prefix CK-ERR9A
	// CK-ERR9A: error: expecting exactly one host target but got 0			// CK-ERR9A: error: expecting exactly one host target but got 0

	// RUN: not clang-offload-bundler -type=i -targets=host-%itanium_abi_triple,host-%itanium_abi_triple,openmp-x86_64-pc-linux-gnu -inputs=%t.i,%t.tgt1,%t.tgt2 -outputs=%t.bundle.i 2>&1 \| FileCheck %s --check-prefix CK-ERR9B			// RUN: not clang-offload-bundler -type=i -targets=host-%itanium_abi_triple,host-%itanium_abi_triple,openmp-x86_64-pc-linux-gnu -inputs=%t.i,%t.tgt1,%t.tgt2 -outputs=%t.bundle.i 2>&1 \| FileCheck %s --check-prefix CK-ERR9B
	// CK-ERR9B: error: Duplicate targets are not allowed			// CK-ERR9B: error: Duplicate targets are not allowed

				// RUN: not clang-offload-bundler -type=a -targets=hxst-powerpcxxle-ibm-linux-gnu,openxp-pxxerpc64le-ibm-linux-gnu,xpenmp-x86_xx-pc-linux-gnu -inputs=%t.i,%t.tgt1,%t.tgt2 -outputs=%t.bundle.i 2>&1 \| FileCheck %s --check-prefix CK-ERR10A
				// CK-ERR10A: error: Archive files are only supported for unbundling

	//			//
	// Check text bundle. This is a readable format, so we check for the format we expect to find.			// Check text bundle. This is a readable format, so we check for the format we expect to find.
	//			//
	// RUN: clang-offload-bundler -type=i -targets=host-%itanium_abi_triple,openmp-powerpc64le-ibm-linux-gnu,openmp-x86_64-pc-linux-gnu -inputs=%t.i,%t.tgt1,%t.tgt2 -outputs=%t.bundle3.i			// RUN: clang-offload-bundler -type=i -targets=host-%itanium_abi_triple,openmp-powerpc64le-ibm-linux-gnu,openmp-x86_64-pc-linux-gnu -inputs=%t.i,%t.tgt1,%t.tgt2 -outputs=%t.bundle3.i
	// RUN: clang-offload-bundler -type=ii -targets=host-%itanium_abi_triple,openmp-powerpc64le-ibm-linux-gnu,openmp-x86_64-pc-linux-gnu -inputs=%t.ii,%t.tgt1,%t.tgt2 -outputs=%t.bundle3.ii			// RUN: clang-offload-bundler -type=ii -targets=host-%itanium_abi_triple,openmp-powerpc64le-ibm-linux-gnu,openmp-x86_64-pc-linux-gnu -inputs=%t.ii,%t.tgt1,%t.tgt2 -outputs=%t.bundle3.ii
	// RUN: clang-offload-bundler -type=ll -targets=host-%itanium_abi_triple,openmp-powerpc64le-ibm-linux-gnu,openmp-x86_64-pc-linux-gnu -inputs=%t.ll,%t.tgt1,%t.tgt2 -outputs=%t.bundle3.ll			// RUN: clang-offload-bundler -type=ll -targets=host-%itanium_abi_triple,openmp-powerpc64le-ibm-linux-gnu,openmp-x86_64-pc-linux-gnu -inputs=%t.ll,%t.tgt1,%t.tgt2 -outputs=%t.bundle3.ll
	// RUN: clang-offload-bundler -type=s -targets=host-%itanium_abi_triple,openmp-powerpc64le-ibm-linux-gnu,openmp-x86_64-pc-linux-gnu -inputs=%t.s,%t.tgt1,%t.tgt2 -outputs=%t.bundle3.s			// RUN: clang-offload-bundler -type=s -targets=host-%itanium_abi_triple,openmp-powerpc64le-ibm-linux-gnu,openmp-x86_64-pc-linux-gnu -inputs=%t.s,%t.tgt1,%t.tgt2 -outputs=%t.bundle3.s
	// RUN: clang-offload-bundler -type=s -targets=openmp-powerpc64le-ibm-linux-gnu,host-%itanium_abi_triple,openmp-x86_64-pc-linux-gnu -inputs=%t.tgt1,%t.s,%t.tgt2 -outputs=%t.bundle3.unordered.s			// RUN: clang-offload-bundler -type=s -targets=openmp-powerpc64le-ibm-linux-gnu,host-%itanium_abi_triple,openmp-x86_64-pc-linux-gnu -inputs=%t.tgt1,%t.s,%t.tgt2 -outputs=%t.bundle3.unordered.s
	▲ Show 20 Lines • Show All 194 Lines • ▼ Show 20 Lines
	// RUN: clang-offload-bundler -type=bc -targets=host-%itanium_abi_triple,openmp-powerpc64le-ibm-linux-gnu,openmp-x86_64-pc-linux-gnu -outputs=%t.res.bc,%t.res.tgt1,%t.res.tgt2 -inputs=%t.bundle3.bc -unbundle			// RUN: clang-offload-bundler -type=bc -targets=host-%itanium_abi_triple,openmp-powerpc64le-ibm-linux-gnu,openmp-x86_64-pc-linux-gnu -outputs=%t.res.bc,%t.res.tgt1,%t.res.tgt2 -inputs=%t.bundle3.bc -unbundle
	// RUN: diff %t.bc %t.res.bc			// RUN: diff %t.bc %t.res.bc
	// RUN: diff %t.tgt1 %t.res.tgt1			// RUN: diff %t.tgt1 %t.res.tgt1
	// RUN: diff %t.tgt2 %t.res.tgt2			// RUN: diff %t.tgt2 %t.res.tgt2

	//			//
	// Check error due to missing bundles			// Check error due to missing bundles
	//			//
	// RUN: clang-offload-bundler -type=bc -targets=host-%itanium_abi_triple,hip-amdgcn-amd-amdhsa-gfx900 -inputs=%t.bc,%t.tgt1 -outputs=%t.hip.bundle.bc			// RUN: clang-offload-bundler -type=bc -targets=host-%itanium_abi_triple,hip-amdgcn-amd-amdhsa--gfx900 -inputs=%t.bc,%t.tgt1 -outputs=%t.hip.bundle.bc
	// RUN: not clang-offload-bundler -type=bc -inputs=%t.hip.bundle.bc -outputs=%t.tmp.bc -unbundle \			// RUN: not clang-offload-bundler -type=bc -inputs=%t.hip.bundle.bc -outputs=%t.tmp.bc -unbundle \
	// RUN: -targets=hip-amdgcn-amd-amdhsa-gfx906 \			// RUN: -targets=hip-amdgcn-amd-amdhsa--gfx906 \
	// RUN: 2>&1 \| FileCheck -check-prefix=MISS1 %s			// RUN: 2>&1 \| FileCheck -check-prefix=MISS1 %s
	// RUN: not clang-offload-bundler -type=bc -inputs=%t.hip.bundle.bc -outputs=%t.tmp.bc,%t.tmp2.bc -unbundle \			// RUN: not clang-offload-bundler -type=bc -inputs=%t.hip.bundle.bc -outputs=%t.tmp.bc,%t.tmp2.bc -unbundle \
	// RUN: -targets=hip-amdgcn-amd-amdhsa-gfx906,hip-amdgcn-amd-amdhsa-gfx900 \			// RUN: -targets=hip-amdgcn-amd-amdhsa--gfx906,hip-amdgcn-amd-amdhsa--gfx900 \
	// RUN: 2>&1 \| FileCheck -check-prefix=MISS1 %s			// RUN: 2>&1 \| FileCheck -check-prefix=MISS1 %s
	// MISS1: error: Can't find bundles for hip-amdgcn-amd-amdhsa-gfx906			// MISS1: error: Can't find bundles for hip-amdgcn-amd-amdhsa--gfx906
	// RUN: not clang-offload-bundler -type=bc -inputs=%t.hip.bundle.bc -outputs=%t.tmp.bc,%t.tmp2.bc -unbundle \			// RUN: not clang-offload-bundler -type=bc -inputs=%t.hip.bundle.bc -outputs=%t.tmp.bc,%t.tmp2.bc -unbundle \
	// RUN: -targets=hip-amdgcn-amd-amdhsa-gfx906,hip-amdgcn-amd-amdhsa-gfx803 \			// RUN: -targets=hip-amdgcn-amd-amdhsa--gfx906,hip-amdgcn-amd-amdhsa--gfx803 \
	// RUN: 2>&1 \| FileCheck -check-prefix=MISS2 %s			// RUN: 2>&1 \| FileCheck -check-prefix=MISS2 %s
	// MISS2: error: Can't find bundles for hip-amdgcn-amd-amdhsa-gfx803 and hip-amdgcn-amd-amdhsa-gfx906			// MISS2: error: Can't find bundles for hip-amdgcn-amd-amdhsa--gfx803 and hip-amdgcn-amd-amdhsa--gfx906
	// RUN: not clang-offload-bundler -type=bc -inputs=%t.hip.bundle.bc -outputs=%t.tmp.bc,%t.tmp2.bc,%t.tmp3.bc -unbundle \			// RUN: not clang-offload-bundler -type=bc -inputs=%t.hip.bundle.bc -outputs=%t.tmp.bc,%t.tmp2.bc,%t.tmp3.bc -unbundle \
	// RUN: -targets=hip-amdgcn-amd-amdhsa-gfx906,hip-amdgcn-amd-amdhsa-gfx803,hip-amdgcn-amd-amdhsa-gfx1010 \			// RUN: -targets=hip-amdgcn-amd-amdhsa--gfx906,hip-amdgcn-amd-amdhsa--gfx803,hip-amdgcn-amd-amdhsa--gfx1010 \
	// RUN: 2>&1 \| FileCheck -check-prefix=MISS3 %s			// RUN: 2>&1 \| FileCheck -check-prefix=MISS3 %s
	// MISS3: error: Can't find bundles for hip-amdgcn-amd-amdhsa-gfx1010, hip-amdgcn-amd-amdhsa-gfx803, and hip-amdgcn-amd-amdhsa-gfx906			// MISS3: error: Can't find bundles for hip-amdgcn-amd-amdhsa--gfx1010, hip-amdgcn-amd-amdhsa--gfx803, and hip-amdgcn-amd-amdhsa--gfx906

	//			//
	// Check error due to duplicate targets			// Check error due to duplicate targets
	//			//
	// RUN: not clang-offload-bundler -type=bc -targets=host-%itanium_abi_triple,hip-amdgcn-amd-amdhsa-gfx900,hip-amdgcn-amd-amdhsa-gfx900 \			// RUN: not clang-offload-bundler -type=bc -targets=host-%itanium_abi_triple,hip-amdgcn-amd-amdhsa--gfx900,hip-amdgcn-amd-amdhsa--gfx900 \
	// RUN: -inputs=%t.bc,%t.tgt1,%t.tgt1 -outputs=%t.hip.bundle.bc 2>&1 \| FileCheck -check-prefix=DUP %s			// RUN: -inputs=%t.bc,%t.tgt1,%t.tgt1 -outputs=%t.hip.bundle.bc 2>&1 \| FileCheck -check-prefix=DUP %s
	// RUN: not clang-offload-bundler -type=bc -inputs=%t.hip.bundle.bc -outputs=%t.tmp.bc,%t.tmp2.bc -unbundle \			// RUN: not clang-offload-bundler -type=bc -inputs=%t.hip.bundle.bc -outputs=%t.tmp.bc,%t.tmp2.bc -unbundle \
	// RUN: -targets=hip-amdgcn-amd-amdhsa-gfx906,hip-amdgcn-amd-amdhsa-gfx906 \			// RUN: -targets=hip-amdgcn-amd-amdhsa--gfx906,hip-amdgcn-amd-amdhsa--gfx906 \
	// RUN: 2>&1 \| FileCheck -check-prefix=DUP %s			// RUN: 2>&1 \| FileCheck -check-prefix=DUP %s
	// DUP: error: Duplicate targets are not allowed			// DUP: error: Duplicate targets are not allowed
	//			//
	// Check -list option			// Check -list option
	//			//

	// RUN: clang-offload-bundler -bundle-align=4096 -type=bc -targets=host-%itanium_abi_triple,openmp-powerpc64le-ibm-linux-gnu,openmp-x86_64-pc-linux-gnu -inputs=%t.bc,%t.tgt1,%t.tgt2 -outputs=%t.bundle3.bc			// RUN: clang-offload-bundler -bundle-align=4096 -type=bc -targets=host-%itanium_abi_triple,openmp-powerpc64le-ibm-linux-gnu,openmp-x86_64-pc-linux-gnu -inputs=%t.bc,%t.tgt1,%t.tgt2 -outputs=%t.bundle3.bc
	// RUN: not clang-offload-bundler -type=bc -inputs=%t.bundle3.bc -unbundle -list 2>&1 \| FileCheck -check-prefix=CKLST-ERR %s			// RUN: not clang-offload-bundler -type=bc -inputs=%t.bundle3.bc -unbundle -list 2>&1 \| FileCheck -check-prefix=CKLST-ERR %s
	Show All 11 Lines

	// CKLST2-NOT: host-			// CKLST2-NOT: host-
	// CKLST2-NOT: openmp-powerpc64le-ibm-linux-gnu			// CKLST2-NOT: openmp-powerpc64le-ibm-linux-gnu
	// CKLST2-NOT: openmp-x86_64-pc-linux-gnu			// CKLST2-NOT: openmp-x86_64-pc-linux-gnu

	//			//
	// Check bundling without host target is allowed for HIP.			// Check bundling without host target is allowed for HIP.
	//			//
	// RUN: clang-offload-bundler -type=bc -targets=hip-amdgcn-amd-amdhsa-gfx900,hip-amdgcn-amd-amdhsa-gfx906 \			// RUN: clang-offload-bundler -type=bc -targets=hip-amdgcn-amd-amdhsa--gfx900,hip-amdgcn-amd-amdhsa--gfx906 \
	// RUN: -inputs=%t.tgt1,%t.tgt2 -outputs=%t.hip.bundle.bc			// RUN: -inputs=%t.tgt1,%t.tgt2 -outputs=%t.hip.bundle.bc
	// RUN: clang-offload-bundler -type=bc -list -inputs=%t.hip.bundle.bc \| FileCheck -check-prefix=NOHOST %s			// RUN: clang-offload-bundler -type=bc -list -inputs=%t.hip.bundle.bc \| FileCheck -check-prefix=NOHOST %s
	// RUN: clang-offload-bundler -type=bc -targets=hip-amdgcn-amd-amdhsa-gfx900,hip-amdgcn-amd-amdhsa-gfx906 \			// RUN: clang-offload-bundler -type=bc -targets=hip-amdgcn-amd-amdhsa--gfx900,hip-amdgcn-amd-amdhsa--gfx906 \
	// RUN: -outputs=%t.res.tgt1,%t.res.tgt2 -inputs=%t.hip.bundle.bc -unbundle			// RUN: -outputs=%t.res.tgt1,%t.res.tgt2 -inputs=%t.hip.bundle.bc -unbundle
	// RUN: diff %t.tgt1 %t.res.tgt1			// RUN: diff %t.tgt1 %t.res.tgt1
	// RUN: diff %t.tgt2 %t.res.tgt2			// RUN: diff %t.tgt2 %t.res.tgt2
	//			//
	// NOHOST-NOT: host-			// NOHOST-NOT: host-
	// NOHOST-DAG: hip-amdgcn-amd-amdhsa-gfx900			// NOHOST-DAG: hip-amdgcn-amd-amdhsa--gfx900
	// NOHOST-DAG: hip-amdgcn-amd-amdhsa-gfx906			// NOHOST-DAG: hip-amdgcn-amd-amdhsa--gfx906
				// Check archive unbundling
				//
				// Create few code object bundles and archive them to create an input archive
				// RUN: clang-offload-bundler -type=o -targets=host-%itanium_abi_triple,openmp-amdgcn-amd-amdhsa--gfx906,openmp-amdgcn-amd-amdhsa--gfx908 -inputs=%t.o,%t.tgt1,%t.tgt2 -outputs=%t.simple.bundle
				// RUN: clang-offload-bundler -type=o -targets=host-%itanium_abi_triple,openmp-amdgcn-amd-amdhsa--gfx903 -inputs=%t.o,%t.tgt1 -outputs=%t.simple1.bundle
				// RUN: llvm-ar cr %t.input-archive.a %t.simple.bundle %t.simple1.bundle

				// RUN: clang-offload-bundler -unbundle -type=a -targets=openmp-amdgcn-amd-amdhsa--gfx906,openmp-amdgcn-amd-amdhsa--gfx908 -inputs=%t.input-archive.a -outputs=%t-archive-gfx906-simple.a,%t-archive-gfx908-simple.a
				// RUN: llvm-ar t %t-archive-gfx906-simple.a \| FileCheck %s -check-prefix=GFX906
				rsmithUnsubmitted Not Done Reply Inline Actions This test does not depend on `llvm-ar`, and this change causes `check-clang` to fail in the case where `llvm-ar` has not previously been built. Please can you fix? (Might need some changes to the build rules to add a dependency on `llvm-ar`, if you can't avoid depending on it for this test.) rsmith: This test does not depend on `llvm-ar`, and this change causes `check-clang` to fail in the…
				// GFX906: simple-openmp-amdgcn-amd-amdhsa--gfx906
				// RUN: llvm-ar t %t-archive-gfx908-simple.a \| FileCheck %s -check-prefix=GFX908
				// GFX908-NOT: {{gfx906}}

	// Some code so that we can create a binary out of this file.			// Some code so that we can create a binary out of this file.
	int A = 0;			int A = 0;
	void test_func(void) {			void test_func(void) {
	++A;			++A;
	}			}

clang/test/Driver/hip-rdc-device-only.hip

	Show First 20 Lines • Show All 76 Lines • ▼ Show 20 Lines
	// COMMON-SAME: "-fapply-global-visibility-to-externs"			// COMMON-SAME: "-fapply-global-visibility-to-externs"
	// COMMON-SAME: "-target-cpu" "gfx900"			// COMMON-SAME: "-target-cpu" "gfx900"
	// COMMON-SAME: "-fgpu-rdc"			// COMMON-SAME: "-fgpu-rdc"
	// EMITBC-SAME: {{.}} "-o" {{".a.*bc"}} "-x" "hip"			// EMITBC-SAME: {{.}} "-o" {{".a.*bc"}} "-x" "hip"
	// EMITLL-SAME: {{.}} "-o" {{".a.*ll"}} "-x" "hip"			// EMITLL-SAME: {{.}} "-o" {{".a.*ll"}} "-x" "hip"
	// COMMON-SAME: {{.}} {{".a.cu"}}			// COMMON-SAME: {{.}} {{".a.cu"}}

	// COMMON: "{{.*}}clang-offload-bundler" "-type={{(bc\|ll)}}"			// COMMON: "{{.*}}clang-offload-bundler" "-type={{(bc\|ll)}}"
	// COMMON-SAME: "-targets=hip-amdgcn-amd-amdhsa-gfx803,hip-amdgcn-amd-amdhsa-gfx900"			// COMMON-SAME: "-targets=hip-amdgcn-amd-amdhsa--gfx803,hip-amdgcn-amd-amdhsa--gfx900"
	// COMMON-SAME: "-outputs=a-hip-amdgcn-amd-amdhsa.{{(bc\|ll)}}"			// COMMON-SAME: "-outputs=a-hip-amdgcn-amd-amdhsa.{{(bc\|ll)}}"

	// COMMON: [[CLANG]] "-cc1" "-triple" "amdgcn-amd-amdhsa"			// COMMON: [[CLANG]] "-cc1" "-triple" "amdgcn-amd-amdhsa"
	// COMMON-SAME: "-aux-triple" "x86_64-unknown-linux-gnu"			// COMMON-SAME: "-aux-triple" "x86_64-unknown-linux-gnu"
	// EMITBC-SAME: "-emit-llvm-bc"			// EMITBC-SAME: "-emit-llvm-bc"
	// EMITLL-SAME: "-emit-llvm"			// EMITLL-SAME: "-emit-llvm"
	// COMMON-SAME: {{.*}} "-main-file-name" "b.hip"			// COMMON-SAME: {{.*}} "-main-file-name" "b.hip"
	// COMMON-SAME: "-fcuda-is-device" "-fcuda-allow-variadic-functions" "-fvisibility" "hidden"			// COMMON-SAME: "-fcuda-is-device" "-fcuda-allow-variadic-functions" "-fvisibility" "hidden"
	Show All 13 Lines
	// COMMON-SAME: "-fapply-global-visibility-to-externs"			// COMMON-SAME: "-fapply-global-visibility-to-externs"
	// COMMON-SAME: "-target-cpu" "gfx900"			// COMMON-SAME: "-target-cpu" "gfx900"
	// COMMON-SAME: "-fgpu-rdc"			// COMMON-SAME: "-fgpu-rdc"
	// EMITBC-SAME: {{.}} "-o" {{".b.*bc"}} "-x" "hip"			// EMITBC-SAME: {{.}} "-o" {{".b.*bc"}} "-x" "hip"
	// EMITLL-SAME: {{.}} "-o" {{".b.*ll"}} "-x" "hip"			// EMITLL-SAME: {{.}} "-o" {{".b.*ll"}} "-x" "hip"
	// COMMON-SAME: {{.}} {{".b.hip"}}			// COMMON-SAME: {{.}} {{".b.hip"}}

	// COMMON: "{{.*}}clang-offload-bundler" "-type={{(bc\|ll)}}"			// COMMON: "{{.*}}clang-offload-bundler" "-type={{(bc\|ll)}}"
	// COMMON-SAME: "-targets=hip-amdgcn-amd-amdhsa-gfx803,hip-amdgcn-amd-amdhsa-gfx900"			// COMMON-SAME: "-targets=hip-amdgcn-amd-amdhsa--gfx803,hip-amdgcn-amd-amdhsa--gfx900"
	// COMMON-SAME: "-outputs=b-hip-amdgcn-amd-amdhsa.{{(bc\|ll)}}"			// COMMON-SAME: "-outputs=b-hip-amdgcn-amd-amdhsa.{{(bc\|ll)}}"

	// SAVETEMP: [[CLANG:".clang."]] "-cc1" "-triple" "amdgcn-amd-amdhsa" "-aux-triple" "x86_64-unknown-linux-gnu"			// SAVETEMP: [[CLANG:".clang."]] "-cc1" "-triple" "amdgcn-amd-amdhsa" "-aux-triple" "x86_64-unknown-linux-gnu"
	// SAVETEMP-SAME: "-E"			// SAVETEMP-SAME: "-E"
	// SAVETEMP-SAME: {{.}} "-main-file-name" "a.cu" {{.}} "-target-cpu" "gfx803"			// SAVETEMP-SAME: {{.}} "-main-file-name" "a.cu" {{.}} "-target-cpu" "gfx803"
	// SAVETEMP-SAME: {{.}} "-o" [[A_GFX803_CUI:"a.cui"]] "-x" "hip" {{".*a.cu"}}			// SAVETEMP-SAME: {{.}} "-o" [[A_GFX803_CUI:"a.cui"]] "-x" "hip" {{".*a.cu"}}
	// SAVETEMP-NEXT: [[CLANG]] "-cc1" "-triple" "amdgcn-amd-amdhsa" "-aux-triple" "x86_64-unknown-linux-gnu"			// SAVETEMP-NEXT: [[CLANG]] "-cc1" "-triple" "amdgcn-amd-amdhsa" "-aux-triple" "x86_64-unknown-linux-gnu"
	// SAVETEMP-SAME: "-emit-llvm-bc"			// SAVETEMP-SAME: "-emit-llvm-bc"
	Show All 13 Lines
	// SAVETEMP-SAME: {{.}} "-main-file-name" "a.cu" {{.}} "-target-cpu" "gfx900"			// SAVETEMP-SAME: {{.}} "-main-file-name" "a.cu" {{.}} "-target-cpu" "gfx900"
	// SAVETEMP-SAME: {{.}} "-o" [[A_GFX900_TMP_BC:"a.tmp.bc"]] "-x" "hip-cpp-output" [[A_GFX900_CUI]]			// SAVETEMP-SAME: {{.}} "-o" [[A_GFX900_TMP_BC:"a.tmp.bc"]] "-x" "hip-cpp-output" [[A_GFX900_CUI]]
	// SAVETEMP-NEXT: [[CLANG]] "-cc1" "-triple" "amdgcn-amd-amdhsa" "-aux-triple" "x86_64-unknown-linux-gnu"			// SAVETEMP-NEXT: [[CLANG]] "-cc1" "-triple" "amdgcn-amd-amdhsa" "-aux-triple" "x86_64-unknown-linux-gnu"
	// SAVETEMP-SAME: "-emit-llvm"			// SAVETEMP-SAME: "-emit-llvm"
	// SAVETEMP-SAME: {{.}} "-main-file-name" "a.cu" {{.}} "-target-cpu" "gfx900"			// SAVETEMP-SAME: {{.}} "-main-file-name" "a.cu" {{.}} "-target-cpu" "gfx900"
	// SAVETEMP-SAME: {{.}} "-o" {{"a..ll"}} "-x" "ir" [[A_GFX900_TMP_BC]]			// SAVETEMP-SAME: {{.}} "-o" {{"a..ll"}} "-x" "ir" [[A_GFX900_TMP_BC]]

	// SAVETEMP: "{{.*}}clang-offload-bundler" "-type=ll"			// SAVETEMP: "{{.*}}clang-offload-bundler" "-type=ll"
	// SAVETEMP-SAME: "-targets=hip-amdgcn-amd-amdhsa-gfx803,hip-amdgcn-amd-amdhsa-gfx900"			// SAVETEMP-SAME: "-targets=hip-amdgcn-amd-amdhsa--gfx803,hip-amdgcn-amd-amdhsa--gfx900"
	// SAVETEMP-SAME: "-outputs=a-hip-amdgcn-amd-amdhsa.ll"			// SAVETEMP-SAME: "-outputs=a-hip-amdgcn-amd-amdhsa.ll"

	// SAVETEMP: [[CLANG]] "-cc1" "-triple" "amdgcn-amd-amdhsa" "-aux-triple" "x86_64-unknown-linux-gnu"			// SAVETEMP: [[CLANG]] "-cc1" "-triple" "amdgcn-amd-amdhsa" "-aux-triple" "x86_64-unknown-linux-gnu"
	// SAVETEMP-SAME: "-E"			// SAVETEMP-SAME: "-E"
	// SAVETEMP-SAME: {{.}} "-main-file-name" "b.hip" {{.}} "-target-cpu" "gfx803"			// SAVETEMP-SAME: {{.}} "-main-file-name" "b.hip" {{.}} "-target-cpu" "gfx803"
	// SAVETEMP-SAME: {{.}} "-o" [[B_GFX803_CUI:"b.cui"]] "-x" "hip" {{".*b.hip"}}			// SAVETEMP-SAME: {{.}} "-o" [[B_GFX803_CUI:"b.cui"]] "-x" "hip" {{".*b.hip"}}
	// SAVETEMP-NEXT: [[CLANG]] "-cc1" "-triple" "amdgcn-amd-amdhsa" "-aux-triple" "x86_64-unknown-linux-gnu"			// SAVETEMP-NEXT: [[CLANG]] "-cc1" "-triple" "amdgcn-amd-amdhsa" "-aux-triple" "x86_64-unknown-linux-gnu"
	// SAVETEMP-SAME: "-emit-llvm-bc"			// SAVETEMP-SAME: "-emit-llvm-bc"
	Show All 13 Lines
	// SAVETEMP-SAME: {{.}} "-main-file-name" "b.hip" {{.}} "-target-cpu" "gfx900"			// SAVETEMP-SAME: {{.}} "-main-file-name" "b.hip" {{.}} "-target-cpu" "gfx900"
	// SAVETEMP-SAME: {{.}} "-o" [[B_GFX900_TMP_BC:"b.tmp.bc"]] "-x" "hip-cpp-output" [[B_GFX900_CUI]]			// SAVETEMP-SAME: {{.}} "-o" [[B_GFX900_TMP_BC:"b.tmp.bc"]] "-x" "hip-cpp-output" [[B_GFX900_CUI]]
	// SAVETEMP-NEXT: [[CLANG]] "-cc1" "-triple" "amdgcn-amd-amdhsa" "-aux-triple" "x86_64-unknown-linux-gnu"			// SAVETEMP-NEXT: [[CLANG]] "-cc1" "-triple" "amdgcn-amd-amdhsa" "-aux-triple" "x86_64-unknown-linux-gnu"
	// SAVETEMP-SAME: "-emit-llvm"			// SAVETEMP-SAME: "-emit-llvm"
	// SAVETEMP-SAME: {{.}} "-main-file-name" "b.hip" {{.}} "-target-cpu" "gfx900"			// SAVETEMP-SAME: {{.}} "-main-file-name" "b.hip" {{.}} "-target-cpu" "gfx900"
	// SAVETEMP-SAME: {{.}} "-o" {{"b..ll"}} "-x" "ir" [[B_GFX900_TMP_BC]]			// SAVETEMP-SAME: {{.}} "-o" {{"b..ll"}} "-x" "ir" [[B_GFX900_TMP_BC]]

	// SAVETEMP: "{{.*}}clang-offload-bundler" "-type=ll"			// SAVETEMP: "{{.*}}clang-offload-bundler" "-type=ll"
	// SAVETEMP-SAME: "-targets=hip-amdgcn-amd-amdhsa-gfx803,hip-amdgcn-amd-amdhsa-gfx900"			// SAVETEMP-SAME: "-targets=hip-amdgcn-amd-amdhsa--gfx803,hip-amdgcn-amd-amdhsa--gfx900"
	// SAVETEMP-SAME: "-outputs=b-hip-amdgcn-amd-amdhsa.ll"			// SAVETEMP-SAME: "-outputs=b-hip-amdgcn-amd-amdhsa.ll"

	// FAIL: error: cannot specify -o when generating multiple output files			// FAIL: error: cannot specify -o when generating multiple output files

clang/test/Driver/hip-toolchain-rdc-separate.hip

	Show All 38 Lines
	// CHECK-SAME: "-aux-triple" "amdgcn-amd-amdhsa"			// CHECK-SAME: "-aux-triple" "amdgcn-amd-amdhsa"
	// CHECK-SAME: "-emit-obj"			// CHECK-SAME: "-emit-obj"
	// CHECK-SAME: {{.*}} "-main-file-name" "a.cu"			// CHECK-SAME: {{.*}} "-main-file-name" "a.cu"
	// CHECK-SAME: "-fgpu-rdc"			// CHECK-SAME: "-fgpu-rdc"
	// CHECK-SAME: {{.}} "-o" "[[A_OBJ_HOST:.o]]" "-x" "hip"			// CHECK-SAME: {{.}} "-o" "[[A_OBJ_HOST:.o]]" "-x" "hip"
	// CHECK-SAME: {{.*}} [[A_SRC]]			// CHECK-SAME: {{.*}} [[A_SRC]]

	// CHECK: [[BUNDLER:".*clang-offload-bundler"]] "-type=o"			// CHECK: [[BUNDLER:".*clang-offload-bundler"]] "-type=o"
	// CHECK-SAME: "-targets=hip-amdgcn-amd-amdhsa-gfx803,hip-amdgcn-amd-amdhsa-gfx900,host-x86_64-unknown-linux-gnu"			// CHECK-SAME: "-targets=hip-amdgcn-amd-amdhsa--gfx803,hip-amdgcn-amd-amdhsa--gfx900,host-x86_64-unknown-linux-gnu"
	// CHECK-SAME: "-outputs=[[A_O:.*a.o]]" "-inputs=[[A_BC1]],[[A_BC2]],[[A_OBJ_HOST]]"			// CHECK-SAME: "-outputs=[[A_O:.*a.o]]" "-inputs=[[A_BC1]],[[A_BC2]],[[A_OBJ_HOST]]"

	// CHECK: [[CLANG]] "-cc1" "-triple" "amdgcn-amd-amdhsa"			// CHECK: [[CLANG]] "-cc1" "-triple" "amdgcn-amd-amdhsa"
	// CHECK-SAME: "-aux-triple" "x86_64-unknown-linux-gnu"			// CHECK-SAME: "-aux-triple" "x86_64-unknown-linux-gnu"
	// CHECK-SAME: "-emit-llvm-bc"			// CHECK-SAME: "-emit-llvm-bc"
	// CHECK-SAME: {{.*}} "-main-file-name" "b.hip"			// CHECK-SAME: {{.*}} "-main-file-name" "b.hip"
	// CHECK-SAME: "-fcuda-is-device" "-fcuda-allow-variadic-functions" "-fvisibility" "hidden"			// CHECK-SAME: "-fcuda-is-device" "-fcuda-allow-variadic-functions" "-fvisibility" "hidden"
	// CHECK-SAME: "-fapply-global-visibility-to-externs"			// CHECK-SAME: "-fapply-global-visibility-to-externs"
	Show All 18 Lines
	// CHECK-SAME: "-aux-triple" "amdgcn-amd-amdhsa"			// CHECK-SAME: "-aux-triple" "amdgcn-amd-amdhsa"
	// CHECK-SAME: "-emit-obj"			// CHECK-SAME: "-emit-obj"
	// CHECK-SAME: {{.*}} "-main-file-name" "b.hip"			// CHECK-SAME: {{.*}} "-main-file-name" "b.hip"
	// CHECK-SAME: "-fgpu-rdc"			// CHECK-SAME: "-fgpu-rdc"
	// CHECK-SAME: {{.}} "-o" "[[B_OBJ_HOST:.o]]" "-x" "hip"			// CHECK-SAME: {{.}} "-o" "[[B_OBJ_HOST:.o]]" "-x" "hip"
	// CHECK-SAME: {{.*}} [[B_SRC]]			// CHECK-SAME: {{.*}} [[B_SRC]]

	// CHECK: [[BUNDLER:".*clang-offload-bundler"]] "-type=o"			// CHECK: [[BUNDLER:".*clang-offload-bundler"]] "-type=o"
	// CHECK-SAME: "-targets=hip-amdgcn-amd-amdhsa-gfx803,hip-amdgcn-amd-amdhsa-gfx900,host-x86_64-unknown-linux-gnu"			// CHECK-SAME: "-targets=hip-amdgcn-amd-amdhsa--gfx803,hip-amdgcn-amd-amdhsa--gfx900,host-x86_64-unknown-linux-gnu"
	// CHECK-SAME: "-outputs=[[B_O:.*b.o]]" "-inputs=[[B_BC1]],[[B_BC2]],[[B_OBJ_HOST]]"			// CHECK-SAME: "-outputs=[[B_O:.*b.o]]" "-inputs=[[B_BC1]],[[B_BC2]],[[B_OBJ_HOST]]"

	// RUN: touch %T/a.o			// RUN: touch %T/a.o
	// RUN: touch %T/b.o			// RUN: touch %T/b.o
	// RUN: %clang --hip-link -### -target x86_64-linux-gnu \			// RUN: %clang --hip-link -### -target x86_64-linux-gnu \
	// RUN: --cuda-gpu-arch=gfx803 --cuda-gpu-arch=gfx900 \			// RUN: --cuda-gpu-arch=gfx803 --cuda-gpu-arch=gfx900 \
	// RUN: -fuse-ld=lld -fgpu-rdc -nogpuinc \			// RUN: -fuse-ld=lld -fgpu-rdc -nogpuinc \
	// RUN: %T/a.o %T/b.o \			// RUN: %T/a.o %T/b.o \
	// RUN: 2>&1 \| FileCheck -check-prefix=LINK %s			// RUN: 2>&1 \| FileCheck -check-prefix=LINK %s

	// LINK: [[BUNDLER:".*clang-offload-bundler"]] "-type=o"			// LINK: [[BUNDLER:".*clang-offload-bundler"]] "-type=o"
	// LINK-SAME: "-targets=host-x86_64-unknown-linux-gnu,hip-amdgcn-amd-amdhsa-gfx803,hip-amdgcn-amd-amdhsa-gfx900"			// LINK-SAME: "-targets=host-x86_64-unknown-linux-gnu,hip-amdgcn-amd-amdhsa--gfx803,hip-amdgcn-amd-amdhsa--gfx900"
	// LINK-SAME: "-inputs=[[A_O:.a.o]]" "-outputs=[[A_OBJ_HOST:.o]],{{.o}},{{.o}}"			// LINK-SAME: "-inputs=[[A_O:.a.o]]" "-outputs=[[A_OBJ_HOST:.o]],{{.o}},{{.o}}"
	// LINK: "-unbundle" "-allow-missing-bundles"			// LINK: "-unbundle" "-allow-missing-bundles"

	// LINK: [[BUNDLER:".*clang-offload-bundler"]] "-type=o"			// LINK: [[BUNDLER:".*clang-offload-bundler"]] "-type=o"
	// LINK-SAME: "-targets=host-x86_64-unknown-linux-gnu,hip-amdgcn-amd-amdhsa-gfx803,hip-amdgcn-amd-amdhsa-gfx900"			// LINK-SAME: "-targets=host-x86_64-unknown-linux-gnu,hip-amdgcn-amd-amdhsa--gfx803,hip-amdgcn-amd-amdhsa--gfx900"
	// LINK-SAME: "-inputs=[[B_O:.b.o]]" "-outputs=[[B_OBJ_HOST:.o]],{{.o}},{{.o}}"			// LINK-SAME: "-inputs=[[B_O:.b.o]]" "-outputs=[[B_OBJ_HOST:.o]],{{.o}},{{.o}}"
	// LINK: "-unbundle" "-allow-missing-bundles"			// LINK: "-unbundle" "-allow-missing-bundles"

	// LINK: [[BUNDLER:".*clang-offload-bundler"]] "-type=o"			// LINK: [[BUNDLER:".*clang-offload-bundler"]] "-type=o"
	// LINK-SAME: "-targets=host-x86_64-unknown-linux-gnu,hip-amdgcn-amd-amdhsa-gfx803,hip-amdgcn-amd-amdhsa-gfx900"			// LINK-SAME: "-targets=host-x86_64-unknown-linux-gnu,hip-amdgcn-amd-amdhsa--gfx803,hip-amdgcn-amd-amdhsa--gfx900"
	// LINK-SAME: "-inputs=[[A_O]]" "-outputs={{.o}},[[A_BC1:.o]],[[A_BC2:.*o]]"			// LINK-SAME: "-inputs=[[A_O]]" "-outputs={{.o}},[[A_BC1:.o]],[[A_BC2:.*o]]"
	// LINK: "-unbundle" "-allow-missing-bundles"			// LINK: "-unbundle" "-allow-missing-bundles"

	// LINK: [[BUNDLER:".*clang-offload-bundler"]] "-type=o"			// LINK: [[BUNDLER:".*clang-offload-bundler"]] "-type=o"
	// LINK-SAME: "-targets=host-x86_64-unknown-linux-gnu,hip-amdgcn-amd-amdhsa-gfx803,hip-amdgcn-amd-amdhsa-gfx900"			// LINK-SAME: "-targets=host-x86_64-unknown-linux-gnu,hip-amdgcn-amd-amdhsa--gfx803,hip-amdgcn-amd-amdhsa--gfx900"
	// LINK-SAME: "-inputs=[[B_O]]" "-outputs={{.o}},[[B_BC1:.o]],[[B_BC2:.*o]]"			// LINK-SAME: "-inputs=[[B_O]]" "-outputs={{.o}},[[B_BC1:.o]],[[B_BC2:.*o]]"
	// LINK: "-unbundle" "-allow-missing-bundles"			// LINK: "-unbundle" "-allow-missing-bundles"

	// LINK-NOT: "*.llvm-link"			// LINK-NOT: "*.llvm-link"
	// LINK-NOT: ".*opt"			// LINK-NOT: ".*opt"
	// LINK-NOT: ".*llc"			// LINK-NOT: ".*llc"
	// LINK: {{".lld."}} {{.*}} "-plugin-opt=-amdgpu-internalize-symbols"			// LINK: {{".lld."}} {{.*}} "-plugin-opt=-amdgpu-internalize-symbols"
	// LINK: "-plugin-opt=mcpu=gfx803"			// LINK: "-plugin-opt=mcpu=gfx803"
	Show All 17 Lines

clang/tools/clang-offload-bundler/ClangOffloadBundler.cpp

Show All 16 Lines

#include "clang/Basic/Version.h" #include "clang/Basic/Version.h"

#include "llvm/ADT/ArrayRef.h" #include "llvm/ADT/ArrayRef.h"

#include "llvm/ADT/SmallString.h" #include "llvm/ADT/SmallString.h"

#include "llvm/ADT/SmallVector.h" #include "llvm/ADT/SmallVector.h"

#include "llvm/ADT/StringMap.h" #include "llvm/ADT/StringMap.h"

#include "llvm/ADT/StringRef.h" #include "llvm/ADT/StringRef.h"

#include "llvm/ADT/StringSwitch.h" #include "llvm/ADT/StringSwitch.h"

#include "llvm/ADT/Triple.h" #include "llvm/ADT/Triple.h"

#include "llvm/Object/Archive.h"

#include "llvm/Object/ArchiveWriter.h"

#include "llvm/Object/Binary.h" #include "llvm/Object/Binary.h"

#include "llvm/Object/ObjectFile.h" #include "llvm/Object/ObjectFile.h"

#include "llvm/Support/Casting.h" #include "llvm/Support/Casting.h"

#include "llvm/Support/CommandLine.h" #include "llvm/Support/CommandLine.h"

#include "llvm/Support/Debug.h"

#include "llvm/Support/Errc.h" #include "llvm/Support/Errc.h"

#include "llvm/Support/Error.h" #include "llvm/Support/Error.h"

#include "llvm/Support/ErrorOr.h" #include "llvm/Support/ErrorOr.h"

#include "llvm/Support/FileSystem.h" #include "llvm/Support/FileSystem.h"

#include "llvm/Support/Host.h"

#include "llvm/Support/MemoryBuffer.h" #include "llvm/Support/MemoryBuffer.h"

#include "llvm/Support/Path.h" #include "llvm/Support/Path.h"

#include "llvm/Support/Program.h" #include "llvm/Support/Program.h"

#include "llvm/Support/Signals.h" #include "llvm/Support/Signals.h"

#include "llvm/Support/StringSaver.h" #include "llvm/Support/StringSaver.h"

#include "llvm/Support/WithColor.h" #include "llvm/Support/WithColor.h"

#include "llvm/Support/raw_ostream.h" #include "llvm/Support/raw_ostream.h"

#include <algorithm> #include <algorithm>

Show All 36 Lines FilesType("type", cl::Required,

" i - cpp-output\n" " i - cpp-output\n"

" ii - c++-cpp-output\n" " ii - c++-cpp-output\n"

" cui - cuda/hip-output\n" " cui - cuda/hip-output\n"

" d - dependency\n" " d - dependency\n"

" ll - llvm\n" " ll - llvm\n"

" bc - llvm-bc\n" " bc - llvm-bc\n"

" s - assembler\n" " s - assembler\n"

" o - object\n" " o - object\n"

" a - archive of objects\n"

" gch - precompiled-header\n" " gch - precompiled-header\n"

" ast - clang AST file"), " ast - clang AST file"),

cl::cat(ClangOffloadBundlerCategory)); cl::cat(ClangOffloadBundlerCategory));

static cl::opt<bool> static cl::opt<bool>

Unbundle("unbundle", Unbundle("unbundle",

cl::desc("Unbundle bundled file into several output files.\n"), cl::desc("Unbundle bundled file into several output files.\n"),

cl::init(false), cl::cat(ClangOffloadBundlerCategory)); cl::init(false), cl::cat(ClangOffloadBundlerCategory));

Show All 25 Lines

static unsigned HostInputIndex = ~0u; static unsigned HostInputIndex = ~0u;

/// Whether not having host target is allowed. /// Whether not having host target is allowed.

static bool AllowNoHost = false; static bool AllowNoHost = false;

/// Path to the current binary. /// Path to the current binary.

static std::string BundlerExecutable; static std::string BundlerExecutable;

/// Obtain the offload kind and real machine triple out of the target /// Obtain the offload kind, real machine triple, and an optional GPUArch

/// information specified by the user. /// out of the target information specified by the user.

static void getOffloadKindAndTriple(StringRef Target, StringRef &OffloadKind, /// Bundle Entry ID (or, Offload Target String) has following components:

StringRef &Triple) { /// * Offload Kind - Host, OpenMP, or HIP

auto KindTriplePair = Target.split('-'); /// * Triple - Standard LLVM Triple

OffloadKind = KindTriplePair.first; /// * GPUArch (Optional) - Processor name, like gfx906 or sm_30

Triple = KindTriplePair.second; /// In presence of Proc, the Triple should contain separator "-" for all

jdoerfertUnsubmitted

Not Done

/// * GPUArch (Optional) - Processor name, like gfx906 or sm_30

- /// In presence of Proc, the Triple should contain separator "-" for all

+ /// The Triple should contain separator "-" for all

/// standard four components, even if they are empty.

jdoerfert:

} /// standard four components, even if they are empty.

static bool hasHostKind(StringRef Target) { struct OffloadTargetInfo {

StringRef OffloadKind; StringRef OffloadKind;

StringRef Triple; llvm::Triple Triple;

getOffloadKindAndTriple(Target, OffloadKind, Triple); StringRef GPUArch;

return OffloadKind == "host";

OffloadTargetInfo(const StringRef Target) {

SmallVector<StringRef, 6> Components;

Target.split(Components, '-', 5);

Components.resize(6);

grokosUnsubmitted

Not Done

Leftover? Components is already 6 elements long.

grokos: Leftover? `Components` is already 6 elements long.

saiislamAuthorUnsubmitted

Done

Not necessarily. It is possible that target has less than 6 elements. For example all bundling/unbundling cases which do not require GPUArch field.
E.g. "openmp-powerpc64le-ibm-linux-gnu"

saiislam: Not necessarily. It is possible that target has less than 6 elements. For example all…

grokosUnsubmitted

Not Done

OK, thanks!

grokos: OK, thanks!

this->OffloadKind = Components[0];

this->Triple = llvm::Triple(Components[1], Components[2], Components[3],

Components[4]);

this->GPUArch = Components[5];

}

bool hasHostKind() const { return this->OffloadKind == "host"; }

bool isOffloadKindValid() const {

return OffloadKind == "host" || OffloadKind == "openmp" ||

OffloadKind == "hip" || OffloadKind == "hipv4";

}

bool isTripleValid() const {

return !Triple.str().empty() && Triple.getArch() != Triple::UnknownArch;

} }

bool operator==(const OffloadTargetInfo &Target) const {

return OffloadKind == Target.OffloadKind &&

Triple.isCompatibleWith(Target.Triple) && GPUArch == Target.GPUArch;

}

std::string str() {

return Twine(OffloadKind + "-" + Triple.str() + "-" + GPUArch).str();

}

};

/// Generic file handler interface. /// Generic file handler interface.

class FileHandler { class FileHandler {

public: public:

struct BundleInfo { struct BundleInfo {

StringRef BundleID; StringRef BundleID;

}; };

FileHandler() {} FileHandler() {}

virtual ~FileHandler() {} virtual ~FileHandler() {}

/// Update the file handler with information from the header of the bundled /// Update the file handler with information from the header of the bundled

/// file. /// file.

virtual Error ReadHeader(MemoryBuffer &Input) = 0; virtual Error ReadHeader(MemoryBuffer &Input) = 0;

ABataevUnsubmitted

Done

No need else here

ABataev: No need `else` here

/// Read the marker of the next bundled to be read in the file. The bundle /// Read the marker of the next bundled to be read in the file. The bundle

/// name is returned if there is one in the file, or `None` if there are no /// name is returned if there is one in the file, or `None` if there are no

/// more bundles to be read. /// more bundles to be read.

virtual Expected<Optional<StringRef>> virtual Expected<Optional<StringRef>>

ReadBundleStart(MemoryBuffer &Input) = 0; ReadBundleStart(MemoryBuffer &Input) = 0;

/// Read the marker that closes the current bundle. /// Read the marker that closes the current bundle.

virtual Error ReadBundleEnd(MemoryBuffer &Input) = 0; virtual Error ReadBundleEnd(MemoryBuffer &Input) = 0;

/// Read the current bundle and write the result into the stream \a OS. /// Read the current bundle and write the result into the stream \a OS.

virtual Error ReadBundle(raw_fd_ostream &OS, MemoryBuffer &Input) = 0; virtual Error ReadBundle(raw_ostream &OS, MemoryBuffer &Input) = 0;

/// Write the header of the bundled file to \a OS based on the information /// Write the header of the bundled file to \a OS based on the information

/// gathered from \a Inputs. /// gathered from \a Inputs.

virtual Error WriteHeader(raw_fd_ostream &OS, virtual Error WriteHeader(raw_fd_ostream &OS,

ArrayRef<std::unique_ptr<MemoryBuffer>> Inputs) = 0; ArrayRef<std::unique_ptr<MemoryBuffer>> Inputs) = 0;

/// Write the marker that initiates a bundle for the triple \a TargetTriple to /// Write the marker that initiates a bundle for the triple \a TargetTriple to

/// \a OS. /// \a OS.

▲ Show 20 Lines • Show All 198 Lines • ▼ Show 20 Lines Expected<Optional<StringRef>> ReadBundleStart(MemoryBuffer &Input) final {

return CurBundleInfo->first(); return CurBundleInfo->first();

} }

Error ReadBundleEnd(MemoryBuffer &Input) final { Error ReadBundleEnd(MemoryBuffer &Input) final {

assert(CurBundleInfo != BundlesInfo.end() && "Invalid reader info!"); assert(CurBundleInfo != BundlesInfo.end() && "Invalid reader info!");

return Error::success(); return Error::success();

} }

Error ReadBundle(raw_fd_ostream &OS, MemoryBuffer &Input) final { Error ReadBundle(raw_ostream &OS, MemoryBuffer &Input) final {

assert(CurBundleInfo != BundlesInfo.end() && "Invalid reader info!"); assert(CurBundleInfo != BundlesInfo.end() && "Invalid reader info!");

StringRef FC = Input.getBuffer(); StringRef FC = Input.getBuffer();

OS.write(FC.data() + CurBundleInfo->second.Offset, OS.write(FC.data() + CurBundleInfo->second.Offset,

CurBundleInfo->second.Size); CurBundleInfo->second.Size);

return Error::success(); return Error::success();

} }

Error WriteHeader(raw_fd_ostream &OS, Error WriteHeader(raw_fd_ostream &OS,

▲ Show 20 Lines • Show All 146 Lines • ▼ Show 20 Lines while (NextSection != Obj->section_end()) {

if (*TripleOrErr) if (*TripleOrErr)

return **TripleOrErr; return **TripleOrErr;

} }

return None; return None;

} }

Error ReadBundleEnd(MemoryBuffer &Input) final { return Error::success(); } Error ReadBundleEnd(MemoryBuffer &Input) final { return Error::success(); }

Error ReadBundle(raw_fd_ostream &OS, MemoryBuffer &Input) final { Error ReadBundle(raw_ostream &OS, MemoryBuffer &Input) final {

Expected<StringRef> ContentOrErr = CurrentSection->getContents(); Expected<StringRef> ContentOrErr = CurrentSection->getContents();

if (!ContentOrErr) if (!ContentOrErr)

return ContentOrErr.takeError(); return ContentOrErr.takeError();

StringRef Content = *ContentOrErr; StringRef Content = *ContentOrErr;

// Copy fat object contents to the output when extracting host bundle. // Copy fat object contents to the output when extracting host bundle.

if (Content.size() == 1u && Content.front() == 0) if (Content.size() == 1u && Content.front() == 0)

Content = StringRef(Input.getBufferStart(), Input.getBufferSize()); Content = StringRef(Input.getBufferStart(), Input.getBufferSize());

▲ Show 20 Lines • Show All 159 Lines • ▼ Show 20 Lines Error ReadBundleEnd(MemoryBuffer &Input) final {

size_t TripleEnd = ReadChars = FC.find("\n", ReadChars + 1); size_t TripleEnd = ReadChars = FC.find("\n", ReadChars + 1);

if (TripleEnd != FC.npos) if (TripleEnd != FC.npos)

// Next time we read after the new line. // Next time we read after the new line.

++ReadChars; ++ReadChars;

return Error::success(); return Error::success();

} }

Error ReadBundle(raw_fd_ostream &OS, MemoryBuffer &Input) final { Error ReadBundle(raw_ostream &OS, MemoryBuffer &Input) final {

StringRef FC = Input.getBuffer(); StringRef FC = Input.getBuffer();

size_t BundleStart = ReadChars; size_t BundleStart = ReadChars;

// Find end of the bundle. // Find end of the bundle.

size_t BundleEnd = ReadChars = FC.find(BundleEndString, ReadChars); size_t BundleEnd = ReadChars = FC.find(BundleEndString, ReadChars);

StringRef Bundle(&FC.data()[BundleStart], BundleEnd - BundleStart); StringRef Bundle(&FC.data()[BundleStart], BundleEnd - BundleStart);

OS << Bundle; OS << Bundle;

▲ Show 20 Lines • Show All 78 Lines • ▼ Show 20 Lines CreateFileHandler(MemoryBuffer &FirstInput) {

if (FilesType == "ll") if (FilesType == "ll")

return std::make_unique<TextFileHandler>(/*Comment=*/";"); return std::make_unique<TextFileHandler>(/*Comment=*/";");

if (FilesType == "bc") if (FilesType == "bc")

return std::make_unique<BinaryFileHandler>(); return std::make_unique<BinaryFileHandler>();

if (FilesType == "s") if (FilesType == "s")

return std::make_unique<TextFileHandler>(/*Comment=*/"#"); return std::make_unique<TextFileHandler>(/*Comment=*/"#");

if (FilesType == "o") if (FilesType == "o")

return CreateObjectFileHandler(FirstInput); return CreateObjectFileHandler(FirstInput);

if (FilesType == "a")

return CreateObjectFileHandler(FirstInput);

if (FilesType == "gch") if (FilesType == "gch")

return std::make_unique<BinaryFileHandler>(); return std::make_unique<BinaryFileHandler>();

if (FilesType == "ast") if (FilesType == "ast")

return std::make_unique<BinaryFileHandler>(); return std::make_unique<BinaryFileHandler>();

return createStringError(errc::invalid_argument, return createStringError(errc::invalid_argument,

"'" + FilesType + "': invalid file type specified"); "'" + FilesType + "': invalid file type specified");

} }

▲ Show 20 Lines • Show All 128 Lines • ▼ Show 20 Lines if (EC)

return createFileError(Output->second, EC); return createFileError(Output->second, EC);

if (Error Err = FH->ReadBundle(OutputFile, Input)) if (Error Err = FH->ReadBundle(OutputFile, Input))

return Err; return Err;

if (Error Err = FH->ReadBundleEnd(Input)) if (Error Err = FH->ReadBundleEnd(Input))

return Err; return Err;

Worklist.erase(Output); Worklist.erase(Output);

// Record if we found the host bundle. // Record if we found the host bundle.

if (hasHostKind(CurTriple)) auto OffloadInfo = OffloadTargetInfo(CurTriple);

if (OffloadInfo.hasHostKind())

FoundHostBundle = true; FoundHostBundle = true;

} }

if (!AllowMissingBundles && !Worklist.empty()) { if (!AllowMissingBundles && !Worklist.empty()) {

std::string ErrMsg = "Can't find bundles for"; std::string ErrMsg = "Can't find bundles for";

std::set<StringRef> Sorted; std::set<StringRef> Sorted;

for (auto &E : Worklist) for (auto &E : Worklist)

Sorted.insert(E.first()); Sorted.insert(E.first());

Show All 16 Lines static Error UnbundleFiles() {

if (Worklist.size() == TargetNames.size()) { if (Worklist.size() == TargetNames.size()) {

for (auto &E : Worklist) { for (auto &E : Worklist) {

std::error_code EC; std::error_code EC;

raw_fd_ostream OutputFile(E.second, EC, sys::fs::OF_None); raw_fd_ostream OutputFile(E.second, EC, sys::fs::OF_None);

if (EC) if (EC)

return createFileError(E.second, EC); return createFileError(E.second, EC);

// If this entry has a host kind, copy the input file to the output file. // If this entry has a host kind, copy the input file to the output file.

if (hasHostKind(E.first())) auto OffloadInfo = OffloadTargetInfo(E.getKey());

if (OffloadInfo.hasHostKind())

OutputFile.write(Input.getBufferStart(), Input.getBufferSize()); OutputFile.write(Input.getBufferStart(), Input.getBufferSize());

} }

return Error::success(); return Error::success();

} }

// If we found elements, we emit an error if none of those were for the host // If we found elements, we emit an error if none of those were for the host

// in case host bundle name was provided in command line. // in case host bundle name was provided in command line.

if (!FoundHostBundle && HostInputIndex != ~0u) if (!FoundHostBundle && HostInputIndex != ~0u)

return createStringError(inconvertibleErrorCode(), return createStringError(inconvertibleErrorCode(),

"Can't find bundle for the host target"); "Can't find bundle for the host target");

// If we still have any elements in the worklist, create empty files for them. // If we still have any elements in the worklist, create empty files for them.

for (auto &E : Worklist) { for (auto &E : Worklist) {

std::error_code EC; std::error_code EC;

raw_fd_ostream OutputFile(E.second, EC, sys::fs::OF_None); raw_fd_ostream OutputFile(E.second, EC, sys::fs::OF_None);

if (EC) if (EC)

return createFileError(E.second, EC); return createFileError(E.second, EC);

} }

return Error::success(); return Error::success();

} }

static Archive::Kind getDefaultArchiveKindForHost() {

return Triple(sys::getDefaultTargetTriple()).isOSDarwin() ? Archive::K_DARWIN

: Archive::K_GNU;

}

/// @brief Checks if a code object \p CodeObjectInfo is compatible with a given

/// target \p TargetInfo.

/// @link https://clang.llvm.org/docs/ClangOffloadBundler.html#bundle-entry-id

bool isCodeObjectCompatible(OffloadTargetInfo &CodeObjectInfo,

OffloadTargetInfo &TargetInfo) {

// Compatible in case of exact match.

if (CodeObjectInfo == TargetInfo) {

DEBUG_WITH_TYPE(

"CodeObjectCompatibility",

dbgs() << "Compatible: Exact match: " << CodeObjectInfo.str() << "\n");

return true;

ABataevUnsubmitted

Not Done

No need for else here

ABataev: No need for `else` here

}

// Incompatible if Kinds or Triples mismatch.

if (CodeObjectInfo.OffloadKind != TargetInfo.OffloadKind ||

!CodeObjectInfo.Triple.isCompatibleWith(TargetInfo.Triple)) {

ABataevUnsubmitted

Done

I think llvm Support lib has all required functions for this.

ABataev: I think llvm Support lib has all required functions for this.

DEBUG_WITH_TYPE(

"CodeObjectCompatibility",

dbgs() << "Incompatible: Kind/Triple mismatch \t[CodeObject: "

<< CodeObjectInfo.str() << "]\t:\t[Target: " << TargetInfo.str()

<< "]\n");

return false;

}

// Incompatible if GPUArch mismatch.

if (CodeObjectInfo.GPUArch != TargetInfo.GPUArch) {

DEBUG_WITH_TYPE("CodeObjectCompatibility",

dbgs() << "Incompatible: GPU Arch mismatch \t[CodeObject: "

<< CodeObjectInfo.str()

<< "]\t:\t[Target: " << TargetInfo.str() << "]\n");

return false;

}

DEBUG_WITH_TYPE(

"CodeObjectCompatibility",

dbgs() << "Compatible: Code Objects are compatible \t[CodeObject: "

<< CodeObjectInfo.str() << "]\t:\t[Target: " << TargetInfo.str()

<< "]\n");

return true;

}

/// @brief Computes a list of targets among all given targets which are

/// compatible with this code object

/// @param [in] Code Object \p CodeObject

grokosUnsubmitted

Not Done

CodeObject --> CodeObjectInfo

grokos: `CodeObject` --> `CodeObjectInfo`

/// @param [out] List of all compatible targets \p CompatibleTargets among all

/// given targets

/// @return false, if no compatible target is found.

static bool

getCompatibleOffloadTargets(OffloadTargetInfo &CodeObjectInfo,

SmallVectorImpl<StringRef> &CompatibleTargets) {

if (!CompatibleTargets.empty()) {

DEBUG_WITH_TYPE("CodeObjectCompatibility",

dbgs() << "CompatibleTargets list should be empty\n");

return false;

ABataevUnsubmitted

Done

Do not use auto where the type is not obvious.

ABataev: Do not use `auto` where the type is not obvious.

}

for (auto &Target : TargetNames) {

auto TargetInfo = OffloadTargetInfo(Target);

if (isCodeObjectCompatible(CodeObjectInfo, TargetInfo))

CompatibleTargets.push_back(Target);

}

return !CompatibleTargets.empty();

}

/// UnbundleArchive takes an archive file (".a") as input containing bundled

/// code object files, and a list of offload targets (not host), and extracts

/// the code objects into a new archive file for each offload target. Each

/// resulting archive file contains all code object files corresponding to that

/// particular offload target. The created archive file does not

/// contain an index of the symbols and code object files are named as

/// <<Parent Bundle Name>-<CodeObject's GPUArch>>, with ':' replaced with '_'.

static Error UnbundleArchive() {

std::vector<std::unique_ptr<MemoryBuffer>> ArchiveBuffers;

/// Map of target names with list of object files that will form the device

/// specific archive for that target

StringMap<std::vector<NewArchiveMember>> OutputArchivesMap;

// Map of target names and output archive filenames

StringMap<StringRef> TargetOutputFileNameMap;

auto Output = OutputFileNames.begin();

for (auto &Target : TargetNames) {

TargetOutputFileNameMap[Target] = *Output;

++Output;

}

StringRef IFName = InputFileNames.front();

ErrorOr<std::unique_ptr<MemoryBuffer>> BufOrErr =

MemoryBuffer::getFileOrSTDIN(IFName, -1, false);

if (std::error_code EC = BufOrErr.getError())

return createFileError(InputFileNames.front(), EC);

ArchiveBuffers.push_back(std::move(*BufOrErr));

Expected<std::unique_ptr<llvm::object::Archive>> LibOrErr =

Archive::create(ArchiveBuffers.back()->getMemBufferRef());

if (!LibOrErr)

return LibOrErr.takeError();

auto Archive = std::move(*LibOrErr);

Error ArchiveErr = Error::success();

auto ChildEnd = Archive->child_end();

ABataevUnsubmitted

Not Done

Just continue and make else if just if

ABataev: Just `continue` and make `else if` just `if`

saiislamAuthorUnsubmitted

Done

wasn't possible with the code flow. there is stuff to be processed in case of failure as well.

saiislam: wasn't possible with the code flow. there is stuff to be processed in case of failure as well.

/// Iterate over all bundled code object files in the input archive.

for (auto ArchiveIter = Archive->child_begin(ArchiveErr);

ArchiveIter != ChildEnd; ++ArchiveIter) {

if (ArchiveErr)

return ArchiveErr;

auto ArchiveChildNameOrErr = (*ArchiveIter).getName();

if (!ArchiveChildNameOrErr)

return ArchiveChildNameOrErr.takeError();

StringRef BundledObjectFile = sys::path::filename(*ArchiveChildNameOrErr);

auto CodeObjectBufferRefOrErr = (*ArchiveIter).getMemoryBufferRef();

if (!CodeObjectBufferRefOrErr)

return CodeObjectBufferRefOrErr.takeError();

auto CodeObjectBuffer =

MemoryBuffer::getMemBuffer(*CodeObjectBufferRefOrErr, false);

Expected<std::unique_ptr<FileHandler>> FileHandlerOrErr =

CreateFileHandler(*CodeObjectBuffer);

if (!FileHandlerOrErr)

return FileHandlerOrErr.takeError();

std::unique_ptr<FileHandler> &FileHandler = *FileHandlerOrErr;

assert(FileHandler &&

"FileHandle creation failed for file in the archive!");

if (Error ReadErr = FileHandler.get()->ReadHeader(*CodeObjectBuffer))

return ReadErr;

Expected<Optional<StringRef>> CurBundleIDOrErr =

FileHandler->ReadBundleStart(*CodeObjectBuffer);

if (!CurBundleIDOrErr)

return CurBundleIDOrErr.takeError();

Optional<StringRef> OptionalCurBundleID = *CurBundleIDOrErr;

// No device code in this child, skip.

if (!OptionalCurBundleID.hasValue())

continue;

StringRef CodeObject = *OptionalCurBundleID;

// Process all bundle entries (CodeObjects) found in this child of input

// archive.

while (!CodeObject.empty()) {

SmallVector<StringRef> CompatibleTargets;

auto CodeObjectInfo = OffloadTargetInfo(CodeObject);

if (CodeObjectInfo.hasHostKind()) {

// Do nothing, we don't extract host code yet.

} else if (getCompatibleOffloadTargets(CodeObjectInfo,

CompatibleTargets)) {

std::string BundleData;

raw_string_ostream DataStream(BundleData);

if (Error Err =

FileHandler.get()->ReadBundle(DataStream, *CodeObjectBuffer))

return Err;

for (auto &CompatibleTarget : CompatibleTargets) {

SmallString<128> BundledObjectFileName;

BundledObjectFileName.assign(BundledObjectFile);

auto OutputBundleName =

Twine(llvm::sys::path::stem(BundledObjectFileName) + "-" +

CodeObject)

.str();

// Replace ':' in optional target feature list with '_' to ensure

// cross-platform validity.

std::replace(OutputBundleName.begin(), OutputBundleName.end(), ':',

'_');

std::unique_ptr<MemoryBuffer> MemBuf = MemoryBuffer::getMemBufferCopy(

DataStream.str(), OutputBundleName);

ArchiveBuffers.push_back(std::move(MemBuf));

llvm::MemoryBufferRef MemBufRef =

MemoryBufferRef(*(ArchiveBuffers.back()));

// For inserting <CompatibleTarget, list<CodeObject>> entry in

// OutputArchivesMap.

if (OutputArchivesMap.find(CompatibleTarget) ==

OutputArchivesMap.end()) {

std::vector<NewArchiveMember> ArchiveMembers;

ArchiveMembers.push_back(NewArchiveMember(MemBufRef));

OutputArchivesMap.insert_or_assign(CompatibleTarget,

std::move(ArchiveMembers));

} else {

OutputArchivesMap[CompatibleTarget].push_back(

NewArchiveMember(MemBufRef));

}

if (Error Err = FileHandler.get()->ReadBundleEnd(*CodeObjectBuffer))

return Err;

Expected<Optional<StringRef>> NextTripleOrErr =

FileHandler->ReadBundleStart(*CodeObjectBuffer);

if (!NextTripleOrErr)

return NextTripleOrErr.takeError();

CodeObject = ((*NextTripleOrErr).hasValue()) ? **NextTripleOrErr : "";

} // End of processing of all bundle entries of this child of input archive.

} // End of while over children of input archive.

jdoerfertUnsubmitted

Not Done

Add a message to asserts. also other places.

jdoerfert: Add a message to asserts. also other places.

assert(!ArchiveErr && "Error occured while reading archive!");

/// Write out an archive for each target

for (auto &Target : TargetNames) {

StringRef FileName = TargetOutputFileNameMap[Target];

StringMapIterator<std::vector<llvm::NewArchiveMember>> CurArchiveMembers =

OutputArchivesMap.find(Target);

if (CurArchiveMembers != OutputArchivesMap.end()) {

if (Error WriteErr = writeArchive(FileName, CurArchiveMembers->getValue(),

true, getDefaultArchiveKindForHost(),

true, false, nullptr))

return WriteErr;

} else if (!AllowMissingBundles) {

std::string ErrMsg =

Twine("no compatible code object found for the target '" + Target +

"' in heterogenous archive library: " + IFName)

.str();

return createStringError(inconvertibleErrorCode(), ErrMsg);

}

return Error::success();

}

static void PrintVersion(raw_ostream &OS) { static void PrintVersion(raw_ostream &OS) {

OS << clang::getClangToolFullVersion("clang-offload-bundler") << '\n'; OS << clang::getClangToolFullVersion("clang-offload-bundler") << '\n';

} }

int main(int argc, const char **argv) { int main(int argc, const char **argv) {

sys::PrintStackTraceOnErrorSignal(argv[0]); sys::PrintStackTraceOnErrorSignal(argv[0]);

cl::HideUnrelatedOptions(ClangOffloadBundlerCategory); cl::HideUnrelatedOptions(ClangOffloadBundlerCategory);

▲ Show 20 Lines • Show All 68 Lines • ▼ Show 20 Lines if (InputFileNames.size() != 1) {

"only one input file supported in unbundling mode")); "only one input file supported in unbundling mode"));

} }

if (OutputFileNames.size() != TargetNames.size()) { if (OutputFileNames.size() != TargetNames.size()) {

reportError(createStringError(errc::invalid_argument, reportError(createStringError(errc::invalid_argument,

"number of output files and targets should " "number of output files and targets should "

"match in unbundling mode")); "match in unbundling mode"));

} }

} else { } else {

if (FilesType == "a") {

reportError(createStringError(errc::invalid_argument,

"Archive files are only supported "

"for unbundling"));

}

if (OutputFileNames.size() != 1) { if (OutputFileNames.size() != 1) {

reportError(createStringError( reportError(createStringError(

errc::invalid_argument, errc::invalid_argument,

"only one output file supported in bundling mode")); "only one output file supported in bundling mode"));

} }

if (InputFileNames.size() != TargetNames.size()) { if (InputFileNames.size() != TargetNames.size()) {

reportError(createStringError( reportError(createStringError(

errc::invalid_argument, errc::invalid_argument,

Show All 9 Lines int main(int argc, const char **argv) {

llvm::DenseSet<StringRef> ParsedTargets; llvm::DenseSet<StringRef> ParsedTargets;

for (StringRef Target : TargetNames) { for (StringRef Target : TargetNames) {

if (ParsedTargets.contains(Target)) { if (ParsedTargets.contains(Target)) {

reportError(createStringError(errc::invalid_argument, reportError(createStringError(errc::invalid_argument,

"Duplicate targets are not allowed")); "Duplicate targets are not allowed"));

} }

ParsedTargets.insert(Target); ParsedTargets.insert(Target);

StringRef Kind; auto OffloadInfo = OffloadTargetInfo(Target);

StringRef Triple; bool KindIsValid = OffloadInfo.isOffloadKindValid();

getOffloadKindAndTriple(Target, Kind, Triple); bool TripleIsValid = OffloadInfo.isTripleValid();

bool KindIsValid = !Kind.empty();

KindIsValid = KindIsValid && StringSwitch<bool>(Kind)

.Case("host", true)

.Case("openmp", true)

.Case("hip", true)

.Case("hipv4", true)

.Default(false);

bool TripleIsValid = !Triple.empty();

llvm::Triple T(Triple);

TripleIsValid &= T.getArch() != Triple::UnknownArch;

if (!KindIsValid || !TripleIsValid) { if (!KindIsValid || !TripleIsValid) {

SmallVector<char, 128u> Buf; SmallVector<char, 128u> Buf;

raw_svector_ostream Msg(Buf); raw_svector_ostream Msg(Buf);

Msg << "invalid target '" << Target << "'"; Msg << "invalid target '" << Target << "'";

if (!KindIsValid) if (!KindIsValid)

Msg << ", unknown offloading kind '" << Kind << "'"; Msg << ", unknown offloading kind '" << OffloadInfo.OffloadKind << "'";

if (!TripleIsValid) if (!TripleIsValid)

Msg << ", unknown target triple '" << Triple << "'"; Msg << ", unknown target triple '" << OffloadInfo.Triple.str() << "'";

reportError(createStringError(errc::invalid_argument, Msg.str())); reportError(createStringError(errc::invalid_argument, Msg.str()));

} }

if (KindIsValid && Kind == "host") { if (KindIsValid && OffloadInfo.hasHostKind()) {

++HostTargetNum; ++HostTargetNum;

// Save the index of the input that refers to the host. // Save the index of the input that refers to the host.

HostInputIndex = Index; HostInputIndex = Index;

} }

if (Kind != "hip" && Kind != "hipv4") if (OffloadInfo.OffloadKind != "hip" && OffloadInfo.OffloadKind != "hipv4")

HIPOnly = false; HIPOnly = false;

++Index; ++Index;

} }

// HIP uses clang-offload-bundler to bundle device-only compilation results // HIP uses clang-offload-bundler to bundle device-only compilation results

// for multiple GPU archs, therefore allow no host target if all entries // for multiple GPU archs, therefore allow no host target if all entries

// are for HIP. // are for HIP.

AllowNoHost = HIPOnly; AllowNoHost = HIPOnly;

// Host triple is not really needed for unbundling operation, so do not // Host triple is not really needed for unbundling operation, so do not

// treat missing host triple as error if we do unbundling. // treat missing host triple as error if we do unbundling.

if ((Unbundle && HostTargetNum > 1) || if ((Unbundle && HostTargetNum > 1) ||

(!Unbundle && HostTargetNum != 1 && !AllowNoHost)) { (!Unbundle && HostTargetNum != 1 && !AllowNoHost)) {

reportError(createStringError(errc::invalid_argument, reportError(createStringError(errc::invalid_argument,

"expecting exactly one host target but got " + "expecting exactly one host target but got " +

Twine(HostTargetNum))); Twine(HostTargetNum)));

} }

doWork([]() { return Unbundle ? UnbundleFiles() : BundleFiles(); }); doWork([]() {

if (Unbundle) {

if (FilesType == "a")

return UnbundleArchive();

jdoerfertUnsubmitted

Not Done

By now, a proper conditional seems appropriate.

jdoerfert: By now, a proper conditional seems appropriate.

else

return UnbundleFiles();

} else

return BundleFiles();

});

return 0; return 0;

} }

This is an archive of the discontinued LLVM Phabricator instance.

[clang-offload-bundler] Add unbundling of archives containing bundled object files into device specific archivesClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 355514

clang/docs/ClangOffloadBundler.rst

clang/lib/Driver/ToolChains/Clang.cpp

clang/test/Driver/clang-offload-bundler.c

clang/test/Driver/hip-rdc-device-only.hip

clang/test/Driver/hip-toolchain-rdc-separate.hip

clang/tools/clang-offload-bundler/ClangOffloadBundler.cpp

[clang-offload-bundler] Add unbundling of archives containing bundled object files into device specific archives
ClosedPublic