This is an archive of the discontinued LLVM Phabricator instance.

[OpenMP] Enable the existing nocudalib flag for OpenMP offloading toolchain.
ClosedPublic

Authored by gtbercea on Sep 15 2017, 11:36 AM.

Download Raw Diff

Details

Reviewers

Hahnfeld
ABataev
carlo.bertolli
caomhin
hfinkel
tra

Commits

rG20789a5f096e: [OpenMP] Enable the existing nocudalib flag for OpenMP offloading toolchain.
rC314164: [OpenMP] Enable the existing nocudalib flag for OpenMP offloading toolchain.
rL314164: [OpenMP] Enable the existing nocudalib flag for OpenMP offloading toolchain.

Summary

Enable the -nocudalib flag for the OpenMP device offloading toolchain as well. Currently it can only be used for the CUDA toolchain.

Diff Detail

Build Status

Buildable 10583
Build 10583: arc lint + arc unit

Event Timeline

gtbercea created this revision.Sep 15 2017, 11:36 AM

Please add a test case.

Add test.

gtbercea added a reviewer: hfinkel.Sep 18 2017, 12:12 PM

gtbercea added a reviewer: tra.Sep 19 2017, 8:46 AM

tra added inline comments.Sep 19 2017, 10:16 AM

lib/Driver/ToolChains/Cuda.cpp
255–257	The purpose of the original assert was to catch a programming error and this change negates that purpose. Perhaps I'm missing something. Could you elaborate on what's the motivation for this particular change? I don't understand why it would be OK to end up with an unknown GPU architecture if -nocudalib is specified. You still do want to pass some specific GPU arch to ptxas and that has nothing to do with whether you happen to have suitable libdevice.

Don't take into account unknown CUDA archs not even for testing purposes.

One small nit. LGTM otherwise.

test/Driver/openmp-offload-gpu.c
133	Please split this RUN line further.

This revision is now accepted and ready to land.Sep 20 2017, 9:37 AM

Split line.

gtbercea closed this revision.Sep 25 2017, 2:58 PM

Revision Contents

Path

Size

lib/

Driver/

ToolChains/

Cuda.cpp

6 lines

test/

Driver/

openmp-offload-gpu.c

10 lines

Commit	Tree	Parents	Author	Summary	Date
0d4d2e980a00	b131afa56381	220f854b87b7	Doru Bercea	Add test break lines.	Sep 25 2017, 2:29 PM
220f854b87b7	852b50c4357a	59e250bab79a 3283b6fcfe3f	Doru Bercea	Merge branch 'unpatched-master' into patch8-8-2	Sep 25 2017, 2:28 PM
59e250bab79a	b1177334d271	49337b204ec1 1b44abc8384c	Doru Bercea	Merge branch 'patch8-8-1' into patch8-8-2	Sep 25 2017, 12:47 PM
1b44abc8384c	7a903441f438	0f5bd43c72a3 0e312680f153	Doru Bercea	Merge branch 'unpatched-master' into patch8-8-1	Sep 25 2017, 12:44 PM
49337b204ec1	9eac0a3bc419	25cb1e9b003e	Doru Bercea	Fix.	Sep 19 2017, 6:03 PM
25cb1e9b003e	a13dc4d5483c	275fc2d74afc 0f5bd43c72a3	Doru Bercea	Merge branch 'patch8-8-1' into patch8-8-2 (Show More…)	Sep 19 2017, 5:57 PM
0f5bd43c72a3	5f17d6cdeeab	542fe8dba2a6	Doru Bercea	Fix.	Sep 19 2017, 5:50 PM
275fc2d74afc	308bc3fa1d9b	7d07f6e37ae5	Doru Bercea	fix	Sep 18 2017, 11:25 AM
7d07f6e37ae5	7bd3cd6ef350	c9d692d84266	Doru Bercea	Fix no cudalib flag passing.	Sep 18 2017, 11:20 AM
542fe8dba2a6	02a0be29d4d6	e0ddabf46c5f	Doru Bercea	Fix file name	Sep 18 2017, 9:13 AM
c9d692d84266	b0a46f02575e	1690d0bd589e	Doru Bercea	Fix file name	Sep 18 2017, 9:12 AM
e0ddabf46c5f	d9acdb8a7e43	cde98b84124f	Doru Bercea	Fix tests.	Sep 15 2017, 2:28 PM
1690d0bd589e	53f65d9e5496	873bdd1aa9a8	Doru Bercea	Fix cubin tests.	Sep 15 2017, 2:26 PM
873bdd1aa9a8	a9a706345cfd	cde98b84124f	Doru Bercea	Enable nocudalib flag.	Sep 15 2017, 11:31 AM
cde98b84124f	f13b4e120102	98e6b05fa4bb	Doru Bercea	Fix.	Sep 15 2017, 10:59 AM
98e6b05fa4bb	653d5388ae31	503a94283df5 a792780cf2a3	Doru Bercea	Merge branch 'unpatched-master' into patch8-8	Sep 15 2017, 10:17 AM
503a94283df5	17e3e69ebb83	92a6b9153cbb 1af16988a373	Doru Bercea	Merge branch 'unpatched-master' into patch8-8	Sep 15 2017, 10:14 AM
92a6b9153cbb	d07f4ae678b0	ad0a0bda3925	Doru Bercea	Move flag checks inside libdevice check.	Sep 14 2017, 8:17 AM
ad0a0bda3925	f4138e541f54	945325f1f092 8a058429f72e	Doru Bercea	Merge branch 'unpatched-master' into patch8-8	Sep 13 2017, 8:30 AM
945325f1f092	201c6792d598	ae35a46f9356	Doru Bercea	Fix path to cubin when -save-temps is not passed.	Aug 21 2017, 4:22 PM
ae35a46f9356	ca15d2ab39fb	fd4fde205b93 987a86cd11ac	Doru Bercea	Merge branch 'unpatched-master' into patch8-8	Aug 21 2017, 2:56 PM
fd4fde205b93	1635ba61fbfb	e093f019432b	Doru Bercea	Don't look for cuda lib when compiling with -S and -c.	Aug 16 2017, 1:11 PM
e093f019432b	dec9e1d09db3	03084050974a	Doru Bercea	Make sure nocudalib flag is respected.	Aug 15 2017, 12:28 PM
03084050974a	bac92635e224	bcea2b15f3c0 5280dab80aa7	Doru Bercea	Merge remote-tracking branch 'ibm/unpatched-master' into patch8-8	Aug 15 2017, 10:32 AM
bcea2b15f3c0	a01c1f115c73	09474c5359bb	Doru Bercea	Move flag patch tests to gpu offloading.	Aug 11 2017, 2:09 PM
09474c5359bb	295e17e11c93	2f24ad2cd7a1	Doru Bercea	Fixes.	Aug 11 2017, 2:04 PM
2f24ad2cd7a1	61dded6fac18	1b1e48f006a7 078b7e8f1ede	Doru Bercea	Merge branch 'unpatched-master' into patch8-8	Aug 11 2017, 1:20 PM
1b1e48f006a7	5842680d6878	655ed700be6d	Doru Bercea	Save the current hanges	Aug 11 2017, 1:16 PM
655ed700be6d	bdf669f71c70	fadfd5aaf6b6 58f82409fac6	Doru Bercea	Merge branch 'unpatched-master' into patch8-8	Aug 11 2017, 8:51 AM
fadfd5aaf6b6	165a9ccd278c	850fa68f795c 69a6da714c0a	Doru Bercea	Merge branch 'unpatched-master' into patch8-8	Aug 11 2017, 8:30 AM
850fa68f795c	33ea63300396	026064557684	Doru Bercea	Add GPU offloading tests in a separate file.	Aug 11 2017, 8:08 AM
026064557684	a1b18ab30ed1	a01ff11275a4	Doru Bercea	Enable compute capability search.	Aug 10 2017, 6:12 PM
a01ff11275a4	f3159a6ca56d	9628e540afb2	Gheorghe-Teodor Bercea	LLVM-LIT mangles file names.	Aug 10 2017, 5:41 PM
9628e540afb2	0b79ad6d8096	0ec685dc7d20	Doru Bercea	Enable everything.	Aug 10 2017, 5:06 PM
0ec685dc7d20	b887b3336f5d	576d8a933f77	Doru Bercea	Remove unreachable.	Aug 10 2017, 4:34 PM
576d8a933f77	fff7ba2fe940	2f11b5eb4618	Doru Bercea	Enable offload tests.	Aug 10 2017, 4:20 PM
2f11b5eb4618	b949ab0bc33b	2644ac123b8d f7558e5102a0	Doru Bercea	Merge branch 'unpatched-master' into patch8-8 (Show More…)	Aug 10 2017, 2:08 PM
2644ac123b8d	f485469c4955	5ac7e668c35c	Doru Bercea	Add early exit once no libdevice libs are detected.	Aug 10 2017, 1:38 PM
5ac7e668c35c	615f87130845	798b3c618dbc 15af0ebfc46e	Doru Bercea	Merge branch npatched-master' into patch8-8	Aug 10 2017, 9:48 AM
798b3c618dbc	6ec6efe740d0	836fde0ac478	Doru Bercea	Fix tests.	Aug 10 2017, 9:46 AM
836fde0ac478	f603dbc59dec	184cdf1dd805	Doru Bercea	Fix tests.	Aug 10 2017, 9:39 AM
184cdf1dd805	5d5a4f888a0b	2a9b2712d81b a545c71ca54c	Doru Bercea	Merge branch 'patch8-6' into patch8-8	Aug 10 2017, 7:23 AM
a545c71ca54c	d8172c8a5942	23246d26488a 00186ef1f9bf	Doru Bercea	Merge branch 'unpatched-master' into patch8-6	Aug 10 2017, 7:21 AM
2a9b2712d81b	5f3eb9d31861	1ede13a1a3be 442874b97b7d	Doru Bercea	Merge branch 'unpatched-master' into patch8-8	Aug 9 2017, 9:48 PM
1ede13a1a3be	187085a022c9	5d0b1c53170d	Doru Bercea	Fix tests.	Aug 9 2017, 9:46 PM
5d0b1c53170d	2bbd4cd4e732	a6ad815539ea c0edc1569159	Doru Bercea	Merge branch 'unpatched-master' into patch8-8 (Show More…)	Aug 9 2017, 4:38 PM
a6ad815539ea	27c57abdb54b	75921481dd54	Doru Bercea	Fix test.	Aug 9 2017, 4:36 PM
75921481dd54	867e54deb171	3f3304f956a3	Doru Bercea	Fix test.	Aug 9 2017, 1:49 PM
3f3304f956a3	b6282c0528de	3575f137965f 16c706343503	Doru Bercea	Merge branch 'unpatched-master' into patch8-8	Aug 9 2017, 12:57 PM
3575f137965f	b6282c0528de	79bc5ac6d544 1ba8f524f713	Doru Bercea	Merge branch 'unpatched-master' into patch8-8	Aug 9 2017, 12:42 PM
79bc5ac6d544	9498ac346922	23246d26488a	Doru Bercea	Find executables in driver directory.	Aug 9 2017, 11:42 AM
23246d26488a	bc100547ad11	b5a366a7d6a8 c06b6e025b2c	Doru Bercea	Merge branch 'unpatched-master' into patch8-6	Aug 9 2017, 11:27 AM
b5a366a7d6a8	bc100547ad11	239e681cbb84	Doru Bercea	Fix test to make it generic enough to run on different archs.	Aug 9 2017, 11:19 AM
239e681cbb84	a8a0113c75cd	2fad13153926	Doru Bercea	Fix test to make it generic enough to run on different archs.	Aug 9 2017, 11:10 AM
2fad13153926	780d96e16551	e2b573043d3e 7984a2104f88	Doru Bercea	Merge branch 'unpatched-master' into patch8-6	Aug 9 2017, 9:01 AM
e2b573043d3e	780d96e16551	e5026e160899	Doru Bercea	Fix test.	Aug 9 2017, 8:47 AM
e5026e160899	de29adf6a089	bf0a5aef6ca0	Doru Bercea	Pass ptx flag to openmp target.	Aug 9 2017, 8:39 AM
bf0a5aef6ca0	982ac3fe41f1	dde254a3dd2f 0420738bf55d	Doru Bercea	Merge branch 'unpatched-master' into patch8-5	Aug 9 2017, 8:31 AM
dde254a3dd2f	c74aad5331b9	eea585bab11e	Doru Bercea	Enables the disabling of relocatable default code gen.	Aug 9 2017, 8:12 AM
eea585bab11e	8e65352367c8	37337b6d7337 0e9a73558bc6	Doru Bercea	Merge branch 'patch8-3' into patch8-4	Aug 9 2017, 8:01 AM
0e9a73558bc6	8e65352367c8	e47fd15ccb11 477550e0ffcf	Doru Bercea	Merge branch 'unpatched-master' into patch8-3	Aug 9 2017, 8:01 AM
37337b6d7337	6b76533c6ebd	140f507a1eb7 82579b76c61e	Doru Bercea	Merge branch 'unpatched-master' into patch8-4	Aug 7 2017, 2:13 PM
140f507a1eb7	6b76533c6ebd	ea35a25b7409 8d6e5d9647a0	Doru Bercea	Merge branch 'unpatched-master' into patch8-4	Aug 7 2017, 2:00 PM
ea35a25b7409	6b76533c6ebd	fe52fbbe8bf5 e47fd15ccb11	Doru Bercea	Merge branch 'patch8-3' into patch8-4	Aug 7 2017, 2:00 PM
e47fd15ccb11	9ae4478017f6	914ef36c9e24 ef2aa4f14509	Doru Bercea	Merge branch 'unpatched-master' into patch8-3	Aug 7 2017, 1:55 PM
914ef36c9e24	35947fbad302	f403533ecfaf 0ba6400cafee	Doru Bercea	Merge branch 'unpatched-master' into patch8-3	Aug 7 2017, 1:33 PM
f403533ecfaf	35947fbad302	cb0d4d7b43c9 35165e08d782	Doru Bercea	Merge branch 'patch8-2' into patch8-3	Aug 7 2017, 1:33 PM
35165e08d782	ef54ef37792a	880dc27d981e d73b9aac8050	Doru Bercea	Merge branch 'unpatched-master' into patch8-2	Aug 7 2017, 1:29 PM
880dc27d981e	3753394a6208	a02ae99a8837	Doru Bercea	Fix test flag.	Aug 7 2017, 1:26 PM
a02ae99a8837	dc990d347c69	cbbe1de541bf d3e3cbe1a74d	Doru Bercea	Merge branch 'unpatched-master' into patch8-2	Aug 7 2017, 1:23 PM
cbbe1de541bf	dc990d347c69	d54f5d8e6434 b911b0595b5b	Doru Bercea	Merge branch 'patch8-1' into patch8-2	Aug 7 2017, 1:22 PM
b911b0595b5b	15526026b780	3db58821c128 e617862d3160	Doru Bercea	Merge branch 'unpatched-master' into patch8-1 (Show More…)	Aug 7 2017, 1:08 PM
fe52fbbe8bf5	f7bad0ddea1e	cb0d4d7b43c9	Doru Bercea	Invalid target error.	Aug 7 2017, 8:18 AM
cb0d4d7b43c9	cc3f7f651ed9	d54f5d8e6434	Doru Bercea	Prevent emission of exception handling code.	Aug 7 2017, 8:14 AM
d54f5d8e6434	3aa38393035b	3db58821c128	Doru Bercea	Make code relocatable.	Aug 7 2017, 8:07 AM
3db58821c128	0baa7cf37a4d	b2eebeaa39ef	Doru Bercea	Add -v flag.	Aug 7 2017, 8:04 AM
b2eebeaa39ef	b5982a2b7ac2	58f17851c627 c133d9a63e6a	Doru Bercea	Merge branch 'patch7-1' into patch8	Aug 7 2017, 7:33 AM
c133d9a63e6a	c0e418a55e06	cd90aa271f44 f9faef8fd4d1	Doru Bercea	Merge branch 'unpatched-master' into patch7-1	Aug 7 2017, 7:33 AM
cd90aa271f44	ac1ce2e87cfd	42f6f1533147	Doru Bercea	Fix tests.	Aug 7 2017, 7:27 AM
58f17851c627	c21b146b1edf	9acfde518e12	Doru Bercea	Fix march flag value.	Aug 6 2017, 2:41 PM
9acfde518e12	cf9a9bcabefe	d53fae72e822 42f6f1533147	Doru Bercea	Merge branch 'patch7-1' into patch8	Aug 6 2017, 2:36 PM
42f6f1533147	cb942a3cba5a	7f39b5465baf	Doru Bercea	Fix march special casing.	Aug 6 2017, 2:36 PM
d53fae72e822	e6b30edf9cb6	e543e1a224b8 7f39b5465baf	Doru Bercea	Merge branch 'patch7-1' into patch8	Aug 6 2017, 2:19 PM
7f39b5465baf	51668b96ee5d	c350f62a2966	Doru Bercea	Fix tests.	Aug 6 2017, 2:18 PM
e543e1a224b8	924c6650c9aa	4da865019785 c350f62a2966	Doru Bercea	Merge branch 'patch7-1' into patch8 (Show More…)	Aug 6 2017, 1:53 PM
c350f62a2966	fa83a36fee0e	0ca2d2e570e0	Doru Bercea	Add tests for the errors.	Aug 6 2017, 1:50 PM
4da865019785	7fc08445a74e	a498ed5f28b1 0ca2d2e570e0	Doru Bercea	Merge branch 'patch7-1' into patch8	Aug 6 2017, 1:05 PM
0ca2d2e570e0	9e00ea84dc65	36aca3e9534c	Doru Bercea	Only pass one march to toolchain.	Aug 6 2017, 1:05 PM
a498ed5f28b1	8cad7f9dd5f2	4ee58dab5380 36aca3e9534c	Doru Bercea	Merge branch 'patch7-1' into patch8	Aug 6 2017, 12:49 PM
36aca3e9534c	3b278c7f5699	f0d9136e264e	Doru Bercea	Redo Arch test.	Aug 6 2017, 12:48 PM
4ee58dab5380	9bee9dd3513a	7095a2a7fcdf f0d9136e264e	Doru Bercea	Merge branch 'patch7-1' into patch8 (Show More…)	Aug 6 2017, 12:37 PM
f0d9136e264e	0e591d1050f1	cd3fdf71b9f7	Doru Bercea	Don't treat march differently.	Aug 6 2017, 12:32 PM
cd3fdf71b9f7	6742fe24e5ba	f189081a9b57	Doru Bercea	Don't exclude flags when host matches offload toolchain.	Aug 5 2017, 5:33 PM
f189081a9b57	48ba54dfd8df	7ba7466c673a	Doru Bercea	New way to handle OpenMP target flags.	Aug 5 2017, 4:36 PM
7095a2a7fcdf	b878efb11173	907f3406a32d	Doru Bercea	Fix OpenMP target specific translation.	Aug 5 2017, 4:14 PM
907f3406a32d	3095c99245ee	0d91e8dc8672	Doru Bercea	Add Hal's suggestions.	Aug 5 2017, 2:10 PM
0d91e8dc8672	7ea6650a3415	8f2d461b3236 7ba7466c673a	Doru Bercea	Merge branch 'patch7-1' into patch8	Jul 10 2017, 4:16 PM
7ba7466c673a	1e5ad828ff4f	8c98493a3105 dc3817f04345	Doru Bercea	Merge branch 'unpatched-master' into patch7-1	Jul 10 2017, 4:10 PM
8c98493a3105	c6fe206e5472	e49b628b9b30	Doru Bercea	Pass arch to CUDA toolchain.	Jul 10 2017, 4:08 PM
8f2d461b3236	45788800826c	b913ae765d49	Doru Bercea	Add cubin.	Jul 10 2017, 4:03 PM
b913ae765d49	af77e977f00b	405cf90b667a e49b628b9b30	Doru Bercea	Merge branch 'patch7-1' into patch8 (Show More…)	Jul 10 2017, 4:02 PM
e49b628b9b30	ba03818f910c	a3b9099a3b5c	Doru Bercea	Pass Arch to CUDA toolchain.	Jul 10 2017, 2:43 PM
405cf90b667a	dee3151b91ad	cb66ddde5852 5cb26fe27854	Doru Bercea	Merge branch 'patch7-1' into patch8	Jul 10 2017, 2:53 PM
5cb26fe27854	8770d4300847	a3b9099a3b5c	Doru Bercea	Pass Arch to CUDA toolchain.	Jul 10 2017, 2:43 PM
cb66ddde5852	edf4f228c8a3	e274794c6522 b4af2b143c00	Doru Bercea	Merge branch 'patch7-1' into patch8	Jul 10 2017, 2:51 PM
b4af2b143c00	5fa2ce7fae11	a3b9099a3b5c	Doru Bercea	Pass Arch to CUDA toolchain.	Jul 10 2017, 2:43 PM
e274794c6522	f2b85441bc56	2f9a8172d387 00b9a2bc6540	Doru Bercea	Merge branch 'patch7-1' into patch8	Jul 10 2017, 2:44 PM
00b9a2bc6540	a5e35d48be2d	a3b9099a3b5c	Doru Bercea	Pass Arch to CUDA toolchain.	Jul 10 2017, 2:43 PM
2f9a8172d387	5e7c1993c797	1ac08de45ad0 a3b9099a3b5c	Doru Bercea	Merge branch 'patch7-1' into patch8	Jul 10 2017, 8:20 AM
a3b9099a3b5c	f76b50c682a8	94207a494779	Doru Bercea	Pass arch to CUDA toolchain.	Jul 10 2017, 8:20 AM
94207a494779	eb159e4ee27c	2eea82dd52d6 5bf57dfedfb0	Doru Bercea	Merge branch 'unpatched-master' into patch7-1	Jul 6 2017, 10:51 AM
2eea82dd52d6	3eeb96d2a7a2	75ff689d1128 9a973f3ee99d	Doru Bercea	Pass CUDA arch.	Jul 6 2017, 9:28 AM
1ac08de45ad0	7a9b541edb3b	3fd139871462 75ff689d1128	Doru Bercea	Merge branch 'patch7-1' into patch8 (Show More…)	Jul 5 2017, 4:29 PM
75ff689d1128	55de8b2f39e9	3a4ccc40bf09	Doru Bercea	Pass arch to CUDA toolchain.	Jul 5 2017, 4:20 PM
3fd139871462	dc1bae55497b	4fc016450025 205c38602112	Doru Bercea	Merge branch 'patch7-1' into patch8	Jul 5 2017, 4:20 PM
205c38602112	656b0c19e0e4	3a4ccc40bf09	Doru Bercea	Pass arch to CUDA toolchain.	Jul 5 2017, 4:20 PM
4fc016450025	3c38310440fe	1b1ee746d823 3a4ccc40bf09	Doru Bercea	Merge branch 'patch7-1' into patch8	Jul 5 2017, 3:53 PM
3a4ccc40bf09	1ba719c56c2e	a09448ce7a50	Doru Bercea	Pass arch to CUDA toolchain.	Jul 5 2017, 3:51 PM
1b1ee746d823	a23c209efef4	c99aae0fed67	Doru Bercea	Add cubin.	Jul 5 2017, 3:15 PM
c99aae0fed67	16f02fe23425	6ec7552dccc5 a09448ce7a50	Doru Bercea	Merge branch 'patch7-1' into patch8	Jul 5 2017, 2:55 PM
a09448ce7a50	4c8fa30d042d	920d3a6880a8	Doru Bercea	Pass arch to CUDA toolchain.	Jul 5 2017, 2:54 PM
6ec7552dccc5	94f2cecfe259	ea72ea394917 920d3a6880a8	Doru Bercea	Merge branch 'patch7-1' into patch8	Jul 5 2017, 1:33 PM
920d3a6880a8	9d628bf51879	4e9493a4164e 2478d528547b	Doru Bercea	Merge branch 'patch5-2' into patch7-1	Jul 5 2017, 1:32 PM
2478d528547b	aa92df1550f8	c0ef8e9536cb	Doru Bercea	Add offloading kind.	Jul 5 2017, 1:29 PM
ea72ea394917	636003a08d46	6c9bcabc6870 4e9493a4164e	Doru Bercea	Merge branch 'patch7-1' into patch8	Jul 5 2017, 1:21 PM
4e9493a4164e	e4cd2f3a87fe	7682c067bfb8 c0ef8e9536cb	Doru Bercea	Merge branch 'patch5-2' into patch7-1 (Show More…)	Jul 5 2017, 1:19 PM
c0ef8e9536cb	cfcdba9f6cef	29b5af2ca767 597eb2dd6152	Doru Bercea	Merge branch 'patch5-1' into patch5-2	Jul 5 2017, 1:17 PM
597eb2dd6152	2cebfbb76064	266779d44de4	Doru Bercea	Add CUDA toolchain selection.	Jul 5 2017, 12:52 PM
266779d44de4	db3018554f3e	dc80f7eceaf0 e300395c3743	Doru Bercea	Merge branch 'unpatched-master' into patch5-1 (Show More…)	Jul 5 2017, 12:51 PM
7682c067bfb8	339346537c1f	4ed04335610d	Doru Bercea	Pass arch to CUDA toolchain.	Jul 5 2017, 8:59 AM
6c9bcabc6870	e8624ff4115e	e77f9b2766b0 4ed04335610d	Doru Bercea	Fix test.	Jun 30 2017, 5:02 PM
4ed04335610d	770cee491d0c	a58ddbcff056	Doru Bercea	Pass OpenMP target options.	Jun 30 2017, 4:52 PM
a58ddbcff056	91bfe5b2677a	dc3e8ad0f014	Doru Bercea	Pass OpenMP target options.	Jun 30 2017, 4:35 PM
dc3e8ad0f014	24092def2489	c7ddab5e4754	Doru Bercea	First attempt at passing target flags.	Jun 30 2017, 1:20 PM
e77f9b2766b0	663f38d28e90	08c7d81a49ea	Doru Bercea	Remove flag.	Jun 30 2017, 7:59 AM
08c7d81a49ea	4c16be2bbe10	771549e0b47f c7ddab5e4754	Doru Bercea	Merge branch 'patch7-1' into patch8 (Show More…)	Jun 30 2017, 7:56 AM
c7ddab5e4754	1525e11c51e9	726d51ecc2de	Doru Bercea	Revert flag changes.	Jun 30 2017, 7:53 AM
771549e0b47f	b0db4a1108dd	8cfc653809cc 726d51ecc2de	Doru Bercea	Merge branch 'patch7-1' into patch8	Jun 30 2017, 7:42 AM
726d51ecc2de	a26231e61c4f	05ecef6b2c46	Doru Bercea	Arch flag: with debug.	Jun 30 2017, 7:42 AM
8cfc653809cc	3367d96eb62a	cf8c68f8667a 05ecef6b2c46	Doru Bercea	Add CUBIN file. (Show More…)	Jun 29 2017, 12:00 PM
05ecef6b2c46	79617b60002e	122577fedcb8 29b5af2ca767	Doru Bercea	Add -fopenmp-target-arch flag.	Jun 29 2017, 11:58 AM
29b5af2ca767	281a97a62419	25a33948d27c dc80f7eceaf0	Doru Bercea	Add offloading kind. (Show More…)	Jun 29 2017, 11:54 AM
dc80f7eceaf0	c45bf39ac34e	ed5a3e34efc6 b7f382cb5d4e	Doru Bercea	CUDA toolchain selection. (Show More…)	Jun 29 2017, 11:53 AM
b7f382cb5d4e	10a4393c401f	1acc7a9260fc da1f3cf54166	Doru Bercea	D29645: Pass -fopenmp-is-device.	Jun 29 2017, 11:39 AM
1acc7a9260fc	483b7e00c9e8	dbbfa76fca6a 71607099bc1e	Doru Bercea	D29645: Pass -fopenmp-is-device. (Show More…)	Jun 29 2017, 8:55 AM
ed5a3e34efc6	e0d9b53b0959	afb661c95427 e157d3d2a7e0	Doru Bercea	CUDA toolchain selection. (Show More…)	Jun 29 2017, 9:36 AM
cf8c68f8667a	f67f6a8947ea	5a115e90c7a9	Doru Bercea	Add CUBIN file.	Jun 28 2017, 4:00 PM
5a115e90c7a9	fd98a6cac4c9	ca821a74d953 122577fedcb8	Doru Bercea	Add CUBIN file. (Show More…)	Jun 28 2017, 3:59 PM
122577fedcb8	9a9c8259d253	7593db543f6c	Doru Bercea	Add -fopenmp-target-arch flag.	Jun 28 2017, 3:58 PM
ca821a74d953	1808a8536904	830e2251c251 7593db543f6c	Doru Bercea	Add CUBIN file. (Show More…)	Jun 28 2017, 3:51 PM
7593db543f6c	004e702c4e32	1dbc7088ac7c	Doru Bercea	Add -fopenmp-target-arch flag.	Jun 28 2017, 3:50 PM
830e2251c251	4c06473c3e29	1d95bea2ee8f	Doru Bercea	Add CUBIN file.	Jun 28 2017, 3:12 PM
1d95bea2ee8f	4a1bee807228	8e9eb4c1297c 1dbc7088ac7c	Doru Bercea	Add CUBIN file.	Jun 28 2017, 3:04 PM
1dbc7088ac7c	7169ecbb8c1a	8440b03b6ad9	Doru Bercea	Add -fopenmp-target-arch flag.	Jun 28 2017, 2:48 PM
8440b03b6ad9	88469f5addc1	0a3729b45fe4	Doru Bercea	Add -fopenmp-target-arch flag.	Jun 28 2017, 2:41 PM
0a3729b45fe4	eec85021f7ae	0587deff7fa4 25a33948d27c	Doru Bercea	Add -fopenmp-target-arch flag.	Jun 28 2017, 2:09 PM
25a33948d27c	f09f3eeca683	afb661c95427	Doru Bercea	Add offloading kind.	Jun 28 2017, 1:43 PM
afb661c95427	4b7d07a22cd9	dbbfa76fca6a	Doru Bercea	CUDA toolchain selection.	Jun 28 2017, 1:16 PM
8e9eb4c1297c	eda76a793052	f8e265ec9396 b1f254b68cbf	Doru Bercea	Cubin file.	Jun 28 2017, 11:00 AM
b1f254b68cbf	635ea08e976a	c9b1a7fe5423 0587deff7fa4	Doru Bercea	CUDA toolchain selection. (Show More…)	Jun 28 2017, 10:56 AM
0587deff7fa4	69f373d46405	715ac9f35055 dbbfa76fca6a	Doru Bercea	Add offloading kind. (Show More…)	Jun 28 2017, 9:36 AM
dbbfa76fca6a	bd07bd27e35b	575efb1c7d80	Doru Bercea	Pass -fopenmp-is-device.	Jun 28 2017, 9:28 AM
f8e265ec9396	faa835d20bee	75589a16499d c9b1a7fe5423	Doru Bercea	Use CUBIN file. (Show More…)	Jun 28 2017, 8:56 AM
c9b1a7fe5423	4e70f84299b0	88688038ef8b 715ac9f35055	Doru Bercea	CUDA Toolcgain selection. (Show More…)	Jun 28 2017, 8:55 AM
88688038ef8b	a0e142a0038b	55de092444d6	Doru Bercea	CUDA toolchain selection.	Jun 28 2017, 8:55 AM
715ac9f35055	00cbb4e0071e	ec6753d5cb52	Doru Bercea	Add offloading kind.	Jun 28 2017, 8:27 AM
ec6753d5cb52	5c1648a6c95f	f95688bd2c75	Doru Bercea	Add offloading kind.	Jun 28 2017, 8:18 AM
75589a16499d	c00ac5d2b10a	06b5d27fa22c 55de092444d6	Doru Bercea	Use CUBIN file. (Show More…)	Jun 28 2017, 7:33 AM
55de092444d6	1b0e494b3f8b	5841ff685d53 2d35cd0fe576	Doru Bercea	CUDA toolchain selection. (Show More…)	Jun 28 2017, 7:32 AM
2d35cd0fe576	1588524fdf96	2268a748015c f95688bd2c75	Doru Bercea	Add LIBRARY_PATH. (Show More…)	Jun 28 2017, 7:31 AM
f95688bd2c75	05222a99abb2	9cb681ef0a4b	Doru Bercea	Add offloading kind.	Jun 28 2017, 7:30 AM
9cb681ef0a4b	4ea9128f6ffd	38dfe38ae4f0	Doru Bercea	Add oflloading kind.	Jun 27 2017, 3:11 PM
06b5d27fa22c	0d867f350b68	5af19735fd3d	Doru Bercea	OpenMP Offloading uses NVLINK and requires a cubin.	Jun 27 2017, 1:00 PM
5af19735fd3d	31ee2b36e332	ab3027866852 5841ff685d53	Doru Bercea	OpenMP Offloading uses NVLINK and requires a cubin. (Show More…)	Jun 27 2017, 10:20 AM
5841ff685d53	1d07c1608d71	fdc1ae6e7577 2268a748015c	Doru Bercea	CUDA tool chain selection. (Show More…)	Jun 27 2017, 10:19 AM
2268a748015c	c58f1728675e	67b372a64616 38dfe38ae4f0	Doru Bercea	Add test for checking if lib folder from LIBRARY_PATH is passed to loader. (Show More…)	Jun 27 2017, 10:18 AM
38dfe38ae4f0	e355ec539247	e8f0d54e6aeb 575efb1c7d80	Doru Bercea	Add oflloading kind. (Show More…)	Jun 27 2017, 10:17 AM
575efb1c7d80	24909c47cd60	a6a6a38d13b1 5104f1c899d7	Doru Bercea	Enable the passing of -fopenmp-is-device. (Show More…)	Jun 27 2017, 10:16 AM
5104f1c899d7	764bf54bfeea	8c687d60a787 a359de1e50ea	Doru Bercea	Pass -v to PTXAS. (Show More…)	Jun 27 2017, 10:15 AM
a359de1e50ea	b04b5bc63053	acd254c2ad9d 0f000a5b31bc	Doru Bercea	Make code relocatable by default by passing -c. (Show More…)	Jun 27 2017, 10:14 AM
0f000a5b31bc	632bf7dfd774	dc9b781c80fa faea3e56d3d2	Doru Bercea	Prevent exception handling code from being emitted for device offloading. (Show More…)	Jun 27 2017, 10:13 AM
faea3e56d3d2	4f25e072be16	01ac9a016c69 5a17e5c7708b	Doru Bercea	Add support for aux-triple flag. (Show More…)	Jun 27 2017, 10:12 AM
ab3027866852	33945235f4be	3d7684509ae0 fdc1ae6e7577	Doru Bercea	OpenMP Offloading uses NVLINK and requires a cubin. (Show More…)	Jun 13 2017, 3:16 PM
fdc1ae6e7577	72f57f6a8c7e	6ca0c5f2bdcf 67b372a64616	Doru Bercea	CUDA tool chain selection. (Show More…)	Jun 13 2017, 2:44 PM
67b372a64616	63fb9cc77a6f	c3a352bea7de e8f0d54e6aeb	Doru Bercea	Add test for checking if lib folder from LIBRARY_PATH is passed to loader. (Show More…)	Jun 13 2017, 2:16 PM
e8f0d54e6aeb	329302c37234	139ba1d04aa0 a6a6a38d13b1	Doru Bercea	Add oflloading kind. (Show More…)	Jun 13 2017, 2:14 PM
a6a6a38d13b1	4be2041e41c0	c49525003257 8c687d60a787	Doru Bercea	Enable the passing of -fopenmp-is-device. (Show More…)	Jun 13 2017, 2:03 PM
8c687d60a787	6f07351f15b8	43618c33d4cb acd254c2ad9d	Doru Bercea	Pass -v to PTXAS. (Show More…)	Jun 13 2017, 1:44 PM
acd254c2ad9d	204e5bf404fe	1775e0f9fc26	Doru Bercea	Make code relocatable by default by passing -c.	Mar 31 2017, 9:30 AM
dc9b781c80fa	6b7db38e81a3	1775e0f9fc26	Doru Bercea	Prevent exception handling code from being emitted for device offloading.	Mar 31 2017, 9:30 AM
1775e0f9fc26	b67079be8d40	01ac9a016c69	Doru Bercea	Prevent the implementation from emitting device exception handling code.	Jan 25 2017, 1:33 PM
01ac9a016c69	89572927ec8b	714941f0a8e5 d725462f1cbc	Doru Bercea	Add support for aux-triple flag. (Show More…)	Jun 13 2017, 11:05 AM
3d7684509ae0	060d57dd4c9b	5e6b9b71d2de 6ca0c5f2bdcf	Doru Bercea	OpenMP Offloading uses NVLINK and requires a cubin. (Show More…)	Jun 13 2017, 10:17 AM
6ca0c5f2bdcf	f7cc5bdc3214	3143cfae7051 c3a352bea7de	Doru Bercea	CUDA tool chain selection. (Show More…)	Jun 13 2017, 10:15 AM
c3a352bea7de	aaa7b3e8c513	4111a472e0b0 139ba1d04aa0	Doru Bercea	Add test for checking if lib folder from LIBRARY_PATH is passed to loader. (Show More…)	Jun 13 2017, 10:13 AM
139ba1d04aa0	d4d6c5e8283d	3f4c339e32d9	Doru Bercea	Add offloading kind argument.	Jun 13 2017, 10:06 AM
5e6b9b71d2de	1b98ecf07691	cf6ea5e0a780 3143cfae7051	Doru Bercea	OpenMP Offloading uses NVLINK and requires a cubin.	Jun 13 2017, 7:57 AM
3143cfae7051	250ac38d9e5e	4276620687ab 4111a472e0b0	Doru Bercea	CUDA tool chain selection. (Show More…)	Jun 13 2017, 7:43 AM
4111a472e0b0	52449054722d	23b7474a2b12 3f4c339e32d9	Doru Bercea	Add test for checking if lib folder from LIBRARY_PATH is passed to loader. (Show More…)	Jun 13 2017, 7:43 AM
3f4c339e32d9	e845e10e9737	3150c459c872 c49525003257	Doru Bercea	Add offloading kind argument. (Show More…)	Jun 13 2017, 7:42 AM
c49525003257	ed9818d9ef6d	12010fc04bc2 43618c33d4cb	Doru Bercea	Enable the passing of -fopenmp-is-device. (Show More…)	Jun 13 2017, 7:41 AM
43618c33d4cb	e78b3af30f49	9349307a5aa9 8a909e99f732	Doru Bercea	Pass -v to PTXAS. (Show More…)	Jun 13 2017, 7:39 AM
8a909e99f732	42d249051093	2e7ba67a3da0	Doru Bercea	Make code relocatable by default by passing -c.	Mar 31 2017, 9:26 AM
2e7ba67a3da0	8fe45ed1d756	0fca5b64d4ff	Doru Bercea	Make OpenMP generated code for the NVIDIA device relocatable by default	Mar 30 2017, 3:48 PM
0fca5b64d4ff	69b0549e410f	24ceb4cdd2fd	Doru Bercea	In OpenMP we need to generate relocatable code.	Mar 30 2017, 10:52 AM
24ceb4cdd2fd	eeaf3464026e	5c26c5e9c239	Doru Bercea	In OpenMP we need to generate relocatable code.	Feb 1 2017, 7:24 AM
5c26c5e9c239	f8e814473dbf	e987fb793243	Doru Bercea	In OpenMP we need to generate relocatable code.	Jan 25 2017, 1:38 PM
e987fb793243	b91d0f689217	e22ce221f71d 714941f0a8e5	Doru Bercea	Prevent exception handling code from being emitted for device offloading. (Show More…)	Jun 13 2017, 7:35 AM
714941f0a8e5	43c39277c448	9233b6321ad6 68584d4a736e	Doru Bercea	Add support for aux-triple flag.	Jun 13 2017, 7:34 AM
cf6ea5e0a780	cded09f30285	1aeaf2495652 4276620687ab	Doru Bercea	OpenMP Offloading uses NVLINK and requires a cubin. (Show More…)	Apr 13 2017, 10:55 AM
4276620687ab	47ed59305342	949e1564950d	Doru Bercea	CUDA tool chain selection.	Apr 13 2017, 10:54 AM
1aeaf2495652	f8e74070251a	f30a21152bf8	Doru Bercea	OpenMP offloading needs linking with NVLINK.	Apr 13 2017, 9:13 AM
f30a21152bf8	9ec4178113c7	21e871a6b5a5	Doru Bercea	OpenMP offloading needs linking with NVLINK.	Apr 11 2017, 2:35 PM
21e871a6b5a5	7e2643fccaad	99645e8a6bcb	Doru Bercea	Use replace extension util.	Apr 6 2017, 2:07 PM
99645e8a6bcb	00b63f73dbc6	088140848634	Doru Bercea	Split function.	Apr 6 2017, 9:21 AM
088140848634	61d4865e9bfb	5c8923200a85	Doru Bercea	Add tool different creation for CUDA and OpenMP.	Apr 6 2017, 9:10 AM
5c8923200a85	cc98176f099f	bf1b99bbe7d7 949e1564950d	Doru Bercea	Embed cubin in host file. (Show More…)	Apr 6 2017, 8:59 AM
949e1564950d	a582257db826	23b7474a2b12	Doru Bercea	CUDA tool chain selection.	Apr 6 2017, 8:58 AM
bf1b99bbe7d7	b665c7d3bd87	e3264b040d0a 23b7474a2b12	Doru Bercea	OpenMP uses nvlink to link cubin files. Embed result in in host binary using… (Show More…)	Mar 31 2017, 10:02 AM
23b7474a2b12	85246aa895ec	678bd452e0c7 3150c459c872	Doru Bercea	Add test for checking if lib folder from LIBRARY_PATH is passed to loader. (Show More…)	Mar 31 2017, 10:00 AM
3150c459c872	f2072a0ffb20	f14767fe1688 12010fc04bc2	Doru Bercea	Add offloading kind argument. (Show More…)	Mar 31 2017, 9:58 AM
12010fc04bc2	506c0d51d53a	08a255b76076 9349307a5aa9	Doru Bercea	Enable the passing of -fopenmp-is-device.	Mar 31 2017, 9:54 AM
9349307a5aa9	41d57d919d93	e655c6f23301	Doru Bercea	Pass -v to PTXAS.	Mar 31 2017, 9:49 AM
e655c6f23301	e76e379fa583	aed538f53c9e 4598dbf13d36	Doru Bercea	Merge branch 'patch3' into patch4	Mar 31 2017, 9:37 AM
4598dbf13d36	631cbb8b6f62	b8b801515ba4 e22ce221f71d	Doru Bercea	Merge branch 'patch2' into patch3	Mar 31 2017, 9:33 AM
e22ce221f71d	d784652821ba	547cb55666cc	Doru Bercea	Prevent exception handling code from being emitted for device offloading.	Mar 31 2017, 9:30 AM
b8b801515ba4	93d86099abc1	e6e425c4e45f	Doru Bercea	Make code relocatable by default by passing -c.	Mar 31 2017, 9:26 AM
e6e425c4e45f	613f5b3b6889	1059acc8f581 547cb55666cc	Doru Bercea	Merge branch 'patch2' into patch3	Mar 31 2017, 9:24 AM
547cb55666cc	fb9589ab311d	55460d95c93c	Doru Bercea	Prevent exception handling code from being emitted for device offloading.	Mar 31 2017, 9:15 AM
55460d95c93c	d19f3c84308f	526a965e6aa2	Doru Bercea	Improve regression test.	Mar 31 2017, 9:09 AM
526a965e6aa2	c280d987afe6	77b5bb642c0f 9233b6321ad6	Doru Bercea	Prevent the implementation from emitting device exception handling code. (Show More…)	Mar 31 2017, 8:00 AM
9233b6321ad6	a2bf56b2abff	41b26c558d77	Doru Bercea	Add support for aux-triple flag.	Mar 31 2017, 7:36 AM
08a255b76076	428c60f33da3	cba92af886d3 aed538f53c9e	Doru Bercea	Enable the passing of -fopenmp-is-device.	Mar 30 2017, 4:01 PM
aed538f53c9e	ade358fe69d0	c9f9ce942175 1059acc8f581	Doru Bercea	Pass -v to PTXAS if it was passed to the driver.	Mar 30 2017, 3:53 PM
1059acc8f581	6a5fdc674fe9	854dee468e0d	Doru Bercea	Make OpenMP generated code for the NVIDIA device relocatable by default	Mar 30 2017, 3:48 PM
cba92af886d3	5101ca9ddd43	b4c74b573df1 c9f9ce942175	Doru Bercea	Enable the passing of -fopenmp-is-device.	Mar 30 2017, 3:46 PM
c9f9ce942175	5f1573b292ae	a78ab514fdbe 854dee468e0d	Doru Bercea	Pass -v to PTXAS if it was passed to the driver.	Mar 30 2017, 3:40 PM
854dee468e0d	1c89d28f56e9	ad4cadef4306 77b5bb642c0f	Doru Bercea	Merge branch 'patch2' into patch3	Mar 30 2017, 3:31 PM
77b5bb642c0f	b3ebd9f8fe89	b156822f6087	Doru Bercea	Prevent the implementation from emitting device exception handling code.	Mar 30 2017, 11:45 AM
b156822f6087	c69ba2906825	25d4e6f2f0cf	Doru Bercea	Prevent the implementation from emitting device exception handling code.	Mar 30 2017, 11:36 AM
a78ab514fdbe	9b1c4a37acd8	2b0fc86ba688	Doru Bercea	Pass -v to PTXAS if it was passed to the driver.	Mar 30 2017, 11:15 AM
2b0fc86ba688	83f5e7026dc8	4c470d513fbf	Doru Bercea	Pass -v to PTXAS if it was passed to the driver.	Feb 1 2017, 7:40 AM
4c470d513fbf	616aea147990	ad4cadef4306	Doru Bercea	In OpenMP we need to generate relocatable code.	Jan 25 2017, 1:38 PM
ad4cadef4306	595e3f1c5a46	4439d9ac3555	Doru Bercea	In OpenMP we need to generate relocatable code.	Mar 30 2017, 10:52 AM
e3264b040d0a	967bcd848e69	27f6ab4a1922	Doru Bercea	OpenMP uses nvlink to link cubin files. Embed result in in host binary using… (Show More…)	Mar 27 2017, 4:00 PM
27f6ab4a1922	3d056633556d	db72cd3188f7 678bd452e0c7	Doru Bercea	Merge branch 'patch7-2' into patch8	Mar 27 2017, 3:41 PM
678bd452e0c7	f4fd6f33e636	ed9d9e338e57	Doru Bercea	Add test for checking if lib folder from LIBRARY_PATH is passed to loader.	Mar 27 2017, 3:32 PM
ed9d9e338e57	9431074045af	d73975008c7c f14767fe1688	Doru Bercea	Merge branch 'patch7-1' into patch7-2 (Show More…)	Mar 27 2017, 3:31 PM
f14767fe1688	58549a2f7164	039dd5597ca5	Doru Bercea	Add offloading kind argument.	Mar 27 2017, 2:58 PM
039dd5597ca5	a22bce16da15	f7eb186ece21	Doru Bercea	Add offloading kind argument.	Mar 27 2017, 2:31 PM
db72cd3188f7	d96af9b44442	b5c8cc68f54a	Doru Bercea	OpenMP uses nvlink to link cubin files. Embed result in in host binary using… (Show More…)	Mar 23 2017, 7:59 AM
b5c8cc68f54a	b5c3b6596f0a	d73975008c7c	Doru Bercea	OpenMP uses nvlink to link cubin files. Embed result in in host binary using… (Show More…)	Feb 1 2017, 3:53 PM
d73975008c7c	309b49e7327b	791117958c14	Doru Bercea	Add test for checking if lib folder from LIBRARY_PATH is passed to loader.	Feb 1 2017, 1:35 PM
791117958c14	d831d6c884a2	b4c74b573df1	Doru Bercea	Report an error for -faltivec on anything other than PowerPC. (Show More…)	Jan 25 2017, 2:03 PM
f7eb186ece21	2eb4864f9fab	b4c74b573df1	Doru Bercea	Report an error for -faltivec on anything other than PowerPC. (Show More…)	Jan 25 2017, 2:03 PM
b4c74b573df1	6baeb85df288	e4119d98fa05	Doru Bercea	Enable the passing of -fopenmp-is-device.	Feb 1 2017, 8:41 AM
e4119d98fa05	56bbffd4d4cf	ce210adf50a4	Doru Bercea	Pass -v to PTXAS if it was passed to the driver.	Jan 25 2017, 1:39 PM
ce210adf50a4	83f5e7026dc8	bacc43f3b67d	Doru Bercea	Pass -v to PTXAS if it was passed to the driver.	Feb 1 2017, 7:40 AM
bacc43f3b67d	236fe5b46f69	4439d9ac3555	Doru Bercea	In OpenMP we need to generate relocatable code.	Jan 25 2017, 1:38 PM
4439d9ac3555	e2cb5ed40ef0	8378300e84fe	Doru Bercea	In OpenMP we need to generate relocatable code.	Feb 1 2017, 7:24 AM
8378300e84fe	51ebb5d44699	25d4e6f2f0cf	Doru Bercea	In OpenMP we need to generate relocatable code.	Jan 25 2017, 1:38 PM
25d4e6f2f0cf	1888421f1497	81eb5270e3be	Doru Bercea	Prevent the implementation from emitting device exception handling code.	Feb 10 2017, 3:19 PM
81eb5270e3be	361e664769b9	a6f244cec239	Doru Bercea	Prevent the implementation from emitting device exception handling code.	Jan 31 2017, 12:05 PM
a6f244cec239	4a01eb229470	41b26c558d77	Doru Bercea	Prevent the implementation from emitting device exception handling code.	Jan 25 2017, 1:33 PM
41b26c558d77	c1442eb431d5	38aaf5ae0a19	Doru Bercea	Add support for aux-triple flag.	Feb 1 2017, 7:15 AM
38aaf5ae0a19	c42cee846753	0b45f6a058ad	Doru Bercea	Add support for aux-triple flag.	Jan 25 2017, 1:30 PM

Diff 116621

lib/Driver/ToolChains/Cuda.cpp

Show First 20 Lines • Show All 246 Lines • ▼ Show 20 Lines	void NVPTX::Assembler::ConstructJob(Compilation &C, const JobAction &JA,
// flag or the default value.		// flag or the default value.
if (JA.isDeviceOffloading(Action::OFK_OpenMP)) {		if (JA.isDeviceOffloading(Action::OFK_OpenMP)) {
GPUArchName = Args.getLastArgValue(options::OPT_march_EQ);		GPUArchName = Args.getLastArgValue(options::OPT_march_EQ);
assert(!GPUArchName.empty() && "Must have an architecture passed in.");		assert(!GPUArchName.empty() && "Must have an architecture passed in.");
} else		} else
GPUArchName = JA.getOffloadingArch();		GPUArchName = JA.getOffloadingArch();

// Obtain architecture from the action.		// Obtain architecture from the action.
CudaArch gpu_arch = StringToCudaArch(GPUArchName);		CudaArch gpu_arch = StringToCudaArch(GPUArchName);
assert(gpu_arch != CudaArch::UNKNOWN &&		assert(gpu_arch != CudaArch::UNKNOWN &&
"Device action expected to have an architecture.");		"Device action expected to have an architecture.");
		traUnsubmitted Not Done Reply Inline Actions The purpose of the original assert was to catch a programming error and this change negates that purpose. Perhaps I'm missing something. Could you elaborate on what's the motivation for this particular change? I don't understand why it would be OK to end up with an unknown GPU architecture if -nocudalib is specified. You still do want to pass some specific GPU arch to ptxas and that has nothing to do with whether you happen to have suitable libdevice. tra: The purpose of the original assert was to catch a programming error and this change negates…

// Check that our installation's ptxas supports gpu_arch.		// Check that our installation's ptxas supports gpu_arch.
if (!Args.hasArg(options::OPT_no_cuda_version_check)) {		if (!Args.hasArg(options::OPT_no_cuda_version_check)) {
TC.CudaInstallation.CheckCudaVersionSupportsArch(gpu_arch);		TC.CudaInstallation.CheckCudaVersionSupportsArch(gpu_arch);
}		}

ArgStringList CmdArgs;		ArgStringList CmdArgs;
CmdArgs.push_back(TC.getTriple().isArch64Bit() ? "-m64" : "-m32");		CmdArgs.push_back(TC.getTriple().isArch64Bit() ? "-m64" : "-m32");
▲ Show 20 Lines • Show All 221 Lines • ▼ Show 20 Lines	if (DeviceOffloadingKind == Action::OFK_Cuda) {

if (DriverArgs.hasFlag(options::OPT_fcuda_flush_denormals_to_zero,		if (DriverArgs.hasFlag(options::OPT_fcuda_flush_denormals_to_zero,
options::OPT_fno_cuda_flush_denormals_to_zero, false))		options::OPT_fno_cuda_flush_denormals_to_zero, false))
CC1Args.push_back("-fcuda-flush-denormals-to-zero");		CC1Args.push_back("-fcuda-flush-denormals-to-zero");

if (DriverArgs.hasFlag(options::OPT_fcuda_approx_transcendentals,		if (DriverArgs.hasFlag(options::OPT_fcuda_approx_transcendentals,
options::OPT_fno_cuda_approx_transcendentals, false))		options::OPT_fno_cuda_approx_transcendentals, false))
CC1Args.push_back("-fcuda-approx-transcendentals");		CC1Args.push_back("-fcuda-approx-transcendentals");
		}

if (DriverArgs.hasArg(options::OPT_nocudalib))		if (DriverArgs.hasArg(options::OPT_nocudalib))
return;		return;
}

std::string LibDeviceFile = CudaInstallation.getLibDeviceFile(GpuArch);		std::string LibDeviceFile = CudaInstallation.getLibDeviceFile(GpuArch);

if (LibDeviceFile.empty()) {		if (LibDeviceFile.empty()) {
getDriver().Diag(diag::err_drv_no_cuda_libdevice) << GpuArch;		getDriver().Diag(diag::err_drv_no_cuda_libdevice) << GpuArch;
return;		return;
}		}

▲ Show 20 Lines • Show All 160 Lines • Show Last 20 Lines

test/Driver/openmp-offload-gpu.c

	Show First 20 Lines • Show All 119 Lines • ▼ Show 20 Lines
	/// ###########################################################################			/// ###########################################################################

	/// PTXAS is passed -c flag by default when offloading to an NVIDIA device using OpenMP			/// PTXAS is passed -c flag by default when offloading to an NVIDIA device using OpenMP
	/// Check that the flag is passed when -fopenmp-relocatable-target is used.			/// Check that the flag is passed when -fopenmp-relocatable-target is used.
	// RUN: %clang -### -fopenmp=libomp -fopenmp-targets=nvptx64-nvidia-cuda -fopenmp-relocatable-target -save-temps -no-canonical-prefixes %s 2>&1 \			// RUN: %clang -### -fopenmp=libomp -fopenmp-targets=nvptx64-nvidia-cuda -fopenmp-relocatable-target -save-temps -no-canonical-prefixes %s 2>&1 \
	// RUN: \| FileCheck -check-prefix=CHK-PTXAS-RELO %s			// RUN: \| FileCheck -check-prefix=CHK-PTXAS-RELO %s

	// CHK-PTXAS-RELO: ptxas{{.*}}" "-c"			// CHK-PTXAS-RELO: ptxas{{.*}}" "-c"

				/// ###########################################################################

				/// Check that error is not thrown by toolchain when no cuda lib flag is used.
				/// Check that the flag is passed when -fopenmp-relocatable-target is used.
				// RUN: %clang -### -fopenmp=libomp -fopenmp-targets=nvptx64-nvidia-cuda -Xopenmp-target -march=sm_60 \
				traUnsubmitted Not Done Reply Inline Actions Please split this RUN line further. tra: Please split this RUN line further.
				// RUN: -nocudalib -fopenmp-relocatable-target -save-temps -no-canonical-prefixes %s 2>&1 \
				// RUN: \| FileCheck -check-prefix=CHK-FLAG-NOLIBDEVICE %s

				// CHK-FLAG-NOLIBDEVICE-NOT: error:{{.*}}sm_60