This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
lib/
-
AST/
-
ASTContext.cpp
-
ItaniumMangle.cpp
-
Sema/
1/3
Sema.cpp
-
SemaType.cpp
-
test/SemaSYCL/
-
SemaSYCL/
-
bf16.cpp

Differential D141375

[SYCL][OpenMP] Fix compilation errors for unsupported __bf16 intrinsics
ClosedPublic

Authored by eandrews on Jan 10 2023, 4:56 AM.

Download Raw Diff

Details

Reviewers

mikerice
jyu2
bader
jdoerfert
aaron.ballman

Commits

rGf81d529f8955: [Clang] Fix compilation errors for unsupported __bf16 intrinsics

Summary

This patch uses existing deferred diagnostics framework to emit error for unsupported type __bf16 in device code. Error is not emitted in host code.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

eandrews created this revision.Jan 10 2023, 4:56 AM

Herald added a project: Restricted Project. · View Herald TranscriptJan 10 2023, 4:56 AM

Herald added subscribers: Naghasan, Anastasia, ebevhan and 2 others. · View Herald Transcript

eandrews requested review of this revision.Jan 10 2023, 4:56 AM

Herald added a subscriber: sstefan1. · View Herald TranscriptJan 10 2023, 4:56 AM

Harbormaster completed remote builds in B206760: Diff 487752.Jan 10 2023, 5:47 AM

LGTM.

I expect this to be a common issue for all single-source offloading programming models (i.e. CUDA and HIP in addition to SYCL and OpenMP offload). Probably we can generalize the code patterns used in this patch for all of them.

In addition to that, there are other built-in data types not supported either by host or device, which are handled similar way. Right?

This revision is now accepted and ready to land.Jan 10 2023, 1:27 PM

In D141375#4041360, @bader wrote:

LGTM.
I expect this to be a common issue for all single-source offloading programming models (i.e. CUDA and HIP in addition to SYCL and OpenMP offload). Probably we can generalize the code patterns used in this patch for all of them.

Looks like CUDA added support for the type - https://reviews.llvm.org/D136311, https://reviews.llvm.org/rG678d8946ba2ba790c4c52e96e2134ee136e30057.

In addition to that, there are other built-in data types not supported either by host or device, which are handled similar way. Right?

Yes. Code added here is similar to code added for other unsupported types like __float128

LGTM too.

Closed by commit rGf81d529f8955: [Clang] Fix compilation errors for unsupported __bf16 intrinsics (authored by eandrews). · Explain WhyJan 25 2023, 12:49 PM

This revision was automatically updated to reflect the committed changes.

eandrews added a commit: rGf81d529f8955: [Clang] Fix compilation errors for unsupported __bf16 intrinsics.

Herald added a project: Restricted Project. · View Herald TranscriptJan 25 2023, 12:49 PM

Thanks for the reviews!

tra added a subscriber: tra.Sep 7 2023, 2:41 PM

tra added inline comments.

clang/lib/Sema/Sema.cpp
1978–1979	@eandrews Do you recall what was the reason for not issuing the diagnostic on the GPU side? It appears to do the opposite to what the patch description says. We're supposed to `emit error for unsupported type __bf16 in device code`, but instead we specifically ignore it during GPU-side compilation. What am I missing?

Herald added a subscriber: jplehr. · View Herald TranscriptSep 7 2023, 2:41 PM

eandrews added inline comments.Sep 7 2023, 3:44 PM

clang/lib/Sema/Sema.cpp
1978–1979	I don't recall the specifics but I think CUDA had code handling __bf16 differently and this change broke a test with CUDA diagnostics and so I excluded it from the patch. I could try removing this check and seeing what breaks if you'd like.

tra added inline comments.Sep 7 2023, 3:59 PM

clang/lib/Sema/Sema.cpp
1978–1979	It may have been around the time when x86 started exposing bf16 type in the host headers, but NVPTX didn't have any support for the type yet. This change may have just papered over the problem. Oh, well. That would be just one of the places where we currently don't handle the 'unusual' types across the host/GPU boundary. I'm attempting to clean it up, and will take care of this instance there.

Revision Contents

Path

Size

clang/

lib/

AST/

ASTContext.cpp

5 lines

ItaniumMangle.cpp

6 lines

Sema/

Sema.cpp

2 lines

SemaType.cpp

7 lines

test/

SemaSYCL/

bf16.cpp

22 lines

Diff 492235

clang/lib/AST/ASTContext.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 2,134 Lines • ▼ Show 20 Lines	case Type::Builtin:
case BuiltinType::SatULongFract:		case BuiltinType::SatULongFract:
Width = Target->getLongFractWidth();		Width = Target->getLongFractWidth();
Align = Target->getLongFractAlign();		Align = Target->getLongFractAlign();
break;		break;
case BuiltinType::BFloat16:		case BuiltinType::BFloat16:
if (Target->hasBFloat16Type()) {		if (Target->hasBFloat16Type()) {
Width = Target->getBFloat16Width();		Width = Target->getBFloat16Width();
Align = Target->getBFloat16Align();		Align = Target->getBFloat16Align();
		} else if ((getLangOpts().SYCLIsDevice \|\|
		(getLangOpts().OpenMP && getLangOpts().OpenMPIsDevice)) &&
		AuxTarget->hasBFloat16Type()) {
		Width = AuxTarget->getBFloat16Width();
		Align = AuxTarget->getBFloat16Align();
}		}
break;		break;
case BuiltinType::Float16:		case BuiltinType::Float16:
case BuiltinType::Half:		case BuiltinType::Half:
if (Target->hasFloat16Type() \|\| !getLangOpts().OpenMP \|\|		if (Target->hasFloat16Type() \|\| !getLangOpts().OpenMP \|\|
!getLangOpts().OpenMPIsDevice) {		!getLangOpts().OpenMPIsDevice) {
Width = Target->getHalfWidth();		Width = Target->getHalfWidth();
Align = Target->getHalfAlign();		Align = Target->getHalfAlign();
▲ Show 20 Lines • Show All 11,305 Lines • Show Last 20 Lines

clang/lib/AST/ItaniumMangle.cpp

Show First 20 Lines • Show All 3,045 Lines • ▼ Show 20 Lines	case BuiltinType::Float128: {
const TargetInfo *TI = getASTContext().getLangOpts().OpenMP &&		const TargetInfo *TI = getASTContext().getLangOpts().OpenMP &&
getASTContext().getLangOpts().OpenMPIsDevice		getASTContext().getLangOpts().OpenMPIsDevice
? getASTContext().getAuxTargetInfo()		? getASTContext().getAuxTargetInfo()
: &getASTContext().getTargetInfo();		: &getASTContext().getTargetInfo();
Out << TI->getFloat128Mangling();		Out << TI->getFloat128Mangling();
break;		break;
}		}
case BuiltinType::BFloat16: {		case BuiltinType::BFloat16: {
const TargetInfo *TI = &getASTContext().getTargetInfo();		const TargetInfo *TI = ((getASTContext().getLangOpts().OpenMP &&
		getASTContext().getLangOpts().OpenMPIsDevice) \|\|
		getASTContext().getLangOpts().SYCLIsDevice)
		? getASTContext().getAuxTargetInfo()
		: &getASTContext().getTargetInfo();
Out << TI->getBFloat16Mangling();		Out << TI->getBFloat16Mangling();
break;		break;
}		}
case BuiltinType::Ibm128: {		case BuiltinType::Ibm128: {
const TargetInfo *TI = &getASTContext().getTargetInfo();		const TargetInfo *TI = &getASTContext().getTargetInfo();
Out << TI->getIbm128Mangling();		Out << TI->getIbm128Mangling();
break;		break;
}		}
▲ Show 20 Lines • Show All 3,514 Lines • Show Last 20 Lines

clang/lib/Sema/Sema.cpp

Show First 20 Lines • Show All 1,969 Lines • ▼ Show 20 Lines	if (Ty->isRealFloatingType() && Context.getTypeSize(Ty) == 128) {
LongDoubleMismatched = true;		LongDoubleMismatched = true;
}		}

if ((Ty->isFloat16Type() && !Context.getTargetInfo().hasFloat16Type()) \|\|		if ((Ty->isFloat16Type() && !Context.getTargetInfo().hasFloat16Type()) \|\|
(Ty->isFloat128Type() && !Context.getTargetInfo().hasFloat128Type()) \|\|		(Ty->isFloat128Type() && !Context.getTargetInfo().hasFloat128Type()) \|\|
(Ty->isIbm128Type() && !Context.getTargetInfo().hasIbm128Type()) \|\|		(Ty->isIbm128Type() && !Context.getTargetInfo().hasIbm128Type()) \|\|
(Ty->isIntegerType() && Context.getTypeSize(Ty) == 128 &&		(Ty->isIntegerType() && Context.getTypeSize(Ty) == 128 &&
!Context.getTargetInfo().hasInt128Type()) \|\|		!Context.getTargetInfo().hasInt128Type()) \|\|
		(Ty->isBFloat16Type() && !Context.getTargetInfo().hasBFloat16Type() &&
		!LangOpts.CUDAIsDevice) \|\|
		traUnsubmitted Not Done Reply Inline Actions @eandrews Do you recall what was the reason for not issuing the diagnostic on the GPU side? It appears to do the opposite to what the patch description says. We're supposed to `emit error for unsupported type __bf16 in device code`, but instead we specifically ignore it during GPU-side compilation. What am I missing? tra: @eandrews Do you recall what was the reason for not issuing the diagnostic on the GPU side?
		eandrewsAuthorUnsubmitted Done Reply Inline Actions I don't recall the specifics but I think CUDA had code handling __bf16 differently and this change broke a test with CUDA diagnostics and so I excluded it from the patch. I could try removing this check and seeing what breaks if you'd like. eandrews: I don't recall the specifics but I think CUDA had code handling __bf16 differently and this…
		traUnsubmitted Not Done Reply Inline Actions It may have been around the time when x86 started exposing bf16 type in the host headers, but NVPTX didn't have any support for the type yet. This change may have just papered over the problem. Oh, well. That would be just one of the places where we currently don't handle the 'unusual' types across the host/GPU boundary. I'm attempting to clean it up, and will take care of this instance there. tra: It may have been around the time when x86 started exposing bf16 type in the host headers, but…
LongDoubleMismatched) {		LongDoubleMismatched) {
PartialDiagnostic PD = PDiag(diag::err_target_unsupported_type);		PartialDiagnostic PD = PDiag(diag::err_target_unsupported_type);
if (D)		if (D)
PD << D;		PD << D;
else		else
PD << "expression";		PD << "expression";

if (targetDiag(Loc, PD, FD)		if (targetDiag(Loc, PD, FD)
▲ Show 20 Lines • Show All 739 Lines • Show Last 20 Lines

clang/lib/Sema/SemaType.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,512 Lines • ▼ Show 20 Lines	case DeclSpec::TST_float16:
if (!S.Context.getTargetInfo().hasFloat16Type() && !S.getLangOpts().CUDA &&		if (!S.Context.getTargetInfo().hasFloat16Type() && !S.getLangOpts().CUDA &&
!(S.getLangOpts().OpenMP && S.getLangOpts().OpenMPIsDevice))		!(S.getLangOpts().OpenMP && S.getLangOpts().OpenMPIsDevice))
S.Diag(DS.getTypeSpecTypeLoc(), diag::err_type_unsupported)		S.Diag(DS.getTypeSpecTypeLoc(), diag::err_type_unsupported)
<< "_Float16";		<< "_Float16";
Result = Context.Float16Ty;		Result = Context.Float16Ty;
break;		break;
case DeclSpec::TST_half: Result = Context.HalfTy; break;		case DeclSpec::TST_half: Result = Context.HalfTy; break;
case DeclSpec::TST_BFloat16:		case DeclSpec::TST_BFloat16:
if (!S.Context.getTargetInfo().hasBFloat16Type())		if (!S.Context.getTargetInfo().hasBFloat16Type() &&
S.Diag(DS.getTypeSpecTypeLoc(), diag::err_type_unsupported)		!(S.getLangOpts().OpenMP && S.getLangOpts().OpenMPIsDevice) &&
<< "__bf16";		!S.getLangOpts().SYCLIsDevice)
		S.Diag(DS.getTypeSpecTypeLoc(), diag::err_type_unsupported) << "__bf16";
Result = Context.BFloat16Ty;		Result = Context.BFloat16Ty;
break;		break;
case DeclSpec::TST_float: Result = Context.FloatTy; break;		case DeclSpec::TST_float: Result = Context.FloatTy; break;
case DeclSpec::TST_double:		case DeclSpec::TST_double:
if (DS.getTypeSpecWidth() == TypeSpecifierWidth::Long)		if (DS.getTypeSpecWidth() == TypeSpecifierWidth::Long)
Result = Context.LongDoubleTy;		Result = Context.LongDoubleTy;
else		else
Result = Context.DoubleTy;		Result = Context.DoubleTy;
▲ Show 20 Lines • Show All 8,073 Lines • Show Last 20 Lines

clang/test/SemaSYCL/bf16.cpp

This file was added.

				// RUN: %clang_cc1 -triple spir64 -aux-triple x86_64-unknown-linux-gnu -fsycl-is-device -verify -fsyntax-only %s

				template <typename Name, typename Func>
				__attribute__((sycl_kernel)) void kernel(Func kernelFunc) {
				kernelFunc(); // expected-note {{called by 'kernel}}
				}

				void host_ok(void) {
				__bf16 A;
				}

				int main()
				{ host_ok();
				__bf16 var; // expected-note {{'var' defined here}}
				kernel<class variables>([=]() {
				(void)var; // expected-error {{'var' requires 16 bit size '__bf16' type support, but target 'spir64' does not support it}}
				int B = sizeof(__bf16);
				});

				return 0;
				}