This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
include/clang/
-
clang/
-
Basic/
-
CodeGenOptions.def
-
Driver/
-
Options.td
-
lib/
-
CodeGen/
-
CGStmt.cpp
-
CMakeLists.txt
-
CodeGenFunction.h
-
CodeGenFunction.cpp
5/5
MisExpect.h
12/15
MisExpect.cpp
-
Driver/ToolChains/
-
ToolChains/
-
Clang.cpp
-
Frontend/
-
CompilerInvocation.cpp
-
test/Profile/
-
Profile/
-
Inputs/
-
misexpect-branch-nonconst-expect-arg.proftext
-
misexpect-branch.proftext
-
misexpect-switch-default-only.proftext
-
misexpect-switch.proftext
-
misexpect-branch-cold.c
-
misexpect-branch-nonconst-expected-val.c
-
misexpect-branch.c
-
misexpect-no-warning-without-flag.c
-
misexpect-switch-default.c
-
misexpect-switch-nonconst.c
-
misexpect-switch-only-default-case.c
-
misexpect-switch.c

Differential D65300

[clang] [CodeGen] clang-misexpect prototype for compiler warnings
AbandonedPublic

Authored by paulkirth on Jul 25 2019, 2:38 PM.

Download Raw Diff

Details

Reviewers

phosek
leonardchan
jakehehrlich
mcgrathr
lebedev.ri
jfb
rsmith
chandlerc
vsk

Summary

Overview:
This patch contains a prototype of the basic functionality of clang-misexpect in the PGO pipeline. clang-misexpect is a proposed clang-tool that can report potentially incorrect usage of __builtin_expect() by comparing the developer's annotation against a collected PGO profile. A more detailed proposal and discussion appears on the CFE-dev mailing list (http://lists.llvm.org/pipermail/cfe-dev/2019-July/062971.html)

This patch adds the basic checking mechanisms to the compiler in the CodeGen library as a set of warnings usable when compiling with PGO. Once the logic in this patch is verified, it can be used to as the basis for a standalone tool.

This patch adds a new flag -fmisexpect to clang, and adds additional checks for branches and switch statements when branch weights are assigned in clang/lib/CodeGen/CodeGenFunction.cpp & clang/lib/CodeGen/CGStmt.cpp.

The bulk of the checking logic is implemented in clang/lib/CodeGen/MisExpect.cpp. It has some minor changes to some of the logic in CGStmt.cpp to properly map the profiling counters to their concrete values in the case statements for switches.

The patch also provides a number of lit based tests for various usage of __builtin_expect() for branches and switch statements.

Details:

The strategy for MisExpect checks is straightforward: when __builtin_expect() is used, then compare the relevant profile counter against a threshold value, and emit a warning if we've found a mismatch (i.e. the profile counter is outside the acceptable range).

For conditional statements this is simple. We can determine whether the profile count agrees with the annotation via direct comparison w/ the appropriate hot/cold thresholds.

For switch statements the situation is slightly more complex due to the way profiling counters are assigned.

Each case statement in the switch body will get its own counter, except when the cases are empty fall-throughs, in which case they will share a counter.

So, in the case where:

switch(x){
case 1:
case 2:
  break;
case default:
  break;
};

Here, values 1 & 2 will share a single counter, and all other cases will share a counter for the default case.

However, in the following example:

switch(x){
case 1:
 x = x+1;
case 2:
  break;
case default:
  break;
};

In the example above, values 1 & 2 will each have their own profiling counter instead of sharing one, even though case 1 falls through to case 2. The default case stays the same with a shared counter for every value not present in the switch body.

To be align with developer expectations we treat these empty fall-throughs conservatively for the purpose of generating warnings, i.e. if the expected value results in the hottest branch being executed we will not emit the warning, even though the strict semantics of __builtin_expect() are slightly different. We support this by maintaining a mapping from the concrete value in the switch arm back to its relevant profile counter.

In the case where the profile counters are not shared, we always do a direct check using the relevant profile counter.

Right now MisExpect only ensures that the hottest arm of the switch statement is the one the developer chosen, without reference to any particular threshold. It is arguable that we should have an additional check against a threshold, similar to what is done for conditional statements.

Heuristics:
For branch 'hotness' I have switched from using the existing thresholds for determining FFA_Hot/FFA_Cold function annotations to using the same branch probability that LLVM asigns when lowerin the __builtin_expect intrinsic (see LowerExpectIntrinsic.cpp for more details). The weight is roughly 2000:1, which is heavily skewed, and may be a bit too tight for practical use. This however is a reasonable default, and the threshold can be exposed through a compiler option. Feedback on a more appropriate set of threshold values is welcome and desirable.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

paulkirth created this revision.Jul 25 2019, 2:38 PM

Herald added a project: Restricted Project. · View Herald TranscriptJul 25 2019, 2:38 PM

Herald added subscribers: cfe-commits, kristof.beyls, javed.absar, mgorny. · View Herald Transcript

lebedev.ri added a reviewer: lebedev.ri.Jul 25 2019, 2:42 PM

xbolva00 added a subscriber: xbolva00.Jul 25 2019, 2:49 PM

Refactors some debug code to be more centralized and cleans up some comments.

Nice work!

It would be nice if we also have “-fsuggest-expect” so we could fix perf issues thanks to PGO counters even for non-PGO builds. What do you think?

In D65300#1601733, @xbolva00 wrote:

Nice work!

Glad to hear you like it.

It would be nice if we also have “-fsuggest-expect” so we could fix perf issues thanks to PGO counters even for non-PGO builds. What do you think?

Supporting suggestions is something we're planning to do in the future. Non-PGO builds are even the main motivation. I wanted to get this logic correct first, and make sure that we're handling all the odd edge cases before making new suggestions. I feel like our approach is really straightforward, but some things in clang are spread out in surprising ways or have interactions I've been surprised to find. That said, if finding problematic usage is done correctly, then reversing the logic to make a suggestions about useful annotations should be pretty easy.

I still need to give some thought to how to express the right balance between execution count and frequency when suggesting new annotations, so that the user won't be overwhelmed with suggestions. I'm not sure that branch weights by themselves are sufficient for that use-case, but we should be able to come up with something that behaves reasonably well without too much trouble.

xbolva00 added reviewers: jfb, rsmith.Jul 25 2019, 3:53 PM

Herald added a subscriber: dexonsmith. · View Herald TranscriptJul 25 2019, 3:53 PM

xbolva00 added a reviewer: chandlerc.Jul 25 2019, 3:58 PM

Add missing test for switch statements when the expected value is not a compile time constant.

Make sure that when the expected value cannot be evaluated, we do not issue any warnings or errors

Update diff to have proper context on Phabricator

phosek added inline comments.Jul 30 2019, 7:41 PM

clang/lib/CodeGen/MisExpect.cpp
30	It seems like `DebugPrintMisExpectSwitchInfo` and `EmitMisExpectWarning` is only used within this file, so I'd move them to anonymous namespace.
75	No space between `:` and `\t` here and below.
83	Are these thresholds defined anywhere as constants?
95	No need for `{` and `}`.
123	No need for { and }, in this case you can probably just use ternary expression.
134	No need for { and }.
clang/lib/CodeGen/MisExpect.h
11	`for validation` or `to validate`?
35	s/__builting_expect/__builtin_expect/
37	extra empty line
41	s/__builting_expect/__builtin_expect/

LGTM but I'd wait a day to see if anyone else has comments they'd like to add before landing.

This revision is now accepted and ready to land.Jul 30 2019, 7:47 PM

Thank you for working on this!

I'm guessing this doesn't have a -Werror= mode?

I still believe this should output a remark.
It will still be visible in the compiler console output,
but won't get buried there but will actually be recorded in the remarks file.

clang/lib/CodeGen/MisExpect.cpp
2	Wrong comment
37	llvm::None
41	`llvm::None`
51	llvm::None
55	return ExpectedVal; should just work?
141–147	This is rather undescriptive. Can you output some more useful info?
clang/lib/CodeGen/MisExpect.h
2	Wrong comment

This revision now requires changes to proceed.Jul 31 2019, 4:42 AM

Fix typo in comments

paulkirth marked 2 inline comments as done.Aug 2 2019, 11:30 AM

paulkirth added inline comments.

clang/lib/CodeGen/MisExpect.cpp
83	These thresholds come from PGOInstrumentation.cpp:1084 I will update with a reference to the code that this comes from, but, as noted in the TODO we need a better heuristic.
141–147	Do you have a suggestion about what feedback would be more useful? My initial thought with the somewhat generic message was to simply point out that this usage looked problematic, and let the developer investigate. I wasn't sure we wanted to expose details of the internal heuristic to the user by reporting the internal thresholds.

xbolva00 added inline comments.Aug 2 2019, 11:42 AM

clang/lib/CodeGen/MisExpect.cpp
141–147	Message is currently confusing a bit. I really miss clear info like “This compiler hint seems to be incorrect according to current PGO counters. Please check the hint if it is still valid and perf-profitable”.

Address feedback from review

Fixed comments in License headers
removed excess {}
Changed conditional assignment to ternary operation
Moved local functions into anonymous namespace
Added reference to the origin of the threshold values in the clang source code
Updated warning to have more useful text
Fixed whitespace
Updated unqualified use of llvm::None
Updated tests with new warning text

paulkirth marked 12 inline comments as done.Aug 2 2019, 7:21 PM

Update documentation and fix typos

paulkirth marked 2 inline comments as done.Aug 2 2019, 7:40 PM

Update threshold values to match those assigned when lowering __builtin_expect intrinsic.

I've modified the branch probability to match the probability assigned in LowerExpectIntrinsics.cpp

This is still debatable, but seems like a reasonable default. My next patch will make the probability threshold assignable through the command line, and extend the probability calculation into switch statements. This way users can determine how hot is hot enough, and otherwise get the same behavior that builtin expect will use anyway.

I also re applied a fix to a typo that I overwrote in an earlier patch.

In D65300#1608147, @lebedev.ri wrote:

Thank you for working on this!

I'm guessing this doesn't have a -Werror= mode?

I still believe this should output a remark.
It will still be visible in the compiler console output,
but won't get buried there but will actually be recorded in the remarks file.

“Clang supports -R flags for enabling remarks. These are diagnostic messages that provide information about the compilation process, but don’t suggest that a problem has been detected.”

I think this patch is okay.

Use existing LLVM code for mapping case literals to their case arms.

This update refactors a great deal of the implementation and test code.

Removes the CaseMap data structure completely. LLVM already maintains a mapping of the constants to their case arm, so there is no reason to duplicate that logic. This also minimizes the changes impacting existing Clang/LLVM components.
Cleans up debug printing. Without the CaseMap printing debug output can be simplified.
Minimizes the code in the end-to-end tests & test profiles.
Improves formatting, white space, and comments.

paulkirth added a child revision: D66324: clang-misexpect: Profile Guided Validation of Performance Annotations in LLVM.Aug 15 2019, 5:07 PM

lebedev.ri added a reviewer: vsk.Aug 15 2019, 11:24 PM

lebedev.ri mentioned this in D66324: clang-misexpect: Profile Guided Validation of Performance Annotations in LLVM.Aug 15 2019, 11:38 PM

Please can you clarify hat's the current layout of these patches?
Is this patch required, or is it superseded by D66324 (and thus should be abandoned)?
I'd like to begin reviewing, but i don't understand where to start.

In D65300#1647746, @lebedev.ri wrote:

Please can you clarify hat's the current layout of these patches?
Is this patch required, or is it superseded by D66324 (and thus should be abandoned)?
I'd like to begin reviewing, but i don't understand where to start.

Sorry for the confusion. D66324 should supersede this patch. It re-implements everything in the backend, and adds support for IR & Sample based profiles, so I think it is safe to ignore this patch for now.

Is there something I should do to mark/change the patches?

Also, thanks for the feedback.

In D65300#1647775, @paulkirth wrote:

In D65300#1647746, @lebedev.ri wrote:

Please can you clarify hat's the current layout of these patches?
Is this patch required, or is it superseded by D66324 (and thus should be abandoned)?
I'd like to begin reviewing, but i don't understand where to start.

Sorry for the confusion. D66324 should supersede this patch.

Okay, i'll be reviewing that one then.

It re-implements everything in the backend, and adds support for IR & Sample based profiles, so I think it is safe to ignore this patch for now.

Is there something I should do to mark/change the patches?

Please abandon this patch to make it obvious (Add Action -> Abandon)

Also, thanks for the feedback.

This is being abandoned in favor of D66324, which reimplements this logic completely in the LLVM backend.

phosek mentioned this in rL371484: clang-misexpect: Profile Guided Validation of Performance Annotations in LLVM.Sep 9 2019, 8:13 PM

phosek mentioned this in rGa10802fd73f9: clang-misexpect: Profile Guided Validation of Performance Annotations in LLVM.

phosek mentioned this in rL371584: clang-misexpect: Profile Guided Validation of Performance Annotations in LLVM.Sep 10 2019, 6:10 PM

phosek mentioned this in rG394a8ed8f1ad: clang-misexpect: Profile Guided Validation of Performance Annotations in LLVM.

phosek mentioned this in rL371635: Reland "clang-misexpect: Profile Guided Validation of Performance Annotations….Sep 11 2019, 9:22 AM

phosek mentioned this in rG7bdad0842941: Reland "clang-misexpect: Profile Guided Validation of Performance Annotations….

Revision Contents

Path

Size

clang/

include/

clang/

Basic/

CodeGenOptions.def

1 line

Driver/

Options.td

5 lines

lib/

CodeGen/

42 lines

1 line

2 lines

5 lines

62 lines

187 lines

Driver/

ToolChains/

Clang.cpp

5 lines

Frontend/

CompilerInvocation.cpp

1 line

test/

Profile/

Inputs/

misexpect-branch-nonconst-expect-arg.proftext

36 lines

misexpect-branch.proftext

36 lines

misexpect-switch-default-only.proftext

40 lines

misexpect-switch.proftext

45 lines

misexpect-branch-cold.c

51 lines

misexpect-branch-nonconst-expected-val.c

51 lines

misexpect-branch.c

50 lines

misexpect-no-warning-without-flag.c

51 lines

misexpect-switch-default.c

68 lines

misexpect-switch-nonconst.c

69 lines

misexpect-switch-only-default-case.c

62 lines

misexpect-switch.c

68 lines

Diff 213379

clang/include/clang/Basic/CodeGenOptions.def

	Show First 20 Lines • Show All 163 Lines • ▼ Show 20 Lines
	/// Choose profile instrumenation kind or no instrumentation.			/// Choose profile instrumenation kind or no instrumentation.
	ENUM_CODEGENOPT(ProfileInstr, ProfileInstrKind, 2, ProfileNone)			ENUM_CODEGENOPT(ProfileInstr, ProfileInstrKind, 2, ProfileNone)
	/// Choose profile kind for PGO use compilation.			/// Choose profile kind for PGO use compilation.
	ENUM_CODEGENOPT(ProfileUse, ProfileInstrKind, 2, ProfileNone)			ENUM_CODEGENOPT(ProfileUse, ProfileInstrKind, 2, ProfileNone)
	CODEGENOPT(CoverageMapping , 1, 0) ///< Generate coverage mapping regions to			CODEGENOPT(CoverageMapping , 1, 0) ///< Generate coverage mapping regions to
	///< enable code coverage analysis.			///< enable code coverage analysis.
	CODEGENOPT(DumpCoverageMapping , 1, 0) ///< Dump the generated coverage mapping			CODEGENOPT(DumpCoverageMapping , 1, 0) ///< Dump the generated coverage mapping
	///< regions.			///< regions.
				CODEGENOPT(MisExpect , 1, 0) ///< Validate __builtin_expect with PGO counters

	/// If -fpcc-struct-return or -freg-struct-return is specified.			/// If -fpcc-struct-return or -freg-struct-return is specified.
	ENUM_CODEGENOPT(StructReturnConvention, StructReturnConventionKind, 2, SRCK_Default)			ENUM_CODEGENOPT(StructReturnConvention, StructReturnConventionKind, 2, SRCK_Default)

	CODEGENOPT(RelaxAll , 1, 0) ///< Relax all machine code instructions.			CODEGENOPT(RelaxAll , 1, 0) ///< Relax all machine code instructions.
	CODEGENOPT(RelaxedAliasing , 1, 0) ///< Set when -fno-strict-aliasing is enabled.			CODEGENOPT(RelaxedAliasing , 1, 0) ///< Set when -fno-strict-aliasing is enabled.
	CODEGENOPT(StructPathTBAA , 1, 0) ///< Whether or not to use struct-path TBAA.			CODEGENOPT(StructPathTBAA , 1, 0) ///< Whether or not to use struct-path TBAA.
	CODEGENOPT(NewStructPathTBAA , 1, 0) ///< Whether or not to use enhanced struct-path TBAA.			CODEGENOPT(NewStructPathTBAA , 1, 0) ///< Whether or not to use enhanced struct-path TBAA.
	▲ Show 20 Lines • Show All 190 Lines • Show Last 20 Lines

clang/include/clang/Driver/Options.td

	Show First 20 Lines • Show All 710 Lines • ▼ Show 20 Lines
	def fno_auto_profile : Flag<["-"], "fno-auto-profile">, Group<f_Group>,			def fno_auto_profile : Flag<["-"], "fno-auto-profile">, Group<f_Group>,
	Alias<fno_profile_sample_use>;			Alias<fno_profile_sample_use>;
	def fauto_profile_EQ : Joined<["-"], "fauto-profile=">,			def fauto_profile_EQ : Joined<["-"], "fauto-profile=">,
	Alias<fprofile_sample_use_EQ>;			Alias<fprofile_sample_use_EQ>;
	def fauto_profile_accurate : Flag<["-"], "fauto-profile-accurate">,			def fauto_profile_accurate : Flag<["-"], "fauto-profile-accurate">,
	Group<f_Group>, Alias<fprofile_sample_accurate>;			Group<f_Group>, Alias<fprofile_sample_accurate>;
	def fno_auto_profile_accurate : Flag<["-"], "fno-auto-profile-accurate">,			def fno_auto_profile_accurate : Flag<["-"], "fno-auto-profile-accurate">,
	Group<f_Group>, Alias<fno_profile_sample_accurate>;			Group<f_Group>, Alias<fno_profile_sample_accurate>;
				def fmisexpect : Flag<["-"], "fmisexpect">,
				Group<f_Group>, Flags<[CC1Option]>,
				HelpText<"Validate use of __builtin_expect with instrumentation data">;
				def fno_misexpect : Flag<["-"], "fno-misexpect">,
				Group<f_Group>, Flags<[CC1Option]>;
	def fdebug_compilation_dir : Separate<["-"], "fdebug-compilation-dir">,			def fdebug_compilation_dir : Separate<["-"], "fdebug-compilation-dir">,
	Group<f_Group>, Flags<[CC1Option, CC1AsOption, CoreOption]>,			Group<f_Group>, Flags<[CC1Option, CC1AsOption, CoreOption]>,
	HelpText<"The compilation directory to embed in the debug info.">;			HelpText<"The compilation directory to embed in the debug info.">;
	def fdebug_info_for_profiling : Flag<["-"], "fdebug-info-for-profiling">,			def fdebug_info_for_profiling : Flag<["-"], "fdebug-info-for-profiling">,
	Group<f_Group>, Flags<[CC1Option]>,			Group<f_Group>, Flags<[CC1Option]>,
	HelpText<"Emit extra debug info to make sample profile more accurate.">;			HelpText<"Emit extra debug info to make sample profile more accurate.">;
	def fno_debug_info_for_profiling : Flag<["-"], "fno-debug-info-for-profiling">,			def fno_debug_info_for_profiling : Flag<["-"], "fno-debug-info-for-profiling">,
	Group<f_Group>, Flags<[DriverOption]>,			Group<f_Group>, Flags<[DriverOption]>,
	▲ Show 20 Lines • Show All 2,503 Lines • Show Last 20 Lines

clang/lib/CodeGen/CGStmt.cpp

//===--- CGStmt.cpp - Emit LLVM Code from Statements ----------------------===//		//===--- CGStmt.cpp - Emit LLVM Code from Statements ----------------------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// This contains code to emit Stmt nodes as LLVM code.		// This contains code to emit Stmt nodes as LLVM code.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "CodeGenFunction.h"		#include "CodeGenFunction.h"
#include "CGDebugInfo.h"		#include "CGDebugInfo.h"
#include "CodeGenModule.h"		#include "CodeGenModule.h"
#include "TargetInfo.h"		#include "TargetInfo.h"
		#include "MisExpect.h"
#include "clang/AST/StmtVisitor.h"		#include "clang/AST/StmtVisitor.h"
#include "clang/Basic/Builtins.h"		#include "clang/Basic/Builtins.h"
#include "clang/Basic/PrettyStackTrace.h"		#include "clang/Basic/PrettyStackTrace.h"
#include "clang/Basic/TargetInfo.h"		#include "clang/Basic/TargetInfo.h"
#include "llvm/ADT/StringExtras.h"		#include "llvm/ADT/StringExtras.h"
#include "llvm/IR/DataLayout.h"		#include "llvm/IR/DataLayout.h"
#include "llvm/IR/InlineAsm.h"		#include "llvm/IR/InlineAsm.h"
#include "llvm/IR/Intrinsics.h"		#include "llvm/IR/Intrinsics.h"
▲ Show 20 Lines • Show All 1,254 Lines • ▼ Show 20 Lines	if (isObviouslyBranchWithoutCleanups(Block)) {
Builder.ClearInsertionPoint();		Builder.ClearInsertionPoint();
}		}
return;		return;
}		}
}		}

llvm::BasicBlock *CaseDest = createBasicBlock("sw.bb");		llvm::BasicBlock *CaseDest = createBasicBlock("sw.bb");
EmitBlockWithFallThrough(CaseDest, &S);		EmitBlockWithFallThrough(CaseDest, &S);
if (SwitchWeights)		size_t MapIndex = 0;
		if (SwitchWeights) {
SwitchWeights->push_back(getProfileCount(&S));		SwitchWeights->push_back(getProfileCount(&S));
		if (CaseMap) {
		MapIndex = SwitchWeights->size() - 1;
		llvm::ConstantInt *CaseVal =
		Builder.getInt(S.getLHS()->EvaluateKnownConstInt(getContext()));
		(*CaseMap)[CaseVal->getSExtValue()] = MapIndex;
		}
		}
SwitchInsn->addCase(CaseVal, CaseDest);		SwitchInsn->addCase(CaseVal, CaseDest);

// Recursively emitting the statement is acceptable, but is not wonderful for		// Recursively emitting the statement is acceptable, but is not wonderful for
// code where we have many case statements nested together, i.e.:		// code where we have many case statements nested together, i.e.:
// case 1:		// case 1:
// case 2:		// case 2:
// case 3: etc.		// case 3: etc.
// Handling this recursively will create a new block for each case statement		// Handling this recursively will create a new block for each case statement
// that falls through to the next case which is IR intensive. It also causes		// that falls through to the next case which is IR intensive. It also causes
// deep recursion which can run into stack depth limitations. Handle		// deep recursion which can run into stack depth limitations. Handle
// sequential non-range case statements specially.		// sequential non-range case statements specially.
const CaseStmt *CurCase = &S;		const CaseStmt *CurCase = &S;
const CaseStmt *NextCase = dyn_cast<CaseStmt>(S.getSubStmt());		const CaseStmt *NextCase = dyn_cast<CaseStmt>(S.getSubStmt());

// Otherwise, iteratively add consecutive cases to this switch stmt.		// Otherwise, iteratively add consecutive cases to this switch stmt.
while (NextCase && NextCase->getRHS() == nullptr) {		while (NextCase && NextCase->getRHS() == nullptr) {
CurCase = NextCase;		CurCase = NextCase;
llvm::ConstantInt *CaseVal =		llvm::ConstantInt *CaseVal =
Builder.getInt(CurCase->getLHS()->EvaluateKnownConstInt(getContext()));		Builder.getInt(CurCase->getLHS()->EvaluateKnownConstInt(getContext()));

if (SwitchWeights)		if (SwitchWeights) {
SwitchWeights->push_back(getProfileCount(NextCase));		SwitchWeights->push_back(getProfileCount(NextCase));
		if (CaseMap) {
		// only the original statement gets a counter, so map back to it's index
		// in SwitchWeights
		(*CaseMap)[CaseVal->getSExtValue()] = MapIndex;
		}
		}

if (CGM.getCodeGenOpts().hasProfileClangInstr()) {		if (CGM.getCodeGenOpts().hasProfileClangInstr()) {
CaseDest = createBasicBlock("sw.bb");		CaseDest = createBasicBlock("sw.bb");
EmitBlockWithFallThrough(CaseDest, &S);		EmitBlockWithFallThrough(CaseDest, &S);
}		}

SwitchInsn->addCase(CaseVal, CaseDest);		SwitchInsn->addCase(CaseVal, CaseDest);
NextCase = dyn_cast<CaseStmt>(CurCase->getSubStmt());		NextCase = dyn_cast<CaseStmt>(CurCase->getSubStmt());
}		}
▲ Show 20 Lines • Show All 251 Lines • ▼ Show 20 Lines	return CollectStatementsForCase(S.getBody(), Case, FoundCase,
ResultStmts) != CSFC_Failure &&		ResultStmts) != CSFC_Failure &&
FoundCase;		FoundCase;
}		}

void CodeGenFunction::EmitSwitchStmt(const SwitchStmt &S) {		void CodeGenFunction::EmitSwitchStmt(const SwitchStmt &S) {
// Handle nested switch statements.		// Handle nested switch statements.
llvm::SwitchInst *SavedSwitchInsn = SwitchInsn;		llvm::SwitchInst *SavedSwitchInsn = SwitchInsn;
SmallVector<uint64_t, 16> *SavedSwitchWeights = SwitchWeights;		SmallVector<uint64_t, 16> *SavedSwitchWeights = SwitchWeights;
		llvm::DenseMap<int64_t, size_t> *SavedCaseMap = CaseMap;
llvm::BasicBlock *SavedCRBlock = CaseRangeBlock;		llvm::BasicBlock *SavedCRBlock = CaseRangeBlock;

// See if we can constant fold the condition of the switch and therefore only		// See if we can constant fold the condition of the switch and therefore only
// emit the live case statement (if any) of the switch.		// emit the live case statement (if any) of the switch.
llvm::APSInt ConstantCondValue;		llvm::APSInt ConstantCondValue;
if (ConstantFoldsToSimpleInteger(S.getCond(), ConstantCondValue)) {		if (ConstantFoldsToSimpleInteger(S.getCond(), ConstantCondValue)) {
SmallVector<const Stmt*, 4> CaseStmts;		SmallVector<const Stmt*, 4> CaseStmts;
const SwitchCase *Case = nullptr;		const SwitchCase *Case = nullptr;
▲ Show 20 Lines • Show All 55 Lines • ▼ Show 20 Lines	for (const SwitchCase *Case = S.getSwitchCaseList();
Case;		Case;
Case = Case->getNextSwitchCase()) {		Case = Case->getNextSwitchCase()) {
if (isa<DefaultStmt>(Case))		if (isa<DefaultStmt>(Case))
DefaultCount = getProfileCount(Case);		DefaultCount = getProfileCount(Case);
NumCases += 1;		NumCases += 1;
}		}
SwitchWeights = new SmallVector<uint64_t, 16>();		SwitchWeights = new SmallVector<uint64_t, 16>();
SwitchWeights->reserve(NumCases);		SwitchWeights->reserve(NumCases);
		CaseMap = new llvm::DenseMap<int64_t, size_t>();
		CaseMap->reserve(NumCases);
// The default needs to be first. We store the edge count, so we already		// The default needs to be first. We store the edge count, so we already
// know the right weight.		// know the right weight.
SwitchWeights->push_back(DefaultCount);		SwitchWeights->push_back(DefaultCount);
}		}
CaseRangeBlock = DefaultBlock;		CaseRangeBlock = DefaultBlock;

// Clear the insertion point to indicate we are in unreachable code.		// Clear the insertion point to indicate we are in unreachable code.
Builder.ClearInsertionPoint();		Builder.ClearInsertionPoint();
Show All 36 Lines	void CodeGenFunction::EmitSwitchStmt(const SwitchStmt &S) {
incrementProfileCounter(&S);		incrementProfileCounter(&S);

// If the switch has a condition wrapped by __builtin_unpredictable,		// If the switch has a condition wrapped by __builtin_unpredictable,
// create metadata that specifies that the switch is unpredictable.		// create metadata that specifies that the switch is unpredictable.
// Don't bother if not optimizing because that metadata would not be used.		// Don't bother if not optimizing because that metadata would not be used.
auto *Call = dyn_cast<CallExpr>(S.getCond());		auto *Call = dyn_cast<CallExpr>(S.getCond());
if (Call && CGM.getCodeGenOpts().OptimizationLevel != 0) {		if (Call && CGM.getCodeGenOpts().OptimizationLevel != 0) {
auto *FD = dyn_cast_or_null<FunctionDecl>(Call->getCalleeDecl());		auto *FD = dyn_cast_or_null<FunctionDecl>(Call->getCalleeDecl());
if (FD && FD->getBuiltinID() == Builtin::BI__builtin_unpredictable) {		if (FD) {
		if (FD->getBuiltinID() == Builtin::BI__builtin_unpredictable) {
llvm::MDBuilder MDHelper(getLLVMContext());		llvm::MDBuilder MDHelper(getLLVMContext());
SwitchInsn->setMetadata(llvm::LLVMContext::MD_unpredictable,		SwitchInsn->setMetadata(llvm::LLVMContext::MD_unpredictable,
MDHelper.createUnpredictable());		MDHelper.createUnpredictable());
		} else if (CGM.getCodeGenOpts().MisExpect &&
		FD->getBuiltinID() == Builtin::BI__builtin_expect) {
		MisExpect::CheckMisExpectSwitch(Call, SwitchWeights, CaseMap, CGM);
		}
}		}
}		}

if (SwitchWeights) {		if (SwitchWeights) {
assert(SwitchWeights->size() == 1 + SwitchInsn->getNumCases() &&		assert(SwitchWeights->size() == 1 + SwitchInsn->getNumCases() &&
"switch weights do not match switch cases");		"switch weights do not match switch cases");
// If there's only one jump destination there's no sense weighting it.		// If there's only one jump destination there's no sense weighting it.
if (SwitchWeights->size() > 1)		if (SwitchWeights->size() > 1)
SwitchInsn->setMetadata(llvm::LLVMContext::MD_prof,		SwitchInsn->setMetadata(llvm::LLVMContext::MD_prof,
createProfileWeights(*SwitchWeights));		createProfileWeights(*SwitchWeights));
delete SwitchWeights;		delete SwitchWeights;
}		}

		if (CaseMap) {
		delete CaseMap;
		}

SwitchInsn = SavedSwitchInsn;		SwitchInsn = SavedSwitchInsn;
SwitchWeights = SavedSwitchWeights;		SwitchWeights = SavedSwitchWeights;
		CaseMap = SavedCaseMap;
CaseRangeBlock = SavedCRBlock;		CaseRangeBlock = SavedCRBlock;
}		}

static std::string		static std::string
SimplifyConstraint(const char *Constraint, const TargetInfo &Target,		SimplifyConstraint(const char *Constraint, const TargetInfo &Target,
SmallVectorImpl<TargetInfo::ConstraintInfo> *OutCons=nullptr) {		SmallVectorImpl<TargetInfo::ConstraintInfo> *OutCons=nullptr) {
std::string Result;		std::string Result;

▲ Show 20 Lines • Show All 689 Lines • Show Last 20 Lines

clang/lib/CodeGen/CMakeLists.txt

Show First 20 Lines • Show All 81 Lines • ▼ Show 20 Lines	add_clang_library(clangCodeGen
CodeGenPGO.cpp		CodeGenPGO.cpp
CodeGenTBAA.cpp		CodeGenTBAA.cpp
CodeGenTypes.cpp		CodeGenTypes.cpp
ConstantInitBuilder.cpp		ConstantInitBuilder.cpp
CoverageMappingGen.cpp		CoverageMappingGen.cpp
ItaniumCXXABI.cpp		ItaniumCXXABI.cpp
MacroPPCallbacks.cpp		MacroPPCallbacks.cpp
MicrosoftCXXABI.cpp		MicrosoftCXXABI.cpp
		MisExpect.cpp
ModuleBuilder.cpp		ModuleBuilder.cpp
ObjectFilePCHContainerOperations.cpp		ObjectFilePCHContainerOperations.cpp
PatternInit.cpp		PatternInit.cpp
SanitizerMetadata.cpp		SanitizerMetadata.cpp
SwiftCallingConv.cpp		SwiftCallingConv.cpp
TargetInfo.cpp		TargetInfo.cpp
VarBypassDetector.cpp		VarBypassDetector.cpp

Show All 12 Lines

clang/lib/CodeGen/CodeGenFunction.h

	Show First 20 Lines • Show All 1,364 Lines • ▼ Show 20 Lines

	private:			private:

	/// SwitchInsn - This is nearest current switch instruction. It is null if			/// SwitchInsn - This is nearest current switch instruction. It is null if
	/// current context is not in a switch.			/// current context is not in a switch.
	llvm::SwitchInst *SwitchInsn = nullptr;			llvm::SwitchInst *SwitchInsn = nullptr;
	/// The branch weights of SwitchInsn when doing instrumentation based PGO.			/// The branch weights of SwitchInsn when doing instrumentation based PGO.
	SmallVector<uint64_t, 16> *SwitchWeights = nullptr;			SmallVector<uint64_t, 16> *SwitchWeights = nullptr;
				llvm::DenseMap<int64_t, size_t> *CaseMap = nullptr;


	/// CaseRangeBlock - This block holds if condition check for last case			/// CaseRangeBlock - This block holds if condition check for last case
	/// statement range in current switch instruction.			/// statement range in current switch instruction.
	llvm::BasicBlock *CaseRangeBlock = nullptr;			llvm::BasicBlock *CaseRangeBlock = nullptr;

	/// OpaqueLValues - Keeps track of the current set of opaque value			/// OpaqueLValues - Keeps track of the current set of opaque value
	/// expressions.			/// expressions.
	llvm::DenseMap<const OpaqueValueExpr *, LValue> OpaqueLValues;			llvm::DenseMap<const OpaqueValueExpr *, LValue> OpaqueLValues;
	▲ Show 20 Lines • Show All 3,012 Lines • Show Last 20 Lines

clang/lib/CodeGen/CodeGenFunction.cpp

Show All 14 Lines
#include "CGCleanup.h"		#include "CGCleanup.h"
#include "CGCUDARuntime.h"		#include "CGCUDARuntime.h"
#include "CGCXXABI.h"		#include "CGCXXABI.h"
#include "CGDebugInfo.h"		#include "CGDebugInfo.h"
#include "CGOpenMPRuntime.h"		#include "CGOpenMPRuntime.h"
#include "CodeGenModule.h"		#include "CodeGenModule.h"
#include "CodeGenPGO.h"		#include "CodeGenPGO.h"
#include "TargetInfo.h"		#include "TargetInfo.h"
		#include "MisExpect.h"
#include "clang/AST/ASTContext.h"		#include "clang/AST/ASTContext.h"
#include "clang/AST/ASTLambda.h"		#include "clang/AST/ASTLambda.h"
#include "clang/AST/Decl.h"		#include "clang/AST/Decl.h"
#include "clang/AST/DeclCXX.h"		#include "clang/AST/DeclCXX.h"
#include "clang/AST/StmtCXX.h"		#include "clang/AST/StmtCXX.h"
#include "clang/AST/StmtObjC.h"		#include "clang/AST/StmtObjC.h"
#include "clang/Basic/Builtins.h"		#include "clang/Basic/Builtins.h"
#include "clang/Basic/CodeGenOptions.h"		#include "clang/Basic/CodeGenOptions.h"
▲ Show 20 Lines • Show All 1,310 Lines • ▼ Show 20 Lines	bool CodeGenFunction::ConstantFoldsToSimpleInteger(const Expr *Cond,
llvm::APSInt Int = Result.Val.getInt();		llvm::APSInt Int = Result.Val.getInt();
if (!AllowLabels && CodeGenFunction::ContainsLabel(Cond))		if (!AllowLabels && CodeGenFunction::ContainsLabel(Cond))
return false; // Contains a label.		return false; // Contains a label.

ResultInt = Int;		ResultInt = Int;
return true;		return true;
}		}



/// EmitBranchOnBoolExpr - Emit a branch on a boolean condition (e.g. for an if		/// EmitBranchOnBoolExpr - Emit a branch on a boolean condition (e.g. for an if
/// statement) to the specified blocks. Based on the condition, this might try		/// statement) to the specified blocks. Based on the condition, this might try
/// to simplify the codegen of the conditional based on the branch.		/// to simplify the codegen of the conditional based on the branch.
///		///
void CodeGenFunction::EmitBranchOnBoolExpr(const Expr *Cond,		void CodeGenFunction::EmitBranchOnBoolExpr(const Expr *Cond,
llvm::BasicBlock *TrueBlock,		llvm::BasicBlock *TrueBlock,
llvm::BasicBlock *FalseBlock,		llvm::BasicBlock *FalseBlock,
uint64_t TrueCount) {		uint64_t TrueCount) {
Cond = Cond->IgnoreParens();		Cond = Cond->IgnoreParens();

		MisExpect::CheckMisExpectBranch(Cond, TrueCount, getCurrentProfileCount(), CGM);

if (const BinaryOperator *CondBOp = dyn_cast<BinaryOperator>(Cond)) {		if (const BinaryOperator *CondBOp = dyn_cast<BinaryOperator>(Cond)) {

// Handle X && Y in a condition.		// Handle X && Y in a condition.
if (CondBOp->getOpcode() == BO_LAnd) {		if (CondBOp->getOpcode() == BO_LAnd) {
// If we have "1 && X", simplify the code. "0 && X" would have constant		// If we have "1 && X", simplify the code. "0 && X" would have constant
// folded if the case was simple enough.		// folded if the case was simple enough.
bool ConstantBool = false;		bool ConstantBool = false;
if (ConstantFoldsToSimpleInteger(CondBOp->getLHS(), ConstantBool) &&		if (ConstantFoldsToSimpleInteger(CondBOp->getLHS(), ConstantBool) &&
▲ Show 20 Lines • Show All 1,028 Lines • Show Last 20 Lines

clang/lib/CodeGen/MisExpect.h

This file was added.

				//===--- MisExpect.h - Check Use of __builtin_expect() with PGO data ------===//
				//
				lebedev.riUnsubmitted Done Reply Inline Actions Wrong comment lebedev.ri: Wrong comment
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// This contains code to emit warnings for potentially incorrect usage of
				// __builtin_expect(). It uses PGO profiles for validation.
				//
				phosekUnsubmitted Done Reply Inline Actions `for validation` or `to validate`? phosek: `for validation` or `to validate`?
				//===----------------------------------------------------------------------===//

				#ifndef LLVM_CLANG_LIB_CODEGEN_MISEXPECT_H
				#define LLVM_CLANG_LIB_CODEGEN_MISEXPECT_H

				#include "CodeGenModule.h"
				#include "clang/AST/ASTContext.h"
				#include "clang/AST/Expr.h"
				#include "clang/Basic/LLVM.h"
				#include "llvm/ADT/Optional.h"

				namespace clang {
				namespace CodeGen {
				namespace MisExpect {

				/// getExpectedValue - Returns the value that __builtin_expect() is expecting.
				/// If the second parameter cannot be evaluated at compile-time, returns an
				/// empty Optional. Returns None when Call is not __builtin_expect()
				/// \param Call the call expression to __builtin_expect()
				/// \param Context the current ASTContext
				Optional<int64_t> getExpectedValue(const clang::CallExpr *Call,
				ASTContext &Context);

				/// CheckMisExpectBranch - check if a branch is annotated with
				phosekUnsubmitted Done Reply Inline Actions s/__builting_expect/__builtin_expect/ phosek: s/__builting_expect/__builtin_expect/
				/// __builtin_expect and when using profiling data, verify that the profile
				/// agrees with the use of the annotation
				phosekUnsubmitted Done Reply Inline Actions extra empty line phosek: extra empty line
				/// \param Cond the conditional expression being checked
				/// \param TrueCount the profile counter for this block
				/// \param CurrProfCount the current total profile count
				/// \param CGM a reference to the current CodeGenModule
				phosekUnsubmitted Done Reply Inline Actions s/__builting_expect/__builtin_expect/ phosek: s/__builting_expect/__builtin_expect/
				void CheckMisExpectBranch(const Expr *Cond, uint64_t TrueCount,
				uint64_t CurrProfCount, CodeGenModule &CGM);

				/// CheckMisExpect - check if a branch is annotated with __builtin_expect and
				/// when using profiling data, verify that the profile agrees with the use of
				/// the annotation
				/// \param Call the call expression to __builtin_expect()
				/// \param SwitchWeights pointer to a vector of profile counts for each case arm
				/// \param CaseMap a table mapping the constant value of a case target to its
				/// index in the SwitchWeights vector
				/// \param CGM a reference to the current CodeGenModule
				void CheckMisExpectSwitch(const CallExpr *Call,
				llvm::SmallVector<uint64_t, 16> *SwitchWeights,
				llvm::DenseMap<int64_t, size_t> *CaseMap,
				CodeGenModule &CGM);

				} // namespace MisExpect
				} // namespace CodeGen
				} // namespace clang

				#endif // LLVM_CLANG_LIB_CODEGEN_MISEXPECT_H

clang/lib/CodeGen/MisExpect.cpp

This file was added.

				//===--- MisExpect.cpp - Check Use of __builtin_expect() with PGO data ----===//
				//
				lebedev.riUnsubmitted Done Reply Inline Actions Wrong comment lebedev.ri: Wrong comment
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// This contains code to emit warnings for potentially incorrect usage of
				// __builtin_expect(). It uses PGO profiles for validation.
				//
				//===----------------------------------------------------------------------===//

				#include "MisExpect.h"
				#include "CodeGenModule.h"
				#include "clang/Basic/Builtins.h"
				#include "clang/Basic/CodeGenOptions.h"
				#include "clang/Basic/Diagnostic.h"
				#include "llvm/Support/BranchProbability.h"

				#include <algorithm>
				#include <numeric>

				namespace {

				using namespace clang;
				using namespace clang::CodeGen;

				// Emit Warning notifying user that the current PGO counter is a mismatch with
				// the use of __builtin_expect()
				phosekUnsubmitted Done Reply Inline Actions It seems like `DebugPrintMisExpectSwitchInfo` and `EmitMisExpectWarning` is only used within this file, so I'd move them to anonymous namespace. phosek: It seems like `DebugPrintMisExpectSwitchInfo` and `EmitMisExpectWarning` is only used within…
				// \param PercentageCorrect the percentage the expected target of
				// __builtin_expect() was taken during profiling as an integer
				void EmitMisExpectWarning(const clang::CallExpr *Call, CodeGenModule &CGM,
				unsigned PercentageCorrect) {
				SourceLocation ExprLoc = Call->getBeginLoc();
				unsigned DiagID = CGM.getDiags().getCustomDiagID(
				DiagnosticsEngine::Warning,
				lebedev.riUnsubmitted Done Reply Inline Actions llvm::None lebedev.ri: llvm::None
				"Potential performance regression from use of __builtin_expect(): "
				"Annotation was correct on %0%% of profiled executions.");

				CGM.getDiags()
				lebedev.riUnsubmitted Done Reply Inline Actions `llvm::None` lebedev.ri: `llvm::None`
				.Report(ExprLoc, DiagID)
				.AddTaggedVal(PercentageCorrect,
				clang::DiagnosticsEngine::ArgumentKind::ak_uint);
				}

				// Prints some debug diagnostics useful when checking SwitchStmts.
				// Allows for simple comparison of the Case Value mappings to their index in the
				// SwitchWeights data structure in CGStmts.cpp
				void DebugPrintMisExpectSwitchInfo(SmallVector<uint64_t, 16> *SwitchWeights,
				llvm::DenseMap<int64_t, size_t> *CaseMap) {
				lebedev.riUnsubmitted Done Reply Inline Actions llvm::None lebedev.ri: llvm::None
				auto size = SwitchWeights->size();
				for (size_t i = 0; i < size; ++i) {
				llvm::dbgs() << "Index: " << i << "\tProfile Value:\t"
				<< (*SwitchWeights)[i] << "\n";
				lebedev.riUnsubmitted Done Reply Inline Actions return ExpectedVal; should just work? lebedev.ri: return ExpectedVal; should just work?
				}

				llvm::dbgs() << "------------------\n";

				for (auto &item : *CaseMap) {
				llvm::dbgs() << "Case Key: " << item.first << "\tRelated Index:\t"
				<< item.second << "\n";
				}

				llvm::dbgs() << "------------------\n";
				uint64_t CaseTotal =
				std::accumulate(SwitchWeights->begin(), SwitchWeights->end(), 0);
				llvm::dbgs() << "Switch Profile Count:\t" << CaseTotal << "\n";
				}

				} // namespace

				namespace clang {
				namespace CodeGen {
				namespace MisExpect {
				phosekUnsubmitted Done Reply Inline Actions No space between `:` and `\t` here and below. phosek: No space between `:` and `\t` here and below.

				#define DEBUG_TYPE "misexpect"

				Optional<int64_t> getExpectedValue(const clang::CallExpr *Call,
				ASTContext &Context) {
				if (!Call)
				return Optional<int64_t>(llvm::None);

				phosekUnsubmitted Not Done Reply Inline Actions Are these thresholds defined anywhere as constants? phosek: Are these thresholds defined anywhere as constants?
				paulkirthAuthorUnsubmitted Done Reply Inline Actions These thresholds come from PGOInstrumentation.cpp:1084 I will update with a reference to the code that this comes from, but, as noted in the TODO we need a better heuristic. paulkirth: These thresholds come from [[ https://github.com/llvm/llvm…
				auto *FD = dyn_cast_or_null<FunctionDecl>(Call->getCalleeDecl());
				if (!FD \|\| FD->getBuiltinID() != Builtin::BI__builtin_expect) {
				return Optional<int64_t>(llvm::None);
				}

				// Check if we can evaluate the 2nd parameter to __builtin_expect(expr,long)
				// since it may not be able to be evaluated at compile-time
				Expr::EvalResult ExprResult;
				auto Arg = Call->getArg(1);
				Arg->EvaluateAsInt(ExprResult, Context, Expr::SE_AllowSideEffects);

				if (!ExprResult.Val.hasValue())
				phosekUnsubmitted Done Reply Inline Actions No need for `{` and `}`. phosek: No need for `{` and `}`.
				return Optional<int64_t>(llvm::None);

				llvm::APSInt &Into = ExprResult.Val.getInt();
				int64_t ExpectedVal = Into.getExtValue();
				return ExpectedVal;
				}

				void CheckMisExpectBranch(const Expr *Cond, uint64_t TrueCount,
				uint64_t CurrProfCount, CodeGenModule &CGM) {
				if (!CGM.getCodeGenOpts().MisExpect \|\|
				CGM.getCodeGenOpts().getProfileUse() == CodeGenOptions::ProfileNone)
				return;

				auto *Call = dyn_cast<CallExpr>(Cond->IgnoreImpCasts());

				Optional<int64_t> ExpectedValOpt = getExpectedValue(Call, CGM.getContext());

				if (!ExpectedValOpt.hasValue())
				return;

				const long ExpectedVal = ExpectedValOpt.getValue();
				const bool ExpectedTrueBranch = (ExpectedVal != 0);

				LLVM_DEBUG(llvm::dbgs() << "Expected Value:\t" << ExpectedVal << "\n");
				LLVM_DEBUG(llvm::dbgs() << "Current Count:\t" << CurrProfCount << "\n");
				LLVM_DEBUG(llvm::dbgs() << "True Count:\t" << TrueCount << "\n");

				bool IncorrectPerfCounters = false;
				phosekUnsubmitted Done Reply Inline Actions No need for { and }, in this case you can probably just use ternary expression. phosek: No need for { and }, in this case you can probably just use ternary expression.
				uint64_t Scaled;
				unsigned Percentage;

				// TODO: determine better heuristics than hot/cold function thresholds
				// LowerExpectIntrinsics.cpp:49 LikelyBranchWeight = 2000
				// LowerExpectIntrinsics.cpp:52 UnlikelyBranchWeight = 1
				if (ExpectedTrueBranch) {
				const llvm::BranchProbability LikelyThreshold(2000, 1);
				Scaled = LikelyThreshold.scale(CurrProfCount);
				Percentage = (TrueCount / (float)CurrProfCount) * 100;
				if (TrueCount < Scaled)
				phosekUnsubmitted Done Reply Inline Actions No need for { and }. phosek: No need for { and }.
				IncorrectPerfCounters = true;
				} else {
				const llvm::BranchProbability UnlikelyThreshold(1, 2000);
				Scaled = UnlikelyThreshold.scale(CurrProfCount);
				Percentage = ((CurrProfCount - TrueCount) / (float)CurrProfCount) * 100;
				if (TrueCount > Scaled)
				IncorrectPerfCounters = true;
				}
				LLVM_DEBUG(llvm::dbgs() << "Scaled Count:\t" << Scaled << "\n");

				if (IncorrectPerfCounters)
				EmitMisExpectWarning(Call, CGM, Percentage);
				}
				lebedev.riUnsubmitted Done Reply Inline Actions This is rather undescriptive. Can you output some more useful info? lebedev.ri: This is rather undescriptive. Can you output some more useful info?
				paulkirthAuthorUnsubmitted Not Done Reply Inline Actions Do you have a suggestion about what feedback would be more useful? My initial thought with the somewhat generic message was to simply point out that this usage looked problematic, and let the developer investigate. I wasn't sure we wanted to expose details of the internal heuristic to the user by reporting the internal thresholds. paulkirth: Do you have a suggestion about what feedback would be more useful? My initial thought with the…
				xbolva00Unsubmitted Not Done Reply Inline Actions Message is currently confusing a bit. I really miss clear info like “This compiler hint seems to be incorrect according to current PGO counters. Please check the hint if it is still valid and perf-profitable”. xbolva00: Message is currently confusing a bit. I really miss clear info like “This compiler hint seems…

				void CheckMisExpectSwitch(const CallExpr *Call,
				SmallVector<uint64_t, 16> *SwitchWeights,
				llvm::DenseMap<int64_t, size_t> *CaseMap,
				CodeGenModule &CGM) {
				if (!SwitchWeights \|\| !CaseMap)
				return;

				Optional<int64_t> ExpectedValOpt = getExpectedValue(Call, CGM.getContext());

				if (!ExpectedValOpt.hasValue())
				return;

				LLVM_DEBUG(DebugPrintMisExpectSwitchInfo(SwitchWeights, CaseMap));
				const long ExpectedVal = ExpectedValOpt.getValue();

				LLVM_DEBUG(llvm::dbgs() << "Expected Value:\t" << ExpectedVal << "\n");

				uint64_t Max =
				*std::max_element(SwitchWeights->begin(), SwitchWeights->end());

				auto MapIndex = CaseMap->find(ExpectedVal);

				uint64_t Index = (MapIndex != CaseMap->end()) ? MapIndex->second : 0;
				uint64_t TakenCount = (*SwitchWeights)[Index];

				LLVM_DEBUG(llvm::dbgs() << "Taken Count:\t" << TakenCount << "\n");
				LLVM_DEBUG(llvm::dbgs() << "Max Count:\t" << Max << "\n");
				uint64_t CaseTotal =
				std::accumulate(SwitchWeights->begin(), SwitchWeights->end(), 0);
				unsigned Percentage = ((float)TakenCount / (float)CaseTotal) * 100;

				if (TakenCount < Max)
				EmitMisExpectWarning(Call, CGM, Percentage);
				}
				#undef DEBUG_TYPE

				} // namespace MisExpect
				} // namespace CodeGen
				} // namespace clang

clang/lib/Driver/ToolChains/Clang.cpp

Show First 20 Lines • Show All 3,992 Lines • ▼ Show 20 Lines	if (!Args.hasFlag(options::OPT_foptimize_sibling_calls,
CmdArgs.push_back("-mdisable-tail-calls");		CmdArgs.push_back("-mdisable-tail-calls");
if (Args.hasFlag(options::OPT_fno_escaping_block_tail_calls,		if (Args.hasFlag(options::OPT_fno_escaping_block_tail_calls,
options::OPT_fescaping_block_tail_calls, false))		options::OPT_fescaping_block_tail_calls, false))
CmdArgs.push_back("-fno-escaping-block-tail-calls");		CmdArgs.push_back("-fno-escaping-block-tail-calls");

Args.AddLastArg(CmdArgs, options::OPT_ffine_grained_bitfield_accesses,		Args.AddLastArg(CmdArgs, options::OPT_ffine_grained_bitfield_accesses,
options::OPT_fno_fine_grained_bitfield_accesses);		options::OPT_fno_fine_grained_bitfield_accesses);

		if (Args.hasFlag(options::OPT_fmisexpect, options::OPT_fno_misexpect,
		false)) {
		CmdArgs.push_back("-fmisexpect");
		}

// Handle segmented stacks.		// Handle segmented stacks.
if (Args.hasArg(options::OPT_fsplit_stack))		if (Args.hasArg(options::OPT_fsplit_stack))
CmdArgs.push_back("-split-stacks");		CmdArgs.push_back("-split-stacks");

RenderFloatingPointOptions(TC, D, OFastEnabled, Args, CmdArgs);		RenderFloatingPointOptions(TC, D, OFastEnabled, Args, CmdArgs);

if (Arg *A = Args.getLastArg(options::OPT_mlong_double_64,		if (Arg *A = Args.getLastArg(options::OPT_mlong_double_64,
options::OPT_mlong_double_128)) {		options::OPT_mlong_double_128)) {
▲ Show 20 Lines • Show All 2,355 Lines • Show Last 20 Lines

clang/lib/Frontend/CompilerInvocation.cpp

Show First 20 Lines • Show All 803 Lines • ▼ Show 20 Lines	if (!Opts.ProfileInstrumentUsePath.empty())
setPGOUseInstrumentor(Opts, Opts.ProfileInstrumentUsePath);		setPGOUseInstrumentor(Opts, Opts.ProfileInstrumentUsePath);
Opts.ProfileRemappingFile =		Opts.ProfileRemappingFile =
Args.getLastArgValue(OPT_fprofile_remapping_file_EQ);		Args.getLastArgValue(OPT_fprofile_remapping_file_EQ);
if (!Opts.ProfileRemappingFile.empty() && !Opts.ExperimentalNewPassManager) {		if (!Opts.ProfileRemappingFile.empty() && !Opts.ExperimentalNewPassManager) {
Diags.Report(diag::err_drv_argument_only_allowed_with)		Diags.Report(diag::err_drv_argument_only_allowed_with)
<< Args.getLastArg(OPT_fprofile_remapping_file_EQ)->getAsString(Args)		<< Args.getLastArg(OPT_fprofile_remapping_file_EQ)->getAsString(Args)
<< "-fexperimental-new-pass-manager";		<< "-fexperimental-new-pass-manager";
}		}
		Opts.MisExpect = Args.hasFlag(OPT_fmisexpect, OPT_fno_misexpect, false);

Opts.CoverageMapping =		Opts.CoverageMapping =
Args.hasFlag(OPT_fcoverage_mapping, OPT_fno_coverage_mapping, false);		Args.hasFlag(OPT_fcoverage_mapping, OPT_fno_coverage_mapping, false);
Opts.DumpCoverageMapping = Args.hasArg(OPT_dump_coverage_mapping);		Opts.DumpCoverageMapping = Args.hasArg(OPT_dump_coverage_mapping);
Opts.AsmVerbose = Args.hasArg(OPT_masm_verbose);		Opts.AsmVerbose = Args.hasArg(OPT_masm_verbose);
Opts.PreserveAsmComments = !Args.hasArg(OPT_fno_preserve_as_comments);		Opts.PreserveAsmComments = !Args.hasArg(OPT_fno_preserve_as_comments);
Opts.AssumeSaneOperatorNew = !Args.hasArg(OPT_fno_assume_sane_operator_new);		Opts.AssumeSaneOperatorNew = !Args.hasArg(OPT_fno_assume_sane_operator_new);
Opts.ObjCAutoRefCountExceptions = Args.hasArg(OPT_fobjc_arc_exceptions);		Opts.ObjCAutoRefCountExceptions = Args.hasArg(OPT_fobjc_arc_exceptions);
▲ Show 20 Lines • Show All 2,768 Lines • Show Last 20 Lines

clang/test/Profile/Inputs/misexpect-branch-nonconst-expect-arg.proftext

This file was added.

				bar
				# Func Hash:
				11262309464
				# Num Counters:
				2
				# Counter Values:
				200000
				2

				baz
				# Func Hash:
				24
				# Num Counters:
				1
				# Counter Values:
				2

				foo
				# Func Hash:
				635992
				# Num Counters:
				2
				# Counter Values:
				199998
				5096876

				main
				# Func Hash:
				303943193688
				# Num Counters:
				3
				# Counter Values:
				1
				2000
				200000

clang/test/Profile/Inputs/misexpect-branch.proftext

This file was added.

				bar
				# Func Hash:
				45795613684824
				# Num Counters:
				2
				# Counter Values:
				200000
				0

				baz
				# Func Hash:
				24
				# Num Counters:
				1
				# Counter Values:
				0

				foo
				# Func Hash:
				635992
				# Num Counters:
				2
				# Counter Values:
				200000
				5095099

				main
				# Func Hash:
				303943193688
				# Num Counters:
				3
				# Counter Values:
				1
				2000
				200000

clang/test/Profile/Inputs/misexpect-switch-default-only.proftext

This file was added.

				init_arry
				# Func Hash:
				18129
				# Num Counters:
				2
				# Counter Values:
				1
				25

				main
				# Func Hash:
				79676873694057560
				# Num Counters:
				5
				# Counter Values:
				1
				20
				20000
				20000
				20000

				random_sample
				# Func Hash:
				19458874217560
				# Num Counters:
				3
				# Counter Values:
				20000
				500000
				100006

				sum
				# Func Hash:
				1160280
				# Num Counters:
				2
				# Counter Values:
				0
				0

clang/test/Profile/Inputs/misexpect-switch.proftext

This file was added.

				init_arry
				# Func Hash:
				18129
				# Num Counters:
				2
				# Counter Values:
				1
				25

				main
				# Func Hash:
				1965403898329309329
				# Num Counters:
				10
				# Counter Values:
				1
				20
				20000
				20000
				5
				19995
				0
				0
				0
				0

				random_sample
				# Func Hash:
				19458874217560
				# Num Counters:
				3
				# Counter Values:
				19995
				499875
				100042

				sum
				# Func Hash:
				1160280
				# Num Counters:
				2
				# Counter Values:
				5
				125

clang/test/Profile/misexpect-branch-cold.c

This file was added.

				// Test that misexpect detects mis-annotated branches

				// RUN: llvm-profdata merge %S/Inputs/misexpect-branch.proftext -o %t.profdata
				// RUN: %clang_cc1 %s -O2 -o - -disable-llvm-passes -emit-llvm -fprofile-instrument-use-path=%t.profdata -verify -fmisexpect

				// expected-no-diagnostics
				#define likely(x) __builtin_expect(!!(x), 1)
				#define unlikely(x) __builtin_expect(!!(x), 0)

				int foo(int);
				int bar();
				int baz(int);
				int buzz();

				const int inner_loop = 100;
				const int outer_loop = 2000;

				int main() {
				int val = 0;
				for (int i = 0; i < outer_loop; ++i) {
				for (int i = 0; i < inner_loop; ++i) {
				val = bar();
				}
				}
				return 0;
				}

				int bar() {
				int rando = buzz();
				int x = 0;
				if (unlikely(rando % (outer_loop * inner_loop) == 0)) {
				x = baz(rando);
				} else {
				x = foo(50);
				}
				return x;
				}

				int foo(int i) {
				int j = 0;
				while (i < 100) {
				i += buzz() % 5;
				j++;
				}
				return j;
				}

				int baz(int rando) {
				int x = rando;
				return x;
				}

clang/test/Profile/misexpect-branch-nonconst-expected-val.c

This file was added.

				// Test that misexpect detects mis-annotated branches

				// RUN: llvm-profdata merge %S/Inputs/misexpect-branch-nonconst-expect-arg.proftext -o %t.profdata
				// RUN: %clang_cc1 %s -O2 -o - -disable-llvm-passes -emit-llvm -fprofile-instrument-use-path=%t.profdata -verify -fmisexpect

				// expected-no-diagnostics
				#define likely(x) __builtin_expect(!!(x), 1)
				#define unlikely(x) __builtin_expect(!!(x), 0)

				int foo(int);
				int bar();
				int baz(int);
				int buzz();

				const int inner_loop = 100;
				const int outer_loop = 2000;

				int main() {
				int val = 0;
				for (int i = 0; i < outer_loop; ++i) {
				for (int i = 0; i < inner_loop; ++i) {
				val = bar();
				}
				}
				return 0;
				}

				int bar() {
				int rando = buzz();
				int x = 0;
				if (__builtin_expect(rando % (outer_loop * inner_loop) == 0, buzz())) {
				x = baz(rando);
				} else {
				x = foo(50);
				}
				return x;
				}

				int foo(int i) {
				int j = 0;
				while (i < 100) {
				i += buzz() % 5;
				j++;
				}
				return j;
				}

				int baz(int rando) {
				int x = rando;
				return x;
				}

clang/test/Profile/misexpect-branch.c

This file was added.

				// Test that misexpect detects mis-annotated branches

				// RUN: llvm-profdata merge %S/Inputs/misexpect-branch.proftext -o %t.profdata
				// RUN: %clang_cc1 %s -O2 -o - -disable-llvm-passes -emit-llvm -fprofile-instrument-use-path=%t.profdata -verify -fmisexpect

				#define likely(x) __builtin_expect(!!(x), 1)
				#define unlikely(x) __builtin_expect(!!(x), 0)

				int foo(int);
				int bar();
				int baz(int);
				int buzz();

				const int inner_loop = 100;
				const int outer_loop = 2000;

				int main() {
				int val = 0;
				for (int i = 0; i < outer_loop; ++i) {
				for (int i = 0; i < inner_loop; ++i) {
				val = bar();
				}
				}
				return 0;
				}

				int bar() {
				int rando = buzz();
				int x = 0;
				if (likely(rando % (outer_loop * inner_loop) == 0)) { // expected-warning-re {{Potential performance regression from use of __builtin_expect(): Annotation was correct on {{.+}}% of profiled executions.}}
				x = baz(rando);
				} else {
				x = foo(50);
				}
				return x;
				}

				int foo(int i) {
				int j = 0;
				while (i < 100) {
				i += buzz() % 5;
				j++;
				}
				return j;
				}

				int baz(int rando) {
				int x = rando;
				return x;
				}

clang/test/Profile/misexpect-no-warning-without-flag.c

This file was added.

				// Test that misexpect detects mis-annotated branches

				// RUN: llvm-profdata merge %S/Inputs/misexpect-branch.proftext -o %t.profdata
				// RUN: %clang_cc1 %s -O2 -o - -disable-llvm-passes -emit-llvm -fprofile-instrument-use-path=%t.profdata -verify

				// expected-no-diagnostics
				#define likely(x) __builtin_expect(!!(x), 1)
				#define unlikely(x) __builtin_expect(!!(x), 0)

				int foo(int);
				int bar();
				int baz(int);
				int buzz();

				const int inner_loop = 100;
				const int outer_loop = 2000;

				int main() {
				int val = 0;
				for (int i = 0; i < outer_loop; ++i) {
				for (int i = 0; i < inner_loop; ++i) {
				val = bar();
				}
				}
				return 0;
				}

				int bar() {
				int rando = buzz();
				int x = 0;
				if (likely(rando % (outer_loop * inner_loop) == 0)) {
				x = baz(rando);
				} else {
				x = foo(50);
				}
				return x;
				}

				int foo(int i) {
				int j = 0;
				while (i < 100) {
				i += buzz() % 5;
				j++;
				}
				return j;
				}

				int baz(int rando) {
				int x = rando;
				return x;
				}

clang/test/Profile/misexpect-switch-default.c

This file was added.

				// Test that misexpect detects mis-annotated switch statements

				// RUN: llvm-profdata merge %S/Inputs/misexpect-switch.proftext -o %t.profdata
				// RUN: %clang_cc1 %s -O2 -o - -disable-llvm-passes -emit-llvm -fprofile-instrument-use-path=%t.profdata -verify -fmisexpect

				int sum(int *buff, int size);
				int random_sample(int *buff, int size);
				int rand();

				const int inner_loop = 1000;
				const int outer_loop = 20;
				const int arry_size = 25;

				int arry[arry_size] = {0};

				void init_arry() {
				int i;
				for (i = 0; i < arry_size; ++i) {
				arry[i] = rand() % 10;
				}
				}

				int main() {
				init_arry();
				int val = 0;

				int j, k;
				for (j = 0; j < outer_loop; ++j) {
				for (k = 0; k < inner_loop; ++k) {
				unsigned condition = rand() % 5;
				switch (__builtin_expect(condition, 6)) { // expected-warning-re {{Potential performance regression from use of __builtin_expect(): Annotation was correct on {{.+}}% of profiled executions.}}
				case 0:
				val += sum(arry, arry_size);
				break;
				case 1:
				case 2:
				case 3:
				case 4:
				val += random_sample(arry, arry_size);
				break;
				default:
				__builtin_unreachable();
				} // end switch
				} // end inner_loop
				} // end outer_loop

				return 0;
				}

				int sum(int *buff, int size) {
				int total = 0;
				int i = 0;
				for (i = 0; i < size; ++i) {
				total += buff[i];
				}
				return total;
				}

				int random_sample(int *buff, int size) {
				int total = 0;
				int i;
				for (i = 0; i < size; ++i) {
				if (rand() % 5 == 0)
				total += buff[i];
				}

				return total;
				}

clang/test/Profile/misexpect-switch-nonconst.c

This file was added.

				// Test that misexpect detects mis-annotated switch statements

				// RUN: llvm-profdata merge %S/Inputs/misexpect-switch.proftext -o %t.profdata
				// RUN: %clang_cc1 %s -O2 -o - -disable-llvm-passes -emit-llvm -fprofile-instrument-use-path=%t.profdata -verify -fmisexpect

				// expected-no-diagnostics
				int sum(int *buff, int size);
				int random_sample(int *buff, int size);
				int rand();

				const int inner_loop = 1000;
				const int outer_loop = 20;
				const int arry_size = 25;

				int arry[arry_size] = {0};

				void init_arry() {
				int i;
				for (i = 0; i < arry_size; ++i) {
				arry[i] = rand() % 10;
				}
				}

				int main() {
				init_arry();
				int val = 0;

				int j, k;
				for (j = 0; j < outer_loop; ++j) {
				for (k = 0; k < inner_loop; ++k) {
				unsigned condition = rand() % 10000;
				switch (__builtin_expect(condition, rand())) {
				case 0:
				val += sum(arry, arry_size);
				break;
				case 1:
				case 2:
				case 3:
				case 4:
				val += random_sample(arry, arry_size);
				break;
				default:
				__builtin_unreachable();
				} // end switch
				} // end inner_loop
				} // end outer_loop

				return 0;
				}

				int sum(int *buff, int size) {
				int total = 0;
				int i = 0;
				for (i = 0; i < size; ++i) {
				total += buff[i];
				}
				return total;
				}

				int random_sample(int *buff, int size) {
				int total = 0;
				int i;
				for (i = 0; i < size; ++i) {
				if (rand() % 5 == 0)
				total += buff[i];
				}

				return total;
				}

clang/test/Profile/misexpect-switch-only-default-case.c

This file was added.

				// Test that misexpect detects mis-annotated switch statements

				// RUN: llvm-profdata merge %S/Inputs/misexpect-switch-default-only.proftext -o %t.profdata
				// RUN: %clang_cc1 %s -O2 -o - -disable-llvm-passes -emit-llvm -fprofile-instrument-use-path=%t.profdata -verify -fmisexpect

				// expected-no-diagnostics
				int sum(int *buff, int size);
				int random_sample(int *buff, int size);
				int rand();

				const int inner_loop = 1000;
				const int outer_loop = 20;
				const int arry_size = 25;

				int arry[arry_size] = {0};

				void init_arry() {
				int i;
				for (i = 0; i < arry_size; ++i) {
				arry[i] = rand() % 10;
				}
				}

				int main()
				{
				init_arry();
				int val = 0;

				int j, k;
				for (j = 0; j < outer_loop; ++j) {
				for (k = 0; k < inner_loop; ++k) {
				unsigned condition = rand() % 10000;
				switch (__builtin_expect(condition, 0)) {
				default:
				val += random_sample(arry, arry_size);
				break;
				}; // end switch
				} // end inner_loop
				} // end outer_loop

				return 0;
				}

				int sum(int *buff, int size) {
				int total = 0;
				int i = 0;
				for (i = 0; i < size; ++i) {
				total += buff[i];
				}
				return total;
				}

				int random_sample(int *buff, int size) {
				int total = 0;
				int i;
				for (i = 0; i < size; ++i) {
				if (rand() % 5 == 0)
				total += buff[i];
				}

				return total;
				}

clang/test/Profile/misexpect-switch.c

This file was added.

				// Test that misexpect detects mis-annotated switch statements

				// RUN: llvm-profdata merge %S/Inputs/misexpect-switch.proftext -o %t.profdata
				// RUN: %clang_cc1 %s -O2 -o - -disable-llvm-passes -emit-llvm -fprofile-instrument-use-path=%t.profdata -verify -fmisexpect

				int sum(int *buff, int size);
				int random_sample(int *buff, int size);
				int rand();

				const int inner_loop = 1000;
				const int outer_loop = 20;
				const int arry_size = 25;

				int arry[arry_size] = {0};

				void init_arry() {
				int i;
				for (i = 0; i < arry_size; ++i) {
				arry[i] = rand() % 10;
				}
				}

				int main() {
				init_arry();
				int val = 0;

				int j, k;
				for (j = 0; j < outer_loop; ++j) {
				for (k = 0; k < inner_loop; ++k) {
				unsigned condition = rand() % 10000;
				switch (__builtin_expect(condition, 0)) { // expected-warning-re {{Potential performance regression from use of __builtin_expect(): Annotation was correct on {{.+}}% of profiled executions.}}
				case 0:
				val += sum(arry, arry_size);
				break;
				case 1:
				case 2:
				case 3:
				case 4:
				val += random_sample(arry, arry_size);
				break;
				default:
				__builtin_unreachable();
				} // end switch
				} // end inner_loop
				} // end outer_loop

				return 0;
				}

				int sum(int *buff, int size) {
				int total = 0;
				int i = 0;
				for (i = 0; i < size; ++i) {
				total += buff[i];
				}
				return total;
				}

				int random_sample(int *buff, int size) {
				int total = 0;
				int i;
				for (i = 0; i < size; ++i) {
				if (rand() % 5 == 0)
				total += buff[i];
				}

				return total;
				}

This is an archive of the discontinued LLVM Phabricator instance.

[clang] [CodeGen] clang-misexpect prototype for compiler warningsAbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 213379

clang/include/clang/Basic/CodeGenOptions.def

clang/include/clang/Driver/Options.td

clang/lib/CodeGen/CGStmt.cpp

clang/lib/CodeGen/CMakeLists.txt

clang/lib/CodeGen/CodeGenFunction.h

clang/lib/CodeGen/CodeGenFunction.cpp

clang/lib/CodeGen/MisExpect.h

clang/lib/CodeGen/MisExpect.cpp

clang/lib/Driver/ToolChains/Clang.cpp

clang/lib/Frontend/CompilerInvocation.cpp

clang/test/Profile/Inputs/misexpect-branch-nonconst-expect-arg.proftext

clang/test/Profile/Inputs/misexpect-branch.proftext

clang/test/Profile/Inputs/misexpect-switch-default-only.proftext

clang/test/Profile/Inputs/misexpect-switch.proftext

clang/test/Profile/misexpect-branch-cold.c

clang/test/Profile/misexpect-branch-nonconst-expected-val.c

clang/test/Profile/misexpect-branch.c

clang/test/Profile/misexpect-no-warning-without-flag.c

clang/test/Profile/misexpect-switch-default.c

clang/test/Profile/misexpect-switch-nonconst.c

clang/test/Profile/misexpect-switch-only-default-case.c

clang/test/Profile/misexpect-switch.c

[clang] [CodeGen] clang-misexpect prototype for compiler warnings
AbandonedPublic