Diff 331707

llvm/lib/Transforms/Instrumentation/DataFlowSanitizer.cpp

Show All 10 Lines

/// analysis.

///

/// Unlike other Sanitizer tools, this tool is not designed to detect a specific

/// class of bugs on its own. Instead, it provides a generic dynamic data flow

/// analysis framework to be used by clients to help detect application-specific

/// issues within their own code.

///

/// The analysis is based on automatic propagation of data flow labels (also

/// known as taint labels) through a program as it performs computation. Each

/// known as taint labels) through a program as it performs computation.

/// byte of application memory is backed by two bytes of shadow memory which

///

/// hold the label. On Linux/x86_64, memory is laid out as follows:

/// There are two possible memory layouts. In the first one, each byte of

/// application memory is backed by a shadow memory byte. The shadow byte can

/// represent up to 8 labels. To enable this you must specify the

/// -dfsan-fast-8-labels flag. On Linux/x86_64, memory is then laid out as

/// follows:

///

/// +--------------------+ 0x800000000000 (top of memory)

/// | application memory |

/// +--------------------+ 0x700000008000 (kAppAddr)

/// | |

/// | unused |

/// | |

/// +--------------------+ 0x300200000000 (kUnusedAddr)

/// | union table |

/// +--------------------+ 0x300000000000 (kUnionTableAddr)

/// | origin |

/// +--------------------+ 0x200000008000 (kOriginAddr)

/// | shadow memory |

/// +--------------------+ 0x100000008000 (kShadowAddr)

/// | unused |

/// +--------------------+ 0x000000010000

/// | reserved by kernel |

/// +--------------------+ 0x000000000000

///

/// In the second memory layout, each byte of application memory is backed by

/// two bytes of shadow memory which hold the label. That means we can represent

/// either 16 labels (with -dfsan-fast-16-labels flag) or 2^16 labels (on the

/// default legacy mode) per byte. On Linux/x86_64, memory is then laid out as

/// follows:

///

stephan.yichao.zhaoUnsubmitted

Done

/// The analysis is based on automatic propagation of data flow labels (also

/// known as taint labels) through a program as it performs computation.

+ ///

/// There are two possible memory layouts. In the first one, each byte of

/// application memory is backed by a shadow memory byte. The shadow byte can

Please add the missing /// at line 20.

stephan.yichao.zhao: Please add the missing /// at line 20.

/// +--------------------+ 0x800000000000 (top of memory)

/// | application memory |

/// +--------------------+ 0x700000008000 (kAppAddr)

/// | |

/// | unused |

/// | |

/// +--------------------+ 0x300200000000 (kUnusedAddr)

/// | union table |

/// +--------------------+ 0x300000000000 (kUnionTableAddr)

/// | origin |

/// +--------------------+ 0x200000008000 (kOriginAddr)

stephan.yichao.zhaoUnsubmitted

Done

Please update this diagram for 8-bit layout.

stephan.yichao.zhao: Please update this diagram for 8-bit layout.

/// | shadow memory |

/// +--------------------+ 0x000000010000 (kShadowAddr)

/// | reserved by kernel |

/// +--------------------+ 0x000000000000

///

/// To derive a shadow memory address from an application memory address,

/// bits 44-46 are cleared to bring the address into the range

stephan.yichao.zhaoUnsubmitted

Done

If users read top to down, it seems helpful if we move this sentence above the first layout.
I suggest we first explain there are two kinds of shadow layouts. One uses 2 bytes bit for fast-16-labels mode with 16 labels and legacy mode with 2^16 labels, and the other uses 1 byte for fast-8-labels mode with 8 labels. Then we show the two layouts.

stephan.yichao.zhao: If users read top to down, it seems helpful if we move this sentence above the first layout. I…

gbalatsAuthorUnsubmitted

Done

You're right the bottom part relates to the 16-shadow-bits layout. Changed the order and added more text to explain better.

gbalats: You're right the bottom part relates to the 16-shadow-bits layout. Changed the order and added…

/// [0x000000008000,0x100000000000). Then the address is shifted left by 1 to

/// account for the double byte representation of shadow labels and move the

/// address into the shadow memory range. See the function

/// DataFlowSanitizer::getShadowAddress below.

///

/// For more information, please refer to the design document:

/// http://clang.llvm.org/docs/DataFlowSanitizerDesign.html

//===----------------------------------------------------------------------===//

#include "llvm/Transforms/Instrumentation/DataFlowSanitizer.h"

#include "llvm/ADT/DenseMap.h"

#include "llvm/ADT/DenseSet.h"

#include "llvm/ADT/DepthFirstIterator.h"

#include "llvm/ADT/None.h"

stephan.yichao.zhaoUnsubmitted

Done

Addresses below 0x000000010000 is for kernel usage.
We can mark unused from 0x000000010000 to 0x100000008000.

stephan.yichao.zhao: Addresses below 0x000000010000 is for kernel usage. We can mark unused from 0x000000010000 to…

#include "llvm/ADT/SmallPtrSet.h"

#include "llvm/ADT/SmallVector.h"

#include "llvm/ADT/StringExtras.h"

#include "llvm/ADT/StringRef.h"

#include "llvm/ADT/Triple.h"

#include "llvm/ADT/iterator.h"

#include "llvm/Analysis/ValueTracking.h"

#include "llvm/IR/Argument.h"

▲ Show 20 Lines • Show All 131 Lines • ▼ Show 20 Lines

// Use a distinct bit for each base label, enabling faster unions with less

// instrumentation. Limits the max number of base labels to 16.

static cl::opt<bool> ClFast16Labels(

"dfsan-fast-16-labels",

cl::desc("Use more efficient instrumentation, limiting the number of "

"labels to 16."),

cl::Hidden, cl::init(false));

// Use a distinct bit for each base label, enabling faster unions with less

// instrumentation. Limits the max number of base labels to 8.

static cl::opt<bool> ClFast8Labels(

"dfsan-fast-8-labels",

cl::desc("Use more efficient instrumentation, limiting the number of "

"labels to 8."),

cl::Hidden, cl::init(false));

// Controls whether the pass tracks the control flow of select instructions.

static cl::opt<bool> ClTrackSelectControlFlow(

"dfsan-track-select-control-flow",

cl::desc("Propagate labels from condition values of select instructions "

"to results."),

cl::Hidden, cl::init(true));

// TODO: This default value follows MSan. DFSan may use a different value.

▲ Show 20 Lines • Show All 125 Lines • ▼ Show 20 Lines

return AttributeList::get(Ctx, CallSiteAttrs.getFnAttributes(),

llvm::makeArrayRef(ArgumentAttributes));

}

class DataFlowSanitizer {

friend struct DFSanFunction;

friend class DFSanVisitor;

enum {

ShadowWidthBits = 16,

ShadowWidthBytes = ShadowWidthBits / 8,

OriginWidthBits = 32,

OriginWidthBytes = OriginWidthBits / 8

};

/// Which ABI should be used for instrumented functions?

enum InstrumentedABI {

/// Argument and return value labels are passed through additional

/// arguments and by modifying the return type.

Show All 24 Lines

enum WrapperKind {

/// original function or provide its own implementation. This is similar to

/// the IA_Args ABI, except that IA_Args uses a struct return type to

/// pass the return value shadow in a register, while WK_Custom uses an

/// extra pointer argument to return the shadow. This allows the wrapped

/// form of the function type to be expressed in C.

WK_Custom

};

unsigned ShadowWidthBits;

unsigned ShadowWidthBytes;

Module *Mod;

LLVMContext *Ctx;

Type *Int8Ptr;

IntegerType *OriginTy;

PointerType *OriginPtrTy;

ConstantInt *OriginBase;

ConstantInt *ZeroOrigin;

/// The shadow type for all primitive types and vector types.

Show All 20 Lines

class DataFlowSanitizer {

FunctionType *DFSanLoadStoreCallbackFnTy;

FunctionType *DFSanMemTransferCallbackFnTy;

FunctionType *DFSanChainOriginFnTy;

FunctionType *DFSanMemOriginTransferFnTy;

FunctionType *DFSanMaybeStoreOriginFnTy;

FunctionCallee DFSanUnionFn;

FunctionCallee DFSanCheckedUnionFn;

FunctionCallee DFSanUnionLoadFn;

FunctionCallee DFSanUnionLoadFast16LabelsFn;

FunctionCallee DFSanUnionLoadFastLabelsFn;

FunctionCallee DFSanLoadLabelAndOriginFn;

FunctionCallee DFSanUnimplementedFn;

FunctionCallee DFSanSetLabelFn;

FunctionCallee DFSanNonzeroLabelFn;

FunctionCallee DFSanVarargWrapperFn;

FunctionCallee DFSanLoadCallbackFn;

FunctionCallee DFSanStoreCallbackFn;

FunctionCallee DFSanMemTransferCallbackFn;

FunctionCallee DFSanCmpCallbackFn;

FunctionCallee DFSanChainOriginFn;

FunctionCallee DFSanMemOriginTransferFn;

FunctionCallee DFSanMaybeStoreOriginFn;

SmallPtrSet<Value *, 16> DFSanRuntimeFunctions;

MDNode *ColdCallWeights;

MDNode *OriginStoreWeights;

DFSanABIList ABIList;

DenseMap<Value *, Function *> UnwrappedFnMap;

AttrBuilder ReadOnlyNoneAttrs;

bool DFSanRuntimeShadowMask = false;

Value *getShadowOffset(Value *Addr, IRBuilder<> &IRB);

Value *getShadowAddress(Value *Addr, Instruction *Pos);

Value *getShadowAddress(Value *Addr, Instruction *Pos, Value *ShadowOffset);

std::pair<Value *, Value *>

getShadowOriginAddress(Value *Addr, Align InstAlignment, Instruction *Pos);

bool isInstrumented(const Function *F);

bool isInstrumented(const GlobalAlias *GA);

FunctionType *getArgsFunctionType(FunctionType *T);

FunctionType *getTrampolineFunctionType(FunctionType *T);

TransformedFunction getCustomFunctionType(FunctionType *T);

InstrumentedABI getInstrumentedABI();

WrapperKind getWrapperKind(Function *F);

void addGlobalNamePrefix(GlobalValue *GV);

Function *buildWrapperFunction(Function *F, StringRef NewFName,

GlobalValue::LinkageTypes NewFLink,

FunctionType *NewFT);

Constant *getOrBuildTrampolineFunction(FunctionType *FT, StringRef FName);

void initializeCallbackFunctions(Module &M);

void initializeRuntimeFunctions(Module &M);

void injectMetadataGlobals(Module &M);

bool init(Module &M);

/// Returns whether fast8 or fast16 mode has been specified.

bool hasFastLabelsEnabled();

/// Returns whether the pass tracks origins. Support only fast16 mode in TLS

/// ABI mode.

bool shouldTrackOrigins();

/// Returns whether the pass tracks labels for struct fields and array

/// indices. Support only fast16 mode in TLS ABI mode.

bool shouldTrackFieldsAndIndices();

▲ Show 20 Lines • Show All 255 Lines • ▼ Show 20 Lines

private:

void addOriginArguments(Function &F, CallBase &CB, std::vector<Value *> &Args,

IRBuilder<> &IRB);

};

} // end anonymous namespace

DataFlowSanitizer::DataFlowSanitizer(

const std::vector<std::string> &ABIListFiles) {

if (ClFast8Labels && ClFast16Labels) {

stephan.yichao.zhaoUnsubmitted

Done

When releasing fast8labels, we set ClFast8Labels = true by default.
If users wanted to use 16bit mode by -dfsan-fast-16-labels=true, this assert happens. So they have to do -dfsan-fast-16-labels=true -dfsan-fast-8-labels=false.
This message can suggest this if we found both are true. Something like:
"cannot set both -dfsan-fast-8-labels and -dfsan-fast-16-labels. -dfsan-fast-8-labels is true by default.
16mode will be deprecated. To use 16 mode, set -dfsan-fast-16-labels=true -dfsan-fast-8-labels=false."

Or when -dfsan-fast-16-labels is true, or we found -dfsan-fast-8-labels=false, this code can print out the deprecation warning.

stephan.yichao.zhao: When releasing fast8labels, we set ClFast8Labels = true by default. If users wanted to use…

gbalatsAuthorUnsubmitted

Done

I'm not sure I follow. Did you mean ClFast16Labels perhaps, since the ClFast8Labels has just been introduced by this path? Where is it set? The default value of both flags is false, so the statement "-dfsan-fast-*-labels is true by default" isn't accurate.

I think the deprecation warning should be introduced after fast8 is properly supported by the dfsan runtime.

gbalats: I'm not sure I follow. Did you mean ClFast16Labels perhaps, since the ClFast8Labels has just…

stephan.yichao.zhaoUnsubmitted

Done

thought about this again. Since fast-8-labels is not fully supported yet, and set to false by default, we do not have the issue for now.
When we set its default to true, this message could be updated accordingly.

stephan.yichao.zhao: thought about this again. Since fast-8-labels is not fully supported yet, and set to false by…

report_fatal_error(

"cannot set both -dfsan-fast-8-labels and -dfsan-fast-16-labels");

}

ShadowWidthBits = ClFast8Labels ? 8 : 16;

ShadowWidthBytes = ShadowWidthBits / 8;

std::vector<std::string> AllABIListFiles(std::move(ABIListFiles));

llvm::append_range(AllABIListFiles, ClABIListFiles);

// FIXME: should we propagate vfs::FileSystem to this constructor?

ABIList.set(

SpecialCaseList::createOrDie(AllABIListFiles, *vfs::getRealFileSystem()));

}

FunctionType *DataFlowSanitizer::getArgsFunctionType(FunctionType *T) {

▲ Show 20 Lines • Show All 78 Lines • ▼ Show 20 Lines

if (!isa<ArrayType>(T) && !isa<StructType>(T)) {

if (const ConstantInt *CI = dyn_cast<ConstantInt>(V))

return CI->isZero();

return false;

}

return isa<ConstantAggregateZero>(V);

}

bool DataFlowSanitizer::hasFastLabelsEnabled() {

static const bool HasFastLabelsEnabled = ClFast8Labels || ClFast16Labels;

return HasFastLabelsEnabled;

}

bool DataFlowSanitizer::shouldTrackOrigins() {

static const bool ShouldTrackOrigins =

ClTrackOrigins && getInstrumentedABI() == DataFlowSanitizer::IA_TLS &&

ClFast16Labels;

return ShouldTrackOrigins;

}

bool DataFlowSanitizer::shouldTrackFieldsAndIndices() {

return getInstrumentedABI() == DataFlowSanitizer::IA_TLS && ClFast16Labels;

return getInstrumentedABI() == DataFlowSanitizer::IA_TLS &&

hasFastLabelsEnabled();

}

Constant *DataFlowSanitizer::getZeroShadow(Type *OrigTy) {

if (!shouldTrackFieldsAndIndices())

return ZeroPrimitiveShadow;

if (!isa<ArrayType>(OrigTy) && !isa<StructType>(OrigTy))

return ZeroPrimitiveShadow;

▲ Show 20 Lines • Show All 148 Lines • ▼ Show 20 Lines

bool DataFlowSanitizer::init(Module &M) {

IntptrTy = DL.getIntPtrType(*Ctx);

ZeroPrimitiveShadow = ConstantInt::getSigned(PrimitiveShadowTy, 0);

ShadowPtrMul = ConstantInt::getSigned(IntptrTy, ShadowWidthBytes);

OriginBase = ConstantInt::get(IntptrTy, 0x200000000000LL);

ZeroOrigin = ConstantInt::getSigned(OriginTy, 0);

switch (TargetTriple.getArch()) {

case Triple::x86_64:

ShadowPtrMask = ConstantInt::getSigned(IntptrTy, ~0x700000000000LL);

ShadowPtrMask = ClFast8Labels

? ConstantInt::getSigned(IntptrTy, ~0x600000000000LL)

: ConstantInt::getSigned(IntptrTy, ~0x700000000000LL);

break;

case Triple::mips64:

case Triple::mips64el:

ShadowPtrMask = ConstantInt::getSigned(IntptrTy, ~0xF000000000LL);

ShadowPtrMask = ClFast8Labels

? ConstantInt::getSigned(IntptrTy, ~0xE000000000LL)

stephan.yichao.zhaoUnsubmitted

Not Done

Like aarch64 it may not be easy to test this. Can we make this as a TODO?

stephan.yichao.zhao: Like aarch64 it may not be easy to test this. Can we make this as a TODO?

gbalatsAuthorUnsubmitted

Done

So what should the behavior be if one runs on MIPS? I think if we don't change the shadow ptr mask, it won't be correct either so I'm not sure just keeping the older value is preferable. Should we report a fatal error if ClFast8Labels is used on MIPS? I'm not sure how this would play out with continuous integration and if it will introduce failures.

gbalats: So what should the behavior be if one runs on MIPS? I think if we don't change the shadow ptr…

: ConstantInt::getSigned(IntptrTy, ~0xF000000000LL);

break;

case Triple::aarch64:

case Triple::aarch64_be:

// AArch64 supports multiple VMAs and the shadow mask is set at runtime.

DFSanRuntimeShadowMask = true;

break;

default:

report_fatal_error("unsupported triple");

▲ Show 20 Lines • Show All 217 Lines • ▼ Show 20 Lines

void DataFlowSanitizer::initializeRuntimeFunctions(Module &M) {

{

AttributeList AL;

AL = AL.addAttribute(M.getContext(), AttributeList::FunctionIndex,

Attribute::NoUnwind);

AL = AL.addAttribute(M.getContext(), AttributeList::FunctionIndex,

Attribute::ReadOnly);

AL = AL.addAttribute(M.getContext(), AttributeList::ReturnIndex,

Attribute::ZExt);

DFSanUnionLoadFast16LabelsFn = Mod->getOrInsertFunction(

DFSanUnionLoadFastLabelsFn = Mod->getOrInsertFunction(

"__dfsan_union_load_fast16labels", DFSanUnionLoadFnTy, AL);

}

{

AttributeList AL;

AL = AL.addAttribute(M.getContext(), AttributeList::FunctionIndex,

Attribute::NoUnwind);

AL = AL.addAttribute(M.getContext(), AttributeList::FunctionIndex,

Attribute::ReadOnly);

Show All 35 Lines

void DataFlowSanitizer::initializeRuntimeFunctions(Module &M) {

}

DFSanRuntimeFunctions.insert(DFSanUnionFn.getCallee()->stripPointerCasts());

DFSanRuntimeFunctions.insert(

DFSanCheckedUnionFn.getCallee()->stripPointerCasts());

DFSanRuntimeFunctions.insert(

DFSanUnionLoadFn.getCallee()->stripPointerCasts());

DFSanRuntimeFunctions.insert(

DFSanUnionLoadFast16LabelsFn.getCallee()->stripPointerCasts());

DFSanUnionLoadFastLabelsFn.getCallee()->stripPointerCasts());

DFSanRuntimeFunctions.insert(

DFSanLoadLabelAndOriginFn.getCallee()->stripPointerCasts());

DFSanRuntimeFunctions.insert(

DFSanUnimplementedFn.getCallee()->stripPointerCasts());

DFSanRuntimeFunctions.insert(

DFSanSetLabelFn.getCallee()->stripPointerCasts());

DFSanRuntimeFunctions.insert(

DFSanNonzeroLabelFn.getCallee()->stripPointerCasts());

▲ Show 20 Lines • Show All 450 Lines • ▼ Show 20 Lines

}

std::pair<Value *, Value *>

DataFlowSanitizer::getShadowOriginAddress(Value *Addr, Align InstAlignment,

Instruction *Pos) {

// Returns ((Addr & shadow_mask) + origin_base) & ~4UL

IRBuilder<> IRB(Pos);

Value *ShadowOffset = getShadowOffset(Addr, IRB);

Value *ShadowPtr = IRB.CreateIntToPtr(

Value *ShadowPtr = getShadowAddress(Addr, Pos, ShadowOffset);

IRB.CreateMul(ShadowOffset, ShadowPtrMul), PrimitiveShadowPtrTy);

Value *OriginPtr = nullptr;

if (shouldTrackOrigins()) {

Value *OriginLong = IRB.CreateAdd(ShadowOffset, OriginBase);

const Align Alignment = llvm::assumeAligned(InstAlignment.value());

// When alignment is >= 4, Addr must be aligned to 4, otherwise it is UB.

// So Mask is unnecessary.

if (Alignment < MinOriginAlignment) {

uint64_t Mask = MinOriginAlignment.value() - 1;

OriginLong = IRB.CreateAnd(OriginLong, ConstantInt::get(IntptrTy, ~Mask));

}

OriginPtr = IRB.CreateIntToPtr(OriginLong, OriginPtrTy);

}

return {ShadowPtr, OriginPtr};

}

Value *DataFlowSanitizer::getShadowAddress(Value *Addr, Instruction *Pos,

Value *ShadowOffset) {

IRBuilder<> IRB(Pos);

if (!ShadowPtrMul->isOne())

ShadowOffset = IRB.CreateMul(ShadowOffset, ShadowPtrMul);

return IRB.CreateIntToPtr(ShadowOffset, PrimitiveShadowPtrTy);

}

Value *DataFlowSanitizer::getShadowAddress(Value *Addr, Instruction *Pos) {

// Returns (Addr & shadow_mask) x 2

IRBuilder<> IRB(Pos);

Value *ShadowOffset = getShadowOffset(Addr, IRB);

return IRB.CreateIntToPtr(IRB.CreateMul(ShadowOffset, ShadowPtrMul),

return getShadowAddress(Addr, Pos, ShadowOffset);

PrimitiveShadowPtrTy);

}

Value *DFSanFunction::combineShadowsThenConvert(Type *T, Value *V1, Value *V2,

Instruction *Pos) {

Value *PrimitiveValue = combineShadows(V1, V2, Pos);

return expandFromPrimitiveShadow(T, PrimitiveValue, Pos);

}

Show All 33 Lines

Value *DFSanFunction::combineShadows(Value *V1, Value *V2, Instruction *Pos) {

if (CCS.Block && DT.dominates(CCS.Block, Pos->getParent()))

return CCS.Shadow;

// Converts inputs shadows to shadows with primitive types.

Value *PV1 = collapseToPrimitiveShadow(V1, Pos);

Value *PV2 = collapseToPrimitiveShadow(V2, Pos);

IRBuilder<> IRB(Pos);

if (ClFast16Labels) {

if (DFS.hasFastLabelsEnabled()) {

stephan.yichao.zhaoUnsubmitted

Done

Since this expression is used multiple times, please create a function.

stephan.yichao.zhao: Since this expression is used multiple times, please create a function.

CCS.Block = Pos->getParent();

CCS.Shadow = IRB.CreateOr(PV1, PV2);

} else if (AvoidNewBlocks) {

CallInst *Call = IRB.CreateCall(DFS.DFSanCheckedUnionFn, {PV1, PV2});

Call->addAttribute(AttributeList::ReturnIndex, Attribute::ZExt);

Call->addParamAttr(0, Attribute::ZExt);

Call->addParamAttr(1, Attribute::ZExt);

▲ Show 20 Lines • Show All 131 Lines • ▼ Show 20 Lines

if (Alignment >= MinOriginAlignment &&

Size % (64 / DFS.ShadowWidthBits) == 0)

return false;

return true;

}

std::pair<Value *, Value *> DFSanFunction::loadFast16ShadowFast(

Value *ShadowAddr, Value *OriginAddr, uint64_t Size, Align ShadowAlign,

Align OriginAlign, Value *FirstOrigin, Instruction *Pos) {

// First OR all the WideShadows, then OR individual shadows within the

// combined WideShadow. This is fewer instructions than ORing shadows

// individually.

const bool ShouldTrackOrigins = DFS.shouldTrackOrigins();

stephan.yichao.zhaoUnsubmitted

Done

32bit is a special case when Size is small. Please add some comments here to explain when 32bit is used.
We also ignore cases for loadFast16ShadowFast when ShadowSize is like 12, 20. Please also explain this
design choice in the comments.

stephan.yichao.zhao: 32bit is a special case when Size is small. Please add some comments here to explain when 32bit…

gbalatsAuthorUnsubmitted

Done

Added comments and assertion.

gbalats: Added comments and assertion.

const uint64_t ShadowSize = Size * DFS.ShadowWidthBytes;

assert(Size >= 4 && "Not large enough load size for fast path!");

// Used for origin tracking.

std::vector<Value *> Shadows;

std::vector<Value *> Origins;

// Load instructions in LLVM can have arbitrary byte sizes (e.g., 3, 12, 20)

stephan.yichao.zhaoUnsubmitted

Done

A comment with more context could be
"LLVM bitcode supports loading integers with byte size 1, 12, 20....
But loadFast16ShadowFast only optimizes normal cases loading i32 i64 i128 etc..
so it is fine to support only size=4 or size % 8 == 0. "

stephan.yichao.zhao: A comment with more context could be "LLVM bitcode supports loading integers with byte size 1…

gbalatsAuthorUnsubmitted

Done

Expanded comment.

gbalats: Expanded comment.

// but this function is only used in a subset of cases that make it possible

// to optimize the instrumentation.

// Specifically, when the shadow size in bytes (i.e., loaded bytes x shadow

// per byte) is either:

// - a multiple of 8 (common)

// - equal to 4 (only for load32 in fast-8 mode)

// For the second case, we can fit the wide shadow in a 32-bit integer. In all

// other cases, we use a 64-bit integer to hold the wide shadow.

Type *WideShadowTy =

ShadowSize == 4 ? Type::getInt32Ty(*DFS.Ctx) : Type::getInt64Ty(*DFS.Ctx);

IRBuilder<> IRB(Pos);

Value *WideAddr =

Value *WideAddr = IRB.CreateBitCast(ShadowAddr, WideShadowTy->getPointerTo());

stephan.yichao.zhaoUnsubmitted

Done

const

stephan.yichao.zhao: const

gbalatsAuthorUnsubmitted

Done

Done. I assume this referred to the ShadowSize variable. Moved to the top.

gbalats: Done. I assume this referred to the ShadowSize variable. Moved to the top.

IRB.CreateBitCast(ShadowAddr, Type::getInt64PtrTy(*DFS.Ctx));

Value *CombinedWideShadow =

IRB.CreateAlignedLoad(IRB.getInt64Ty(), WideAddr, ShadowAlign);

IRB.CreateAlignedLoad(WideShadowTy, WideAddr, ShadowAlign);

stephan.yichao.zhaoUnsubmitted

Done

We call loadFast16ShadowFast only when Size > 2 and Size % 8 = 0 or Size == 4. So Size is at least 4.
If we assert (ShadowSize >= 4), this avoids reasoning if the code still works when ShadowSize < 4.

stephan.yichao.zhao: We call loadFast16ShadowFast only when Size > 2 and Size % 8 = 0 or Size == 4. So Size is at…

gbalatsAuthorUnsubmitted

Done

Added assertion and changed <= to == 4.

gbalats: Added assertion and changed <= to == 4.

if (ShouldTrackOrigins) {

Shadows.push_back(CombinedWideShadow);

Origins.push_back(FirstOrigin);

}

for (uint64_t Ofs = 64 / DFS.ShadowWidthBits; Ofs != Size;

Ofs += 64 / DFS.ShadowWidthBits) {

// First OR all the WideShadows (i.e., 64bit or 32bit shadow chunks) linearly;

WideAddr = IRB.CreateGEP(Type::getInt64Ty(*DFS.Ctx), WideAddr,

// then OR individual shadows within the combined WideShadow by binary ORing.

// This is fewer instructions than ORing shadows individually, since it

// needs logN shift/or instructions (N being the bytes of the combined wide

// shadow).

unsigned WideShadowBitWidth = WideShadowTy->getIntegerBitWidth();

const uint64_t BytesPerWideShadow = WideShadowBitWidth / DFS.ShadowWidthBits;

for (uint64_t ByteOfs = BytesPerWideShadow; ByteOfs < Size;

ByteOfs += BytesPerWideShadow) {

WideAddr = IRB.CreateGEP(WideShadowTy, WideAddr,

ConstantInt::get(DFS.IntptrTy, 1));

Value *NextWideShadow =

IRB.CreateAlignedLoad(IRB.getInt64Ty(), WideAddr, ShadowAlign);

IRB.CreateAlignedLoad(WideShadowTy, WideAddr, ShadowAlign);

CombinedWideShadow = IRB.CreateOr(CombinedWideShadow, NextWideShadow);

if (ShouldTrackOrigins) {

Shadows.push_back(NextWideShadow);

OriginAddr = IRB.CreateGEP(DFS.OriginTy, OriginAddr,

ConstantInt::get(DFS.IntptrTy, 1));

Origins.push_back(

IRB.CreateAlignedLoad(DFS.OriginTy, OriginAddr, OriginAlign));

}

for (unsigned Width = 32; Width >= DFS.ShadowWidthBits; Width >>= 1) {

for (unsigned Width = WideShadowBitWidth / 2; Width >= DFS.ShadowWidthBits;

Width >>= 1) {

Value *ShrShadow = IRB.CreateLShr(CombinedWideShadow, Width);

CombinedWideShadow = IRB.CreateOr(CombinedWideShadow, ShrShadow);

}

return {IRB.CreateTrunc(CombinedWideShadow, DFS.PrimitiveShadowTy),

ShouldTrackOrigins

? combineOrigins(Shadows, Origins, Pos,

ConstantInt::getSigned(IRB.getInt64Ty(), 0))

: DFS.ZeroOrigin};

}

Value *DFSanFunction::loadLegacyShadowFast(Value *ShadowAddr, uint64_t Size,

Align ShadowAlign,

Instruction *Pos) {

// Fast path for the common case where each byte has identical shadow: load

// shadow 64 bits at a time, fall out to a __dfsan_union_load call if any

// shadow 64 (or 32) bits at a time, fall out to a __dfsan_union_load call if

stephan.yichao.zhaoUnsubmitted

Done

This also needs to update about when 32bit is used.

stephan.yichao.zhao: This also needs to update about when 32bit is used.

gbalatsAuthorUnsubmitted

Done

Added comment.

gbalats: Added comment.

stephan.yichao.zhaoUnsubmitted

Done

64 -> 64 or 32

stephan.yichao.zhao: 64 -> 64 or 32

// shadow is non-equal.

// any shadow is non-equal.

BasicBlock *FallbackBB = BasicBlock::Create(*DFS.Ctx, "", F);

IRBuilder<> FallbackIRB(FallbackBB);

CallInst *FallbackCall = FallbackIRB.CreateCall(

DFS.DFSanUnionLoadFn, {ShadowAddr, ConstantInt::get(DFS.IntptrTy, Size)});

FallbackCall->addAttribute(AttributeList::ReturnIndex, Attribute::ZExt);

const uint64_t ShadowSize = Size * DFS.ShadowWidthBytes;

stephan.yichao.zhaoUnsubmitted

Done

const

stephan.yichao.zhao: const

assert(Size >= 4 && "Not large enough load size for fast path!");

// Same as in loadFast16AShadowsFast. In the case of load32, we can fit the

// wide shadow in a 32-bit integer instead.

Type *WideShadowTy =

ShadowSize == 4 ? Type::getInt32Ty(*DFS.Ctx) : Type::getInt64Ty(*DFS.Ctx);

// Compare each of the shadows stored in the loaded 64 bits to each other,

// by computing (WideShadow rotl ShadowWidthBits) == WideShadow.

IRBuilder<> IRB(Pos);

Value *WideAddr =

unsigned WideShadowBitWidth = WideShadowTy->getIntegerBitWidth();

IRB.CreateBitCast(ShadowAddr, Type::getInt64PtrTy(*DFS.Ctx));

Value *WideAddr = IRB.CreateBitCast(ShadowAddr, WideShadowTy->getPointerTo());

Value *WideShadow =

IRB.CreateAlignedLoad(IRB.getInt64Ty(), WideAddr, ShadowAlign);

IRB.CreateAlignedLoad(WideShadowTy, WideAddr, ShadowAlign);

Value *TruncShadow = IRB.CreateTrunc(WideShadow, DFS.PrimitiveShadowTy);

Value *ShlShadow = IRB.CreateShl(WideShadow, DFS.ShadowWidthBits);

Value *ShrShadow = IRB.CreateLShr(WideShadow, 64 - DFS.ShadowWidthBits);

Value *ShrShadow =

IRB.CreateLShr(WideShadow, WideShadowBitWidth - DFS.ShadowWidthBits);

Value *RotShadow = IRB.CreateOr(ShlShadow, ShrShadow);

Value *ShadowsEq = IRB.CreateICmpEQ(WideShadow, RotShadow);

BasicBlock *Head = Pos->getParent();

BasicBlock *Tail = Head->splitBasicBlock(Pos->getIterator());

if (DomTreeNode *OldNode = DT.getNode(Head)) {

std::vector<DomTreeNode *> Children(OldNode->begin(), OldNode->end());

DomTreeNode *NewNode = DT.addNewBlock(Tail, Head);

for (auto *Child : Children)

DT.changeImmediateDominator(Child, NewNode);

}

// In the following code LastBr will refer to the previous basic block's

// conditional branch instruction, whose true successor is fixed up to point

// to the next block during the loop below or to the tail after the final

// iteration.

BranchInst *LastBr = BranchInst::Create(FallbackBB, FallbackBB, ShadowsEq);

ReplaceInstWithInst(Head->getTerminator(), LastBr);

DT.addNewBlock(FallbackBB, Head);

for (uint64_t Ofs = 64 / DFS.ShadowWidthBits; Ofs != Size;

const uint64_t BytesPerWideShadow = WideShadowBitWidth / DFS.ShadowWidthBits;

Ofs += 64 / DFS.ShadowWidthBits) {

for (uint64_t ByteOfs = BytesPerWideShadow; ByteOfs < Size;

ByteOfs += BytesPerWideShadow) {

BasicBlock *NextBB = BasicBlock::Create(*DFS.Ctx, "", F);

DT.addNewBlock(NextBB, LastBr->getParent());

IRBuilder<> NextIRB(NextBB);

WideAddr = NextIRB.CreateGEP(Type::getInt64Ty(*DFS.Ctx), WideAddr,

WideAddr = NextIRB.CreateGEP(WideShadowTy, WideAddr,

ConstantInt::get(DFS.IntptrTy, 1));

Value *NextWideShadow =

NextIRB.CreateAlignedLoad(NextIRB.getInt64Ty(), WideAddr, ShadowAlign);

NextIRB.CreateAlignedLoad(WideShadowTy, WideAddr, ShadowAlign);

ShadowsEq = NextIRB.CreateICmpEQ(WideShadow, NextWideShadow);

LastBr->setSuccessor(0, NextBB);

LastBr = NextIRB.CreateCondBr(ShadowsEq, FallbackBB, FallbackBB);

}

LastBr->setSuccessor(0, Tail);

FallbackIRB.CreateBr(Tail);

PHINode *Shadow =

▲ Show 20 Lines • Show All 70 Lines • ▼ Show 20 Lines

std::pair<Value *, Value *> DFSanFunction::loadShadowOrigin(Value *Addr,

const Align ShadowAlign = getShadowAlign(InstAlignment);

const Align OriginAlign = getOriginAlign(InstAlignment);

Value *Origin = nullptr;

if (ShouldTrackOrigins) {

IRBuilder<> IRB(Pos);

Origin = IRB.CreateAlignedLoad(DFS.OriginTy, OriginAddr, OriginAlign);

}

// When the byte size is small enough, we can load the shadow directly with

stephan.yichao.zhaoUnsubmitted

Done

something like "load shadow directly without optimizing instrumentation".

stephan.yichao.zhao: something like "load shadow directly without optimizing instrumentation".

// just a few instructions.

switch (Size) {

case 1: {

LoadInst *LI = new LoadInst(DFS.PrimitiveShadowTy, ShadowAddr, "", Pos);

LI->setAlignment(ShadowAlign);

return {LI, Origin};

}

case 2: {

IRBuilder<> IRB(Pos);

Value *ShadowAddr1 = IRB.CreateGEP(DFS.PrimitiveShadowTy, ShadowAddr,

ConstantInt::get(DFS.IntptrTy, 1));

Value *Load =

IRB.CreateAlignedLoad(DFS.PrimitiveShadowTy, ShadowAddr, ShadowAlign);

Value *Load1 =

IRB.CreateAlignedLoad(DFS.PrimitiveShadowTy, ShadowAddr1, ShadowAlign);

return {combineShadows(Load, Load1, Pos), Origin};

}

uint64_t ShadowSize = Size * DFS.ShadowWidthBytes;

bool HasSizeForFastPath = ShadowSize % 8 == 0 || ShadowSize == 4;

bool HasFastLabelsEnabled = DFS.hasFastLabelsEnabled();

if (ClFast16Labels && Size % (64 / DFS.ShadowWidthBits) == 0)

if (HasFastLabelsEnabled && HasSizeForFastPath)

return loadFast16ShadowFast(ShadowAddr, OriginAddr, Size, ShadowAlign,

OriginAlign, Origin, Pos);

if (!AvoidNewBlocks && Size % (64 / DFS.ShadowWidthBits) == 0)

if (!AvoidNewBlocks && HasSizeForFastPath)

return {loadLegacyShadowFast(ShadowAddr, Size, ShadowAlign, Pos), Origin};

IRBuilder<> IRB(Pos);

FunctionCallee &UnionLoadFn =

FunctionCallee &UnionLoadFn = HasFastLabelsEnabled

ClFast16Labels ? DFS.DFSanUnionLoadFast16LabelsFn : DFS.DFSanUnionLoadFn;

? DFS.DFSanUnionLoadFastLabelsFn

: DFS.DFSanUnionLoadFn;

CallInst *FallbackCall = IRB.CreateCall(

UnionLoadFn, {ShadowAddr, ConstantInt::get(DFS.IntptrTy, Size)});

FallbackCall->addAttribute(AttributeList::ReturnIndex, Attribute::ZExt);

return {FallbackCall, Origin};

}

static AtomicOrdering addAcquireOrdering(AtomicOrdering AO) {

switch (AO) {

▲ Show 20 Lines • Show All 204 Lines • ▼ Show 20 Lines

if (DFS.isZeroShadow(PrimitiveShadow)) {

return;

}

IRBuilder<> IRB(Pos);

Value *ShadowAddr, *OriginAddr;

std::tie(ShadowAddr, OriginAddr) =

DFS.getShadowOriginAddress(Addr, InstAlignment, Pos);

const unsigned ShadowVecSize = 128 / DFS.ShadowWidthBits;

const unsigned ShadowVecSize = 8;

assert(ShadowVecSize * DFS.ShadowWidthBits <= 128 &&

"Shadow vector is too large!");

stephan.yichao.zhaoUnsubmitted

Done

nit: if assert is considered as a nop in an optimized build, ShadowVecBitSize may be considered as a not-used variable. In that case it can be inlined into the assert.

stephan.yichao.zhao: nit: if assert is considered as a nop in an optimized build, ShadowVecBitSize may be considered…

uint64_t Offset = 0;

uint64_t LeftSize = Size;

if (LeftSize >= ShadowVecSize) {

auto *ShadowVecTy =

stephan.yichao.zhaoUnsubmitted

Done

The assert prevents using ShadowBytes > 2.
But not every number less than 128 works.
Maybe calculating ShadowVecSize = 8bitmode? 64/ShadowWidthBits: 128/ShadowWidthBits; makes this easy to follow.

stephan.yichao.zhao: The assert prevents using ShadowBytes > 2. But not every number less than 128 works. Maybe…

gbalatsAuthorUnsubmitted

Done

I'm not sure why the suggestion makes this easier to follow. It only makes it easier if you know what the code was before, imo, but I think the original intention was that the vector type should fit in 128 bits regardless of the mode being used (which is what this assertion is trying to enforce).

Removed the assertion to support ShadowBytes > 2.

gbalats: I'm not sure why the suggestion makes this easier to follow. It only makes it easier if you…

stephan.yichao.zhaoUnsubmitted

Done

My first comment was confusing. It was not suggesting supporting ShadowBytes>2.
The first version with assertion(ShadowBytes * ShadowVecSize <= 128) is better, because with that the code can only use 64bit or 128bit.
Please add that assertion back. sorry for the confusing suggestion.

stephan.yichao.zhao: My first comment was confusing. It was not suggesting supporting ShadowBytes>2. The first…

gbalatsAuthorUnsubmitted

Done

Re-added the assertion.

gbalats: Re-added the assertion.

FixedVectorType::get(DFS.PrimitiveShadowTy, ShadowVecSize);

Value *ShadowVec = UndefValue::get(ShadowVecTy);

for (unsigned I = 0; I != ShadowVecSize; ++I) {

ShadowVec = IRB.CreateInsertElement(

ShadowVec, PrimitiveShadow,

ConstantInt::get(Type::getInt32Ty(*DFS.Ctx), I));

}

Value *ShadowVecAddr =

▲ Show 20 Lines • Show All 762 Lines • Show Last 20 Lines

llvm/test/Instrumentation/DataFlowSanitizer/abilist.ll

	; RUN: opt < %s -dfsan -dfsan-args-abi -dfsan-abilist=%S/Inputs/abilist.txt -S \| FileCheck %s			; RUN: opt < %s -dfsan -dfsan-args-abi -dfsan-abilist=%S/Inputs/abilist.txt -S \| FileCheck %s
				; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -dfsan-args-abi -dfsan-abilist=%S/Inputs/abilist.txt -S \| FileCheck %s
				; RUN: opt < %s -dfsan -dfsan-fast-8-labels=true -dfsan-args-abi -dfsan-abilist=%S/Inputs/abilist.txt -S \| FileCheck %s
	target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"			target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	; CHECK: @__dfsan_shadow_width_bits = weak_odr constant i32 [[#SBITS:]]			; CHECK: @__dfsan_shadow_width_bits = weak_odr constant i32 [[#SBITS:]]
	; CHECK: @__dfsan_shadow_width_bytes = weak_odr constant i32 [[#SBYTES:]]			; CHECK: @__dfsan_shadow_width_bytes = weak_odr constant i32 [[#SBYTES:]]

	; CHECK: i32 @discard(i32 %a, i32 %b)			; CHECK: i32 @discard(i32 %a, i32 %b)
	define i32 @discard(i32 %a, i32 %b) {			define i32 @discard(i32 %a, i32 %b) {
	▲ Show 20 Lines • Show All 95 Lines • Show Last 20 Lines

llvm/test/Instrumentation/DataFlowSanitizer/abilist_aggregate.ll

	; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -dfsan-abilist=%S/Inputs/abilist.txt -S \| FileCheck %s --check-prefixes=CHECK,TLS_ABI			; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -dfsan-abilist=%S/Inputs/abilist.txt -S \| FileCheck %s --check-prefixes=CHECK,TLS_ABI
				; RUN: opt < %s -dfsan -dfsan-fast-8-labels=true -dfsan-abilist=%S/Inputs/abilist.txt -S \| FileCheck %s --check-prefixes=CHECK,TLS_ABI
	; RUN: opt < %s -dfsan -dfsan-abilist=%S/Inputs/abilist.txt -S \| FileCheck %s --check-prefixes=CHECK,LEGACY			; RUN: opt < %s -dfsan -dfsan-abilist=%S/Inputs/abilist.txt -S \| FileCheck %s --check-prefixes=CHECK,LEGACY
	; RUN: opt < %s -dfsan -dfsan-args-abi -dfsan-abilist=%S/Inputs/abilist.txt -S \| FileCheck %s --check-prefixes=CHECK,ARGS_ABI			; RUN: opt < %s -dfsan -dfsan-args-abi -dfsan-abilist=%S/Inputs/abilist.txt -S \| FileCheck %s --check-prefixes=CHECK,ARGS_ABI
	target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"			target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	; CHECK: @__dfsan_shadow_width_bits = weak_odr constant i32 [[#SBITS:]]			; CHECK: @__dfsan_shadow_width_bits = weak_odr constant i32 [[#SBITS:]]
	; CHECK: @__dfsan_shadow_width_bytes = weak_odr constant i32 [[#SBYTES:]]			; CHECK: @__dfsan_shadow_width_bytes = weak_odr constant i32 [[#SBYTES:]]

	▲ Show 20 Lines • Show All 289 Lines • Show Last 20 Lines

llvm/test/Instrumentation/DataFlowSanitizer/array.ll

; RUN: opt < %s -dfsan -S \| FileCheck %s --check-prefixes=CHECK,LEGACY		; RUN: opt < %s -dfsan -S \| FileCheck %s --check-prefixes=CHECK,LEGACY
; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -dfsan-event-callbacks=true -S \| FileCheck %s --check-prefixes=CHECK,EVENT_CALLBACKS		; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -dfsan-event-callbacks=true -S \| FileCheck %s --check-prefixes=CHECK,EVENT_CALLBACKS
		; RUN: opt < %s -dfsan -dfsan-fast-8-labels=true -dfsan-event-callbacks=true -S \| FileCheck %s --check-prefixes=CHECK,EVENT_CALLBACKS
; RUN: opt < %s -dfsan -dfsan-args-abi -S \| FileCheck %s --check-prefixes=CHECK,ARGS_ABI		; RUN: opt < %s -dfsan -dfsan-args-abi -S \| FileCheck %s --check-prefixes=CHECK,ARGS_ABI
; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -S \| FileCheck %s --check-prefixes=CHECK,FAST16		; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -S \| FileCheck %s --check-prefixes=CHECK,FAST
; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -dfsan-combine-pointer-labels-on-load=false -S \| FileCheck %s --check-prefixes=CHECK,NO_COMBINE_LOAD_PTR		; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -dfsan-combine-pointer-labels-on-load=false -S \| FileCheck %s --check-prefixes=CHECK,NO_COMBINE_LOAD_PTR
; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -dfsan-combine-pointer-labels-on-store=true -S \| FileCheck %s --check-prefixes=CHECK,COMBINE_STORE_PTR		; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -dfsan-combine-pointer-labels-on-store=true -S \| FileCheck %s --check-prefixes=CHECK,COMBINE_STORE_PTR
; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -dfsan-debug-nonzero-labels -S \| FileCheck %s --check-prefixes=CHECK,DEBUG_NONZERO_LABELS		; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -dfsan-debug-nonzero-labels -S \| FileCheck %s --check-prefixes=CHECK,DEBUG_NONZERO_LABELS
		; RUN: opt < %s -dfsan -dfsan-fast-8-labels=true -S \| FileCheck %s --check-prefixes=CHECK,FAST
		stephan.yichao.zhaoUnsubmitted Done Reply Inline Actions renamed to FAST? stephan.yichao.zhao: renamed to FAST?
		gbalatsAuthorUnsubmitted Done Reply Inline Actions Renamed. gbalats: Renamed.
		; RUN: opt < %s -dfsan -dfsan-fast-8-labels=true -dfsan-combine-pointer-labels-on-load=false -S \| FileCheck %s --check-prefixes=CHECK,NO_COMBINE_LOAD_PTR
		; RUN: opt < %s -dfsan -dfsan-fast-8-labels=true -dfsan-combine-pointer-labels-on-store=true -S \| FileCheck %s --check-prefixes=CHECK,COMBINE_STORE_PTR
		; RUN: opt < %s -dfsan -dfsan-fast-8-labels=true -dfsan-debug-nonzero-labels -S \| FileCheck %s --check-prefixes=CHECK,DEBUG_NONZERO_LABELS
target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"		target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"
target triple = "x86_64-unknown-linux-gnu"		target triple = "x86_64-unknown-linux-gnu"

; CHECK: @__dfsan_arg_tls = external thread_local(initialexec) global [[TLS_ARR:\[100 x i64\]]]		; CHECK: @__dfsan_arg_tls = external thread_local(initialexec) global [[TLS_ARR:\[100 x i64\]]]
; CHECK: @__dfsan_retval_tls = external thread_local(initialexec) global [[TLS_ARR]]		; CHECK: @__dfsan_retval_tls = external thread_local(initialexec) global [[TLS_ARR]]
; CHECK: @__dfsan_shadow_width_bits = weak_odr constant i32 [[#SBITS:]]		; CHECK: @__dfsan_shadow_width_bits = weak_odr constant i32 [[#SBITS:]]
; CHECK: @__dfsan_shadow_width_bytes = weak_odr constant i32 [[#SBYTES:]]		; CHECK: @__dfsan_shadow_width_bytes = weak_odr constant i32 [[#SBYTES:]]

▲ Show 20 Lines • Show All 69 Lines • ▼ Show 20 Lines	define [1 x i1] @load_array1([1 x i1]* %p) {
; NO_COMBINE_LOAD_PTR: [[L:%.*]] = load i[[#SBITS]],		; NO_COMBINE_LOAD_PTR: [[L:%.*]] = load i[[#SBITS]],
; NO_COMBINE_LOAD_PTR: [[S:%.*]] = insertvalue [1 x i[[#SBITS]]] undef, i[[#SBITS]] [[L]], 0		; NO_COMBINE_LOAD_PTR: [[S:%.*]] = insertvalue [1 x i[[#SBITS]]] undef, i[[#SBITS]] [[L]], 0
; NO_COMBINE_LOAD_PTR: store [1 x i[[#SBITS]]] [[S]], [1 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to [1 x i[[#SBITS]]]*), align 2		; NO_COMBINE_LOAD_PTR: store [1 x i[[#SBITS]]] [[S]], [1 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to [1 x i[[#SBITS]]]*), align 2

; EVENT_CALLBACKS: @"dfs$load_array1"		; EVENT_CALLBACKS: @"dfs$load_array1"
; EVENT_CALLBACKS: [[L:%.*]] = or i[[#SBITS]]		; EVENT_CALLBACKS: [[L:%.*]] = or i[[#SBITS]]
; EVENT_CALLBACKS: call void @__dfsan_load_callback(i[[#SBITS]] [[L]], i8* {{.*}})		; EVENT_CALLBACKS: call void @__dfsan_load_callback(i[[#SBITS]] [[L]], i8* {{.*}})

; FAST16: @"dfs$load_array1"		; FAST: @"dfs$load_array1"
; FAST16: [[P:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN:2]]		; FAST: [[P:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN:2]]
; FAST16: [[L:%.]] = load i[[#SBITS]], i[[#SBITS]] {{.*}}, align [[#SBYTES]]		; FAST: [[L:%.]] = load i[[#SBITS]], i[[#SBITS]] {{.*}}, align [[#SBYTES]]
; FAST16: [[U:%.*]] = or i[[#SBITS]] [[L]], [[P]]		; FAST: [[U:%.*]] = or i[[#SBITS]] [[L]], [[P]]
; FAST16: [[S1:%.*]] = insertvalue [1 x i[[#SBITS]]] undef, i[[#SBITS]] [[U]], 0		; FAST: [[S1:%.*]] = insertvalue [1 x i[[#SBITS]]] undef, i[[#SBITS]] [[U]], 0
; FAST16: store [1 x i[[#SBITS]]] [[S1]], [1 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to [1 x i[[#SBITS]]]*), align [[ALIGN]]		; FAST: store [1 x i[[#SBITS]]] [[S1]], [1 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to [1 x i[[#SBITS]]]*), align [[ALIGN]]

; LEGACY: @"dfs$load_array1"		; LEGACY: @"dfs$load_array1"
; LEGACY: [[P:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN:2]]		; LEGACY: [[P:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN:2]]
; LEGACY: [[L:%.]] = load i[[#SBITS]], i[[#SBITS]] {{.*}}, align [[#SBYTES]]		; LEGACY: [[L:%.]] = load i[[#SBITS]], i[[#SBITS]] {{.*}}, align [[#SBYTES]]
; LEGACY: [[U:%.*]] = call zeroext i[[#SBITS]] @__dfsan_union(i[[#SBITS]] zeroext [[L]], i[[#SBITS]] zeroext [[P]])		; LEGACY: [[U:%.*]] = call zeroext i[[#SBITS]] @__dfsan_union(i[[#SBITS]] zeroext [[L]], i[[#SBITS]] zeroext [[P]])
; LEGACY: [[PH:%.]] = phi i[[#SBITS]] [ [[U]], {{.}} ], [ [[L]], {{.*}} ]		; LEGACY: [[PH:%.]] = phi i[[#SBITS]] [ [[U]], {{.}} ], [ [[L]], {{.*}} ]
; LEGACY: store i[[#SBITS]] [[PH]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]		; LEGACY: store i[[#SBITS]] [[PH]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]

Show All 11 Lines	define [2 x i1] @load_array2([2 x i1]* %p) {
; NO_COMBINE_LOAD_PTR: [[S2:%.*]] = insertvalue [2 x i[[#SBITS]]] [[S1]], i[[#SBITS]] [[U]], 1		; NO_COMBINE_LOAD_PTR: [[S2:%.*]] = insertvalue [2 x i[[#SBITS]]] [[S1]], i[[#SBITS]] [[U]], 1
; NO_COMBINE_LOAD_PTR: store [2 x i[[#SBITS]]] [[S2]], [2 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to [2 x i[[#SBITS]]]*), align [[ALIGN:2]]		; NO_COMBINE_LOAD_PTR: store [2 x i[[#SBITS]]] [[S2]], [2 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to [2 x i[[#SBITS]]]*), align [[ALIGN:2]]

; EVENT_CALLBACKS: @"dfs$load_array2"		; EVENT_CALLBACKS: @"dfs$load_array2"
; EVENT_CALLBACKS: [[O1:%.*]] = or i[[#SBITS]]		; EVENT_CALLBACKS: [[O1:%.*]] = or i[[#SBITS]]
; EVENT_CALLBACKS: [[O2:%.*]] = or i[[#SBITS]] [[O1]]		; EVENT_CALLBACKS: [[O2:%.*]] = or i[[#SBITS]] [[O1]]
; EVENT_CALLBACKS: call void @__dfsan_load_callback(i[[#SBITS]] [[O2]], i8* {{.*}})		; EVENT_CALLBACKS: call void @__dfsan_load_callback(i[[#SBITS]] [[O2]], i8* {{.*}})

; FAST16: @"dfs$load_array2"		; FAST: @"dfs$load_array2"
; FAST16: [[P:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN:2]]		; FAST: [[P:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN:2]]
; FAST16: [[O:%.*]] = or i[[#SBITS]]		; FAST: [[O:%.*]] = or i[[#SBITS]]
; FAST16: [[U:%.*]] = or i[[#SBITS]] [[O]], [[P]]		; FAST: [[U:%.*]] = or i[[#SBITS]] [[O]], [[P]]
; FAST16: [[S:%.*]] = insertvalue [2 x i[[#SBITS]]] undef, i[[#SBITS]] [[U]], 0		; FAST: [[S:%.*]] = insertvalue [2 x i[[#SBITS]]] undef, i[[#SBITS]] [[U]], 0
; FAST16: [[S1:%.*]] = insertvalue [2 x i[[#SBITS]]] [[S]], i[[#SBITS]] [[U]], 1		; FAST: [[S1:%.*]] = insertvalue [2 x i[[#SBITS]]] [[S]], i[[#SBITS]] [[U]], 1
; FAST16: store [2 x i[[#SBITS]]] [[S1]], [2 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to [2 x i[[#SBITS]]]*), align [[ALIGN]]		; FAST: store [2 x i[[#SBITS]]] [[S1]], [2 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to [2 x i[[#SBITS]]]*), align [[ALIGN]]
%a = load [2 x i1], [2 x i1]* %p		%a = load [2 x i1], [2 x i1]* %p
ret [2 x i1] %a		ret [2 x i1] %a
}		}

define [4 x i1] @load_array4([4 x i1]* %p) {		define [4 x i1] @load_array4([4 x i1]* %p) {
; NO_COMBINE_LOAD_PTR: @"dfs$load_array4"		; NO_COMBINE_LOAD_PTR: @"dfs$load_array4"
; NO_COMBINE_LOAD_PTR: [[T:%.]] = trunc i[[#mul(4, SBITS)]] {{.}} to i[[#SBITS]]		; NO_COMBINE_LOAD_PTR: [[T:%.]] = trunc i[[#mul(4, SBITS)]] {{.}} to i[[#SBITS]]
; NO_COMBINE_LOAD_PTR: [[S1:%.*]] = insertvalue [4 x i[[#SBITS]]] undef, i[[#SBITS]] [[T]], 0		; NO_COMBINE_LOAD_PTR: [[S1:%.*]] = insertvalue [4 x i[[#SBITS]]] undef, i[[#SBITS]] [[T]], 0
; NO_COMBINE_LOAD_PTR: [[S2:%.*]] = insertvalue [4 x i[[#SBITS]]] [[S1]], i[[#SBITS]] [[T]], 1		; NO_COMBINE_LOAD_PTR: [[S2:%.*]] = insertvalue [4 x i[[#SBITS]]] [[S1]], i[[#SBITS]] [[T]], 1
; NO_COMBINE_LOAD_PTR: [[S3:%.*]] = insertvalue [4 x i[[#SBITS]]] [[S2]], i[[#SBITS]] [[T]], 2		; NO_COMBINE_LOAD_PTR: [[S3:%.*]] = insertvalue [4 x i[[#SBITS]]] [[S2]], i[[#SBITS]] [[T]], 2
; NO_COMBINE_LOAD_PTR: [[S4:%.*]] = insertvalue [4 x i[[#SBITS]]] [[S3]], i[[#SBITS]] [[T]], 3		; NO_COMBINE_LOAD_PTR: [[S4:%.*]] = insertvalue [4 x i[[#SBITS]]] [[S3]], i[[#SBITS]] [[T]], 3
; NO_COMBINE_LOAD_PTR: store [4 x i[[#SBITS]]] [[S4]], [4 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to [4 x i[[#SBITS]]]*), align 2		; NO_COMBINE_LOAD_PTR: store [4 x i[[#SBITS]]] [[S4]], [4 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to [4 x i[[#SBITS]]]*), align 2

; EVENT_CALLBACKS: @"dfs$load_array4"		; EVENT_CALLBACKS: @"dfs$load_array4"
; EVENT_CALLBACKS: [[O0:%.*]] = or i[[#mul(4, SBITS)]]		; EVENT_CALLBACKS: [[O0:%.*]] = or i[[#mul(4, SBITS)]]
; EVENT_CALLBACKS: [[O1:%.*]] = or i[[#mul(4, SBITS)]] [[O0]]		; EVENT_CALLBACKS: [[O1:%.*]] = or i[[#mul(4, SBITS)]] [[O0]]
; EVENT_CALLBACKS: [[O2:%.*]] = trunc i[[#mul(4, SBITS)]] [[O1]] to i[[#SBITS]]		; EVENT_CALLBACKS: [[O2:%.*]] = trunc i[[#mul(4, SBITS)]] [[O1]] to i[[#SBITS]]
; EVENT_CALLBACKS: [[O3:%.*]] = or i[[#SBITS]] [[O2]]		; EVENT_CALLBACKS: [[O3:%.*]] = or i[[#SBITS]] [[O2]]
; EVENT_CALLBACKS: call void @__dfsan_load_callback(i[[#SBITS]] [[O3]], i8* {{.*}})		; EVENT_CALLBACKS: call void @__dfsan_load_callback(i[[#SBITS]] [[O3]], i8* {{.*}})

; FAST16: @"dfs$load_array4"		; FAST: @"dfs$load_array4"
; FAST16: [[T:%.]] = trunc i[[#mul(4, SBITS)]] {{.}} to i[[#SBITS]]		; FAST: [[T:%.]] = trunc i[[#mul(4, SBITS)]] {{.}} to i[[#SBITS]]
; FAST16: [[O:%.*]] = or i[[#SBITS]] [[T]]		; FAST: [[O:%.*]] = or i[[#SBITS]] [[T]]
; FAST16: [[S1:%.*]] = insertvalue [4 x i[[#SBITS]]] undef, i[[#SBITS]] [[O]], 0		; FAST: [[S1:%.*]] = insertvalue [4 x i[[#SBITS]]] undef, i[[#SBITS]] [[O]], 0
; FAST16: [[S2:%.*]] = insertvalue [4 x i[[#SBITS]]] [[S1]], i[[#SBITS]] [[O]], 1		; FAST: [[S2:%.*]] = insertvalue [4 x i[[#SBITS]]] [[S1]], i[[#SBITS]] [[O]], 1
; FAST16: [[S3:%.*]] = insertvalue [4 x i[[#SBITS]]] [[S2]], i[[#SBITS]] [[O]], 2		; FAST: [[S3:%.*]] = insertvalue [4 x i[[#SBITS]]] [[S2]], i[[#SBITS]] [[O]], 2
; FAST16: [[S4:%.*]] = insertvalue [4 x i[[#SBITS]]] [[S3]], i[[#SBITS]] [[O]], 3		; FAST: [[S4:%.*]] = insertvalue [4 x i[[#SBITS]]] [[S3]], i[[#SBITS]] [[O]], 3
; FAST16: store [4 x i[[#SBITS]]] [[S4]], [4 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to [4 x i[[#SBITS]]]*), align 2		; FAST: store [4 x i[[#SBITS]]] [[S4]], [4 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to [4 x i[[#SBITS]]]*), align 2

; LEGACY: @"dfs$load_array4"		; LEGACY: @"dfs$load_array4"
; LEGACY: [[P:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN:2]]		; LEGACY: [[P:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN:2]]
; LEGACY: [[PH1:%.*]] = phi i[[#SBITS]]		; LEGACY: [[PH1:%.*]] = phi i[[#SBITS]]
; LEGACY: [[U:%.*]] = call zeroext i[[#SBITS]] @__dfsan_union(i[[#SBITS]] zeroext [[PH1]], i[[#SBITS]] zeroext [[P]])		; LEGACY: [[U:%.*]] = call zeroext i[[#SBITS]] @__dfsan_union(i[[#SBITS]] zeroext [[PH1]], i[[#SBITS]] zeroext [[P]])
; LEGACY: [[PH:%.]] = phi i[[#SBITS]] [ [[U]], {{.}} ], [ [[PH1]], {{.*}} ]		; LEGACY: [[PH:%.]] = phi i[[#SBITS]] [ [[U]], {{.}} ], [ [[PH1]], {{.*}} ]
; LEGACY: store i[[#SBITS]] [[PH]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]		; LEGACY: store i[[#SBITS]] [[PH]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]

Show All 17 Lines	define [4 x i1] @insert_array([4 x i1] %a, i1 %e2) {
; NO_COMBINE_LOAD_PTR: [[AM:%.]] = load [4 x i[[#SBITS]]], [4 x i[[#SBITS]]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to [4 x i[[#SBITS]]]*), align [[ALIGN]]		; NO_COMBINE_LOAD_PTR: [[AM:%.]] = load [4 x i[[#SBITS]]], [4 x i[[#SBITS]]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to [4 x i[[#SBITS]]]*), align [[ALIGN]]
; NO_COMBINE_LOAD_PTR: [[AM1:%.*]] = insertvalue [4 x i[[#SBITS]]] [[AM]], i[[#SBITS]] [[EM]], 0		; NO_COMBINE_LOAD_PTR: [[AM1:%.*]] = insertvalue [4 x i[[#SBITS]]] [[AM]], i[[#SBITS]] [[EM]], 0
; NO_COMBINE_LOAD_PTR: store [4 x i[[#SBITS]]] [[AM1]], [4 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to [4 x i[[#SBITS]]]*), align [[ALIGN]]		; NO_COMBINE_LOAD_PTR: store [4 x i[[#SBITS]]] [[AM1]], [4 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to [4 x i[[#SBITS]]]*), align [[ALIGN]]
%a1 = insertvalue [4 x i1] %a, i1 %e2, 0		%a1 = insertvalue [4 x i1] %a, i1 %e2, 0
ret [4 x i1] %a1		ret [4 x i1] %a1
}		}

define void @store_alloca_array([4 x i1] %a) {		define void @store_alloca_array([4 x i1] %a) {
; FAST16: @"dfs$store_alloca_array"		; FAST: @"dfs$store_alloca_array"
; FAST16: [[S:%.]] = load [4 x i[[#SBITS]]], [4 x i[[#SBITS]]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to [4 x i[[#SBITS]]]*), align [[ALIGN:2]]		; FAST: [[S:%.]] = load [4 x i[[#SBITS]]], [4 x i[[#SBITS]]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to [4 x i[[#SBITS]]]*), align [[ALIGN:2]]
; FAST16: [[SP:%.*]] = alloca i[[#SBITS]], align [[#SBYTES]]		; FAST: [[SP:%.*]] = alloca i[[#SBITS]], align [[#SBYTES]]
; FAST16: [[E0:%.*]] = extractvalue [4 x i[[#SBITS]]] [[S]], 0		; FAST: [[E0:%.*]] = extractvalue [4 x i[[#SBITS]]] [[S]], 0
; FAST16: [[E1:%.*]] = extractvalue [4 x i[[#SBITS]]] [[S]], 1		; FAST: [[E1:%.*]] = extractvalue [4 x i[[#SBITS]]] [[S]], 1
; FAST16: [[E01:%.*]] = or i[[#SBITS]] [[E0]], [[E1]]		; FAST: [[E01:%.*]] = or i[[#SBITS]] [[E0]], [[E1]]
; FAST16: [[E2:%.*]] = extractvalue [4 x i[[#SBITS]]] [[S]], 2		; FAST: [[E2:%.*]] = extractvalue [4 x i[[#SBITS]]] [[S]], 2
; FAST16: [[E012:%.*]] = or i[[#SBITS]] [[E01]], [[E2]]		; FAST: [[E012:%.*]] = or i[[#SBITS]] [[E01]], [[E2]]
; FAST16: [[E3:%.*]] = extractvalue [4 x i[[#SBITS]]] [[S]], 3		; FAST: [[E3:%.*]] = extractvalue [4 x i[[#SBITS]]] [[S]], 3
; FAST16: [[E0123:%.*]] = or i[[#SBITS]] [[E012]], [[E3]]		; FAST: [[E0123:%.*]] = or i[[#SBITS]] [[E012]], [[E3]]
; FAST16: store i[[#SBITS]] [[E0123]], i[[#SBITS]]* [[SP]], align [[#SBYTES]]		; FAST: store i[[#SBITS]] [[E0123]], i[[#SBITS]]* [[SP]], align [[#SBYTES]]
%p = alloca [4 x i1]		%p = alloca [4 x i1]
store [4 x i1] %a, [4 x i1]* %p		store [4 x i1] %a, [4 x i1]* %p
ret void		ret void
}		}

define void @store_zero_array([4 x i1]* %p) {		define void @store_zero_array([4 x i1]* %p) {
; FAST16: @"dfs$store_zero_array"		; FAST: @"dfs$store_zero_array"
; FAST16: store i[[#mul(4, SBITS)]] 0, i[[#mul(4, SBITS)]]* {{.*}}		; FAST: store i[[#mul(4, SBITS)]] 0, i[[#mul(4, SBITS)]]* {{.*}}
store [4 x i1] zeroinitializer, [4 x i1]* %p		store [4 x i1] zeroinitializer, [4 x i1]* %p
ret void		ret void
}		}

define void @store_array2([2 x i1] %a, [2 x i1]* %p) {		define void @store_array2([2 x i1] %a, [2 x i1]* %p) {
; LEGACY: @"dfs$store_array2"		; LEGACY: @"dfs$store_array2"
; LEGACY: [[S:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN:2]]		; LEGACY: [[S:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN:2]]
; LEGACY: [[SP0:%.]] = getelementptr i[[#SBITS]], i[[#SBITS]] [[SP:%.*]], i32 0		; LEGACY: [[SP0:%.]] = getelementptr i[[#SBITS]], i[[#SBITS]] [[SP:%.*]], i32 0
; LEGACY: store i[[#SBITS]] [[S]], i[[#SBITS]]* [[SP0]], align [[#SBYTES]]		; LEGACY: store i[[#SBITS]] [[S]], i[[#SBITS]]* [[SP0]], align [[#SBYTES]]
; LEGACY: [[SP1:%.]] = getelementptr i[[#SBITS]], i[[#SBITS]] [[SP]], i32 1		; LEGACY: [[SP1:%.]] = getelementptr i[[#SBITS]], i[[#SBITS]] [[SP]], i32 1
; LEGACY: store i[[#SBITS]] [[S]], i[[#SBITS]]* [[SP1]], align [[#SBYTES]]		; LEGACY: store i[[#SBITS]] [[S]], i[[#SBITS]]* [[SP1]], align [[#SBYTES]]

; EVENT_CALLBACKS: @"dfs$store_array2"		; EVENT_CALLBACKS: @"dfs$store_array2"
; EVENT_CALLBACKS: [[E12:%.*]] = or i[[#SBITS]]		; EVENT_CALLBACKS: [[E12:%.*]] = or i[[#SBITS]]
; EVENT_CALLBACKS: [[P:%.]] = bitcast [2 x i1] %p to i8*		; EVENT_CALLBACKS: [[P:%.]] = bitcast [2 x i1] %p to i8*
; EVENT_CALLBACKS: call void @__dfsan_store_callback(i[[#SBITS]] [[E12]], i8* [[P]])		; EVENT_CALLBACKS: call void @__dfsan_store_callback(i[[#SBITS]] [[E12]], i8* [[P]])

; FAST16: @"dfs$store_array2"		; FAST: @"dfs$store_array2"
; FAST16: [[S:%.]] = load [2 x i[[#SBITS]]], [2 x i[[#SBITS]]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to [2 x i[[#SBITS]]]*), align [[ALIGN:2]]		; FAST: [[S:%.]] = load [2 x i[[#SBITS]]], [2 x i[[#SBITS]]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to [2 x i[[#SBITS]]]*), align [[ALIGN:2]]
; FAST16: [[E1:%.*]] = extractvalue [2 x i[[#SBITS]]] [[S]], 0		; FAST: [[E1:%.*]] = extractvalue [2 x i[[#SBITS]]] [[S]], 0
; FAST16: [[E2:%.*]] = extractvalue [2 x i[[#SBITS]]] [[S]], 1		; FAST: [[E2:%.*]] = extractvalue [2 x i[[#SBITS]]] [[S]], 1
; FAST16: [[E12:%.*]] = or i[[#SBITS]] [[E1]], [[E2]]		; FAST: [[E12:%.*]] = or i[[#SBITS]] [[E1]], [[E2]]
; FAST16: [[SP0:%.]] = getelementptr i[[#SBITS]], i[[#SBITS]] [[SP:%.*]], i32 0		; FAST: [[SP0:%.]] = getelementptr i[[#SBITS]], i[[#SBITS]] [[SP:%.*]], i32 0
; FAST16: store i[[#SBITS]] [[E12]], i[[#SBITS]]* [[SP0]], align [[#SBYTES]]		; FAST: store i[[#SBITS]] [[E12]], i[[#SBITS]]* [[SP0]], align [[#SBYTES]]
; FAST16: [[SP1:%.]] = getelementptr i[[#SBITS]], i[[#SBITS]] [[SP]], i32 1		; FAST: [[SP1:%.]] = getelementptr i[[#SBITS]], i[[#SBITS]] [[SP]], i32 1
; FAST16: store i[[#SBITS]] [[E12]], i[[#SBITS]]* [[SP1]], align [[#SBYTES]]		; FAST: store i[[#SBITS]] [[E12]], i[[#SBITS]]* [[SP1]], align [[#SBYTES]]

; COMBINE_STORE_PTR: @"dfs$store_array2"		; COMBINE_STORE_PTR: @"dfs$store_array2"
; COMBINE_STORE_PTR: [[O:%.*]] = or i[[#SBITS]]		; COMBINE_STORE_PTR: [[O:%.*]] = or i[[#SBITS]]
; COMBINE_STORE_PTR: [[U:%.*]] = or i[[#SBITS]] [[O]]		; COMBINE_STORE_PTR: [[U:%.*]] = or i[[#SBITS]] [[O]]
; COMBINE_STORE_PTR: [[P1:%.]] = getelementptr i[[#SBITS]], i[[#SBITS]] [[P:%.*]], i32 0		; COMBINE_STORE_PTR: [[P1:%.]] = getelementptr i[[#SBITS]], i[[#SBITS]] [[P:%.*]], i32 0
; COMBINE_STORE_PTR: store i[[#SBITS]] [[U]], i[[#SBITS]]* [[P1]], align [[#SBYTES]]		; COMBINE_STORE_PTR: store i[[#SBITS]] [[U]], i[[#SBITS]]* [[P1]], align [[#SBYTES]]
; COMBINE_STORE_PTR: [[P2:%.]] = getelementptr i[[#SBITS]], i[[#SBITS]] [[P]], i32 1		; COMBINE_STORE_PTR: [[P2:%.]] = getelementptr i[[#SBITS]], i[[#SBITS]] [[P]], i32 1
; COMBINE_STORE_PTR: store i[[#SBITS]] [[U]], i[[#SBITS]]* [[P2]], align [[#SBYTES]]		; COMBINE_STORE_PTR: store i[[#SBITS]] [[U]], i[[#SBITS]]* [[P2]], align [[#SBYTES]]

store [2 x i1] %a, [2 x i1]* %p		store [2 x i1] %a, [2 x i1]* %p
ret void		ret void
}		}

define void @store_array17([17 x i1] %a, [17 x i1]* %p) {		define void @store_array17([17 x i1] %a, [17 x i1]* %p) {
; FAST16: @"dfs$store_array17"		; FAST: @"dfs$store_array17"
; FAST16: %[[#R:]] = load [17 x i[[#SBITS]]], [17 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to [17 x i[[#SBITS]]]*), align 2		; FAST: %[[#R:]] = load [17 x i[[#SBITS]]], [17 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to [17 x i[[#SBITS]]]*), align 2
; FAST16: %[[#R+1]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 0		; FAST: %[[#R+1]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 0
; FAST16: %[[#R+2]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 1		; FAST: %[[#R+2]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 1
; FAST16: %[[#R+3]] = or i[[#SBITS]] %[[#R+1]], %[[#R+2]]		; FAST: %[[#R+3]] = or i[[#SBITS]] %[[#R+1]], %[[#R+2]]
; FAST16: %[[#R+4]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 2		; FAST: %[[#R+4]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 2
; FAST16: %[[#R+5]] = or i[[#SBITS]] %[[#R+3]], %[[#R+4]]		; FAST: %[[#R+5]] = or i[[#SBITS]] %[[#R+3]], %[[#R+4]]
; FAST16: %[[#R+6]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 3		; FAST: %[[#R+6]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 3
; FAST16: %[[#R+7]] = or i[[#SBITS]] %[[#R+5]], %[[#R+6]]		; FAST: %[[#R+7]] = or i[[#SBITS]] %[[#R+5]], %[[#R+6]]
; FAST16: %[[#R+8]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 4		; FAST: %[[#R+8]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 4
; FAST16: %[[#R+9]] = or i[[#SBITS]] %[[#R+7]], %[[#R+8]]		; FAST: %[[#R+9]] = or i[[#SBITS]] %[[#R+7]], %[[#R+8]]
; FAST16: %[[#R+10]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 5		; FAST: %[[#R+10]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 5
; FAST16: %[[#R+11]] = or i[[#SBITS]] %[[#R+9]], %[[#R+10]]		; FAST: %[[#R+11]] = or i[[#SBITS]] %[[#R+9]], %[[#R+10]]
; FAST16: %[[#R+12]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 6		; FAST: %[[#R+12]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 6
; FAST16: %[[#R+13]] = or i[[#SBITS]] %[[#R+11]], %[[#R+12]]		; FAST: %[[#R+13]] = or i[[#SBITS]] %[[#R+11]], %[[#R+12]]
; FAST16: %[[#R+14]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 7		; FAST: %[[#R+14]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 7
; FAST16: %[[#R+15]] = or i[[#SBITS]] %[[#R+13]], %[[#R+14]]		; FAST: %[[#R+15]] = or i[[#SBITS]] %[[#R+13]], %[[#R+14]]
; FAST16: %[[#R+16]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 8		; FAST: %[[#R+16]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 8
; FAST16: %[[#R+17]] = or i[[#SBITS]] %[[#R+15]], %[[#R+16]]		; FAST: %[[#R+17]] = or i[[#SBITS]] %[[#R+15]], %[[#R+16]]
; FAST16: %[[#R+18]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 9		; FAST: %[[#R+18]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 9
; FAST16: %[[#R+19]] = or i[[#SBITS]] %[[#R+17]], %[[#R+18]]		; FAST: %[[#R+19]] = or i[[#SBITS]] %[[#R+17]], %[[#R+18]]
; FAST16: %[[#R+20]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 10		; FAST: %[[#R+20]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 10
; FAST16: %[[#R+21]] = or i[[#SBITS]] %[[#R+19]], %[[#R+20]]		; FAST: %[[#R+21]] = or i[[#SBITS]] %[[#R+19]], %[[#R+20]]
; FAST16: %[[#R+22]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 11		; FAST: %[[#R+22]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 11
; FAST16: %[[#R+23]] = or i[[#SBITS]] %[[#R+21]], %[[#R+22]]		; FAST: %[[#R+23]] = or i[[#SBITS]] %[[#R+21]], %[[#R+22]]
; FAST16: %[[#R+24]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 12		; FAST: %[[#R+24]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 12
; FAST16: %[[#R+25]] = or i[[#SBITS]] %[[#R+23]], %[[#R+24]]		; FAST: %[[#R+25]] = or i[[#SBITS]] %[[#R+23]], %[[#R+24]]
; FAST16: %[[#R+26]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 13		; FAST: %[[#R+26]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 13
; FAST16: %[[#R+27]] = or i[[#SBITS]] %[[#R+25]], %[[#R+26]]		; FAST: %[[#R+27]] = or i[[#SBITS]] %[[#R+25]], %[[#R+26]]
; FAST16: %[[#R+28]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 14		; FAST: %[[#R+28]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 14
; FAST16: %[[#R+29]] = or i[[#SBITS]] %[[#R+27]], %[[#R+28]]		; FAST: %[[#R+29]] = or i[[#SBITS]] %[[#R+27]], %[[#R+28]]
; FAST16: %[[#R+30]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 15		; FAST: %[[#R+30]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 15
; FAST16: %[[#R+31]] = or i[[#SBITS]] %[[#R+29]], %[[#R+30]]		; FAST: %[[#R+31]] = or i[[#SBITS]] %[[#R+29]], %[[#R+30]]
; FAST16: %[[#R+32]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 16		; FAST: %[[#R+32]] = extractvalue [17 x i[[#SBITS]]] %[[#R]], 16
; FAST16: %[[#R+33]] = or i[[#SBITS]] %[[#R+31]], %[[#R+32]]		; FAST: %[[#R+33]] = or i[[#SBITS]] %[[#R+31]], %[[#R+32]]
; FAST16: %[[#VREG:]] = insertelement <8 x i[[#SBITS]]> undef, i[[#SBITS]] %[[#R+33]], i32 0		; FAST: %[[#VREG:]] = insertelement <8 x i[[#SBITS]]> undef, i[[#SBITS]] %[[#R+33]], i32 0
; FAST16: %[[#VREG+1]] = insertelement <8 x i[[#SBITS]]> %[[#VREG]], i[[#SBITS]] %[[#R+33]], i32 1		; FAST: %[[#VREG+1]] = insertelement <8 x i[[#SBITS]]> %[[#VREG]], i[[#SBITS]] %[[#R+33]], i32 1
; FAST16: %[[#VREG+2]] = insertelement <8 x i[[#SBITS]]> %[[#VREG+1]], i[[#SBITS]] %[[#R+33]], i32 2		; FAST: %[[#VREG+2]] = insertelement <8 x i[[#SBITS]]> %[[#VREG+1]], i[[#SBITS]] %[[#R+33]], i32 2
; FAST16: %[[#VREG+3]] = insertelement <8 x i[[#SBITS]]> %[[#VREG+2]], i[[#SBITS]] %[[#R+33]], i32 3		; FAST: %[[#VREG+3]] = insertelement <8 x i[[#SBITS]]> %[[#VREG+2]], i[[#SBITS]] %[[#R+33]], i32 3
; FAST16: %[[#VREG+4]] = insertelement <8 x i[[#SBITS]]> %[[#VREG+3]], i[[#SBITS]] %[[#R+33]], i32 4		; FAST: %[[#VREG+4]] = insertelement <8 x i[[#SBITS]]> %[[#VREG+3]], i[[#SBITS]] %[[#R+33]], i32 4
; FAST16: %[[#VREG+5]] = insertelement <8 x i[[#SBITS]]> %[[#VREG+4]], i[[#SBITS]] %[[#R+33]], i32 5		; FAST: %[[#VREG+5]] = insertelement <8 x i[[#SBITS]]> %[[#VREG+4]], i[[#SBITS]] %[[#R+33]], i32 5
; FAST16: %[[#VREG+6]] = insertelement <8 x i[[#SBITS]]> %[[#VREG+5]], i[[#SBITS]] %[[#R+33]], i32 6		; FAST: %[[#VREG+6]] = insertelement <8 x i[[#SBITS]]> %[[#VREG+5]], i[[#SBITS]] %[[#R+33]], i32 6
; FAST16: %[[#VREG+7]] = insertelement <8 x i[[#SBITS]]> %[[#VREG+6]], i[[#SBITS]] %[[#R+33]], i32 7		; FAST: %[[#VREG+7]] = insertelement <8 x i[[#SBITS]]> %[[#VREG+6]], i[[#SBITS]] %[[#R+33]], i32 7
; FAST16: %[[#VREG+8]] = bitcast i[[#SBITS]]* %[[P:.]] to <8 x i[[#SBITS]]>		; FAST: %[[#VREG+8]] = bitcast i[[#SBITS]]* %[[P:.]] to <8 x i[[#SBITS]]>
; FAST16: %[[#VREG+9]] = getelementptr <8 x i[[#SBITS]]>, <8 x i[[#SBITS]]>* %[[#VREG+8]], i32 0		; FAST: %[[#VREG+9]] = getelementptr <8 x i[[#SBITS]]>, <8 x i[[#SBITS]]>* %[[#VREG+8]], i32 0
; FAST16: store <8 x i[[#SBITS]]> %[[#VREG+7]], <8 x i[[#SBITS]]>* %[[#VREG+9]], align [[#SBYTES]]		; FAST: store <8 x i[[#SBITS]]> %[[#VREG+7]], <8 x i[[#SBITS]]>* %[[#VREG+9]], align [[#SBYTES]]
; FAST16: %[[#VREG+10]] = getelementptr <8 x i[[#SBITS]]>, <8 x i[[#SBITS]]>* %[[#VREG+8]], i32 1		; FAST: %[[#VREG+10]] = getelementptr <8 x i[[#SBITS]]>, <8 x i[[#SBITS]]>* %[[#VREG+8]], i32 1
; FAST16: store <8 x i[[#SBITS]]> %[[#VREG+7]], <8 x i[[#SBITS]]>* %[[#VREG+10]], align [[#SBYTES]]		; FAST: store <8 x i[[#SBITS]]> %[[#VREG+7]], <8 x i[[#SBITS]]>* %[[#VREG+10]], align [[#SBYTES]]
; FAST16: %[[#VREG+11]] = getelementptr i[[#SBITS]], i[[#SBITS]]* %[[P]], i32 16		; FAST: %[[#VREG+11]] = getelementptr i[[#SBITS]], i[[#SBITS]]* %[[P]], i32 16
; FAST16: store i[[#SBITS]] %[[#R+33]], i[[#SBITS]]* %[[#VREG+11]], align [[#SBYTES]]		; FAST: store i[[#SBITS]] %[[#R+33]], i[[#SBITS]]* %[[#VREG+11]], align [[#SBYTES]]
store [17 x i1] %a, [17 x i1]* %p		store [17 x i1] %a, [17 x i1]* %p
ret void		ret void
}		}

define [2 x i32] @const_array() {		define [2 x i32] @const_array() {
; FAST16: @"dfs$const_array"		; FAST: @"dfs$const_array"
; FAST16: store [2 x i[[#SBITS]]] zeroinitializer, [2 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to [2 x i[[#SBITS]]]*), align 2		; FAST: store [2 x i[[#SBITS]]] zeroinitializer, [2 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to [2 x i[[#SBITS]]]*), align 2
ret [2 x i32] [ i32 42, i32 11 ]		ret [2 x i32] [ i32 42, i32 11 ]
}		}

define [4 x i8] @call_array([4 x i8] %a) {		define [4 x i8] @call_array([4 x i8] %a) {
; FAST16-LABEL: @"dfs$call_array"		; FAST-LABEL: @"dfs$call_array"
; FAST16: %[[#R:]] = load [4 x i[[#SBITS]]], [4 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to [4 x i[[#SBITS]]]*), align [[ALIGN:2]]		; FAST: %[[#R:]] = load [4 x i[[#SBITS]]], [4 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to [4 x i[[#SBITS]]]*), align [[ALIGN:2]]
; FAST16: store [4 x i[[#SBITS]]] %[[#R]], [4 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to [4 x i[[#SBITS]]]*), align [[ALIGN]]		; FAST: store [4 x i[[#SBITS]]] %[[#R]], [4 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to [4 x i[[#SBITS]]]*), align [[ALIGN]]
; FAST16: %_dfsret = load [4 x i[[#SBITS]]], [4 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to [4 x i[[#SBITS]]]*), align [[ALIGN]]		; FAST: %_dfsret = load [4 x i[[#SBITS]]], [4 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to [4 x i[[#SBITS]]]*), align [[ALIGN]]
; FAST16: store [4 x i[[#SBITS]]] %_dfsret, [4 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to [4 x i[[#SBITS]]]*), align [[ALIGN]]		; FAST: store [4 x i[[#SBITS]]] %_dfsret, [4 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to [4 x i[[#SBITS]]]*), align [[ALIGN]]

%r = call [4 x i8] @pass_array([4 x i8] %a)		%r = call [4 x i8] @pass_array([4 x i8] %a)
ret [4 x i8] %r		ret [4 x i8] %r
}		}

%LargeArr = type [1000 x i8]		%LargeArr = type [1000 x i8]

define i8 @fun_with_large_args(i1 %i, %LargeArr %a) {		define i8 @fun_with_large_args(i1 %i, %LargeArr %a) {
; FAST16: @"dfs$fun_with_large_args"		; FAST: @"dfs$fun_with_large_args"
; FAST16: store i[[#SBITS]] 0, i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align 2		; FAST: store i[[#SBITS]] 0, i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align 2
%r = extractvalue %LargeArr %a, 0		%r = extractvalue %LargeArr %a, 0
ret i8 %r		ret i8 %r
}		}

define %LargeArr @fun_with_large_ret() {		define %LargeArr @fun_with_large_ret() {
; FAST16: @"dfs$fun_with_large_ret"		; FAST: @"dfs$fun_with_large_ret"
; FAST16-NEXT: ret [1000 x i8] zeroinitializer		; FAST-NEXT: ret [1000 x i8] zeroinitializer
ret %LargeArr zeroinitializer		ret %LargeArr zeroinitializer
}		}

define i8 @call_fun_with_large_ret() {		define i8 @call_fun_with_large_ret() {
; FAST16: @"dfs$call_fun_with_large_ret"		; FAST: @"dfs$call_fun_with_large_ret"
; FAST16: store i[[#SBITS]] 0, i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align 2		; FAST: store i[[#SBITS]] 0, i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align 2
%r = call %LargeArr @fun_with_large_ret()		%r = call %LargeArr @fun_with_large_ret()
%e = extractvalue %LargeArr %r, 0		%e = extractvalue %LargeArr %r, 0
ret i8 %e		ret i8 %e
}		}

define i8 @call_fun_with_large_args(i1 %i, %LargeArr %a) {		define i8 @call_fun_with_large_args(i1 %i, %LargeArr %a) {
; FAST16: @"dfs$call_fun_with_large_args"		; FAST: @"dfs$call_fun_with_large_args"
; FAST16: [[I:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN:2]]		; FAST: [[I:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN:2]]
; FAST16: store i[[#SBITS]] [[I]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN]]		; FAST: store i[[#SBITS]] [[I]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN]]
; FAST16: %r = call i8 @"dfs$fun_with_large_args"(i1 %i, [1000 x i8] %a)		; FAST: %r = call i8 @"dfs$fun_with_large_args"(i1 %i, [1000 x i8] %a)

%r = call i8 @fun_with_large_args(i1 %i, %LargeArr %a)		%r = call i8 @fun_with_large_args(i1 %i, %LargeArr %a)
ret i8 %r		ret i8 %r
}		}

llvm/test/Instrumentation/DataFlowSanitizer/atomics.ll

	; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -S \| FileCheck %s --check-prefixes=CHECK,CHECK16			; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -S \| FileCheck %s --check-prefixes=CHECK,CHECK16
				; RUN: opt < %s -dfsan -dfsan-fast-8-labels=true -S \| FileCheck %s --check-prefixes=CHECK
	;			;
	; The patterns about origins cannot be tested until the origin tracking feature is complete.			; The patterns about origins cannot be tested until the origin tracking feature is complete.
	target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"			target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	; CHECK: @__dfsan_arg_tls = external thread_local(initialexec) global [[TLS_ARR:\[100 x i64\]]]			; CHECK: @__dfsan_arg_tls = external thread_local(initialexec) global [[TLS_ARR:\[100 x i64\]]]
	; CHECK: @__dfsan_retval_tls = external thread_local(initialexec) global [[TLS_ARR]]			; CHECK: @__dfsan_retval_tls = external thread_local(initialexec) global [[TLS_ARR]]
	; CHECK: @__dfsan_shadow_width_bits = weak_odr constant i32 [[#SBITS:]]			; CHECK: @__dfsan_shadow_width_bits = weak_odr constant i32 [[#SBITS:]]
	▲ Show 20 Lines • Show All 277 Lines • Show Last 20 Lines

llvm/test/Instrumentation/DataFlowSanitizer/basic.ll

	; RUN: opt < %s -dfsan -S \| FileCheck %s --check-prefixes=CHECK,CHECK_NO_ORIGIN -DSHADOW_MASK=-123145302310913			; RUN: opt < %s -dfsan -S \| FileCheck %s --check-prefixes=CHECK,CHECK_NO_ORIGIN -DSHADOW_MASK=-123145302310913
	; RUN: opt < %s -dfsan -dfsan-track-origins=1 -dfsan-fast-16-labels=true -S \| FileCheck %s --check-prefixes=CHECK,CHECK_ORIGIN -DSHADOW_MASK=-123145302310913			; RUN: opt < %s -dfsan -dfsan-track-origins=1 -dfsan-fast-16-labels=true -S \| FileCheck %s --check-prefixes=CHECK,CHECK_ORIGIN -DSHADOW_MASK=-123145302310913
				; RUN: opt < %s -dfsan -dfsan-track-origins=1 -dfsan-fast-8-labels=true -S \| FileCheck %s --check-prefixes=CHECK,CHECK_NO_ORIGIN -DSHADOW_MASK=-105553116266497
	target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"			target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	; CHECK: @__dfsan_arg_tls = external thread_local(initialexec) global [100 x i64]			; CHECK: @__dfsan_arg_tls = external thread_local(initialexec) global [100 x i64]
	; CHECK: @__dfsan_retval_tls = external thread_local(initialexec) global [100 x i64]			; CHECK: @__dfsan_retval_tls = external thread_local(initialexec) global [100 x i64]
	; CHECK: @__dfsan_arg_origin_tls = external thread_local(initialexec) global [200 x i32]			; CHECK: @__dfsan_arg_origin_tls = external thread_local(initialexec) global [200 x i32]
	; CHECK: @__dfsan_retval_origin_tls = external thread_local(initialexec) global i32			; CHECK: @__dfsan_retval_origin_tls = external thread_local(initialexec) global i32
	; CHECK_NO_ORIGIN: @__dfsan_track_origins = weak_odr constant i32 0			; CHECK_NO_ORIGIN: @__dfsan_track_origins = weak_odr constant i32 0
	▲ Show 20 Lines • Show All 48 Lines • Show Last 20 Lines

llvm/test/Instrumentation/DataFlowSanitizer/call.ll

	; RUN: opt < %s -dfsan -S \| FileCheck %s			; RUN: opt < %s -dfsan -S \| FileCheck %s
	; RUN: opt < %s -dfsan -dfsan-fast-16-labels -S \| FileCheck %s			; RUN: opt < %s -dfsan -dfsan-fast-16-labels -S \| FileCheck %s
				; RUN: opt < %s -dfsan -dfsan-fast-8-labels -S \| FileCheck %s
	; RUN: opt < %s -passes=dfsan -S \| FileCheck %s			; RUN: opt < %s -passes=dfsan -S \| FileCheck %s
	target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"			target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	; CHECK-LABEL: @__dfsan_arg_tls			; CHECK-LABEL: @__dfsan_arg_tls
	; CHECK: = external thread_local(initialexec) global [100 x i64]			; CHECK: = external thread_local(initialexec) global [100 x i64]

	; CHECK-LABEL: @__dfsan_retval_tls			; CHECK-LABEL: @__dfsan_retval_tls
	▲ Show 20 Lines • Show All 53 Lines • Show Last 20 Lines

llvm/test/Instrumentation/DataFlowSanitizer/external_mask.ll

	; RUN: opt < %s -dfsan -S \| FileCheck %s --check-prefixes=CHECK,CHECK16			; RUN: opt < %s -dfsan -S \| FileCheck %s --check-prefixes=CHECK,CHECK16
	; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -S \| FileCheck %s --check-prefixes=CHECK,CHECK16			; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -S \| FileCheck %s --check-prefixes=CHECK,CHECK16
				; RUN: opt < %s -dfsan -dfsan-fast-8-labels=true -S \| FileCheck %s --check-prefixes=CHECK
	target datalayout = "e-m:e-i64:64-i128:128-n32:64-S128"			target datalayout = "e-m:e-i64:64-i128:128-n32:64-S128"
	target triple = "aarch64-unknown-linux-gnu"			target triple = "aarch64-unknown-linux-gnu"

	define i32 @test(i32 %a, i32* nocapture readonly %b) #0 {			define i32 @test(i32 %a, i32* nocapture readonly %b) #0 {
	; CHECK: @"dfs$test"			; CHECK: @"dfs$test"
	; CHECK: %[[RV:.]] load{{.}}__dfsan_shadow_ptr_mask			; CHECK: %[[RV:.]] load{{.}}__dfsan_shadow_ptr_mask
	; CHECK: ptrtoint i32* {{.*}} to i64			; CHECK: ptrtoint i32* {{.*}} to i64
	; CHECK: and {{.}}%[[RV:.]]			; CHECK: and {{.}}%[[RV:.]]
	; CHECK16: mul i64			; CHECK16: mul i64
	%1 = load i32, i32* %b, align 4			%1 = load i32, i32* %b, align 4
	%2 = add nsw i32 %1, %a			%2 = add nsw i32 %1, %a
	ret i32 %2			ret i32 %2
	}			}

llvm/test/Instrumentation/DataFlowSanitizer/fast16labels.ll

; Test that -dfsan-fast-16-labels mode uses inline ORs rather than calling		; Test that -dfsan-fast-16-labels mode uses inline ORs rather than calling
; __dfsan_union or __dfsan_union_load.		; __dfsan_union or __dfsan_union_load.
; RUN: opt < %s -dfsan -dfsan-fast-16-labels -S \| FileCheck %s --implicit-check-not="call{{.*}}__dfsan_union" --check-prefixes=CHECK,CHECK16		; RUN: opt < %s -dfsan -dfsan-fast-16-labels -S \| FileCheck %s --implicit-check-not="call{{.*}}__dfsan_union" --check-prefixes=CHECK,CHECK16
		; RUN: opt < %s -dfsan -dfsan-fast-8-labels -S \| FileCheck %s --implicit-check-not="call{{.*}}__dfsan_union" --check-prefixes=CHECK,CHECK8
target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"		target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"
target triple = "x86_64-unknown-linux-gnu"		target triple = "x86_64-unknown-linux-gnu"

; CHECK: @__dfsan_arg_tls = external thread_local(initialexec) global [[TLS_ARR:\[100 x i64\]]]		; CHECK: @__dfsan_arg_tls = external thread_local(initialexec) global [[TLS_ARR:\[100 x i64\]]]
; CHECK: @__dfsan_retval_tls = external thread_local(initialexec) global [[TLS_ARR]]		; CHECK: @__dfsan_retval_tls = external thread_local(initialexec) global [[TLS_ARR]]
; CHECK: @__dfsan_shadow_width_bits = weak_odr constant i32 [[#SBITS:]]		; CHECK: @__dfsan_shadow_width_bits = weak_odr constant i32 [[#SBITS:]]
; CHECK: @__dfsan_shadow_width_bytes = weak_odr constant i32 [[#SBYTES:]]		; CHECK: @__dfsan_shadow_width_bytes = weak_odr constant i32 [[#SBYTES:]]

define i8 @add(i8 %a, i8 %b) {		define i8 @add(i8 %a, i8 %b) {
; CHECK-LABEL: define i8 @"dfs$add"		; CHECK-LABEL: define i8 @"dfs$add"
; CHECK-DAG: %[[ALABEL:.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN:2]]		; CHECK-DAG: %[[ALABEL:.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN:2]]
; CHECK-DAG: %[[BLABEL:.]] = load i[[#SBITS]], i[[#SBITS]] inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 2) to i[[#SBITS]]*), align [[ALIGN]]		; CHECK-DAG: %[[BLABEL:.]] = load i[[#SBITS]], i[[#SBITS]] inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 2) to i[[#SBITS]]*), align [[ALIGN]]
; CHECK: %[[ADDLABEL:.*]] = or i16 %[[ALABEL]], %[[BLABEL]]		; CHECK: %[[ADDLABEL:.*]] = or i[[#SBITS]] %[[ALABEL]], %[[BLABEL]]
; CHECK: %c = add i8 %a, %b		; CHECK: %c = add i8 %a, %b
; CHECK: store i[[#SBITS]] %[[ADDLABEL]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]		; CHECK: store i[[#SBITS]] %[[ADDLABEL]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]
; CHECK: ret i8 %c		; CHECK: ret i8 %c
%c = add i8 %a, %b		%c = add i8 %a, %b
ret i8 %c		ret i8 %c
}		}

define i8 @load8(i8* %p) {		define i8 @load8(i8* %p) {
; CHECK-LABEL: define i8 @"dfs$load8"		; CHECK-LABEL: define i8 @"dfs$load8"
; CHECK-SAME: (i8* %[[PADDR:.*]])		; CHECK-SAME: (i8* %[[PADDR:.*]])
; CHECK-NEXT: %[[#ARG:]] = load i[[#SBITS]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i16*), align [[ALIGN]]		; CHECK-NEXT: %[[#ARG:]] = load i[[#SBITS]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN]]
; CHECK-NEXT: %[[#R:]] = ptrtoint i8* %[[PADDR]] to i64		; CHECK-NEXT: %[[#R:]] = ptrtoint i8* %[[PADDR]] to i64
; CHECK-NEXT: %[[#PS:R+1]] = and i64 %[[#R]], [[#%.10d,MASK:]]		; CHECK-NEXT: %[[#PS:R+1]] = and i64 %[[#R]], [[#%.10d,MASK:]]
; CHECK16-NEXT: %[[#PS:R+2]] = mul i64 %[[#R+1]], 2		; CHECK16-NEXT: %[[#PS:R+2]] = mul i64 %[[#R+1]], 2
; CHECK-NEXT: %[[#SADDR:]] = inttoptr i64 %[[#PS]] to i[[#SBITS]]*		; CHECK-NEXT: %[[#SADDR:]] = inttoptr i64 %[[#PS]] to i[[#SBITS]]*
; CHECK-NEXT: %[[#S:]] = load i[[#SBITS]], i[[#SBITS]]* %[[#SADDR]]		; CHECK-NEXT: %[[#S:]] = load i[[#SBITS]], i[[#SBITS]]* %[[#SADDR]]
; CHECK-NEXT: %[[#S_OUT:S+1]] = or i[[#SBITS]] %[[#S]], %[[#ARG]]		; CHECK-NEXT: %[[#S_OUT:S+1]] = or i[[#SBITS]] %[[#S]], %[[#ARG]]
; CHECK-NEXT: %a = load i8, i8* %p		; CHECK-NEXT: %a = load i8, i8* %p
; CHECK-NEXT: store i[[#SBITS]] %[[#S_OUT]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]		; CHECK-NEXT: store i[[#SBITS]] %[[#S_OUT]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]
▲ Show 20 Lines • Show All 65 Lines • ▼ Show 20 Lines	define i64 @load64(i64* %p) {
; CHECK16-NEXT: %[[#WS:]] = or i64 %[[#WS]], %[[#WS_NEXT]]		; CHECK16-NEXT: %[[#WS:]] = or i64 %[[#WS]], %[[#WS_NEXT]]
; CHECK16-NEXT: %[[#WS+1]] = lshr i64 %[[#WS]], 32		; CHECK16-NEXT: %[[#WS+1]] = lshr i64 %[[#WS]], 32
; CHECK16-NEXT: %[[#WS+2]] = or i64 %[[#WS]], %[[#WS+1]]		; CHECK16-NEXT: %[[#WS+2]] = or i64 %[[#WS]], %[[#WS+1]]
; CHECK16-NEXT: %[[#WS+3]] = lshr i64 %[[#WS+2]], 16		; CHECK16-NEXT: %[[#WS+3]] = lshr i64 %[[#WS+2]], 16
; CHECK16-NEXT: %[[#WS+4]] = or i64 %[[#WS+2]], %[[#WS+3]]		; CHECK16-NEXT: %[[#WS+4]] = or i64 %[[#WS+2]], %[[#WS+3]]
; CHECK16-NEXT: %[[#WS+5]] = trunc i64 %[[#WS+4]] to i[[#SBITS]]		; CHECK16-NEXT: %[[#WS+5]] = trunc i64 %[[#WS+4]] to i[[#SBITS]]
; CHECK16-NEXT: %[[#S_OUT:]] = or i[[#SBITS]] %[[#WS+5]], %[[#ARG]]		; CHECK16-NEXT: %[[#S_OUT:]] = or i[[#SBITS]] %[[#WS+5]], %[[#ARG]]

		; COMM: On fast8, no need to OR the wide shadow but one more shift is needed.
		; CHECK8-NEXT: %[[#WS+1]] = lshr i64 %[[#WS]], 32
		; CHECK8-NEXT: %[[#WS+2]] = or i64 %[[#WS]], %[[#WS+1]]
		; CHECK8-NEXT: %[[#WS+3]] = lshr i64 %[[#WS+2]], 16
		; CHECK8-NEXT: %[[#WS+4]] = or i64 %[[#WS+2]], %[[#WS+3]]
		; CHECK8-NEXT: %[[#WS+5]] = lshr i64 %[[#WS+4]], 8
		; CHECK8-NEXT: %[[#WS+6]] = or i64 %[[#WS+4]], %[[#WS+5]]
		; CHECK8-NEXT: %[[#WS+7]] = trunc i64 %[[#WS+6]] to i[[#SBITS]]
		; CHECK8-NEXT: %[[#S_OUT:]] = or i[[#SBITS]] %[[#WS+7]], %[[#ARG]]

; CHECK-NEXT: %a = load i64, i64* %p		; CHECK-NEXT: %a = load i64, i64* %p
; CHECK-NEXT: store i[[#SBITS]] %[[#S_OUT]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]		; CHECK-NEXT: store i[[#SBITS]] %[[#S_OUT]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]
; CHECK-NEXT: ret i64 %a		; CHECK-NEXT: ret i64 %a

%a = load i64, i64* %p		%a = load i64, i64* %p
ret i64 %a		ret i64 %a
}		}

Show All 20 Lines	define i128 @load128(i128* %p) {
; CHECK16-NEXT: %[[#WS:S+9]] = or i64 %[[#S+6]], %[[#S+8]]		; CHECK16-NEXT: %[[#WS:S+9]] = or i64 %[[#S+6]], %[[#S+8]]
; CHECK16-NEXT: %[[#WS+1]] = lshr i64 %[[#WS]], 32		; CHECK16-NEXT: %[[#WS+1]] = lshr i64 %[[#WS]], 32
; CHECK16-NEXT: %[[#WS+2]] = or i64 %[[#WS]], %[[#WS+1]]		; CHECK16-NEXT: %[[#WS+2]] = or i64 %[[#WS]], %[[#WS+1]]
; CHECK16-NEXT: %[[#WS+3]] = lshr i64 %[[#WS+2]], 16		; CHECK16-NEXT: %[[#WS+3]] = lshr i64 %[[#WS+2]], 16
; CHECK16-NEXT: %[[#WS+4]] = or i64 %[[#WS+2]], %[[#WS+3]]		; CHECK16-NEXT: %[[#WS+4]] = or i64 %[[#WS+2]], %[[#WS+3]]
; CHECK16-NEXT: %[[#WS+5]] = trunc i64 %[[#WS+4]] to i[[#SBITS]]		; CHECK16-NEXT: %[[#WS+5]] = trunc i64 %[[#WS+4]] to i[[#SBITS]]
; CHECK16-NEXT: %[[#S_OUT:]] = or i[[#SBITS]] %[[#WS+5]], %[[#ARG]]		; CHECK16-NEXT: %[[#S_OUT:]] = or i[[#SBITS]] %[[#WS+5]], %[[#ARG]]

		; COMM: On fast8, we need to OR 2x64bits for the wide shadow, before ORing its bytes (one more shift).
		; CHECK8-NEXT: %[[#WS+1]] = lshr i64 %[[#WS]], 32
		; CHECK8-NEXT: %[[#WS+2]] = or i64 %[[#WS]], %[[#WS+1]]
		; CHECK8-NEXT: %[[#WS+3]] = lshr i64 %[[#WS+2]], 16
		; CHECK8-NEXT: %[[#WS+4]] = or i64 %[[#WS+2]], %[[#WS+3]]
		; CHECK8-NEXT: %[[#WS+5]] = lshr i64 %[[#WS+4]], 8
		; CHECK8-NEXT: %[[#WS+6]] = or i64 %[[#WS+4]], %[[#WS+5]]
		; CHECK8-NEXT: %[[#WS+7]] = trunc i64 %[[#WS+6]] to i[[#SBITS]]
		; CHECK8-NEXT: %[[#S_OUT:]] = or i[[#SBITS]] %[[#WS+7]], %[[#ARG]]

; CHECK-NEXT: %a = load i128, i128* %p		; CHECK-NEXT: %a = load i128, i128* %p
; CHECK-NEXT: store i[[#SBITS]] %[[#S_OUT]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]		; CHECK-NEXT: store i[[#SBITS]] %[[#S_OUT]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]
; CHECK-NEXT: ret i128 %a		; CHECK-NEXT: ret i128 %a

%a = load i128, i128* %p		%a = load i128, i128* %p
ret i128 %a		ret i128 %a
}		}

llvm/test/Instrumentation/DataFlowSanitizer/phi.ll

	; RUN: opt < %s -dfsan -S \| FileCheck %s --check-prefixes=CHECK,LEGACY			; RUN: opt < %s -dfsan -S \| FileCheck %s --check-prefixes=CHECK,LEGACY
	; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -S \| FileCheck %s --check-prefixes=CHECK,FAST			; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -S \| FileCheck %s --check-prefixes=CHECK,FAST
				; RUN: opt < %s -dfsan -dfsan-fast-8-labels=true -S \| FileCheck %s --check-prefixes=CHECK,FAST
	target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"			target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	; CHECK: @__dfsan_shadow_width_bits = weak_odr constant i32 [[#SBITS:]]			; CHECK: @__dfsan_shadow_width_bits = weak_odr constant i32 [[#SBITS:]]
	; CHECK: @__dfsan_shadow_width_bytes = weak_odr constant i32 [[#SBYTES:]]			; CHECK: @__dfsan_shadow_width_bytes = weak_odr constant i32 [[#SBYTES:]]

	define {i32, i32} @test({i32, i32} %a, i1 %c) {			define {i32, i32} @test({i32, i32} %a, i1 %c) {
	; LEGACY: [[AL:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([100 x i64]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN:2]]			; LEGACY: [[AL:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([100 x i64]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN:2]]
	; LEGACY: [[PL:%.*]] = phi i[[#SBITS]] [ [[AL]], %T ], [ [[AL]], %F ]			; LEGACY: [[PL:%.*]] = phi i[[#SBITS]] [ [[AL]], %T ], [ [[AL]], %F ]
	; LEGACY: store i[[#SBITS]] [[PL]], i[[#SBITS]]* bitcast ([100 x i64]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]			; LEGACY: store i[[#SBITS]] [[PL]], i[[#SBITS]]* bitcast ([100 x i64]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]

	; FAST: [[AL:%.]] = load { [[ST:i[0-9]+]], i[[#SBITS]] }, { i[[#SBITS]], i[[#SBITS]] } bitcast ([100 x i64]* @__dfsan_arg_tls to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN:2]]			; FAST: [[AL:%.]] = load { i[[#SBITS]], i[[#SBITS]] }, { i[[#SBITS]], i[[#SBITS]] } bitcast ([100 x i64]* @__dfsan_arg_tls to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN:2]]
	; FAST: [[AL0:%.*]] = insertvalue { i[[#SBITS]], i[[#SBITS]] } [[AL]], i[[#SBITS]] 0, 0			; FAST: [[AL0:%.*]] = insertvalue { i[[#SBITS]], i[[#SBITS]] } [[AL]], i[[#SBITS]] 0, 0
	; FAST: [[AL1:%.*]] = insertvalue { i[[#SBITS]], i[[#SBITS]] } [[AL]], i[[#SBITS]] 0, 1			; FAST: [[AL1:%.*]] = insertvalue { i[[#SBITS]], i[[#SBITS]] } [[AL]], i[[#SBITS]] 0, 1
	; FAST: [[PL:%.*]] = phi { i[[#SBITS]], i[[#SBITS]] } [ [[AL0]], %T ], [ [[AL1]], %F ]			; FAST: [[PL:%.*]] = phi { i[[#SBITS]], i[[#SBITS]] } [ [[AL0]], %T ], [ [[AL1]], %F ]
	; FAST: store { i[[#SBITS]], i[[#SBITS]] } [[PL]], { i[[#SBITS]], i[[#SBITS]] }* bitcast ([100 x i64]* @__dfsan_retval_tls to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]			; FAST: store { i[[#SBITS]], i[[#SBITS]] } [[PL]], { i[[#SBITS]], i[[#SBITS]] }* bitcast ([100 x i64]* @__dfsan_retval_tls to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]

	entry:			entry:
	br i1 %c, label %T, label %F			br i1 %c, label %T, label %F

	Show All 12 Lines

llvm/test/Instrumentation/DataFlowSanitizer/select.ll

	; RUN: opt < %s -dfsan -dfsan-track-select-control-flow=1 -S \| FileCheck %s --check-prefixes=CHECK,TRACK_CF,TRACK_CF_LEGACY			; RUN: opt < %s -dfsan -dfsan-track-select-control-flow=1 -S \| FileCheck %s --check-prefixes=CHECK,TRACK_CF,TRACK_CF_LEGACY
	; RUN: opt < %s -dfsan -dfsan-track-select-control-flow=0 -S \| FileCheck %s --check-prefixes=CHECK,NO_TRACK_CF,NO_TRACK_CF_LEGACY			; RUN: opt < %s -dfsan -dfsan-track-select-control-flow=0 -S \| FileCheck %s --check-prefixes=CHECK,NO_TRACK_CF,NO_TRACK_CF_LEGACY
	; RUN: opt < %s -dfsan -dfsan-fast-16-labels -dfsan-track-select-control-flow=1 -S \| FileCheck %s --check-prefixes=CHECK,TRACK_CF,TRACK_CF_FAST			; RUN: opt < %s -dfsan -dfsan-fast-16-labels -dfsan-track-select-control-flow=1 -S \| FileCheck %s --check-prefixes=CHECK,TRACK_CF,TRACK_CF_FAST
	; RUN: opt < %s -dfsan -dfsan-fast-16-labels -dfsan-track-select-control-flow=0 -S \| FileCheck %s --check-prefixes=CHECK,NO_TRACK_CF,NO_TRACK_CF_FAST			; RUN: opt < %s -dfsan -dfsan-fast-16-labels -dfsan-track-select-control-flow=0 -S \| FileCheck %s --check-prefixes=CHECK,NO_TRACK_CF,NO_TRACK_CF_FAST
				; RUN: opt < %s -dfsan -dfsan-fast-8-labels -dfsan-track-select-control-flow=1 -S \| FileCheck %s --check-prefixes=CHECK,TRACK_CF,TRACK_CF_FAST
				; RUN: opt < %s -dfsan -dfsan-fast-8-labels -dfsan-track-select-control-flow=0 -S \| FileCheck %s --check-prefixes=CHECK,NO_TRACK_CF,NO_TRACK_CF_FAST
	target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"			target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	; CHECK: @__dfsan_arg_tls = external thread_local(initialexec) global [[TLS_ARR:\[100 x i64\]]]			; CHECK: @__dfsan_arg_tls = external thread_local(initialexec) global [[TLS_ARR:\[100 x i64\]]]
	; CHECK: @__dfsan_retval_tls = external thread_local(initialexec) global [[TLS_ARR]]			; CHECK: @__dfsan_retval_tls = external thread_local(initialexec) global [[TLS_ARR]]
	; CHECK: @__dfsan_shadow_width_bits = weak_odr constant i32 [[#SBITS:]]			; CHECK: @__dfsan_shadow_width_bits = weak_odr constant i32 [[#SBITS:]]
	; CHECK: @__dfsan_shadow_width_bytes = weak_odr constant i32 [[#SBYTES:]]			; CHECK: @__dfsan_shadow_width_bytes = weak_odr constant i32 [[#SBYTES:]]

	▲ Show 20 Lines • Show All 86 Lines • Show Last 20 Lines

llvm/test/Instrumentation/DataFlowSanitizer/shadow-args-zext.ll

	; RUN: opt -mtriple=x86_64-unknown-linux-gnu < %s -dfsan -S --dfsan-abilist=%S/Inputs/shadow-args-abilist.txt \| FileCheck %s			; RUN: opt -mtriple=x86_64-unknown-linux-gnu < %s -dfsan -S --dfsan-abilist=%S/Inputs/shadow-args-abilist.txt \| FileCheck %s
	; RUN: opt -mtriple=x86_64-unknown-linux-gnu < %s -dfsan -S --dfsan-abilist=%S/Inputs/shadow-args-abilist.txt -dfsan-fast-16-labels \| FileCheck %s			; RUN: opt -mtriple=x86_64-unknown-linux-gnu < %s -dfsan -S --dfsan-abilist=%S/Inputs/shadow-args-abilist.txt -dfsan-fast-16-labels \| FileCheck %s
				; RUN: opt -mtriple=x86_64-unknown-linux-gnu < %s -dfsan -S --dfsan-abilist=%S/Inputs/shadow-args-abilist.txt -dfsan-fast-8-labels \| FileCheck %s

	; REQUIRES: x86-registered-target			; REQUIRES: x86-registered-target

	; Test that the custom abi marks shadow parameters as zero extended.			; Test that the custom abi marks shadow parameters as zero extended.

	; CHECK: @__dfsan_shadow_width_bits = weak_odr constant i32 [[#SBITS:]]			; CHECK: @__dfsan_shadow_width_bits = weak_odr constant i32 [[#SBITS:]]
	; CHECK: @__dfsan_shadow_width_bytes = weak_odr constant i32 [[#SBYTES:]]			; CHECK: @__dfsan_shadow_width_bytes = weak_odr constant i32 [[#SBYTES:]]

	▲ Show 20 Lines • Show All 46 Lines • Show Last 20 Lines

llvm/test/Instrumentation/DataFlowSanitizer/store.ll

	; RUN: opt < %s -dfsan -dfsan-combine-pointer-labels-on-store=1 -S \| FileCheck %s --check-prefixes=CHECK,CHECK16,COMBINE_PTR_LABEL			; RUN: opt < %s -dfsan -dfsan-combine-pointer-labels-on-store=1 -S \| FileCheck %s --check-prefixes=CHECK,CHECK16,COMBINE_PTR_LABEL
	; RUN: opt < %s -dfsan -dfsan-combine-pointer-labels-on-store=0 -S \| FileCheck %s --check-prefixes=CHECK,CHECK16,NO_COMBINE_PTR_LABEL			; RUN: opt < %s -dfsan -dfsan-combine-pointer-labels-on-store=0 -S \| FileCheck %s --check-prefixes=CHECK,CHECK16,NO_COMBINE_PTR_LABEL
	; RUN: opt < %s -dfsan -dfsan-fast-16-labels -dfsan-combine-pointer-labels-on-store=1 -S \| FileCheck %s --check-prefixes=CHECK,CHECK16,COMBINE_PTR_LABEL_FAST			; RUN: opt < %s -dfsan -dfsan-fast-16-labels -dfsan-combine-pointer-labels-on-store=1 -S \| FileCheck %s --check-prefixes=CHECK,CHECK16,COMBINE_PTR_LABEL_FAST
	; RUN: opt < %s -dfsan -dfsan-fast-16-labels -dfsan-combine-pointer-labels-on-store=0 -S \| FileCheck %s --check-prefixes=CHECK,CHECK16,NO_COMBINE_PTR_LABEL			; RUN: opt < %s -dfsan -dfsan-fast-16-labels -dfsan-combine-pointer-labels-on-store=0 -S \| FileCheck %s --check-prefixes=CHECK,CHECK16,NO_COMBINE_PTR_LABEL
				; RUN: opt < %s -dfsan -dfsan-fast-8-labels -dfsan-combine-pointer-labels-on-store=1 -S \| FileCheck %s --check-prefixes=CHECK,COMBINE_PTR_LABEL_FAST
				; RUN: opt < %s -dfsan -dfsan-fast-8-labels -dfsan-combine-pointer-labels-on-store=0 -S \| FileCheck %s --check-prefixes=CHECK,NO_COMBINE_PTR_LABEL
	target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"			target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	; CHECK: @__dfsan_shadow_width_bits = weak_odr constant i32 [[#SBITS:]]			; CHECK: @__dfsan_shadow_width_bits = weak_odr constant i32 [[#SBITS:]]
	; CHECK: @__dfsan_shadow_width_bytes = weak_odr constant i32 [[#SBYTES:]]			; CHECK: @__dfsan_shadow_width_bytes = weak_odr constant i32 [[#SBYTES:]]

	define void @store0({} %v, {}* %p) {			define void @store0({} %v, {}* %p) {
	; CHECK-LABEL: @"dfs$store0"			; CHECK-LABEL: @"dfs$store0"
	▲ Show 20 Lines • Show All 129 Lines • Show Last 20 Lines

llvm/test/Instrumentation/DataFlowSanitizer/struct.ll

; RUN: opt < %s -dfsan -S \| FileCheck %s --check-prefixes=CHECK,LEGACY		; RUN: opt < %s -dfsan -S \| FileCheck %s --check-prefixes=CHECK,LEGACY
; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -dfsan-event-callbacks=true -S \| FileCheck %s --check-prefixes=CHECK,EVENT_CALLBACKS		; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -dfsan-event-callbacks=true -S \| FileCheck %s --check-prefixes=CHECK,EVENT_CALLBACKS
; RUN: opt < %s -dfsan -dfsan-args-abi -S \| FileCheck %s --check-prefixes=CHECK,ARGS_ABI		; RUN: opt < %s -dfsan -dfsan-args-abi -S \| FileCheck %s --check-prefixes=CHECK,ARGS_ABI
; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -S \| FileCheck %s --check-prefixes=CHECK,FAST16		; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -S \| FileCheck %s --check-prefixes=CHECK,FAST
; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -dfsan-combine-pointer-labels-on-load=false -S \| FileCheck %s --check-prefixes=CHECK,NO_COMBINE_LOAD_PTR		; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -dfsan-combine-pointer-labels-on-load=false -S \| FileCheck %s --check-prefixes=CHECK,NO_COMBINE_LOAD_PTR
; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -dfsan-combine-pointer-labels-on-store=true -S \| FileCheck %s --check-prefixes=CHECK,COMBINE_STORE_PTR		; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -dfsan-combine-pointer-labels-on-store=true -S \| FileCheck %s --check-prefixes=CHECK,COMBINE_STORE_PTR
; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -dfsan-track-select-control-flow=false -S \| FileCheck %s --check-prefixes=CHECK,NO_SELECT_CONTROL		; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -dfsan-track-select-control-flow=false -S \| FileCheck %s --check-prefixes=CHECK,NO_SELECT_CONTROL
; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -dfsan-debug-nonzero-labels -S \| FileCheck %s --check-prefixes=CHECK,DEBUG_NONZERO_LABELS		; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -dfsan-debug-nonzero-labels -S \| FileCheck %s --check-prefixes=CHECK,DEBUG_NONZERO_LABELS
		; RUN: opt < %s -dfsan -dfsan-fast-8-labels=true -S \| FileCheck %s --check-prefixes=CHECK,FAST
		stephan.yichao.zhaoUnsubmitted Done Reply Inline Actions FAST16->FAST? stephan.yichao.zhao: FAST16->FAST?
		gbalatsAuthorUnsubmitted Done Reply Inline Actions Renamed them. gbalats: Renamed them.
		; RUN: opt < %s -dfsan -dfsan-fast-8-labels=true -dfsan-combine-pointer-labels-on-load=false -S \| FileCheck %s --check-prefixes=CHECK,NO_COMBINE_LOAD_PTR
		; RUN: opt < %s -dfsan -dfsan-fast-8-labels=true -dfsan-combine-pointer-labels-on-store=true -S \| FileCheck %s --check-prefixes=CHECK,COMBINE_STORE_PTR
		; RUN: opt < %s -dfsan -dfsan-fast-8-labels=true -dfsan-track-select-control-flow=false -S \| FileCheck %s --check-prefixes=CHECK,NO_SELECT_CONTROL
		; RUN: opt < %s -dfsan -dfsan-fast-8-labels=true -dfsan-debug-nonzero-labels -S \| FileCheck %s --check-prefixes=CHECK,DEBUG_NONZERO_LABELS
target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"		target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"
target triple = "x86_64-unknown-linux-gnu"		target triple = "x86_64-unknown-linux-gnu"

; CHECK: @__dfsan_arg_tls = external thread_local(initialexec) global [[TLS_ARR:\[100 x i64\]]]		; CHECK: @__dfsan_arg_tls = external thread_local(initialexec) global [[TLS_ARR:\[100 x i64\]]]
; CHECK: @__dfsan_retval_tls = external thread_local(initialexec) global [[TLS_ARR]]		; CHECK: @__dfsan_retval_tls = external thread_local(initialexec) global [[TLS_ARR]]
; CHECK: @__dfsan_shadow_width_bits = weak_odr constant i32 [[#SBITS:]]		; CHECK: @__dfsan_shadow_width_bits = weak_odr constant i32 [[#SBITS:]]
; CHECK: @__dfsan_shadow_width_bytes = weak_odr constant i32 [[#SBYTES:]]		; CHECK: @__dfsan_shadow_width_bytes = weak_odr constant i32 [[#SBYTES:]]

▲ Show 20 Lines • Show All 51 Lines • ▼ Show 20 Lines
define {i1, i32} @select_struct(i1 %c, {i1, i32} %a, {i1, i32} %b) {		define {i1, i32} @select_struct(i1 %c, {i1, i32} %a, {i1, i32} %b) {
; NO_SELECT_CONTROL: @"dfs$select_struct"		; NO_SELECT_CONTROL: @"dfs$select_struct"
; NO_SELECT_CONTROL: [[B:%.]] = load { i[[#SBITS]], i[[#SBITS]] }, { i[[#SBITS]], i[[#SBITS]] } inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 [[#mul(2, SBYTES) + 2]]) to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN:2]]		; NO_SELECT_CONTROL: [[B:%.]] = load { i[[#SBITS]], i[[#SBITS]] }, { i[[#SBITS]], i[[#SBITS]] } inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 [[#mul(2, SBYTES) + 2]]) to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN:2]]
; NO_SELECT_CONTROL: [[A:%.]] = load { i[[#SBITS]], i[[#SBITS]] }, { i[[#SBITS]], i[[#SBITS]] } inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 2) to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]		; NO_SELECT_CONTROL: [[A:%.]] = load { i[[#SBITS]], i[[#SBITS]] }, { i[[#SBITS]], i[[#SBITS]] } inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 2) to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]
; NO_SELECT_CONTROL: [[C:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN]]		; NO_SELECT_CONTROL: [[C:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN]]
; NO_SELECT_CONTROL: [[S:%.*]] = select i1 %c, { i[[#SBITS]], i[[#SBITS]] } [[A]], { i[[#SBITS]], i[[#SBITS]] } [[B]]		; NO_SELECT_CONTROL: [[S:%.*]] = select i1 %c, { i[[#SBITS]], i[[#SBITS]] } [[A]], { i[[#SBITS]], i[[#SBITS]] } [[B]]
; NO_SELECT_CONTROL: store { i[[#SBITS]], i[[#SBITS]] } [[S]], { i[[#SBITS]], i[[#SBITS]] }* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]		; NO_SELECT_CONTROL: store { i[[#SBITS]], i[[#SBITS]] } [[S]], { i[[#SBITS]], i[[#SBITS]] }* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]

; FAST16: @"dfs$select_struct"		; FAST: @"dfs$select_struct"
; FAST16: %[[#R:]] = load { i[[#SBITS]], i[[#SBITS]] }, { i[[#SBITS]], i[[#SBITS]] }* inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 [[#mul(2, SBYTES) + 2]]) to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN:2]]		; FAST: %[[#R:]] = load { i[[#SBITS]], i[[#SBITS]] }, { i[[#SBITS]], i[[#SBITS]] }* inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 [[#mul(2, SBYTES) + 2]]) to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN:2]]
; FAST16: %[[#R+1]] = load { i[[#SBITS]], i[[#SBITS]] }, { i[[#SBITS]], i[[#SBITS]] }* inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 2) to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]		; FAST: %[[#R+1]] = load { i[[#SBITS]], i[[#SBITS]] }, { i[[#SBITS]], i[[#SBITS]] }* inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 2) to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]
; FAST16: %[[#R+2]] = load i[[#SBITS]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN]]		; FAST: %[[#R+2]] = load i[[#SBITS]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN]]
; FAST16: %[[#R+3]] = select i1 %c, { i[[#SBITS]], i[[#SBITS]] } %[[#R+1]], { i[[#SBITS]], i[[#SBITS]] } %[[#R]]		; FAST: %[[#R+3]] = select i1 %c, { i[[#SBITS]], i[[#SBITS]] } %[[#R+1]], { i[[#SBITS]], i[[#SBITS]] } %[[#R]]
; FAST16: %[[#R+4]] = extractvalue { i[[#SBITS]], i[[#SBITS]] } %[[#R+3]], 0		; FAST: %[[#R+4]] = extractvalue { i[[#SBITS]], i[[#SBITS]] } %[[#R+3]], 0
; FAST16: %[[#R+5]] = extractvalue { i[[#SBITS]], i[[#SBITS]] } %[[#R+3]], 1		; FAST: %[[#R+5]] = extractvalue { i[[#SBITS]], i[[#SBITS]] } %[[#R+3]], 1
; FAST16: %[[#R+6]] = or i[[#SBITS]] %[[#R+4]], %[[#R+5]]		; FAST: %[[#R+6]] = or i[[#SBITS]] %[[#R+4]], %[[#R+5]]
; FAST16: %[[#R+7]] = or i[[#SBITS]] %[[#R+2]], %[[#R+6]]		; FAST: %[[#R+7]] = or i[[#SBITS]] %[[#R+2]], %[[#R+6]]
; FAST16: %[[#R+8]] = insertvalue { i[[#SBITS]], i[[#SBITS]] } undef, i[[#SBITS]] %[[#R+7]], 0		; FAST: %[[#R+8]] = insertvalue { i[[#SBITS]], i[[#SBITS]] } undef, i[[#SBITS]] %[[#R+7]], 0
; FAST16: %[[#R+9]] = insertvalue { i[[#SBITS]], i[[#SBITS]] } %[[#R+8]], i[[#SBITS]] %[[#R+7]], 1		; FAST: %[[#R+9]] = insertvalue { i[[#SBITS]], i[[#SBITS]] } %[[#R+8]], i[[#SBITS]] %[[#R+7]], 1
; FAST16: store { i[[#SBITS]], i[[#SBITS]] } %[[#R+9]], { i[[#SBITS]], i[[#SBITS]] }* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]		; FAST: store { i[[#SBITS]], i[[#SBITS]] } %[[#R+9]], { i[[#SBITS]], i[[#SBITS]] }* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]

; LEGACY: @"dfs$select_struct"		; LEGACY: @"dfs$select_struct"
; LEGACY: [[U:%.*]] = call zeroext i[[#SBITS]] @__dfsan_union		; LEGACY: [[U:%.*]] = call zeroext i[[#SBITS]] @__dfsan_union
; LEGACY: [[P:%.*]] = phi i[[#SBITS]] [ [[U]],		; LEGACY: [[P:%.*]] = phi i[[#SBITS]] [ [[U]],
; LEGACY: store i[[#SBITS]] [[P]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align 2		; LEGACY: store i[[#SBITS]] [[P]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align 2

%s = select i1 %c, {i1, i32} %a, {i1, i32} %b		%s = select i1 %c, {i1, i32} %a, {i1, i32} %b
ret {i1, i32} %s		ret {i1, i32} %s
}		}

define { i32, i32 } @asm_struct(i32 %0, i32 %1) {		define { i32, i32 } @asm_struct(i32 %0, i32 %1) {
; FAST16: @"dfs$asm_struct"		; FAST: @"dfs$asm_struct"
; FAST16: [[E1:%.]] = load i[[#SBITS]], i[[#SBITS]] inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 2) to i[[#SBITS]]*), align [[ALIGN:2]]		; FAST: [[E1:%.]] = load i[[#SBITS]], i[[#SBITS]] inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 2) to i[[#SBITS]]*), align [[ALIGN:2]]
; FAST16: [[E0:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN]]		; FAST: [[E0:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN]]
; FAST16: [[E01:%.*]] = or i[[#SBITS]] [[E0]], [[E1]]		; FAST: [[E01:%.*]] = or i[[#SBITS]] [[E0]], [[E1]]
; FAST16: [[S0:%.*]] = insertvalue { i[[#SBITS]], i[[#SBITS]] } undef, i[[#SBITS]] [[E01]], 0		; FAST: [[S0:%.*]] = insertvalue { i[[#SBITS]], i[[#SBITS]] } undef, i[[#SBITS]] [[E01]], 0
; FAST16: [[S1:%.*]] = insertvalue { i[[#SBITS]], i[[#SBITS]] } [[S0]], i[[#SBITS]] [[E01]], 1		; FAST: [[S1:%.*]] = insertvalue { i[[#SBITS]], i[[#SBITS]] } [[S0]], i[[#SBITS]] [[E01]], 1
; FAST16: store { i[[#SBITS]], i[[#SBITS]] } [[S1]], { i[[#SBITS]], i[[#SBITS]] }* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]		; FAST: store { i[[#SBITS]], i[[#SBITS]] } [[S1]], { i[[#SBITS]], i[[#SBITS]] }* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]

; LEGACY: @"dfs$asm_struct"		; LEGACY: @"dfs$asm_struct"
; LEGACY: [[E1:%.]] = load i[[#SBITS]], i[[#SBITS]] inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 2) to i[[#SBITS]]*), align [[ALIGN:2]]		; LEGACY: [[E1:%.]] = load i[[#SBITS]], i[[#SBITS]] inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 2) to i[[#SBITS]]*), align [[ALIGN:2]]
; LEGACY: [[E0:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN]]		; LEGACY: [[E0:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN]]
; LEGACY: [[E01:%.*]] = call zeroext i[[#SBITS]] @__dfsan_union(i[[#SBITS]] zeroext [[E0]], i[[#SBITS]] zeroext [[E1]])		; LEGACY: [[E01:%.*]] = call zeroext i[[#SBITS]] @__dfsan_union(i[[#SBITS]] zeroext [[E0]], i[[#SBITS]] zeroext [[E1]])
; LEGACY: [[P:%.]] = phi i[[#SBITS]] [ [[E01]], {{.}} ], [ [[E0]], {{.*}} ]		; LEGACY: [[P:%.]] = phi i[[#SBITS]] [ [[E01]], {{.}} ], [ [[E0]], {{.*}} ]
; LEGACY: store i[[#SBITS]] [[P]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]		; LEGACY: store i[[#SBITS]] [[P]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]

entry:		entry:
%a = call { i32, i32 } asm "", "=r,=r,r,r,~{dirflag},~{fpsr},~{flags}"(i32 %0, i32 %1)		%a = call { i32, i32 } asm "", "=r,=r,r,r,~{dirflag},~{fpsr},~{flags}"(i32 %0, i32 %1)
ret { i32, i32 } %a		ret { i32, i32 } %a
}		}

define {i32, i32} @const_struct() {		define {i32, i32} @const_struct() {
; FAST16: @"dfs$const_struct"		; FAST: @"dfs$const_struct"
; FAST16: store { i[[#SBITS]], i[[#SBITS]] } zeroinitializer, { i[[#SBITS]], i[[#SBITS]] }* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to { i[[#SBITS]], i[[#SBITS]] }*), align 2		; FAST: store { i[[#SBITS]], i[[#SBITS]] } zeroinitializer, { i[[#SBITS]], i[[#SBITS]] }* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to { i[[#SBITS]], i[[#SBITS]] }*), align 2

; LEGACY: @"dfs$const_struct"		; LEGACY: @"dfs$const_struct"
; LEGACY: store i[[#SBITS]] 0, i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align 2		; LEGACY: store i[[#SBITS]] 0, i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align 2
ret {i32, i32} { i32 42, i32 11 }		ret {i32, i32} { i32 42, i32 11 }
}		}

define i1 @extract_struct({i1, i5} %s) {		define i1 @extract_struct({i1, i5} %s) {
; FAST16: @"dfs$extract_struct"		; FAST: @"dfs$extract_struct"
; FAST16: [[SM:%.]] = load { i[[#SBITS]], i[[#SBITS]] }, { i[[#SBITS]], i[[#SBITS]] } bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN:2]]		; FAST: [[SM:%.]] = load { i[[#SBITS]], i[[#SBITS]] }, { i[[#SBITS]], i[[#SBITS]] } bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN:2]]
; FAST16: [[EM:%.*]] = extractvalue { i[[#SBITS]], i[[#SBITS]] } [[SM]], 0		; FAST: [[EM:%.*]] = extractvalue { i[[#SBITS]], i[[#SBITS]] } [[SM]], 0
; FAST16: store i[[#SBITS]] [[EM]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]		; FAST: store i[[#SBITS]] [[EM]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]

; LEGACY: @"dfs$extract_struct"		; LEGACY: @"dfs$extract_struct"
; LEGACY: [[SM:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN:2]]		; LEGACY: [[SM:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN:2]]
; LEGACY: store i[[#SBITS]] [[SM]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]		; LEGACY: store i[[#SBITS]] [[SM]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]
%e2 = extractvalue {i1, i5} %s, 0		%e2 = extractvalue {i1, i5} %s, 0
ret i1 %e2		ret i1 %e2
}		}

define {i1, i5} @insert_struct({i1, i5} %s, i5 %e1) {		define {i1, i5} @insert_struct({i1, i5} %s, i5 %e1) {
; FAST16: @"dfs$insert_struct"		; FAST: @"dfs$insert_struct"
; FAST16: [[EM:%.]] = load i[[#SBITS]], i[[#SBITS]] inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 [[#mul(2, SBYTES)]]) to i[[#SBITS]]*), align [[ALIGN:2]]		; FAST: [[EM:%.]] = load i[[#SBITS]], i[[#SBITS]] inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 [[#mul(2, SBYTES)]]) to i[[#SBITS]]*), align [[ALIGN:2]]
; FAST16: [[SM:%.]] = load { i[[#SBITS]], i[[#SBITS]] }, { i[[#SBITS]], i[[#SBITS]] } bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]		; FAST: [[SM:%.]] = load { i[[#SBITS]], i[[#SBITS]] }, { i[[#SBITS]], i[[#SBITS]] } bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]
; FAST16: [[SM1:%.*]] = insertvalue { i[[#SBITS]], i[[#SBITS]] } [[SM]], i[[#SBITS]] [[EM]], 1		; FAST: [[SM1:%.*]] = insertvalue { i[[#SBITS]], i[[#SBITS]] } [[SM]], i[[#SBITS]] [[EM]], 1
; FAST16: store { i[[#SBITS]], i[[#SBITS]] } [[SM1]], { i[[#SBITS]], i[[#SBITS]] }* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]		; FAST: store { i[[#SBITS]], i[[#SBITS]] } [[SM1]], { i[[#SBITS]], i[[#SBITS]] }* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]

; LEGACY: @"dfs$insert_struct"		; LEGACY: @"dfs$insert_struct"
; LEGACY: [[EM:%.]] = load i[[#SBITS]], i[[#SBITS]] inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 2) to i[[#SBITS]]*), align [[ALIGN:2]]		; LEGACY: [[EM:%.]] = load i[[#SBITS]], i[[#SBITS]] inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 2) to i[[#SBITS]]*), align [[ALIGN:2]]
; LEGACY: [[SM:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN]]		; LEGACY: [[SM:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN]]
; LEGACY: [[U:%.*]] = call zeroext i[[#SBITS]] @__dfsan_union(i[[#SBITS]] zeroext [[SM]], i[[#SBITS]] zeroext [[EM]])		; LEGACY: [[U:%.*]] = call zeroext i[[#SBITS]] @__dfsan_union(i[[#SBITS]] zeroext [[SM]], i[[#SBITS]] zeroext [[EM]])
; LEGACY: [[P:%.]] = phi i[[#SBITS]] [ [[U]], {{.}} ], [ [[SM]], {{.*}} ]		; LEGACY: [[P:%.]] = phi i[[#SBITS]] [ [[U]], {{.}} ], [ [[SM]], {{.*}} ]
; LEGACY: store i[[#SBITS]] [[P]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]		; LEGACY: store i[[#SBITS]] [[P]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]
%s1 = insertvalue {i1, i5} %s, i5 %e1, 1		%s1 = insertvalue {i1, i5} %s, i5 %e1, 1
Show All 13 Lines	define {i1, i1} @load_struct({i1, i1}* %p) {
; EVENT_CALLBACKS: [[S0:%.*]] = insertvalue { i[[#SBITS]], i[[#SBITS]] } undef, i[[#SBITS]] [[OL1]], 0		; EVENT_CALLBACKS: [[S0:%.*]] = insertvalue { i[[#SBITS]], i[[#SBITS]] } undef, i[[#SBITS]] [[OL1]], 0
; EVENT_CALLBACKS: call void @__dfsan_load_callback(i[[#SBITS]] [[OL1]]		; EVENT_CALLBACKS: call void @__dfsan_load_callback(i[[#SBITS]] [[OL1]]

%s = load {i1, i1}, {i1, i1}* %p		%s = load {i1, i1}, {i1, i1}* %p
ret {i1, i1} %s		ret {i1, i1} %s
}		}

define void @store_struct({i1, i1}* %p, {i1, i1} %s) {		define void @store_struct({i1, i1}* %p, {i1, i1} %s) {
; FAST16: @"dfs$store_struct"		; FAST: @"dfs$store_struct"
; FAST16: [[S:%.]] = load { i[[#SBITS]], i[[#SBITS]] }, { i[[#SBITS]], i[[#SBITS]] } inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 2) to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN:2]]		; FAST: [[S:%.]] = load { i[[#SBITS]], i[[#SBITS]] }, { i[[#SBITS]], i[[#SBITS]] } inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 2) to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN:2]]
; FAST16: [[E0:%.*]] = extractvalue { i[[#SBITS]], i[[#SBITS]] } [[S]], 0		; FAST: [[E0:%.*]] = extractvalue { i[[#SBITS]], i[[#SBITS]] } [[S]], 0
; FAST16: [[E1:%.*]] = extractvalue { i[[#SBITS]], i[[#SBITS]] } [[S]], 1		; FAST: [[E1:%.*]] = extractvalue { i[[#SBITS]], i[[#SBITS]] } [[S]], 1
; FAST16: [[E:%.*]] = or i[[#SBITS]] [[E0]], [[E1]]		; FAST: [[E:%.*]] = or i[[#SBITS]] [[E0]], [[E1]]
; FAST16: [[P0:%.]] = getelementptr i[[#SBITS]], i[[#SBITS]] [[P:%.*]], i32 0		; FAST: [[P0:%.]] = getelementptr i[[#SBITS]], i[[#SBITS]] [[P:%.*]], i32 0
; FAST16: store i[[#SBITS]] [[E]], i[[#SBITS]]* [[P0]], align [[#SBYTES]]		; FAST: store i[[#SBITS]] [[E]], i[[#SBITS]]* [[P0]], align [[#SBYTES]]
; FAST16: [[P1:%.]] = getelementptr i[[#SBITS]], i[[#SBITS]] [[P]], i32 1		; FAST: [[P1:%.]] = getelementptr i[[#SBITS]], i[[#SBITS]] [[P]], i32 1
; FAST16: store i[[#SBITS]] [[E]], i[[#SBITS]]* [[P1]], align [[#SBYTES]]		; FAST: store i[[#SBITS]] [[E]], i[[#SBITS]]* [[P1]], align [[#SBYTES]]

; EVENT_CALLBACKS: @"dfs$store_struct"		; EVENT_CALLBACKS: @"dfs$store_struct"
; EVENT_CALLBACKS: [[OL:%.*]] = or i[[#SBITS]]		; EVENT_CALLBACKS: [[OL:%.*]] = or i[[#SBITS]]
; EVENT_CALLBACKS: call void @__dfsan_store_callback(i[[#SBITS]] [[OL]]		; EVENT_CALLBACKS: call void @__dfsan_store_callback(i[[#SBITS]] [[OL]]

; COMBINE_STORE_PTR: @"dfs$store_struct"		; COMBINE_STORE_PTR: @"dfs$store_struct"
; COMBINE_STORE_PTR: [[PL:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN:2]]		; COMBINE_STORE_PTR: [[PL:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN:2]]
; COMBINE_STORE_PTR: [[SL:%.]] = load { i[[#SBITS]], i[[#SBITS]] }, { i[[#SBITS]], i[[#SBITS]] } inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 2) to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]		; COMBINE_STORE_PTR: [[SL:%.]] = load { i[[#SBITS]], i[[#SBITS]] }, { i[[#SBITS]], i[[#SBITS]] } inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 2) to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]
; COMBINE_STORE_PTR: [[SL0:%.*]] = extractvalue { i[[#SBITS]], i[[#SBITS]] } [[SL]], 0		; COMBINE_STORE_PTR: [[SL0:%.*]] = extractvalue { i[[#SBITS]], i[[#SBITS]] } [[SL]], 0
; COMBINE_STORE_PTR: [[SL1:%.*]] = extractvalue { i[[#SBITS]], i[[#SBITS]] } [[SL]], 1		; COMBINE_STORE_PTR: [[SL1:%.*]] = extractvalue { i[[#SBITS]], i[[#SBITS]] } [[SL]], 1
; COMBINE_STORE_PTR: [[SL01:%.*]] = or i[[#SBITS]] [[SL0]], [[SL1]]		; COMBINE_STORE_PTR: [[SL01:%.*]] = or i[[#SBITS]] [[SL0]], [[SL1]]
; COMBINE_STORE_PTR: [[E:%.*]] = or i[[#SBITS]] [[SL01]], [[PL]]		; COMBINE_STORE_PTR: [[E:%.*]] = or i[[#SBITS]] [[SL01]], [[PL]]
; COMBINE_STORE_PTR: [[P0:%.]] = getelementptr i[[#SBITS]], i[[#SBITS]] [[P:%.*]], i32 0		; COMBINE_STORE_PTR: [[P0:%.]] = getelementptr i[[#SBITS]], i[[#SBITS]] [[P:%.*]], i32 0
; COMBINE_STORE_PTR: store i[[#SBITS]] [[E]], i[[#SBITS]]* [[P0]], align [[#SBYTES]]		; COMBINE_STORE_PTR: store i[[#SBITS]] [[E]], i[[#SBITS]]* [[P0]], align [[#SBYTES]]
; COMBINE_STORE_PTR: [[P1:%.]] = getelementptr i[[#SBITS]], i[[#SBITS]] [[P]], i32 1		; COMBINE_STORE_PTR: [[P1:%.]] = getelementptr i[[#SBITS]], i[[#SBITS]] [[P]], i32 1
; COMBINE_STORE_PTR: store i[[#SBITS]] [[E]], i[[#SBITS]]* [[P1]], align [[#SBYTES]]		; COMBINE_STORE_PTR: store i[[#SBITS]] [[E]], i[[#SBITS]]* [[P1]], align [[#SBYTES]]

store {i1, i1} %s, {i1, i1}* %p		store {i1, i1} %s, {i1, i1}* %p
ret void		ret void
}		}

define i2 @extract_struct_of_aggregate11(%StructOfAggr %s) {		define i2 @extract_struct_of_aggregate11(%StructOfAggr %s) {
; FAST16: @"dfs$extract_struct_of_aggregate11"		; FAST: @"dfs$extract_struct_of_aggregate11"
; FAST16: [[E:%.]] = load { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }, { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }*), align [[ALIGN:2]]		; FAST: [[E:%.]] = load { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }, { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }*), align [[ALIGN:2]]
; FAST16: [[E11:%.*]] = extractvalue { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } [[E]], 1, 1		; FAST: [[E11:%.*]] = extractvalue { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } [[E]], 1, 1
; FAST16: store i[[#SBITS]] [[E11]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]		; FAST: store i[[#SBITS]] [[E11]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]

%e11 = extractvalue %StructOfAggr %s, 1, 1		%e11 = extractvalue %StructOfAggr %s, 1, 1
ret i2 %e11		ret i2 %e11
}		}

define [4 x i2] @extract_struct_of_aggregate1(%StructOfAggr %s) {		define [4 x i2] @extract_struct_of_aggregate1(%StructOfAggr %s) {
; FAST16: @"dfs$extract_struct_of_aggregate1"		; FAST: @"dfs$extract_struct_of_aggregate1"
; FAST16: [[E:%.]] = load { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }, { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }*), align [[ALIGN:2]]		; FAST: [[E:%.]] = load { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }, { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }*), align [[ALIGN:2]]
; FAST16: [[E1:%.*]] = extractvalue { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } [[E]], 1		; FAST: [[E1:%.*]] = extractvalue { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } [[E]], 1
; FAST16: store [4 x i[[#SBITS]]] [[E1]], [4 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to [4 x i[[#SBITS]]]*), align [[ALIGN]]		; FAST: store [4 x i[[#SBITS]]] [[E1]], [4 x i[[#SBITS]]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to [4 x i[[#SBITS]]]*), align [[ALIGN]]
%e1 = extractvalue %StructOfAggr %s, 1		%e1 = extractvalue %StructOfAggr %s, 1
ret [4 x i2] %e1		ret [4 x i2] %e1
}		}

define <4 x i3> @extract_struct_of_aggregate2(%StructOfAggr %s) {		define <4 x i3> @extract_struct_of_aggregate2(%StructOfAggr %s) {
; FAST16: @"dfs$extract_struct_of_aggregate2"		; FAST: @"dfs$extract_struct_of_aggregate2"
; FAST16: [[E:%.]] = load { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }, { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }*), align [[ALIGN:2]]		; FAST: [[E:%.]] = load { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }, { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }*), align [[ALIGN:2]]
; FAST16: [[E2:%.*]] = extractvalue { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } [[E]], 2		; FAST: [[E2:%.*]] = extractvalue { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } [[E]], 2
; FAST16: store i[[#SBITS]] [[E2]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]		; FAST: store i[[#SBITS]] [[E2]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]
%e2 = extractvalue %StructOfAggr %s, 2		%e2 = extractvalue %StructOfAggr %s, 2
ret <4 x i3> %e2		ret <4 x i3> %e2
}		}

define { i1, i1 } @extract_struct_of_aggregate3(%StructOfAggr %s) {		define { i1, i1 } @extract_struct_of_aggregate3(%StructOfAggr %s) {
; FAST16: @"dfs$extract_struct_of_aggregate3"		; FAST: @"dfs$extract_struct_of_aggregate3"
; FAST16: [[E:%.]] = load { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }, { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }*), align [[ALIGN:2]]		; FAST: [[E:%.]] = load { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }, { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }*), align [[ALIGN:2]]
; FAST16: [[E3:%.*]] = extractvalue { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } [[E]], 3		; FAST: [[E3:%.*]] = extractvalue { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } [[E]], 3
; FAST16: store { i[[#SBITS]], i[[#SBITS]] } [[E3]], { i[[#SBITS]], i[[#SBITS]] }* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]		; FAST: store { i[[#SBITS]], i[[#SBITS]] } [[E3]], { i[[#SBITS]], i[[#SBITS]] }* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]
%e3 = extractvalue %StructOfAggr %s, 3		%e3 = extractvalue %StructOfAggr %s, 3
ret { i1, i1 } %e3		ret { i1, i1 } %e3
}		}

define i1 @extract_struct_of_aggregate31(%StructOfAggr %s) {		define i1 @extract_struct_of_aggregate31(%StructOfAggr %s) {
; FAST16: @"dfs$extract_struct_of_aggregate31"		; FAST: @"dfs$extract_struct_of_aggregate31"
; FAST16: [[E:%.]] = load { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }, { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }*), align [[ALIGN:2]]		; FAST: [[E:%.]] = load { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }, { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }*), align [[ALIGN:2]]
; FAST16: [[E31:%.*]] = extractvalue { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } [[E]], 3, 1		; FAST: [[E31:%.*]] = extractvalue { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } [[E]], 3, 1
; FAST16: store i[[#SBITS]] [[E31]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]		; FAST: store i[[#SBITS]] [[E31]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to i[[#SBITS]]*), align [[ALIGN]]
%e31 = extractvalue %StructOfAggr %s, 3, 1		%e31 = extractvalue %StructOfAggr %s, 3, 1
ret i1 %e31		ret i1 %e31
}		}

define %StructOfAggr @insert_struct_of_aggregate11(%StructOfAggr %s, i2 %e11) {		define %StructOfAggr @insert_struct_of_aggregate11(%StructOfAggr %s, i2 %e11) {
; FAST16: @"dfs$insert_struct_of_aggregate11"		; FAST: @"dfs$insert_struct_of_aggregate11"
; FAST16: [[E11:%.]] = load i[[#SBITS]], i[[#SBITS]] inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 [[#mul(8, SBYTES)]]) to i[[#SBITS]]*), align [[ALIGN:2]]		; FAST: [[E11:%.]] = load i[[#SBITS]], i[[#SBITS]] inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 [[#mul(8, SBYTES)]]) to i[[#SBITS]]*), align [[ALIGN:2]]
; FAST16: [[S:%.]] = load { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }, { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }*), align [[ALIGN]]		; FAST: [[S:%.]] = load { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }, { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }*), align [[ALIGN]]
; FAST16: [[S1:%.*]] = insertvalue { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } [[S]], i[[#SBITS]] [[E11]], 1, 1		; FAST: [[S1:%.*]] = insertvalue { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } [[S]], i[[#SBITS]] [[E11]], 1, 1
; FAST16: store { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } [[S1]], { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }*), align [[ALIGN]]		; FAST: store { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } [[S1]], { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }*), align [[ALIGN]]

%s1 = insertvalue %StructOfAggr %s, i2 %e11, 1, 1		%s1 = insertvalue %StructOfAggr %s, i2 %e11, 1, 1
ret %StructOfAggr %s1		ret %StructOfAggr %s1
}		}

define {i8, i32} @call_struct({i8, i32} %s) {		define {i8, i32} @call_struct({i8, i32} %s) {
; FAST16: @"dfs$call_struct"		; FAST: @"dfs$call_struct"
; FAST16: [[S:%.]] = load { i[[#SBITS]], i[[#SBITS]] }, { i[[#SBITS]], i[[#SBITS]] } bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN:2]]		; FAST: [[S:%.]] = load { i[[#SBITS]], i[[#SBITS]] }, { i[[#SBITS]], i[[#SBITS]] } bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN:2]]
; FAST16: store { i[[#SBITS]], i[[#SBITS]] } [[S]], { i[[#SBITS]], i[[#SBITS]] }* bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]		; FAST: store { i[[#SBITS]], i[[#SBITS]] } [[S]], { i[[#SBITS]], i[[#SBITS]] }* bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]
; FAST16: %_dfsret = load { i[[#SBITS]], i[[#SBITS]] }, { i[[#SBITS]], i[[#SBITS]] }* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]		; FAST: %_dfsret = load { i[[#SBITS]], i[[#SBITS]] }, { i[[#SBITS]], i[[#SBITS]] }* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]
; FAST16: store { i[[#SBITS]], i[[#SBITS]] } %_dfsret, { i[[#SBITS]], i[[#SBITS]] }* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]		; FAST: store { i[[#SBITS]], i[[#SBITS]] } %_dfsret, { i[[#SBITS]], i[[#SBITS]] }* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]

%r = call {i8, i32} @pass_struct({i8, i32} %s)		%r = call {i8, i32} @pass_struct({i8, i32} %s)
ret {i8*, i32} %r		ret {i8*, i32} %r
}		}

declare %StructOfAggr @fun_with_many_aggr_args(<2 x i7> %v, [2 x i5] %a, {i3, i3} %s)		declare %StructOfAggr @fun_with_many_aggr_args(<2 x i7> %v, [2 x i5] %a, {i3, i3} %s)

define %StructOfAggr @call_many_aggr_args(<2 x i7> %v, [2 x i5] %a, {i3, i3} %s) {		define %StructOfAggr @call_many_aggr_args(<2 x i7> %v, [2 x i5] %a, {i3, i3} %s) {
; FAST16: @"dfs$call_many_aggr_args"		; FAST: @"dfs$call_many_aggr_args"
; FAST16: [[S:%.]] = load { i[[#SBITS]], i[[#SBITS]] }, { i[[#SBITS]], i[[#SBITS]] } inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 [[#mul(2, SBYTES) + 2]]) to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN:2]]		; FAST: [[S:%.]] = load { i[[#SBITS]], i[[#SBITS]] }, { i[[#SBITS]], i[[#SBITS]] } inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 [[#mul(2, SBYTES) + 2]]) to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN:2]]
; FAST16: [[A:%.]] = load [2 x i[[#SBITS]]], [2 x i[[#SBITS]]] inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 2) to [2 x i[[#SBITS]]]*), align [[ALIGN]]		; FAST: [[A:%.]] = load [2 x i[[#SBITS]]], [2 x i[[#SBITS]]] inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 2) to [2 x i[[#SBITS]]]*), align [[ALIGN]]
; FAST16: [[V:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN]]		; FAST: [[V:%.]] = load i[[#SBITS]], i[[#SBITS]] bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN]]
; FAST16: store i[[#SBITS]] [[V]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN]]		; FAST: store i[[#SBITS]] [[V]], i[[#SBITS]]* bitcast ([[TLS_ARR]]* @__dfsan_arg_tls to i[[#SBITS]]*), align [[ALIGN]]
; FAST16: store [2 x i[[#SBITS]]] [[A]], [2 x i[[#SBITS]]]* inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 2) to [2 x i[[#SBITS]]]*), align [[ALIGN]]		; FAST: store [2 x i[[#SBITS]]] [[A]], [2 x i[[#SBITS]]]* inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 2) to [2 x i[[#SBITS]]]*), align [[ALIGN]]
; FAST16: store { i[[#SBITS]], i[[#SBITS]] } [[S]], { i[[#SBITS]], i[[#SBITS]] }* inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 [[#mul(2, SBYTES) + 2]]) to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]		; FAST: store { i[[#SBITS]], i[[#SBITS]] } [[S]], { i[[#SBITS]], i[[#SBITS]] }* inttoptr (i64 add (i64 ptrtoint ([[TLS_ARR]]* @__dfsan_arg_tls to i64), i64 [[#mul(2, SBYTES) + 2]]) to { i[[#SBITS]], i[[#SBITS]] }*), align [[ALIGN]]
; FAST16: %_dfsret = load { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }, { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }*), align [[ALIGN]]		; FAST: %_dfsret = load { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }, { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }*), align [[ALIGN]]
; FAST16: store { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } %_dfsret, { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }*), align [[ALIGN]]		; FAST: store { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } } %_dfsret, { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }* bitcast ([[TLS_ARR]]* @__dfsan_retval_tls to { i[[#SBITS]], [4 x i[[#SBITS]]], i[[#SBITS]], { i[[#SBITS]], i[[#SBITS]] } }*), align [[ALIGN]]

%r = call %StructOfAggr @fun_with_many_aggr_args(<2 x i7> %v, [2 x i5] %a, {i3, i3} %s)		%r = call %StructOfAggr @fun_with_many_aggr_args(<2 x i7> %v, [2 x i5] %a, {i3, i3} %s)
ret %StructOfAggr %r		ret %StructOfAggr %r
}		}

llvm/test/Instrumentation/DataFlowSanitizer/vector.ll

	; RUN: opt < %s -dfsan -S \| FileCheck %s --check-prefixes=CHECK,TLS_ABI,TLS_ABI_LEGACY			; RUN: opt < %s -dfsan -S \| FileCheck %s --check-prefixes=CHECK,TLS_ABI,TLS_ABI_LEGACY
	; RUN: opt < %s -dfsan -dfsan-args-abi -S \| FileCheck %s --check-prefixes=CHECK,ARGS_ABI			; RUN: opt < %s -dfsan -dfsan-args-abi -S \| FileCheck %s --check-prefixes=CHECK,ARGS_ABI
	; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -S \| FileCheck %s --check-prefixes=CHECK,TLS_ABI,TLS_ABI_FAST			; RUN: opt < %s -dfsan -dfsan-fast-16-labels=true -S \| FileCheck %s --check-prefixes=CHECK,TLS_ABI,TLS_ABI_FAST
				; RUN: opt < %s -dfsan -dfsan-fast-8-labels=true -S \| FileCheck %s --check-prefixes=CHECK,TLS_ABI,TLS_ABI_FAST
	target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"			target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	; CHECK: @__dfsan_shadow_width_bits = weak_odr constant i32 [[#SBITS:]]			; CHECK: @__dfsan_shadow_width_bits = weak_odr constant i32 [[#SBITS:]]
	; CHECK: @__dfsan_shadow_width_bytes = weak_odr constant i32 [[#SBYTES:]]			; CHECK: @__dfsan_shadow_width_bytes = weak_odr constant i32 [[#SBYTES:]]

	define <4 x i4> @pass_vector(<4 x i4> %v) {			define <4 x i4> @pass_vector(<4 x i4> %v) {
	; ARGS_ABI-LABEL: @"dfs$pass_vector"			; ARGS_ABI-LABEL: @"dfs$pass_vector"
	▲ Show 20 Lines • Show All 62 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[dfsan] Add -dfsan-fast-8-labels flag
ClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 331707

llvm/lib/Transforms/Instrumentation/DataFlowSanitizer.cpp

llvm/test/Instrumentation/DataFlowSanitizer/abilist.ll

llvm/test/Instrumentation/DataFlowSanitizer/abilist_aggregate.ll

llvm/test/Instrumentation/DataFlowSanitizer/array.ll

llvm/test/Instrumentation/DataFlowSanitizer/atomics.ll

llvm/test/Instrumentation/DataFlowSanitizer/basic.ll

llvm/test/Instrumentation/DataFlowSanitizer/call.ll

llvm/test/Instrumentation/DataFlowSanitizer/external_mask.ll

llvm/test/Instrumentation/DataFlowSanitizer/fast16labels.ll

llvm/test/Instrumentation/DataFlowSanitizer/phi.ll

llvm/test/Instrumentation/DataFlowSanitizer/select.ll

llvm/test/Instrumentation/DataFlowSanitizer/shadow-args-zext.ll

llvm/test/Instrumentation/DataFlowSanitizer/store.ll

llvm/test/Instrumentation/DataFlowSanitizer/struct.ll

llvm/test/Instrumentation/DataFlowSanitizer/vector.ll

This is an archive of the discontinued LLVM Phabricator instance.

[dfsan] Add -dfsan-fast-8-labels flagClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 331707

llvm/lib/Transforms/Instrumentation/DataFlowSanitizer.cpp

llvm/test/Instrumentation/DataFlowSanitizer/abilist.ll

llvm/test/Instrumentation/DataFlowSanitizer/abilist_aggregate.ll

llvm/test/Instrumentation/DataFlowSanitizer/array.ll

llvm/test/Instrumentation/DataFlowSanitizer/atomics.ll

llvm/test/Instrumentation/DataFlowSanitizer/basic.ll

llvm/test/Instrumentation/DataFlowSanitizer/call.ll

llvm/test/Instrumentation/DataFlowSanitizer/external_mask.ll

llvm/test/Instrumentation/DataFlowSanitizer/fast16labels.ll

llvm/test/Instrumentation/DataFlowSanitizer/phi.ll

llvm/test/Instrumentation/DataFlowSanitizer/select.ll

llvm/test/Instrumentation/DataFlowSanitizer/shadow-args-zext.ll

llvm/test/Instrumentation/DataFlowSanitizer/store.ll

llvm/test/Instrumentation/DataFlowSanitizer/struct.ll

llvm/test/Instrumentation/DataFlowSanitizer/vector.ll

[dfsan] Add -dfsan-fast-8-labels flag
ClosedPublic