This is an archive of the discontinued LLVM Phabricator instance.

[compiler-rt] [tsan] Unify aarch64 mapping
ClosedPublic

Authored by zatrazz on Oct 30 2015, 6:42 AM.

Download Raw Diff

Details

Reviewers

rengolin
dvyukov
aemerson
samsonov
eugenis

Summary

This patch unify the 39 and 42-bit VMA support for AArch64 by using indirect
calls instead of inline the address function transformation. The already
defined maps for both VMA are used. Although slower, this leads to same
instrumented binary to be independent of the kernel. It also has the advantage
to be easier to enable the remaining VMA for AArch64 (48-bits).

Along with this change this patch also fix some 42-bit failures with
ALSR disable by increasing the upper high app memory threshold and also
the 42-bit madvise value for non large page set.

Regarding performance, I did a run with speccpu2006 using 'test' set (mainly
to evaluate faster) and the results shows the new indirect calls are roughly
4.6% slower than the inline version:

                 DIFF
401.bzip2        5.12 
403.gcc          5.24 
429.mcf          1.00 
445.gobmk        3.36 
456.hmmer        6.72 
458.sjeng        6.15 
462.libquantum   4.55 
464.h264ref      6.82 
473.astar        5.68 
483.xalancbmk    6.77 
GEOMEAN          4.62

PS: 400.perlbench fails due missing longjmp family instrumentation for aarch64 and 471.omnetpp
the know issue with user defined new/delete operators.

Diff Detail

Event Timeline

zatrazz updated this revision to Diff 38804.Oct 30 2015, 6:42 AM

zatrazz retitled this revision from to [compiler-rt] [tsan] Unify aarch64 mapping.

zatrazz updated this object.

zatrazz added reviewers: aemerson, rengolin, samsonov, eugenis.

zatrazz added a subscriber: llvm-commits.

Herald added subscribers: rengolin, aemerson. · View Herald TranscriptOct 30 2015, 6:42 AM

zatrazz updated this object.Oct 30 2015, 6:47 AM

rengolin updated this object.Oct 30 2015, 6:51 AM

rengolin updated this object.

eugenis added a reviewer: dvyukov.Nov 3 2015, 5:22 PM

dvyukov added inline comments.Nov 4 2015, 4:11 AM

lib/tsan/rtl/tsan_platform.h
159	You introduce both indirect functions and these non-const variables. This looks excessive. Let's do one or another. I would prefer global variables as it looks less intrusive. You can do something along the lines of: Here declare the consts: extern uptr kLoAppMemBeg; ... And then in InitializePlatformEarly initialize them: kLoAppMemEnd = vma_39_42(0x0000400000ull, 0x00000400000ull); And that's it.
175	All identifiers with double underscores are reserved for language implementation. TSAN_ is a fine prefix, and also consistent with all macros in the codebase.
580	Please rename this to InitializePlatformEarly. I don't understand what is "Modules" (dynamic libraries?). And "Specific" is excessive, "Platform" already implies platform-specific stuff.

Changes from previous version:

Increase the VMA range for application on both 39-bit and 42-bit VMA. Now 39-bit ranges from 0x7D00000000-0x7FFFFFFFFF (previously 0x7F00000000-0x7FFFFFFFFF) and 42-bits from 0x3f000000000-0x3FFFFFFFFFF (previously 0x3FF00000000-0x3FFFFFFFFFF);
Fixed a printf format specifier in test/tsan/test.h.

zatrazz added inline comments.Nov 4 2015, 1:02 PM

lib/tsan/rtl/tsan_platform.h
159	I declared them in the header because I used the constants in the function definitions at lib/tsan/rtl/tsan_platform_linux.cc. I can move them to the cc file if it desirable, but I think by removing the definitions will require to hardcode its value on the function definition somehow (and I think current approach in simple than replicate the hexadecimals values in some places). Also, I use them so to add another VMA (48-bits) would not require much work. Instead of adding now 'vma_39_42' and then moving it to 'vma_39_42_48', the patch will just require to check and add the 48-bits definitions.
175	I will change it.
580	I will change it (I used 'modules' and 'specific' because I could not came up with a better naming).

This is an updated version based on comments:

Moved aarch64 const VMA definition from platform header to platform file
Rename InitializePlatformInitModules to InitializePlatformEarly

dvyukov added inline comments.Nov 5 2015, 4:33 AM

lib/tsan/rtl/tsan_platform.h
157	But can we have just global vars? It seems to be enough. The same generic functions will just operate with different values stores in the vars. I am just frighten by the size of this change, amount of logical indirections and macros.

zatrazz added inline comments.Nov 5 2015, 5:52 AM

lib/tsan/rtl/tsan_platform.h
157	We can, but it will be slower: every function transformation will require not only an indirection but also potentially memory reads to actually get the global variables value. The idea of the patch to create multiple functions is to try to minimize the performance hit this same indirection occurs by creating functions is fast as possible.

Small update on previous patch to initialize kMadviseRangeBeg and kMadviseRangeSize.

dvyukov added inline comments.Nov 5 2015, 8:20 AM

lib/tsan/rtl/tsan_platform.h

143–144

Won't it then be faster to inline all 3 variants?

E.g.

struct Mapping39 {
  static const uptr kLoAppMemBeg = 0x...;
  static const uptr kLoAppMemEnd = 0x...;
  static const uptr kShadowBeg = 0x...;
  ...
};

struct Mapping42 {
  static const uptr kLoAppMemBeg = 0x...;
  static const uptr kLoAppMemEnd = 0x...;
  static const uptr kShadowBeg = 0x...;
  ...
};

template<typename Mapping>
uptr MemToShadowImpl(uptr x) {
  return (((x) & ~(Mapping::kAppMemMsk | (kShadowCell - 1)))
      ^ Mapping::kAppMemXor) * kShadowCnt;
}

uptr MemToShadow(uptr x) {
  if (vma_size == 39)
    return MemToShadowImpl<Mapping39>(x);
  else
    return MemToShadowImpl<Mapping42>(x);
  DCHECK(0);
}

I suspect it can be faster as it is inlinable. And probably compiler will be able to do some common expressions merging, so the code won't be too large.

zatrazz added inline comments.Nov 5 2015, 8:28 AM

lib/tsan/rtl/tsan_platform.h
143–144	I will check this out, thanks!

The mapping using inline function is indeed slight faster than the previous attempt (I used the same benchmark as before, speccpu2006 test size). Changes from previous version:

Use a different strategy for mapping functions: instead of indirect functions inline version based on an external VMA detected at runtime.
Fixed the 39-bit shadow memory upper bound.

Much nicer now. Thanks.

Do we still need these extern vars in header
extern uptr kLoAppMemBeg
?
I would prefer if we get rid of any remaining usages of these and introduce more functions instead if needed.
They introduce significant amount of boilerplate. And also they are not necessary consts now, so it can be dangerous (performance-wise) to use them occasionally on some performance-critical path.

Also we now have duplication of complex code between MemToShadow for x86 and MemToShadowImpl for power.
Please move x86 consts to a Mapping struct as well. Then we can have a single MemToShadowImpl implementation and then define MemToShadow as:

uptr MemToShadowImpl(uptr x) {
#ifdef power

if (vmaSize == 39)
  return MemToShadowImpl<Mapping39>(x);
else
  return MemToShadowImpl<Mapping42>(x);

#else

return MemToShadowImpl<Mapping>(x);

#endif
}

Changes from previous patch:

Unify all implementations using the Mapping scheme.
Remove redundant code for go implementation by unify windows and non-windows transformation.
All accesses to mapping const definition are done through functions now. For architecture with only one mapping this won't change code generation.

For architecture with static mapping I had to still define kHeapMemBeg and kHeapMemEnd because they are used on SizeClassAllocator64 definition. I think a possible cleanup is possible (prob when I adjust tsan for aarch64 48-bit VMA).

Ping.

Sorry for long delays.
I've looked at it several times and it still feels somewhat awkward, but at the same time I cannot come up with something better (we could remove lots of boilerplate and replication with advanced C++ templates, but I doubt it worth it, advanced C++ templates is an issue in itself). So probably some amount of duplication and boilerplate here is the way to go.
I will take a final look today.

dvyukov added inline comments.Nov 17 2015, 8:44 AM

lib/tsan/rtl/tsan_platform.h
224	UserRegions are used only once during startup, there is no point in having both const and dynamic versions of it. Also you initialize the dynamic variable only on linux, but it actually needs to be initialized when TSAN_RUNTIME_VMA (e.g. on mac/aarch64 in future). Please leave only one version and define it in this header. Something along the lines of: bool GetUserRegion(int i, uptr start, uptr end) { switch (i) { default: return false; case 0: start = LoAppMemBeg(); end = LoAppMemEnd(); case 1: ... case 2: ... } } And loop in CheckShadowMapping until this function returns false.
272	Why !SANITIZER_WINDOWS? There is also no windows-Go-specific code, so these definitions must be necessary either on all Go platforms or on no Go platforms.
lib/tsan/rtl/tsan_platform_posix.cc
50–60	vmaSize should be already set by now, so use it. Add } else { CHECK(0); } branch below. It will catch both case when vmaSize is somehow not initialized by now and case when support for a new vma size is added.

zatrazz added inline comments.Nov 19 2015, 7:02 AM

lib/tsan/rtl/tsan_platform.h
224	I will change that.
272	This is wrong, I will fix it.
lib/tsan/rtl/tsan_platform_posix.cc
50–60	I will change that.

Changes from previous version:

Remove UserRegions struct and change the interface to call GetUserRegion.
Remove a wrong windows SANITIZER_GO define
Use vmaSize on InitializeShadowMemory for aarch64.

LGTM with nits

sorry for delays and thanks for bearing with me

lib/tsan/rtl/tsan_platform.h
248	#ifndef SANITIZER_GO that's more common throughout tsan
256	#else
315	drop !SANITIZER_WINDOWS part
lib/tsan/rtl/tsan_rtl.cc
291	please replace this with: uptr beg, end; for (int i = 0; GetUserRegion(i, &beg, &end); i++) { it is shorter and more idiomatic for loops.

This revision is now accepted and ready to land.Nov 20 2015, 11:40 AM

zatrazz closed this revision.Nov 26 2015, 5:33 AM

eugenis added inline comments.Dec 9 2015, 4:00 PM

lib/tsan/rtl/tsan_platform.h
129	I've noticed that this mapping does not have an application region at 0x55... That's where PIE executables are mapped by the newer kernels. Did you test this on an older kernel (I don't know exactly when the transition happened; it was around 4.1 for X86 and AFAIK a bit earlier for AArch 64)? See http://reviews.llvm.org/D15308 for more context.

zatrazz added inline comments.Dec 10 2015, 8:30 AM

lib/tsan/rtl/tsan_platform.h
129	I did check the PIE executable with a full TSAN test built with '-pie -fpic' and I saw no issue. Analyzing the trace for 39-bits using the mutexset1.cc (I picked it at random) I see: Non-pie build: 00400000-004d1000 r-xp 00000000 08:02 21008174 /home/adhemerval.zanella/llvm/llvm-git-aarch64-build-release/mutexset1 004e0000-004e3000 r--p 000d0000 08:02 21008174 /home/adhemerval.zanella/llvm/llvm-git-aarch64-build-release/mutexset1 004e3000-004e6000 rw-p 000d3000 08:02 21008174 /home/adhemerval.zanella/llvm/llvm-git-aarch64-build-release/mutexset1 PIE build: 7fae119000-7fae1ee000 r-xp 00000000 08:02 21008174 /home/adhemerval.zanella/llvm/llvm-git-aarch64-build-release/mutexset1 7fae1ee000-7fae1fb000 rw-p 00000000 00:00 0 7fae1fd000-7fae200000 r--p 000d4000 08:02 21008174 /home/adhemerval.zanella/llvm/llvm-git-aarch64-build-release/mutexset1 7fae200000-7fae203000 rw-p 000d7000 08:02 21008174 /home/adhemerval.zanella/llvm/llvm-git-aarch64-build-release/mutexset1 So at least for 39-bit PIE addresses are being covered. I will check again on 42-bit VMA.

zatrazz added inline comments.Dec 10 2015, 8:53 AM

lib/tsan/rtl/tsan_platform.h
129	I and I checked on a fairly recent kernel, 3.19. I am not aware if the mappings for PIE has changed in recent kernels, neither I checked on recent ones. Do you have more information?

eugenis added inline comments.Dec 10 2015, 11:14 AM

lib/tsan/rtl/tsan_platform.h
129	See this bug for the linux 4.1.2 problem with MSan: https://llvm.org/bugs/show_bug.cgi?id=24155 I see 0x55... mappings with 3.10.40 kernel on Android/AAarch64. git tag --contains in the torvalds repo tells me that this change appears in v4.1 and newer. Android must have cherry-picked it in 3.10. What happens if you disable ASLR? We used to get the main executable at 0x55... under gdb (i.e. with disabled randomization) long before 3.9.

zatrazz added inline comments.Dec 11 2015, 3:33 AM

lib/tsan/rtl/tsan_platform.h
129	My previous analysis was in fact wrong, even for the kernels I am currently using (3.19 for 39-bit and 3.17 for 42-bit I am seeing): 0x00000000000-0x00010000000: 39/42-bits program own segments 0x05500000000-0x05600000000: 39-bits PIE program segments 0x07f80000000-0x07fffffffff: 39-bits libraries segments 0x2aa00000000-0x2ab00000000: 42-bits PIE program segments 0x3ff00000000-0x3ffffffffff: 42-bits libraries segments Fortunately I could adjust aarch64 mapping to include these segments using the current mapping. I am preparing a patch.

Revision Contents

Path

Size

lib/

tsan/

rtl/

tsan_platform.h

612 lines

tsan_platform_linux.cc

33 lines

tsan_platform_mac.cc

3 lines

tsan_platform_posix.cc

52 lines

tsan_platform_windows.cc

3 lines

tsan_rtl.h

5 lines

tsan_rtl.cc

16 lines

test/

tsan/

test.h

2 lines

Diff 40643

lib/tsan/rtl/tsan_platform.h

	Show All 35 Lines
	3000 0000 0000 - 4000 0000 0000: metainfo (memory blocks and sync objects)			3000 0000 0000 - 4000 0000 0000: metainfo (memory blocks and sync objects)
	4000 0000 0000 - 6000 0000 0000: -			4000 0000 0000 - 6000 0000 0000: -
	6000 0000 0000 - 6200 0000 0000: traces			6000 0000 0000 - 6200 0000 0000: traces
	6200 0000 0000 - 7d00 0000 0000: -			6200 0000 0000 - 7d00 0000 0000: -
	7d00 0000 0000 - 7e00 0000 0000: heap			7d00 0000 0000 - 7e00 0000 0000: heap
	7e00 0000 0000 - 7e80 0000 0000: -			7e00 0000 0000 - 7e80 0000 0000: -
	7e80 0000 0000 - 8000 0000 0000: modules and main thread stack			7e80 0000 0000 - 8000 0000 0000: modules and main thread stack
	*/			*/
	const uptr kMetaShadowBeg = 0x300000000000ull;			struct Mapping {
	const uptr kMetaShadowEnd = 0x400000000000ull;			static const uptr kMetaShadowBeg = 0x300000000000ull;
	const uptr kTraceMemBeg = 0x600000000000ull;			static const uptr kMetaShadowEnd = 0x400000000000ull;
	const uptr kTraceMemEnd = 0x620000000000ull;			static const uptr kTraceMemBeg = 0x600000000000ull;
	const uptr kShadowBeg = 0x020000000000ull;			static const uptr kTraceMemEnd = 0x620000000000ull;
	const uptr kShadowEnd = 0x100000000000ull;			static const uptr kShadowBeg = 0x020000000000ull;
	const uptr kHeapMemBeg = 0x7d0000000000ull;			static const uptr kShadowEnd = 0x100000000000ull;
	const uptr kHeapMemEnd = 0x7e0000000000ull;			static const uptr kHeapMemBeg = 0x7d0000000000ull;
	const uptr kLoAppMemBeg = 0x000000001000ull;			static const uptr kHeapMemEnd = 0x7e0000000000ull;
	const uptr kLoAppMemEnd = 0x010000000000ull;			static const uptr kLoAppMemBeg = 0x000000001000ull;
	const uptr kHiAppMemBeg = 0x7e8000000000ull;			static const uptr kLoAppMemEnd = 0x010000000000ull;
	const uptr kHiAppMemEnd = 0x800000000000ull;			static const uptr kHiAppMemBeg = 0x7e8000000000ull;
	const uptr kAppMemMsk = 0x7c0000000000ull;			static const uptr kHiAppMemEnd = 0x800000000000ull;
	const uptr kAppMemXor = 0x020000000000ull;			static const uptr kAppMemMsk = 0x7c0000000000ull;
	const uptr kVdsoBeg = 0xf000000000000000ull;			static const uptr kAppMemXor = 0x020000000000ull;
				static const uptr kVdsoBeg = 0xf000000000000000ull;
				};
	#elif defined(__mips64)			#elif defined(__mips64)
	/*			/*
	C/C++ on linux/mips64			C/C++ on linux/mips64
	0100 0000 00 - 0200 0000 00: main binary			0100 0000 00 - 0200 0000 00: main binary
	0200 0000 00 - 1400 0000 00: -			0200 0000 00 - 1400 0000 00: -
	1400 0000 00 - 2400 0000 00: shadow			1400 0000 00 - 2400 0000 00: shadow
	2400 0000 00 - 3000 0000 00: -			2400 0000 00 - 3000 0000 00: -
	3000 0000 00 - 4000 0000 00: metainfo (memory blocks and sync objects)			3000 0000 00 - 4000 0000 00: metainfo (memory blocks and sync objects)
	4000 0000 00 - 6000 0000 00: -			4000 0000 00 - 6000 0000 00: -
	6000 0000 00 - 6200 0000 00: traces			6000 0000 00 - 6200 0000 00: traces
	6200 0000 00 - fe00 0000 00: -			6200 0000 00 - fe00 0000 00: -
	fe00 0000 00 - ff00 0000 00: heap			fe00 0000 00 - ff00 0000 00: heap
	ff00 0000 00 - ff80 0000 00: -			ff00 0000 00 - ff80 0000 00: -
	ff80 0000 00 - ffff ffff ff: modules and main thread stack			ff80 0000 00 - ffff ffff ff: modules and main thread stack
	*/			*/
	const uptr kMetaShadowBeg = 0x3000000000ull;			struct Mapping {
	const uptr kMetaShadowEnd = 0x4000000000ull;			static const uptr kMetaShadowBeg = 0x3000000000ull;
	const uptr kTraceMemBeg = 0x6000000000ull;			static const uptr kMetaShadowEnd = 0x4000000000ull;
	const uptr kTraceMemEnd = 0x6200000000ull;			static const uptr kTraceMemBeg = 0x6000000000ull;
	const uptr kShadowBeg = 0x1400000000ull;			static const uptr kTraceMemEnd = 0x6200000000ull;
	const uptr kShadowEnd = 0x2400000000ull;			static const uptr kShadowBeg = 0x1400000000ull;
	const uptr kHeapMemBeg = 0xfe00000000ull;			static const uptr kShadowEnd = 0x2400000000ull;
	const uptr kHeapMemEnd = 0xff00000000ull;			static const uptr kHeapMemBeg = 0xfe00000000ull;
	const uptr kLoAppMemBeg = 0x0100000000ull;			static const uptr kHeapMemEnd = 0xff00000000ull;
	const uptr kLoAppMemEnd = 0x0200000000ull;			static const uptr kLoAppMemBeg = 0x0100000000ull;
	const uptr kHiAppMemBeg = 0xff80000000ull;			static const uptr kLoAppMemEnd = 0x0200000000ull;
	const uptr kHiAppMemEnd = 0xffffffffffull;			static const uptr kHiAppMemBeg = 0xff80000000ull;
	const uptr kAppMemMsk = 0xfc00000000ull;			static const uptr kHiAppMemEnd = 0xffffffffffull;
	const uptr kAppMemXor = 0x0400000000ull;			static const uptr kAppMemMsk = 0xfc00000000ull;
	const uptr kVdsoBeg = 0xfffff00000ull;			static const uptr kAppMemXor = 0x0400000000ull;
				static const uptr kVdsoBeg = 0xfffff00000ull;
				};
	#elif defined(__aarch64__)			#elif defined(__aarch64__)
	# if SANITIZER_AARCH64_VMA == 39			// AArch64 supports multiple VMA which leads to multiple address transformation
				// functions. To support these multiple VMAS transformations and mappings TSAN
				// runtime for AArch64 uses an external memory read (vmaSize) to select which
				// mapping to use. Although slower, it make a same instrumented binary run on
				// multiple kernels.

	/*			/*
	C/C++ on linux/aarch64 (39-bit VMA)			C/C++ on linux/aarch64 (39-bit VMA)
	0000 4000 00 - 0200 0000 00: main binary			0000 0010 00 - 0100 0000 00: main binary
	2000 0000 00 - 4000 0000 00: shadow memory			0100 0000 00 - 0800 0000 00: -
	4000 0000 00 - 5000 0000 00: metainfo			0800 0000 00 - 1F00 0000 00: shadow memory
	5000 0000 00 - 6000 0000 00: -			1C00 0000 00 - 3100 0000 00: -
				3100 0000 00 - 3400 0000 00: metainfo
				3400 0000 00 - 6000 0000 00: -
	6000 0000 00 - 6200 0000 00: traces			6000 0000 00 - 6200 0000 00: traces
	6200 0000 00 - 7d00 0000 00: -			6200 0000 00 - 7d00 0000 00: -
	7d00 0000 00 - 7e00 0000 00: heap			7c00 0000 00 - 7d00 0000 00: heap
	7e00 0000 00 - 7fff ffff ff: modules and main thread stack			7d00 0000 00 - 7fff ffff ff: modules and main thread stack
	*/			*/
	const uptr kLoAppMemBeg = 0x0000400000ull;			struct Mapping39 {
	const uptr kLoAppMemEnd = 0x0200000000ull;			static const uptr kLoAppMemBeg = 0x0000001000ull;
	const uptr kShadowBeg = 0x2000000000ull;			static const uptr kLoAppMemEnd = 0x0100000000ull;
	const uptr kShadowEnd = 0x4000000000ull;			static const uptr kShadowBeg = 0x0800000000ull;
	const uptr kMetaShadowBeg = 0x4000000000ull;			static const uptr kShadowEnd = 0x1F00000000ull;
	const uptr kMetaShadowEnd = 0x5000000000ull;			static const uptr kMetaShadowBeg = 0x3100000000ull;
	const uptr kTraceMemBeg = 0x6000000000ull;			static const uptr kMetaShadowEnd = 0x3400000000ull;
	const uptr kTraceMemEnd = 0x6200000000ull;			static const uptr kTraceMemBeg = 0x6000000000ull;
	const uptr kHeapMemBeg = 0x7d00000000ull;			static const uptr kTraceMemEnd = 0x6200000000ull;
	const uptr kHeapMemEnd = 0x7e00000000ull;			static const uptr kHeapMemBeg = 0x7c00000000ull;
	const uptr kHiAppMemBeg = 0x7e00000000ull;			static const uptr kHeapMemEnd = 0x7d00000000ull;
	const uptr kHiAppMemEnd = 0x7fffffffffull;			static const uptr kHiAppMemBeg = 0x7d00000000ull;
	const uptr kAppMemMsk = 0x7800000000ull;			static const uptr kHiAppMemEnd = 0x7fffffffffull;
	const uptr kAppMemXor = 0x0800000000ull;			static const uptr kAppMemMsk = 0x7800000000ull;
	const uptr kVdsoBeg = 0x7f00000000ull;			static const uptr kAppMemXor = 0x0200000000ull;
	# elif SANITIZER_AARCH64_VMA == 42			static const uptr kVdsoBeg = 0x7f00000000ull;
				};
				eugenisUnsubmitted Not Done Reply Inline Actions I've noticed that this mapping does not have an application region at 0x55... That's where PIE executables are mapped by the newer kernels. Did you test this on an older kernel (I don't know exactly when the transition happened; it was around 4.1 for X86 and AFAIK a bit earlier for AArch 64)? See http://reviews.llvm.org/D15308 for more context. eugenis: I've noticed that this mapping does not have an application region at 0x55... That's where PIE…
				zatrazzAuthorUnsubmitted Not Done Reply Inline Actions I did check the PIE executable with a full TSAN test built with '-pie -fpic' and I saw no issue. Analyzing the trace for 39-bits using the mutexset1.cc (I picked it at random) I see: Non-pie build: 00400000-004d1000 r-xp 00000000 08:02 21008174 /home/adhemerval.zanella/llvm/llvm-git-aarch64-build-release/mutexset1 004e0000-004e3000 r--p 000d0000 08:02 21008174 /home/adhemerval.zanella/llvm/llvm-git-aarch64-build-release/mutexset1 004e3000-004e6000 rw-p 000d3000 08:02 21008174 /home/adhemerval.zanella/llvm/llvm-git-aarch64-build-release/mutexset1 PIE build: 7fae119000-7fae1ee000 r-xp 00000000 08:02 21008174 /home/adhemerval.zanella/llvm/llvm-git-aarch64-build-release/mutexset1 7fae1ee000-7fae1fb000 rw-p 00000000 00:00 0 7fae1fd000-7fae200000 r--p 000d4000 08:02 21008174 /home/adhemerval.zanella/llvm/llvm-git-aarch64-build-release/mutexset1 7fae200000-7fae203000 rw-p 000d7000 08:02 21008174 /home/adhemerval.zanella/llvm/llvm-git-aarch64-build-release/mutexset1 So at least for 39-bit PIE addresses are being covered. I will check again on 42-bit VMA. zatrazz: I did check the PIE executable with a full TSAN test built with '-pie -fpic' and I saw no issue.
				zatrazzAuthorUnsubmitted Not Done Reply Inline Actions I and I checked on a fairly recent kernel, 3.19. I am not aware if the mappings for PIE has changed in recent kernels, neither I checked on recent ones. Do you have more information? zatrazz: I and I checked on a fairly recent kernel, 3.19. I am not aware if the mappings for PIE has…
				eugenisUnsubmitted Not Done Reply Inline Actions See this bug for the linux 4.1.2 problem with MSan: https://llvm.org/bugs/show_bug.cgi?id=24155 I see 0x55... mappings with 3.10.40 kernel on Android/AAarch64. git tag --contains in the torvalds repo tells me that this change appears in v4.1 and newer. Android must have cherry-picked it in 3.10. What happens if you disable ASLR? We used to get the main executable at 0x55... under gdb (i.e. with disabled randomization) long before 3.9. eugenis: See this bug for the linux 4.1.2 problem with MSan: https://llvm.org/bugs/show_bug.cgi?id=24155…
				zatrazzAuthorUnsubmitted Not Done Reply Inline Actions My previous analysis was in fact wrong, even for the kernels I am currently using (3.19 for 39-bit and 3.17 for 42-bit I am seeing): 0x00000000000-0x00010000000: 39/42-bits program own segments 0x05500000000-0x05600000000: 39-bits PIE program segments 0x07f80000000-0x07fffffffff: 39-bits libraries segments 0x2aa00000000-0x2ab00000000: 42-bits PIE program segments 0x3ff00000000-0x3ffffffffff: 42-bits libraries segments Fortunately I could adjust aarch64 mapping to include these segments using the current mapping. I am preparing a patch. zatrazz: My previous analysis was in fact wrong, even for the kernels I am currently using (3.19 for 39…

	/*			/*
	C/C++ on linux/aarch64 (42-bit VMA)			C/C++ on linux/aarch64 (42-bit VMA)
	00000 4000 00 - 01000 0000 00: main binary			00000 0010 00 - 01000 0000 00: main binary
	01000 0000 00 - 10000 0000 00: -			01000 0000 00 - 10000 0000 00: -
	10000 0000 00 - 20000 0000 00: shadow memory			10000 0000 00 - 20000 0000 00: shadow memory
	20000 0000 00 - 26000 0000 00: -			20000 0000 00 - 26000 0000 00: -
	26000 0000 00 - 28000 0000 00: metainfo			26000 0000 00 - 28000 0000 00: metainfo
	28000 0000 00 - 36200 0000 00: -			28000 0000 00 - 36200 0000 00: -
	36200 0000 00 - 36240 0000 00: traces			36200 0000 00 - 36240 0000 00: traces
	36240 0000 00 - 3e000 0000 00: -			36240 0000 00 - 3e000 0000 00: -
	3e000 0000 00 - 3f000 0000 00: heap			3e000 0000 00 - 3f000 0000 00: heap
	3c000 0000 00 - 3ff00 0000 00: -			3f000 0000 00 - 3ffff ffff ff: modules and main thread stack
	3ff00 0000 00 - 3ffff f000 00: modules and main thread stack
	*/			*/
	const uptr kLoAppMemBeg = 0x00000400000ull;			struct Mapping42 {
				dvyukovUnsubmitted Not Done Reply Inline Actions Won't it then be faster to inline all 3 variants? E.g. struct Mapping39 { static const uptr kLoAppMemBeg = 0x...; static const uptr kLoAppMemEnd = 0x...; static const uptr kShadowBeg = 0x...; ... }; struct Mapping42 { static const uptr kLoAppMemBeg = 0x...; static const uptr kLoAppMemEnd = 0x...; static const uptr kShadowBeg = 0x...; ... }; template<typename Mapping> uptr MemToShadowImpl(uptr x) { return (((x) & ~(Mapping::kAppMemMsk \| (kShadowCell - 1))) ^ Mapping::kAppMemXor) * kShadowCnt; } uptr MemToShadow(uptr x) { if (vma_size == 39) return MemToShadowImpl<Mapping39>(x); else return MemToShadowImpl<Mapping42>(x); DCHECK(0); } I suspect it can be faster as it is inlinable. And probably compiler will be able to do some common expressions merging, so the code won't be too large. dvyukov: Won't it then be faster to inline all 3 variants? E.g. struct Mapping39 { static const…
				zatrazzAuthorUnsubmitted Not Done Reply Inline Actions I will check this out, thanks! zatrazz: I will check this out, thanks!
	const uptr kLoAppMemEnd = 0x01000000000ull;			static const uptr kLoAppMemBeg = 0x00000001000ull;
	const uptr kShadowBeg = 0x10000000000ull;			static const uptr kLoAppMemEnd = 0x01000000000ull;
	const uptr kShadowEnd = 0x20000000000ull;			static const uptr kShadowBeg = 0x10000000000ull;
	const uptr kMetaShadowBeg = 0x26000000000ull;			static const uptr kShadowEnd = 0x20000000000ull;
	const uptr kMetaShadowEnd = 0x28000000000ull;			static const uptr kMetaShadowBeg = 0x26000000000ull;
	const uptr kTraceMemBeg = 0x36200000000ull;			static const uptr kMetaShadowEnd = 0x28000000000ull;
	const uptr kTraceMemEnd = 0x36400000000ull;			static const uptr kTraceMemBeg = 0x36200000000ull;
	const uptr kHeapMemBeg = 0x3e000000000ull;			static const uptr kTraceMemEnd = 0x36400000000ull;
	const uptr kHeapMemEnd = 0x3f000000000ull;			static const uptr kHeapMemBeg = 0x3e000000000ull;
	const uptr kHiAppMemBeg = 0x3ff00000000ull;			static const uptr kHeapMemEnd = 0x3f000000000ull;
	const uptr kHiAppMemEnd = 0x3fffff00000ull;			static const uptr kHiAppMemBeg = 0x3f000000000ull;
	const uptr kAppMemMsk = 0x3c000000000ull;			static const uptr kHiAppMemEnd = 0x3ffffffffffull;
	const uptr kAppMemXor = 0x04000000000ull;			static const uptr kAppMemMsk = 0x3c000000000ull;
				dvyukovUnsubmitted Not Done Reply Inline Actions But can we have just global vars? It seems to be enough. The same generic functions will just operate with different values stores in the vars. I am just frighten by the size of this change, amount of logical indirections and macros. dvyukov: But can we have just global vars? It seems to be enough. The same generic functions will just…
				zatrazzAuthorUnsubmitted Not Done Reply Inline Actions We can, but it will be slower: every function transformation will require not only an indirection but also potentially memory reads to actually get the global variables value. The idea of the patch to create multiple functions is to try to minimize the performance hit this same indirection occurs by creating functions is fast as possible. zatrazz: We can, but it will be slower: every function transformation will require not only an…
	const uptr kVdsoBeg = 0x37f00000000ull;			static const uptr kAppMemXor = 0x04000000000ull;
	# endif			static const uptr kVdsoBeg = 0x37f00000000ull;
				dvyukovUnsubmitted Not Done Reply Inline Actions You introduce both indirect functions and these non-const variables. This looks excessive. Let's do one or another. I would prefer global variables as it looks less intrusive. You can do something along the lines of: Here declare the consts: extern uptr kLoAppMemBeg; ... And then in InitializePlatformEarly initialize them: kLoAppMemEnd = vma_39_42(0x0000400000ull, 0x00000400000ull); And that's it. dvyukov: You introduce both indirect functions and these non-const variables. This looks excessive.
				zatrazzAuthorUnsubmitted Not Done Reply Inline Actions I declared them in the header because I used the constants in the function definitions at lib/tsan/rtl/tsan_platform_linux.cc. I can move them to the cc file if it desirable, but I think by removing the definitions will require to hardcode its value on the function definition somehow (and I think current approach in simple than replicate the hexadecimals values in some places). Also, I use them so to add another VMA (48-bits) would not require much work. Instead of adding now 'vma_39_42' and then moving it to 'vma_39_42_48', the patch will just require to check and add the 48-bits definitions. zatrazz: I declared them in the header because I used the constants in the function definitions at…
	#endif

	ALWAYS_INLINE
	bool IsAppMem(uptr mem) {
	return (mem >= kHeapMemBeg && mem < kHeapMemEnd) \|\|
	(mem >= kLoAppMemBeg && mem < kLoAppMemEnd) \|\|
	(mem >= kHiAppMemBeg && mem < kHiAppMemEnd);
	}

	ALWAYS_INLINE
	bool IsShadowMem(uptr mem) {
	return mem >= kShadowBeg && mem <= kShadowEnd;
	}

	ALWAYS_INLINE
	bool IsMetaMem(uptr mem) {
	return mem >= kMetaShadowBeg && mem <= kMetaShadowEnd;
	}

	ALWAYS_INLINE
	uptr MemToShadow(uptr x) {
	DCHECK(IsAppMem(x));
	return (((x) & ~(kAppMemMsk \| (kShadowCell - 1)))
	^ kAppMemXor) * kShadowCnt;
	}

	ALWAYS_INLINE
	u32 *MemToMeta(uptr x) {
	DCHECK(IsAppMem(x));
	return (u32*)(((((x) & ~(kAppMemMsk \| (kMetaShadowCell - 1)))
	^ kAppMemXor) / kMetaShadowCell * kMetaShadowSize) \| kMetaShadowBeg);
	}

	ALWAYS_INLINE
	uptr ShadowToMem(uptr s) {
	CHECK(IsShadowMem(s));
	if (s >= MemToShadow(kLoAppMemBeg) && s <= MemToShadow(kLoAppMemEnd - 1))
	return (s / kShadowCnt) ^ kAppMemXor;
	else
	return ((s / kShadowCnt) ^ kAppMemXor) \| kAppMemMsk;
	}

	static USED uptr UserRegions[] = {
	kLoAppMemBeg, kLoAppMemEnd,
	kHiAppMemBeg, kHiAppMemEnd,
	kHeapMemBeg, kHeapMemEnd,
	};			};

				// Indicates the runtime will define the memory regions at runtime.
				#define TSAN_RUNTIME_VMA 1
				#endif

	#elif defined(SANITIZER_GO) && !SANITIZER_WINDOWS			#elif defined(SANITIZER_GO) && !SANITIZER_WINDOWS

	/* Go on linux, darwin and freebsd			/* Go on linux, darwin and freebsd
	0000 0000 1000 - 0000 1000 0000: executable			0000 0000 1000 - 0000 1000 0000: executable
	0000 1000 0000 - 00c0 0000 0000: -			0000 1000 0000 - 00c0 0000 0000: -
	00c0 0000 0000 - 00e0 0000 0000: heap			00c0 0000 0000 - 00e0 0000 0000: heap
	00e0 0000 0000 - 2000 0000 0000: -			00e0 0000 0000 - 2000 0000 0000: -
	2000 0000 0000 - 2380 0000 0000: shadow			2000 0000 0000 - 2380 0000 0000: shadow
	2380 0000 0000 - 3000 0000 0000: -			2380 0000 0000 - 3000 0000 0000: -
	3000 0000 0000 - 4000 0000 0000: metainfo (memory blocks and sync objects)			3000 0000 0000 - 4000 0000 0000: metainfo (memory blocks and sync objects)
				dvyukovUnsubmitted Not Done Reply Inline Actions All identifiers with double underscores are reserved for language implementation. TSAN_ is a fine prefix, and also consistent with all macros in the codebase. dvyukov: All identifiers with double underscores are reserved for language implementation. TSAN_ is a…
				zatrazzAuthorUnsubmitted Not Done Reply Inline Actions I will change it. zatrazz: I will change it.
	4000 0000 0000 - 6000 0000 0000: -			4000 0000 0000 - 6000 0000 0000: -
	6000 0000 0000 - 6200 0000 0000: traces			6000 0000 0000 - 6200 0000 0000: traces
	6200 0000 0000 - 8000 0000 0000: -			6200 0000 0000 - 8000 0000 0000: -
	*/			*/

	const uptr kMetaShadowBeg = 0x300000000000ull;			struct Mapping {
	const uptr kMetaShadowEnd = 0x400000000000ull;			static const uptr kMetaShadowBeg = 0x300000000000ull;
	const uptr kTraceMemBeg = 0x600000000000ull;			static const uptr kMetaShadowEnd = 0x400000000000ull;
	const uptr kTraceMemEnd = 0x620000000000ull;			static const uptr kTraceMemBeg = 0x600000000000ull;
	const uptr kShadowBeg = 0x200000000000ull;			static const uptr kTraceMemEnd = 0x620000000000ull;
	const uptr kShadowEnd = 0x238000000000ull;			static const uptr kShadowBeg = 0x200000000000ull;
	const uptr kAppMemBeg = 0x000000001000ull;			static const uptr kShadowEnd = 0x238000000000ull;
	const uptr kAppMemEnd = 0x00e000000000ull;			static const uptr kAppMemBeg = 0x000000001000ull;
				static const uptr kAppMemEnd = 0x00e000000000ull;
				};

				#elif defined(SANITIZER_GO) && SANITIZER_WINDOWS

				/* Go on windows
				0000 0000 1000 - 0000 1000 0000: executable
				0000 1000 0000 - 00f8 0000 0000: -
				00c0 0000 0000 - 00e0 0000 0000: heap
				00e0 0000 0000 - 0100 0000 0000: -
				0100 0000 0000 - 0500 0000 0000: shadow
				0500 0000 0000 - 0560 0000 0000: -
				0560 0000 0000 - 0760 0000 0000: traces
				0760 0000 0000 - 07d0 0000 0000: metainfo (memory blocks and sync objects)
				07d0 0000 0000 - 8000 0000 0000: -
				*/

				struct Mapping {
				static const uptr kMetaShadowBeg = 0x076000000000ull;
				static const uptr kMetaShadowEnd = 0x07d000000000ull;
				static const uptr kTraceMemBeg = 0x056000000000ull;
				static const uptr kTraceMemEnd = 0x076000000000ull;
				static const uptr kShadowBeg = 0x010000000000ull;
				static const uptr kShadowEnd = 0x050000000000ull;
				static const uptr kAppMemBeg = 0x000000001000ull;
				static const uptr kAppMemEnd = 0x00e000000000ull;
				}

				#else
				# error "Unknown platform"
				#endif


				#ifdef TSAN_RUNTIME_VMA
				extern uptr vmaSize;
				#endif
				dvyukovUnsubmitted Not Done Reply Inline Actions UserRegions are used only once during startup, there is no point in having both const and dynamic versions of it. Also you initialize the dynamic variable only on linux, but it actually needs to be initialized when TSAN_RUNTIME_VMA (e.g. on mac/aarch64 in future). Please leave only one version and define it in this header. Something along the lines of: bool GetUserRegion(int i, uptr start, uptr end) { switch (i) { default: return false; case 0: start = LoAppMemBeg(); end = LoAppMemEnd(); case 1: ... case 2: ... } } And loop in CheckShadowMapping until this function returns false. dvyukov: UserRegions are used only once during startup, there is no point in having both const and…
				zatrazzAuthorUnsubmitted Not Done Reply Inline Actions I will change that. zatrazz: I will change that.


				enum MappingType {
				MAPPING_LO_APP_BEG,
				MAPPING_LO_APP_END,
				MAPPING_HI_APP_BEG,
				MAPPING_HI_APP_END,
				MAPPING_HEAP_BEG,
				MAPPING_HEAP_END,
				MAPPING_APP_BEG,
				MAPPING_APP_END,
				MAPPING_SHADOW_BEG,
				MAPPING_SHADOW_END,
				MAPPING_META_SHADOW_BEG,
				MAPPING_META_SHADOW_END,
				MAPPING_TRACE_BEG,
				MAPPING_TRACE_END,
				MAPPING_VDSO_BEG,
				};

				template<typename Mapping, int Type>
				uptr MappingImpl(void) {
				switch (Type) {
				#if !defined(SANITIZER_GO)
				dvyukovUnsubmitted Not Done Reply Inline Actions #ifndef SANITIZER_GO that's more common throughout tsan dvyukov: #ifndef SANITIZER_GO that's more common throughout tsan
				case MAPPING_LO_APP_BEG: return Mapping::kLoAppMemBeg;
				case MAPPING_LO_APP_END: return Mapping::kLoAppMemEnd;
				case MAPPING_HI_APP_BEG: return Mapping::kHiAppMemBeg;
				case MAPPING_HI_APP_END: return Mapping::kHiAppMemEnd;
				case MAPPING_HEAP_BEG: return Mapping::kHeapMemBeg;
				case MAPPING_HEAP_END: return Mapping::kHeapMemEnd;
				case MAPPING_VDSO_BEG: return Mapping::kVdsoBeg;
				#elif defined(SANITIZER_GO)
				dvyukovUnsubmitted Not Done Reply Inline Actions #else dvyukov: #else
				case MAPPING_APP_BEG: return Mapping::kAppMemBeg;
				case MAPPING_APP_END: return Mapping::kAppMemEnd;
				#endif
				case MAPPING_SHADOW_BEG: return Mapping::kShadowBeg;
				case MAPPING_SHADOW_END: return Mapping::kShadowEnd;
				case MAPPING_META_SHADOW_BEG: return Mapping::kMetaShadowBeg;
				case MAPPING_META_SHADOW_END: return Mapping::kMetaShadowEnd;
				case MAPPING_TRACE_BEG: return Mapping::kTraceMemBeg;
				case MAPPING_TRACE_END: return Mapping::kTraceMemEnd;
				}
				}

				template<int Type>
				uptr MappingArchImpl(void) {
				#ifdef __aarch64__
				if (vmaSize == 39)
				dvyukovUnsubmitted Not Done Reply Inline Actions Why !SANITIZER_WINDOWS? There is also no windows-Go-specific code, so these definitions must be necessary either on all Go platforms or on no Go platforms. dvyukov: Why !SANITIZER_WINDOWS? There is also no windows-Go-specific code, so these definitions must be…
				zatrazzAuthorUnsubmitted Not Done Reply Inline Actions This is wrong, I will fix it. zatrazz: This is wrong, I will fix it.
				return MappingImpl<Mapping39, Type>();
				else
				return MappingImpl<Mapping42, Type>();
				DCHECK(0);
				#else
				return MappingImpl<Mapping, Type>();
				#endif
				}

				#if !defined(SANITIZER_GO)
	ALWAYS_INLINE			ALWAYS_INLINE
	bool IsAppMem(uptr mem) {			uptr LoAppMemBeg(void) {
	return mem >= kAppMemBeg && mem < kAppMemEnd;			return MappingArchImpl<MAPPING_LO_APP_BEG>();
				}
				ALWAYS_INLINE
				uptr LoAppMemEnd(void) {
				return MappingArchImpl<MAPPING_LO_APP_END>();
	}			}

	ALWAYS_INLINE			ALWAYS_INLINE
	bool IsShadowMem(uptr mem) {			uptr HeapMemBeg(void) {
	return mem >= kShadowBeg && mem <= kShadowEnd;			return MappingArchImpl<MAPPING_HEAP_BEG>();
				}
				ALWAYS_INLINE
				uptr HeapMemEnd(void) {
				return MappingArchImpl<MAPPING_HEAP_END>();
	}			}

	ALWAYS_INLINE			ALWAYS_INLINE
	bool IsMetaMem(uptr mem) {			uptr HiAppMemBeg(void) {
	return mem >= kMetaShadowBeg && mem <= kMetaShadowEnd;			return MappingArchImpl<MAPPING_HI_APP_BEG>();
				}
				ALWAYS_INLINE
				uptr HiAppMemEnd(void) {
				return MappingArchImpl<MAPPING_HI_APP_END>();
	}			}

	ALWAYS_INLINE			ALWAYS_INLINE
	uptr MemToShadow(uptr x) {			uptr VdsoBeg(void) {
	DCHECK(IsAppMem(x));			return MappingArchImpl<MAPPING_VDSO_BEG>();
	return ((x & ~(kShadowCell - 1)) * kShadowCnt) \| kShadowBeg;
	}			}

				#elif defined(SANITIZER_GO) && !SANITIZER_WINDOWS
				dvyukovUnsubmitted Not Done Reply Inline Actions drop !SANITIZER_WINDOWS part dvyukov: drop !SANITIZER_WINDOWS part

	ALWAYS_INLINE			ALWAYS_INLINE
	u32 *MemToMeta(uptr x) {			uptr AppMemBeg(void) {
	DCHECK(IsAppMem(x));			return MappingArchImpl<MAPPING_APP_BEG>();
	return (u32*)(((x & ~(kMetaShadowCell - 1)) / \			}
	kMetaShadowCell * kMetaShadowSize) \| kMetaShadowBeg);			ALWAYS_INLINE
				uptr AppMemEnd(void) {
				return MappingArchImpl<MAPPING_APP_END>();
				}

				#endif

				static inline
				bool GetUserRegion(int i, uptr start, uptr end) {
				switch (i) {
				default:
				return false;
				#ifndef SANITIZER_GO
				case 0:
				*start = LoAppMemBeg();
				*end = LoAppMemEnd();
				return true;
				case 1:
				*start = HiAppMemBeg();
				*end = HiAppMemEnd();
				return true;
				case 2:
				*start = HeapMemBeg();
				*end = HeapMemEnd();
				return true;
				#else
				case 0:
				*start = AppMemBeg();
				*end = AppMemEnd();
				return true;
				#endif
				}
	}			}

	ALWAYS_INLINE			ALWAYS_INLINE
	uptr ShadowToMem(uptr s) {			uptr ShadowBeg(void) {
	CHECK(IsShadowMem(s));			return MappingArchImpl<MAPPING_SHADOW_BEG>();
	return (s & ~kShadowBeg) / kShadowCnt;			}
				ALWAYS_INLINE
				uptr ShadowEnd(void) {
				return MappingArchImpl<MAPPING_SHADOW_END>();
	}			}

	static USED uptr UserRegions[] = {			ALWAYS_INLINE
	kAppMemBeg, kAppMemEnd,			uptr MetaShadowBeg(void) {
	};			return MappingArchImpl<MAPPING_META_SHADOW_BEG>();
				}
				ALWAYS_INLINE
				uptr MetaShadowEnd(void) {
				return MappingArchImpl<MAPPING_META_SHADOW_END>();
				}

	#elif defined(SANITIZER_GO) && SANITIZER_WINDOWS			ALWAYS_INLINE
				uptr TraceMemBeg(void) {
				return MappingArchImpl<MAPPING_TRACE_BEG>();
				}
				ALWAYS_INLINE
				uptr TraceMemEnd(void) {
				return MappingArchImpl<MAPPING_TRACE_END>();
				}

	/* Go on windows
	0000 0000 1000 - 0000 1000 0000: executable
	0000 1000 0000 - 00f8 0000 0000: -
	00c0 0000 0000 - 00e0 0000 0000: heap
	00e0 0000 0000 - 0100 0000 0000: -
	0100 0000 0000 - 0500 0000 0000: shadow
	0500 0000 0000 - 0560 0000 0000: -
	0560 0000 0000 - 0760 0000 0000: traces
	0760 0000 0000 - 07d0 0000 0000: metainfo (memory blocks and sync objects)
	07d0 0000 0000 - 8000 0000 0000: -
	*/

	const uptr kMetaShadowBeg = 0x076000000000ull;			template<typename Mapping>
	const uptr kMetaShadowEnd = 0x07d000000000ull;			bool IsAppMemImpl(uptr mem) {
	const uptr kTraceMemBeg = 0x056000000000ull;			#if !defined(SANITIZER_GO)
	const uptr kTraceMemEnd = 0x076000000000ull;			return (mem >= Mapping::kHeapMemBeg && mem < Mapping::kHeapMemEnd) \|\|
	const uptr kShadowBeg = 0x010000000000ull;			(mem >= Mapping::kLoAppMemBeg && mem < Mapping::kLoAppMemEnd) \|\|
	const uptr kShadowEnd = 0x050000000000ull;			(mem >= Mapping::kHiAppMemBeg && mem < Mapping::kHiAppMemEnd);
	const uptr kAppMemBeg = 0x000000001000ull;			#else
	const uptr kAppMemEnd = 0x00e000000000ull;			return mem >= Mapping::kAppMemBeg && mem < Mapping::kAppMemEnd;
				#endif
				}

	ALWAYS_INLINE			ALWAYS_INLINE
	bool IsAppMem(uptr mem) {			bool IsAppMem(uptr mem) {
	return mem >= kAppMemBeg && mem < kAppMemEnd;			#ifdef __aarch64__
				if (vmaSize == 39)
				return IsAppMemImpl<Mapping39>(mem);
				else
				return IsAppMemImpl<Mapping42>(mem);
				DCHECK(0);
				#else
				return IsAppMemImpl<Mapping>(mem);
				#endif
				}


				template<typename Mapping>
				bool IsShadowMemImpl(uptr mem) {
				return mem >= Mapping::kShadowBeg && mem <= Mapping::kShadowEnd;
	}			}

	ALWAYS_INLINE			ALWAYS_INLINE
	bool IsShadowMem(uptr mem) {			bool IsShadowMem(uptr mem) {
	return mem >= kShadowBeg && mem <= kShadowEnd;			#ifdef __aarch64__
				if (vmaSize == 39)
				return IsShadowMemImpl<Mapping39>(mem);
				else
				return IsShadowMemImpl<Mapping42>(mem);
				DCHECK(0);
				#else
				return IsShadowMemImpl<Mapping>(mem);
				#endif
				}


				template<typename Mapping>
				bool IsMetaMemImpl(uptr mem) {
				return mem >= Mapping::kMetaShadowBeg && mem <= Mapping::kMetaShadowEnd;
	}			}

	ALWAYS_INLINE			ALWAYS_INLINE
	bool IsMetaMem(uptr mem) {			bool IsMetaMem(uptr mem) {
	return mem >= kMetaShadowBeg && mem <= kMetaShadowEnd;			#ifdef __aarch64__
				if (vmaSize == 39)
				return IsMetaMemImpl<Mapping39>(mem);
				else
				return IsMetaMemImpl<Mapping42>(mem);
				DCHECK(0);
				#else
				return IsMetaMemImpl<Mapping>(mem);
				#endif
	}			}

	ALWAYS_INLINE
	uptr MemToShadow(uptr x) {			template<typename Mapping>
				uptr MemToShadowImpl(uptr x) {
	DCHECK(IsAppMem(x));			DCHECK(IsAppMem(x));
	return ((x & ~(kShadowCell - 1)) * kShadowCnt) + kShadowBeg;			#if !defined(SANITIZER_GO)
				return (((x) & ~(Mapping::kAppMemMsk \| (kShadowCell - 1)))
				^ Mapping::kAppMemXor) * kShadowCnt;
				#else
				return ((x & ~(kShadowCell - 1)) * kShadowCnt) \| Mapping::kShadowBeg;
				#endif
	}			}

	ALWAYS_INLINE			ALWAYS_INLINE
	u32 *MemToMeta(uptr x) {			uptr MemToShadow(uptr x) {
				#ifdef __aarch64__
				if (vmaSize == 39)
				return MemToShadowImpl<Mapping39>(x);
				else
				return MemToShadowImpl<Mapping42>(x);
				DCHECK(0);
				#else
				return MemToShadowImpl<Mapping>(x);
				#endif
				}


				template<typename Mapping>
				u32 *MemToMetaImpl(uptr x) {
	DCHECK(IsAppMem(x));			DCHECK(IsAppMem(x));
				#if !defined(SANITIZER_GO)
				return (u32*)(((((x) & ~(Mapping::kAppMemMsk \| (kMetaShadowCell - 1)))
				^ Mapping::kAppMemXor) / kMetaShadowCell * kMetaShadowSize)
				\| Mapping::kMetaShadowBeg);
				#else
	return (u32*)(((x & ~(kMetaShadowCell - 1)) / \			return (u32*)(((x & ~(kMetaShadowCell - 1)) / \
	kMetaShadowCell * kMetaShadowSize) \| kMetaShadowBeg);			kMetaShadowCell * kMetaShadowSize) \| Mapping::kMetaShadowBeg);
				#endif
	}			}

	ALWAYS_INLINE			ALWAYS_INLINE
	uptr ShadowToMem(uptr s) {			u32 *MemToMeta(uptr x) {
	CHECK(IsShadowMem(s));			#ifdef __aarch64__
	// FIXME(dvyukov): this is most likely wrong as the mapping is not bijection.			if (vmaSize == 39)
	return (s - kShadowBeg) / kShadowCnt;			return MemToMetaImpl<Mapping39>(x);
				else
				return MemToMetaImpl<Mapping42>(x);
				DCHECK(0);
				#else
				return MemToMetaImpl<Mapping>(x);
				#endif
	}			}

	static USED uptr UserRegions[] = {
	kAppMemBeg, kAppMemEnd,
	};

				template<typename Mapping>
				uptr ShadowToMemImpl(uptr s) {
				DCHECK(IsShadowMem(s));
				#if !defined(SANITIZER_GO)
				if (s >= MemToShadow(Mapping::kLoAppMemBeg)
				&& s <= MemToShadow(Mapping::kLoAppMemEnd - 1))
				return (s / kShadowCnt) ^ Mapping::kAppMemXor;
				else
				return ((s / kShadowCnt) ^ Mapping::kAppMemXor) \| Mapping::kAppMemMsk;
				#elif defined(SANITIZER_GO)
				# ifndef SANITIZER_WINDOWS
				return (s & ~Mapping::kShadowBeg) / kShadowCnt;
	#else			# else
	# error "Unknown platform"			// FIXME(dvyukov): this is most likely wrong as the mapping is not bijection.
				return (s - Mapping::kShadowBeg) / kShadowCnt;
				# endif // SANITIZER_WINDOWS
				#endif
				}

				ALWAYS_INLINE
				uptr ShadowToMem(uptr s) {
				#ifdef __aarch64__
				if (vmaSize == 39)
				return ShadowToMemImpl<Mapping39>(s);
				else
				return ShadowToMemImpl<Mapping42>(s);
				DCHECK(0);
				#else
				return ShadowToMemImpl<Mapping>(s);
	#endif			#endif
				}



	// The additional page is to catch shadow stack overflow as paging fault.			// The additional page is to catch shadow stack overflow as paging fault.
	// Windows wants 64K alignment for mmaps.			// Windows wants 64K alignment for mmaps.
	const uptr kTotalTraceSize = (kTraceSize * sizeof(Event) + sizeof(Trace)			const uptr kTotalTraceSize = (kTraceSize * sizeof(Event) + sizeof(Trace)
	+ (64 << 10) + (64 << 10) - 1) & ~((64 << 10) - 1);			+ (64 << 10) + (64 << 10) - 1) & ~((64 << 10) - 1);

	uptr ALWAYS_INLINE GetThreadTrace(int tid) {			template<typename Mapping>
	uptr p = kTraceMemBeg + (uptr)tid * kTotalTraceSize;			uptr GetThreadTraceImpl(int tid) {
	DCHECK_LT(p, kTraceMemEnd);			uptr p = Mapping::kTraceMemBeg + (uptr)tid * kTotalTraceSize;
				DCHECK_LT(p, Mapping::kTraceMemEnd);
	return p;			return p;
	}			}

	uptr ALWAYS_INLINE GetThreadTraceHeader(int tid) {			ALWAYS_INLINE
	uptr p = kTraceMemBeg + (uptr)tid * kTotalTraceSize			uptr GetThreadTrace(int tid) {
				#ifdef __aarch64__
				if (vmaSize == 39)
				return GetThreadTraceImpl<Mapping39>(tid);
				else
				return GetThreadTraceImpl<Mapping42>(tid);
				DCHECK(0);
				#else
				return GetThreadTraceImpl<Mapping>(tid);
				#endif
				}


				template<typename Mapping>
				uptr GetThreadTraceHeaderImpl(int tid) {
				uptr p = Mapping::kTraceMemBeg + (uptr)tid * kTotalTraceSize
	+ kTraceSize * sizeof(Event);			+ kTraceSize * sizeof(Event);
	DCHECK_LT(p, kTraceMemEnd);			DCHECK_LT(p, Mapping::kTraceMemEnd);
	return p;			return p;
	}			}

				ALWAYS_INLINE
				uptr GetThreadTraceHeader(int tid) {
				#ifdef __aarch64__
				if (vmaSize == 39)
				return GetThreadTraceHeaderImpl<Mapping39>(tid);
				else
				return GetThreadTraceHeaderImpl<Mapping42>(tid);
				DCHECK(0);
				#else
				return GetThreadTraceHeaderImpl<Mapping>(tid);
				#endif
				}

	void InitializePlatform();			void InitializePlatform();
				void InitializePlatformEarly();
				dvyukovUnsubmitted Not Done Reply Inline Actions Please rename this to InitializePlatformEarly. I don't understand what is "Modules" (dynamic libraries?). And "Specific" is excessive, "Platform" already implies platform-specific stuff. dvyukov: Please rename this to InitializePlatformEarly. I don't understand what is "Modules" (dynamic…
				zatrazzAuthorUnsubmitted Not Done Reply Inline Actions I will change it (I used 'modules' and 'specific' because I could not came up with a better naming). zatrazz: I will change it (I used 'modules' and 'specific' because I could not came up with a better…
	void CheckAndProtect();			void CheckAndProtect();
	void InitializeShadowMemoryPlatform();			void InitializeShadowMemoryPlatform();
	void FlushShadowMemory();			void FlushShadowMemory();
	void WriteMemoryProfile(char *buf, uptr buf_size, uptr nthread, uptr nlive);			void WriteMemoryProfile(char *buf, uptr buf_size, uptr nthread, uptr nlive);

	// Says whether the addr relates to a global var.			// Says whether the addr relates to a global var.
	// Guesses with high probability, may yield both false positives and negatives.			// Guesses with high probability, may yield both false positives and negatives.
	bool IsGlobalVar(uptr addr);			bool IsGlobalVar(uptr addr);
	Show All 12 Lines

lib/tsan/rtl/tsan_platform_linux.cc

Show First 20 Lines • Show All 61 Lines • ▼ Show 20 Lines
void *__libc_stack_end = 0;		void *__libc_stack_end = 0;
#endif		#endif

namespace __tsan {		namespace __tsan {

static uptr g_data_start;		static uptr g_data_start;
static uptr g_data_end;		static uptr g_data_end;

		#ifdef TSAN_RUNTIME_VMA
		// Runtime detected VMA size.
		uptr vmaSize;
		#endif

enum {		enum {
MemTotal = 0,		MemTotal = 0,
MemShadow = 1,		MemShadow = 1,
MemMeta = 2,		MemMeta = 2,
MemFile = 3,		MemFile = 3,
MemMmap = 4,		MemMmap = 4,
MemTrace = 5,		MemTrace = 5,
MemHeap = 6,		MemHeap = 6,
MemOther = 7,		MemOther = 7,
MemCount = 8,		MemCount = 8,
};		};

void FillProfileCallback(uptr p, uptr rss, bool file,		void FillProfileCallback(uptr p, uptr rss, bool file,
uptr *mem, uptr stats_size) {		uptr *mem, uptr stats_size) {
mem[MemTotal] += rss;		mem[MemTotal] += rss;
if (p >= kShadowBeg && p < kShadowEnd)		if (p >= ShadowBeg() && p < ShadowEnd())
mem[MemShadow] += rss;		mem[MemShadow] += rss;
else if (p >= kMetaShadowBeg && p < kMetaShadowEnd)		else if (p >= MetaShadowBeg() && p < MetaShadowEnd())
mem[MemMeta] += rss;		mem[MemMeta] += rss;
#ifndef SANITIZER_GO		#ifndef SANITIZER_GO
else if (p >= kHeapMemBeg && p < kHeapMemEnd)		else if (p >= HeapMemBeg() && p < HeapMemEnd())
mem[MemHeap] += rss;		mem[MemHeap] += rss;
else if (p >= kLoAppMemBeg && p < kLoAppMemEnd)		else if (p >= LoAppMemBeg() && p < LoAppMemEnd())
mem[file ? MemFile : MemMmap] += rss;		mem[file ? MemFile : MemMmap] += rss;
else if (p >= kHiAppMemBeg && p < kHiAppMemEnd)		else if (p >= HiAppMemBeg() && p < HiAppMemEnd())
mem[file ? MemFile : MemMmap] += rss;		mem[file ? MemFile : MemMmap] += rss;
#else		#else
else if (p >= kAppMemBeg && p < kAppMemEnd)		else if (p >= AppMemBeg() && p < AppMemEnd())
mem[file ? MemFile : MemMmap] += rss;		mem[file ? MemFile : MemMmap] += rss;
#endif		#endif
else if (p >= kTraceMemBeg && p < kTraceMemEnd)		else if (p >= TraceMemBeg() && p < TraceMemEnd())
mem[MemTrace] += rss;		mem[MemTrace] += rss;
else		else
mem[MemOther] += rss;		mem[MemOther] += rss;
}		}

void WriteMemoryProfile(char *buf, uptr buf_size, uptr nthread, uptr nlive) {		void WriteMemoryProfile(char *buf, uptr buf_size, uptr nthread, uptr nlive) {
uptr mem[MemCount] = {};		uptr mem[MemCount] = {};
__sanitizer::GetMemoryProfile(FillProfileCallback, mem, 7);		__sanitizer::GetMemoryProfile(FillProfileCallback, mem, 7);
StackDepotStats *stacks = StackDepotGetStats();		StackDepotStats *stacks = StackDepotGetStats();
internal_snprintf(buf, buf_size,		internal_snprintf(buf, buf_size,
"RSS %zd MB: shadow:%zd meta:%zd file:%zd mmap:%zd"		"RSS %zd MB: shadow:%zd meta:%zd file:%zd mmap:%zd"
" trace:%zd heap:%zd other:%zd stacks=%zd[%zd] nthr=%zd/%zd\n",		" trace:%zd heap:%zd other:%zd stacks=%zd[%zd] nthr=%zd/%zd\n",
mem[MemTotal] >> 20, mem[MemShadow] >> 20, mem[MemMeta] >> 20,		mem[MemTotal] >> 20, mem[MemShadow] >> 20, mem[MemMeta] >> 20,
mem[MemFile] >> 20, mem[MemMmap] >> 20, mem[MemTrace] >> 20,		mem[MemFile] >> 20, mem[MemMmap] >> 20, mem[MemTrace] >> 20,
mem[MemHeap] >> 20, mem[MemOther] >> 20,		mem[MemHeap] >> 20, mem[MemOther] >> 20,
stacks->allocated >> 20, stacks->n_uniq_ids,		stacks->allocated >> 20, stacks->n_uniq_ids,
nlive, nthread);		nlive, nthread);
}		}

#if SANITIZER_LINUX		#if SANITIZER_LINUX
void FlushShadowMemoryCallback(		void FlushShadowMemoryCallback(
const SuspendedThreadsList &suspended_threads_list,		const SuspendedThreadsList &suspended_threads_list,
void *argument) {		void *argument) {
FlushUnneededShadowMemory(kShadowBeg, kShadowEnd - kShadowBeg);		FlushUnneededShadowMemory(ShadowBeg(), ShadowEnd() - ShadowBeg());
}		}
#endif		#endif

void FlushShadowMemory() {		void FlushShadowMemory() {
#if SANITIZER_LINUX		#if SANITIZER_LINUX
StopTheWorld(FlushShadowMemoryCallback, 0);		StopTheWorld(FlushShadowMemoryCallback, 0);
#endif		#endif
}		}
▲ Show 20 Lines • Show All 97 Lines • ▼ Show 20 Lines	#endif
DPrintf("guessed data_start=%p data_end=%p\n", g_data_start, g_data_end);		DPrintf("guessed data_start=%p data_end=%p\n", g_data_start, g_data_end);
CHECK_LT(g_data_start, g_data_end);		CHECK_LT(g_data_start, g_data_end);
CHECK_GE((uptr)&g_data_start, g_data_start);		CHECK_GE((uptr)&g_data_start, g_data_start);
CHECK_LT((uptr)&g_data_start, g_data_end);		CHECK_LT((uptr)&g_data_start, g_data_end);
}		}

#endif // #ifndef SANITIZER_GO		#endif // #ifndef SANITIZER_GO

		void InitializePlatformEarly() {
		#ifdef TSAN_RUNTIME_VMA
		vmaSize =
		(MostSignificantSetBitIndex(GET_CURRENT_FRAME()) + 1);
		if (vmaSize != 39 && vmaSize != 42) {
		Printf("FATAL: ThreadSanitizer: unsupported VMA range\n");
		Printf("FATAL: Found %d - Supported 39 and 42\n", vmaSize);
		Die();
		}
		#endif
		}

void InitializePlatform() {		void InitializePlatform() {
DisableCoreDumperIfNecessary();		DisableCoreDumperIfNecessary();

// Go maps shadow memory lazily and works fine with limited address space.		// Go maps shadow memory lazily and works fine with limited address space.
// Unlimited stack is not a problem as well, because the executable		// Unlimited stack is not a problem as well, because the executable
// is not compiled with -pie.		// is not compiled with -pie.
if (kCppMode) {		if (kCppMode) {
bool reexec = false;		bool reexec = false;
▲ Show 20 Lines • Show All 95 Lines • Show Last 20 Lines

lib/tsan/rtl/tsan_platform_mac.cc

Show First 20 Lines • Show All 145 Lines • ▼ Show 20 Lines	if (event == PTHREAD_INTROSPECTION_THREAD_CREATE) {
}		}
}		}

if (prev_pthread_introspection_hook != nullptr)		if (prev_pthread_introspection_hook != nullptr)
prev_pthread_introspection_hook(event, thread, addr, size);		prev_pthread_introspection_hook(event, thread, addr, size);
}		}
#endif		#endif

		void InitializePlatformEarly() {
		}

void InitializePlatform() {		void InitializePlatform() {
DisableCoreDumperIfNecessary();		DisableCoreDumperIfNecessary();
#ifndef SANITIZER_GO		#ifndef SANITIZER_GO
CheckAndProtect();		CheckAndProtect();

CHECK_EQ(main_thread_identity, 0);		CHECK_EQ(main_thread_identity, 0);
main_thread_identity = (uptr)pthread_self();		main_thread_identity = (uptr)pthread_self();

Show All 28 Lines

lib/tsan/rtl/tsan_platform_posix.cc

	Show All 21 Lines
	#include "tsan_rtl.h"			#include "tsan_rtl.h"

	namespace __tsan {			namespace __tsan {

	#ifndef SANITIZER_GO			#ifndef SANITIZER_GO
	void InitializeShadowMemory() {			void InitializeShadowMemory() {
	// Map memory shadow.			// Map memory shadow.
	uptr shadow =			uptr shadow =
	(uptr)MmapFixedNoReserve(kShadowBeg, kShadowEnd - kShadowBeg, "shadow");			(uptr)MmapFixedNoReserve(ShadowBeg(), ShadowEnd() - ShadowBeg(),
	if (shadow != kShadowBeg) {			"shadow");
				if (shadow != ShadowBeg()) {
	Printf("FATAL: ThreadSanitizer can not mmap the shadow memory\n");			Printf("FATAL: ThreadSanitizer can not mmap the shadow memory\n");
	Printf("FATAL: Make sure to compile with -fPIE and "			Printf("FATAL: Make sure to compile with -fPIE and "
	"to link with -pie (%p, %p).\n", shadow, kShadowBeg);			"to link with -pie (%p, %p).\n", shadow, ShadowBeg());
	Die();			Die();
	}			}
	// This memory range is used for thread stacks and large user mmaps.			// This memory range is used for thread stacks and large user mmaps.
	// Frequently a thread uses only a small part of stack and similarly			// Frequently a thread uses only a small part of stack and similarly
	// a program uses a small part of large mmap. On some programs			// a program uses a small part of large mmap. On some programs
	// we see 20% memory usage reduction without huge pages for this range.			// we see 20% memory usage reduction without huge pages for this range.
	// FIXME: don't use constants here.			// FIXME: don't use constants here.
	#if defined(__x86_64__)			#if defined(__x86_64__)
	const uptr kMadviseRangeBeg = 0x7f0000000000ull;			const uptr kMadviseRangeBeg = 0x7f0000000000ull;
	const uptr kMadviseRangeSize = 0x010000000000ull;			const uptr kMadviseRangeSize = 0x010000000000ull;
	#elif defined(__mips64)			#elif defined(__mips64)
	const uptr kMadviseRangeBeg = 0xff00000000ull;			const uptr kMadviseRangeBeg = 0xff00000000ull;
	const uptr kMadviseRangeSize = 0x0100000000ull;			const uptr kMadviseRangeSize = 0x0100000000ull;
	#elif defined(__aarch64__)			#elif defined(__aarch64__)
	const uptr kMadviseRangeBeg = 0x7e00000000ull;			uptr kMadviseRangeBeg = 0;
	const uptr kMadviseRangeSize = 0x0100000000ull;			uptr kMadviseRangeSize = 0;
				if (vmaSize == 39) {
				kMadviseRangeBeg = 0x7d00000000ull;
				kMadviseRangeSize = 0x0300000000ull;
				} else if (vmaSize == 42) {
				kMadviseRangeBeg = 0x3f000000000ull;
				kMadviseRangeSize = 0x01000000000ull;
				} else {
				DCHECK(0);
				}
				dvyukovUnsubmitted Not Done Reply Inline Actions vmaSize should be already set by now, so use it. Add } else { CHECK(0); } branch below. It will catch both case when vmaSize is somehow not initialized by now and case when support for a new vma size is added. dvyukov: vmaSize should be already set by now, so use it. Add } else { CHECK(0); } branch…
				zatrazzAuthorUnsubmitted Not Done Reply Inline Actions I will change that. zatrazz: I will change that.
	#endif			#endif
	NoHugePagesInRegion(MemToShadow(kMadviseRangeBeg),			NoHugePagesInRegion(MemToShadow(kMadviseRangeBeg),
	kMadviseRangeSize * kShadowMultiplier);			kMadviseRangeSize * kShadowMultiplier);
	// Meta shadow is compressing and we don't flush it,			// Meta shadow is compressing and we don't flush it,
	// so it makes sense to mark it as NOHUGEPAGE to not over-allocate memory.			// so it makes sense to mark it as NOHUGEPAGE to not over-allocate memory.
	// On one program it reduces memory consumption from 5GB to 2.5GB.			// On one program it reduces memory consumption from 5GB to 2.5GB.
	NoHugePagesInRegion(kMetaShadowBeg, kMetaShadowEnd - kMetaShadowBeg);			NoHugePagesInRegion(MetaShadowBeg(), MetaShadowEnd() - MetaShadowBeg());
	if (common_flags()->use_madv_dontdump)			if (common_flags()->use_madv_dontdump)
	DontDumpShadowMemory(kShadowBeg, kShadowEnd - kShadowBeg);			DontDumpShadowMemory(ShadowBeg(), ShadowEnd() - ShadowBeg());
	DPrintf("memory shadow: %zx-%zx (%zuGB)\n",			DPrintf("memory shadow: %zx-%zx (%zuGB)\n",
	kShadowBeg, kShadowEnd,			ShadowBeg(), ShadowEnd(),
	(kShadowEnd - kShadowBeg) >> 30);			(ShadowEnd() - ShadowBeg()) >> 30);

	// Map meta shadow.			// Map meta shadow.
	uptr meta_size = kMetaShadowEnd - kMetaShadowBeg;			uptr meta_size = MetaShadowEnd() - MetaShadowBeg();
	uptr meta =			uptr meta =
	(uptr)MmapFixedNoReserve(kMetaShadowBeg, meta_size, "meta shadow");			(uptr)MmapFixedNoReserve(MetaShadowBeg(), meta_size, "meta shadow");
	if (meta != kMetaShadowBeg) {			if (meta != MetaShadowBeg()) {
	Printf("FATAL: ThreadSanitizer can not mmap the shadow memory\n");			Printf("FATAL: ThreadSanitizer can not mmap the shadow memory\n");
	Printf("FATAL: Make sure to compile with -fPIE and "			Printf("FATAL: Make sure to compile with -fPIE and "
	"to link with -pie (%p, %p).\n", meta, kMetaShadowBeg);			"to link with -pie (%p, %p).\n", meta, MetaShadowBeg());
	Die();			Die();
	}			}
	if (common_flags()->use_madv_dontdump)			if (common_flags()->use_madv_dontdump)
	DontDumpShadowMemory(meta, meta_size);			DontDumpShadowMemory(meta, meta_size);
	DPrintf("meta shadow: %zx-%zx (%zuGB)\n",			DPrintf("meta shadow: %zx-%zx (%zuGB)\n",
	meta, meta + meta_size, meta_size >> 30);			meta, meta + meta_size, meta_size >> 30);

	InitializeShadowMemoryPlatform();			InitializeShadowMemoryPlatform();
	Show All 12 Lines

	void CheckAndProtect() {			void CheckAndProtect() {
	// Ensure that the binary is indeed compiled with -pie.			// Ensure that the binary is indeed compiled with -pie.
	MemoryMappingLayout proc_maps(true);			MemoryMappingLayout proc_maps(true);
	uptr p, end, prot;			uptr p, end, prot;
	while (proc_maps.Next(&p, &end, 0, 0, 0, &prot)) {			while (proc_maps.Next(&p, &end, 0, 0, 0, &prot)) {
	if (IsAppMem(p))			if (IsAppMem(p))
	continue;			continue;
	if (p >= kHeapMemEnd &&			if (p >= HeapMemEnd() &&
	p < HeapEnd())			p < HeapEnd())
	continue;			continue;
	if (prot == 0) // Zero page or mprotected.			if (prot == 0) // Zero page or mprotected.
	continue;			continue;
	if (p >= kVdsoBeg) // vdso			if (p >= VdsoBeg()) // vdso
	break;			break;
	Printf("FATAL: ThreadSanitizer: unexpected memory mapping %p-%p\n", p, end);			Printf("FATAL: ThreadSanitizer: unexpected memory mapping %p-%p\n", p, end);
	Die();			Die();
	}			}

	ProtectRange(kLoAppMemEnd, kShadowBeg);			ProtectRange(LoAppMemEnd(), ShadowBeg());
	ProtectRange(kShadowEnd, kMetaShadowBeg);			ProtectRange(ShadowEnd(), MetaShadowBeg());
	ProtectRange(kMetaShadowEnd, kTraceMemBeg);			ProtectRange(MetaShadowEnd(), TraceMemBeg());
	// Memory for traces is mapped lazily in MapThreadTrace.			// Memory for traces is mapped lazily in MapThreadTrace.
	// Protect the whole range for now, so that user does not map something here.			// Protect the whole range for now, so that user does not map something here.
	ProtectRange(kTraceMemBeg, kTraceMemEnd);			ProtectRange(TraceMemBeg(), TraceMemEnd());
	ProtectRange(kTraceMemEnd, kHeapMemBeg);			ProtectRange(TraceMemEnd(), HeapMemBeg());
	ProtectRange(HeapEnd(), kHiAppMemBeg);			ProtectRange(HeapEnd(), HiAppMemBeg());
	}			}
	#endif			#endif

	} // namespace __tsan			} // namespace __tsan

	#endif // SANITIZER_POSIX			#endif // SANITIZER_POSIX

lib/tsan/rtl/tsan_platform_windows.cc

	Show All 25 Lines
	}			}

	void FlushShadowMemory() {			void FlushShadowMemory() {
	}			}

	void WriteMemoryProfile(char *buf, uptr buf_size, uptr nthread, uptr nlive) {			void WriteMemoryProfile(char *buf, uptr buf_size, uptr nthread, uptr nlive) {
	}			}

				void InitializePlatformEarly() {
				}

	void InitializePlatform() {			void InitializePlatform() {
	}			}

	} // namespace __tsan			} // namespace __tsan

	#endif // SANITIZER_WINDOWS			#endif // SANITIZER_WINDOWS

lib/tsan/rtl/tsan_rtl.h

Show First 20 Lines • Show All 60 Lines • ▼ Show 20 Lines
static const uptr kAllocatorNumRegions =		static const uptr kAllocatorNumRegions =
kAllocatorSize >> kAllocatorRegionSizeLog;		kAllocatorSize >> kAllocatorRegionSizeLog;
typedef TwoLevelByteMap<(kAllocatorNumRegions >> 12), 1 << 12,		typedef TwoLevelByteMap<(kAllocatorNumRegions >> 12), 1 << 12,
MapUnmapCallback> ByteMap;		MapUnmapCallback> ByteMap;
typedef SizeClassAllocator32<kAllocatorSpace, kAllocatorSize, 0,		typedef SizeClassAllocator32<kAllocatorSpace, kAllocatorSize, 0,
CompactSizeClassMap, kAllocatorRegionSizeLog, ByteMap,		CompactSizeClassMap, kAllocatorRegionSizeLog, ByteMap,
MapUnmapCallback> PrimaryAllocator;		MapUnmapCallback> PrimaryAllocator;
#else		#else
typedef SizeClassAllocator64<kHeapMemBeg, kHeapMemEnd - kHeapMemBeg, 0,		typedef SizeClassAllocator64<Mapping::kHeapMemBeg,
		Mapping::kHeapMemEnd - Mapping::kHeapMemBeg, 0,
DefaultSizeClassMap, MapUnmapCallback> PrimaryAllocator;		DefaultSizeClassMap, MapUnmapCallback> PrimaryAllocator;
#endif		#endif
typedef SizeClassAllocatorLocalCache<PrimaryAllocator> AllocatorCache;		typedef SizeClassAllocatorLocalCache<PrimaryAllocator> AllocatorCache;
typedef LargeMmapAllocator<MapUnmapCallback> SecondaryAllocator;		typedef LargeMmapAllocator<MapUnmapCallback> SecondaryAllocator;
typedef CombinedAllocator<PrimaryAllocator, AllocatorCache,		typedef CombinedAllocator<PrimaryAllocator, AllocatorCache,
SecondaryAllocator> Allocator;		SecondaryAllocator> Allocator;
Allocator *allocator();		Allocator *allocator();
#endif		#endif
▲ Show 20 Lines • Show All 678 Lines • ▼ Show 20 Lines	#endif
Event trace = (Event)GetThreadTrace(fs.tid());		Event trace = (Event)GetThreadTrace(fs.tid());
Event *evp = &trace[pos];		Event *evp = &trace[pos];
Event ev = (u64)addr \| ((u64)typ << 61);		Event ev = (u64)addr \| ((u64)typ << 61);
*evp = ev;		*evp = ev;
}		}

#ifndef SANITIZER_GO		#ifndef SANITIZER_GO
uptr ALWAYS_INLINE HeapEnd() {		uptr ALWAYS_INLINE HeapEnd() {
return kHeapMemEnd + PrimaryAllocator::AdditionalSize();		return HeapMemEnd() + PrimaryAllocator::AdditionalSize();
}		}
#endif		#endif

} // namespace __tsan		} // namespace __tsan

#endif // TSAN_RTL_H		#endif // TSAN_RTL_H

lib/tsan/rtl/tsan_rtl.cc

Show First 20 Lines • Show All 267 Lines • ▼ Show 20 Lines	if (!data_mapped) {
mapped_meta_end = meta_end;		mapped_meta_end = meta_end;
}		}
VPrintf(2, "mapped meta shadow for (%p-%p) at (%p-%p)\n",		VPrintf(2, "mapped meta shadow for (%p-%p) at (%p-%p)\n",
addr, addr+size, meta_begin, meta_end);		addr, addr+size, meta_begin, meta_end);
}		}

void MapThreadTrace(uptr addr, uptr size, const char *name) {		void MapThreadTrace(uptr addr, uptr size, const char *name) {
DPrintf("#0: Mapping trace at %p-%p(0x%zx)\n", addr, addr + size, size);		DPrintf("#0: Mapping trace at %p-%p(0x%zx)\n", addr, addr + size, size);
CHECK_GE(addr, kTraceMemBeg);		CHECK_GE(addr, TraceMemBeg());
CHECK_LE(addr + size, kTraceMemEnd);		CHECK_LE(addr + size, TraceMemEnd());
CHECK_EQ(addr, addr & ~((64 << 10) - 1)); // windows wants 64K alignment		CHECK_EQ(addr, addr & ~((64 << 10) - 1)); // windows wants 64K alignment
uptr addr1 = (uptr)MmapFixedNoReserve(addr, size, name);		uptr addr1 = (uptr)MmapFixedNoReserve(addr, size, name);
if (addr1 != addr) {		if (addr1 != addr) {
Printf("FATAL: ThreadSanitizer can not mmap thread trace (%p/%p->%p)\n",		Printf("FATAL: ThreadSanitizer can not mmap thread trace (%p/%p->%p)\n",
addr, size, addr1);		addr, size, addr1);
Die();		Die();
}		}
}		}

static void CheckShadowMapping() {		static void CheckShadowMapping() {
for (uptr i = 0; i < ARRAY_SIZE(UserRegions); i += 2) {		int i = 0;
const uptr beg = UserRegions[i];		while (1) {
const uptr end = UserRegions[i + 1];		uptr beg, end;
		if (!GetUserRegion (i, &beg, &end))
		dvyukovUnsubmitted Not Done Reply Inline Actions please replace this with: uptr beg, end; for (int i = 0; GetUserRegion(i, &beg, &end); i++) { it is shorter and more idiomatic for loops. dvyukov: please replace this with: uptr beg, end; for (int i = 0; GetUserRegion(i, &beg, &end); i++)…
		break;
		i++;

VPrintf(3, "checking shadow region %p-%p\n", beg, end);		VPrintf(3, "checking shadow region %p-%p\n", beg, end);
for (uptr p0 = beg; p0 <= end; p0 += (end - beg) / 4) {		for (uptr p0 = beg; p0 <= end; p0 += (end - beg) / 4) {
for (int x = -1; x <= 1; x++) {		for (int x = -1; x <= 1; x++) {
const uptr p = p0 + x;		const uptr p = p0 + x;
if (p < beg \|\| p >= end)		if (p < beg \|\| p >= end)
continue;		continue;
const uptr s = MemToShadow(p);		const uptr s = MemToShadow(p);
const uptr m = (uptr)MemToMeta(p);		const uptr m = (uptr)MemToMeta(p);
Show All 18 Lines	void Initialize(ThreadState *thr) {
SanitizerToolName = "ThreadSanitizer";		SanitizerToolName = "ThreadSanitizer";
// Install tool-specific callbacks in sanitizer_common.		// Install tool-specific callbacks in sanitizer_common.
SetCheckFailedCallback(TsanCheckFailed);		SetCheckFailedCallback(TsanCheckFailed);

ctx = new(ctx_placeholder) Context;		ctx = new(ctx_placeholder) Context;
const char *options = GetEnv(kTsanOptionsEnv);		const char *options = GetEnv(kTsanOptionsEnv);
CacheBinaryName();		CacheBinaryName();
InitializeFlags(&ctx->flags, options);		InitializeFlags(&ctx->flags, options);
CheckVMASize();		InitializePlatformEarly();
#ifndef SANITIZER_GO		#ifndef SANITIZER_GO
InitializeAllocator();		InitializeAllocator();
ReplaceSystemMalloc();		ReplaceSystemMalloc();
#endif		#endif
InitializeInterceptors();		InitializeInterceptors();
CheckShadowMapping();		CheckShadowMapping();
InitializePlatform();		InitializePlatform();
InitializeMutex();		InitializeMutex();
▲ Show 20 Lines • Show All 685 Lines • Show Last 20 Lines

test/tsan/test.h

Show First 20 Lines • Show All 47 Lines • ▼ Show 20 Lines	#elif defined(__aarch64__)
const char *format = 0;		const char *format = 0;
unsigned long vma = (unsigned long)__builtin_frame_address(0);		unsigned long vma = (unsigned long)__builtin_frame_address(0);
vma = 64 - __builtin_clzll(vma);		vma = 64 - __builtin_clzll(vma);
if (vma == 39)		if (vma == 39)
format = "0x%010lx";		format = "0x%010lx";
else if (vma == 42)		else if (vma == 42)
format = "0x%011lx";		format = "0x%011lx";
else {		else {
fprintf(stderr, "unsupported vma: %ul\n", vma);		fprintf(stderr, "unsupported vma: %lu\n", vma);
exit(1);		exit(1);
}		}

fprintf(stderr, format, (unsigned long) address);		fprintf(stderr, format, (unsigned long) address);
#endif		#endif
}		}

This is an archive of the discontinued LLVM Phabricator instance.

[compiler-rt] [tsan] Unify aarch64 mappingClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 40643

lib/tsan/rtl/tsan_platform.h

lib/tsan/rtl/tsan_platform_linux.cc

lib/tsan/rtl/tsan_platform_mac.cc

lib/tsan/rtl/tsan_platform_posix.cc

lib/tsan/rtl/tsan_platform_windows.cc

lib/tsan/rtl/tsan_rtl.h

lib/tsan/rtl/tsan_rtl.cc

test/tsan/test.h

[compiler-rt] [tsan] Unify aarch64 mapping
ClosedPublic