Download Raw Diff

Details

Reviewers

seaneveson
hfinkel
MatzeB
jhenderson
javed.absar

Commits

rGdcf59c548009: Recommit r335333 "[MC] - Add .stack_size sections into groups and link them…
rGe14485a0c61a: [MC] - Add .stack_size sections into groups and link them with .text
rL335336: Recommit r335333 "[MC] - Add .stack_size sections into groups and link them…
rL335332: [MC] - Add .stack_size sections into groups and link them with .text

Summary

D39788 added a '.stack-size' section containing metadata on function stack sizes
to output ELF files behind the new -stack-size-section flag.

This change does following two things on top:

Imagine the case when there are -ffunction-sections flag given and there are text sections in COMDATs. The patch adds a '.stack-size' section into corresponding COMDAT group, so that linker will be able to eliminate them fast during resolving the COMDATs.
Patch sets a SHF_LINK_ORDER flag and links '.stack-size' with the corresponding .text. With that linker will be able to do -gc-sections on dead stack sizes sections.

Diff Detail

Repository: rL LLVM

Event Timeline

grimar created this revision.May 15 2018, 5:49 AM

This is something that has needed doing, thanks! The code looks plausible to me, but I am not really an ELF expert so I am reluctant to formally approve it.

The test source needs to be IR, not C++; presumably you can just -emit-llvm -S on this test to get it.

Thanks, Paul!

I switched to IR test case (and also fixed another one that turned out to be failing).

grimar mentioned this in D46880: [ELF] - Do not crash when do --gc-sections for non-allocatable metadata sections..May 16 2018, 2:13 AM

jhenderson added a subscriber: jhenderson.May 17 2018, 1:28 AM

@grimar, what is the link-time performance of doing this on something with many functions?

Also, I think it would make more sense for the stack sizes section names to be derived from the "parent" section, a bit like relocation sections, so they'd be called something like .stack_sizes.text._Z3foov or possibly simply .stack_sizes._Z3foov. That way dumping tools can more easily dump the specific individual stack_sizes sections.

test/CodeGen/X86/stack-size-section.ll
16 ↗	(On Diff #147013)	I might be misunderstanding something here, but I don't think these should be uniqued, i.e. in my opinion, in this case, there should only be one stack_sizes section containing both entries, since there is only one text section.

In D46874#1102694, @jhenderson wrote:

@grimar, what is the link-time performance of doing this on something with many functions?

I did not yet measure. I can do the benchmark and return with the results.

Also, I think it would make more sense for the stack sizes section names to be derived from the "parent" section, a bit like relocation sections, so they'd be called something like .stack_sizes.text._Z3foov or possibly simply .stack_sizes._Z3foov. That way dumping tools can more easily dump the specific individual stack_sizes sections.

Maybe, but renaming is a subject for a different patch.
Also, they linked with a sh_link field to their parents already. Dumping tools can use that.
And renaming to something like .stack_sizes.XXX would require linker side change to place them into the single output section I think,
as currently they are merged by name, just like other regular sections.

test/CodeGen/X86/stack-size-section.ll
16 ↗	(On Diff #147013)	Probably that would be ideal. I am doing unification unconditionally. I think just everyone use -ffunction-sections. So I am not sure if it worth to add code to avoid doing unification for that particular case right now. (It just a few lines probably though) Without doing unification asm produced would contain several declarations of `.section .stack_sizes,"",@progbits` which are combined into the single section in the object finally. (see original stack-size-section.ll)

In D46874#1102734, @grimar wrote:

In D46874#1102694, @jhenderson wrote:

@grimar, what is the link-time performance of doing this on something with many functions?

I did not yet measure. I can do the benchmark and return with the results.

Please do - we probably need to consider the impact on bfd and gold as well, although I personally am less concerned there.

Also, I think it would make more sense for the stack sizes section names to be derived from the "parent" section, a bit like relocation sections, so they'd be called something like .stack_sizes.text._Z3foov or possibly simply .stack_sizes._Z3foov. That way dumping tools can more easily dump the specific individual stack_sizes sections.

Maybe, but renaming is a subject for a different patch.

Okay, that's fine.

Also, they linked with a sh_link field to their parents already. Dumping tools can use that.

Only when they want to dump an associated group. It doesn't allow easy dumping of just the single stack sizes section (e.g. via -j in objdump).

And renaming to something like .stack_sizes.XXX would require linker side change to place them into the single output section I think,
as currently they are merged by name, just like other regular sections.

This surprises me. I thought that default grouping would match up to the first '.', like it does for e.g. .text or .data grouping (i.e. .text.foo and .text.bar end up in .text). Or are some sections special-cased for this?

test/CodeGen/X86/stack-size-section.ll
16 ↗	(On Diff #147013)	Probably that would be ideal. I am doing unification unconditionally. I think just everyone use -ffunction-sections. So I am not sure if it worth to add code to avoid doing unification for that particular case right now. (It just a few lines probably though) I'm not sure I'm willing to generalise that much. However, it may not matter if performance impact is minimal. I don't think having multiple section declarations for the same section is a big deal. It's already the case for some other sections, at least to a small extent.

! In D46874#1102740, @jhenderson wrote:

! In D46874#1102734, @grimar wrote:
And renaming to something like .stack_sizes.XXX would require linker side change to place them into the single output section I think,
as currently they are merged by name, just like other regular sections.

This surprises me. I thought that default grouping would match up to the first '.', like it does for e.g. .text or .data grouping (i.e. .text.foo and .text.bar end up in .text). Or are some sections special-cased for this?

Yes, .text.*, .data.* and few others are a special case. See LLD code for that:

https://github.com/llvm-mirror/lld/blob/master/ELF/Writer.cpp#L124

for (StringRef V :
     {".text.", ".rodata.", ".data.rel.ro.", ".data.", ".bss.rel.ro.",
      ".bss.", ".init_array.", ".fini_array.", ".ctors.", ".dtors.", ".tbss.",
      ".gcc_except_table.", ".tdata.", ".ARM.exidx.", ".ARM.extab."}) {
  if (isSectionPrefix(V, S->Name))
    return V.drop_back();
}

The default behavior is to group by name.

In D46874#1102755, @grimar wrote:
! In D46874#1102740, @jhenderson wrote:

! In D46874#1102734, @grimar wrote:
And renaming to something like .stack_sizes.XXX would require linker side change to place them into the single output section I think,
as currently they are merged by name, just like other regular sections.

This surprises me. I thought that default grouping would match up to the first '.', like it does for e.g. .text or .data grouping (i.e. .text.foo and .text.bar end up in .text). Or are some sections special-cased for this?

Yes, .text.*, .data.* and few others are a special case. See LLD code for that:

https://github.com/llvm-mirror/lld/blob/master/ELF/Writer.cpp#L124
for (StringRef V :
     {".text.", ".rodata.", ".data.rel.ro.", ".data.", ".bss.rel.ro.",
      ".bss.", ".init_array.", ".fini_array.", ".ctors.", ".dtors.", ".tbss.",
      ".gcc_except_table.", ".tdata.", ".ARM.exidx.", ".ARM.extab."}) {
  if (isSectionPrefix(V, S->Name))
    return V.drop_back();
}
The default behavior is to group by name.

Ah okay, I haven't really worked with that area. I assume it's the same with other linkers? Because otherwise, I'd say it makes more sense to not have the special case.

In D46874#1102806, @jhenderson wrote:

The default behavior is to group by name.

Ah okay, I haven't really worked with that area. I assume it's the same with other linkers?

Yes. They also will place .stack_sizes.XXX and .stack_sizes.YYY to different output sections.

I'll update the status of this later.

The problem of this patch is benchmark results I had.

I tested linking of clang and chromium (used LLD as a linker, linked with -gc-sections).
(both built with -ffunction-sections -fstack-size-section)

Clang:
Link time changes from 0,428s to 0.481s (~ +12%).
Output size changes from 87,216,640 to 86,983,720 bytes (~ -1%).
Total input objects size changes from 191.3mb to 204.2mb (~ +6,7%).

Chromium:
Link time changes from 6.136s to 6.75s (~ +10%)
Output size changes from 756,347,912 to 753,375,064 bytes (~ -0.4%)
Total input objects size changes from 1,920,795,546 to 2,011,722,858 bytes (~ +4.7%)

So for clang, it allows producing 1% smaller output for a cost of +12% to link time.
For chromium, the benefit is just 0.4%, for about the same link time penalty.

I've been thinking more about this since asking you to do the numbers. Our problem is that without doing this, we will need some other approach to handle .stack_sizes entries referring to removed functions (e.g. discarded COMDATs or GC'ed sections). This is essentially the same problem as we have with things like debug data - the stack size entry will have an address (probably zero) that on at least some platforms is a valid address, which would cause problems for consumers of the section. Either they have to special case it (which may not be possible), or they get potentially misleading output.

Is .stack_sizes off by default? I assume so, and if it isn't, it probably should be, since it's not necessarily useful for everybody. If so, I'd suggest you go ahead with this change - it will only impact those who are using it, and they probably want it done properly if they are using -ffunction-sections. Maybe @ruiu has some suggestions from the linker's point of view too?

In D46874#1109196, @jhenderson wrote:

Is .stack_sizes off by default? I assume so, and if it isn't, it probably should be, since it's not necessarily useful for everybody. If so, I'd suggest you go ahead with this change - it will only impact those who are using it, and they probably want it done properly if they are using -ffunction-sections. Maybe @ruiu has some suggestions from the linker's point of view too?

Yes, it is off by default.

And observing the results, I think it is worth to stop doing unification when -ffunction-sections is off
(to reduce the number of sections and possible linker slowdown).

I'll update the patch.

In D46874#1109198, @grimar wrote:

And observing the results, I think it is worth to stop doing unification when -ffunction-sections is off
(to reduce the number of sections and possible linker slowdown).

I'll update the patch.

I agree, although with one caveat: COMDAT sections should always have a separate .stack_sizes section (maybe in their group), as otherwise we'll have the same invalid problem for discarded COMDATs as I mentioned earlier - i.e. their entries will be preserved, even though they are no longer present.

In D46874#1109238, @jhenderson wrote:

In D46874#1109198, @grimar wrote:

And observing the results, I think it is worth to stop doing unification when -ffunction-sections is off
(to reduce the number of sections and possible linker slowdown).

I'll update the patch.

I agree, although with one caveat: COMDAT sections should always have a separate .stack_sizes section (maybe in their group), as otherwise we'll have the same invalid problem for discarded COMDATs as I mentioned earlier - i.e. their entries will be preserved, even though they are no longer present.

Sure, this is an optimization we want to keep. That does not require setting unique ID attribute.
It is enough to place .stack_sizes into COMDAT. For example the following code will produce 3 different sections in a object:

.section .stack_sizes,"aG",@progbits,foo,comdat
nop

.section .stack_sizes,"aG",@progbits,bar,comdat
nop

.section .stack_sizes,"",@progbits
nop

Do not do unification when no -ffunction-section is specified.

I think it would be good to have testing that shows that we fragment .stack_sizes for COMDATs even without -ffunction-sections. Otherwise, this change looks good to me, but you should probably get someone else to take a look too, as I'm still getting to grips with the MC code.

Thanks, James!

Updated test case (added no -ffunction-sections case).

I'm wondering whether it would make more sense to do the COMDAT group bit without function-sections in the other stack-size-section test, and then rename the new test to stack-size-section-function-sectons or something similar? What do you think?

I think that is fine. Updated the test cases as suggested.

jhenderson added inline comments.May 23 2018, 5:49 AM

test/CodeGen/X86/stack-size-section.ll
16 ↗	(On Diff #148193)	Should we still have SHF_LINK_ORDER in these cases? If we GC (or otherwise discard) the text section here, we'll still keep the stack_sizes section, which I don't think we want to do. I know -gc-sections without -function-sections is rare, but it is legal, and LLD does potentially strip some sections (imagine an object file with only one or two unused functions in, for example).

grimar added inline comments.May 23 2018, 6:52 AM

test/CodeGen/X86/stack-size-section.ll
16 ↗	(On Diff #148193)	Ok. Problem is that currently ELF section key is based on section name, group and unique id: https://github.com/llvm-mirror/llvm/blob/master/lib/MC/MCContext.cpp#L393 Because of that the following code currently produce single .stack_sizes section: .section .text,"ax",@progbits nop .section .stack_sizes,"o",@progbits,.text .byte 1 .section .text,"ax",@progbits nop .section .stack_sizes,"o",@progbits,.text .byte 2 It would produce 2 if stack_sizes would have `unique` attribute set. I can also imagine code when we have multiple .text sections without -ffunction-sections. Example: int foo() { return 0; } int main () __attribute__ ((section (".text.main"))); int main() { return 0; } In that case, we also would want to link .stack_sizes to its parent correctly. At the same time does not seem we really want to complicate the logic of finding the ideal unique ID for them right now. (As -fno-function-sections case is rare and no need to optimize it that early probably) Given that I think the simple logic that should be OK for start would be to have `unique` property unconditionally set (for both -ffunction-sections and -fno-function-sections cases), like in one of the previous iterations. What do you think?

grimar added inline comments.May 23 2018, 7:02 AM

test/CodeGen/X86/stack-size-section.ll
16 ↗	(On Diff #148193)	My first sample is a bit confusing sorry. What I wanted to say is that without unique attribute it generates single .stack_sizes which is linked to nowhere. And to set sh_link field correctly, I think we need `unique` attribute now.

Updated patch to match the behavior I suggested in latest comments.

(unconditionally add unique attribute)

Maybe I've been confusing things, because from what I'm seeing in this, there is now a unique stack sizes section for every function, even without -ffunction-sections, which I don't think we want. What I kind of expect to see is this:

Without function sections:
1. Non-COMDAT functions all share a stack_sizes section which refers via SHF_LINK_ORDER to .text.
2. COMDAT functions have their own stack_sizes section, which optionally refers via SHF_LINK_ORDER to .text.<symbol>, and is a member of the COMDAT group.
With function sections:
1. All functions have their own unique stack_sizes section, which refer to their corresponding section via SHF_LINK_ORDER.
2. COMDAT functions' .stack_sizes sections are members of their group.

Does that make sense?

I think it would be easy to do by only using ID in the elf section creation/fetching if we are for a group section or in ffunction-sections.

In D46874#1109528, @jhenderson wrote:

Maybe I've been confusing things, because from what I'm seeing in this, there is now a unique stack sizes section for every function, even without -ffunction-sections, which I don't think we want.

That is not ideal, but safe, simple and correct I believe.

What I kind of expect to see is this:

Without function sections:

Non-COMDAT functions all share a stack_sizes section which refers via SHF_LINK_ORDER to .text.

But what about my sample?

int foo() {
 return 0;
}

int main () __attribute__ ((section (".text.main")));
int main() {
  return 0;
}

This code compiled without -ffunction-sections will have 2 .text sections: .text and .text.main.
The current version of the patch would correctly create 2 unique stack sizes and would link them to .text and .text.main
respectively.

There is no way currently to create and link 2 different .stack_sizes to different .text sections without setting different unique ID to them I think.

What I can probably do is to compute ID based on the .text section name.

So that for no -ffunction-sections case it would emit several .stack_sizes with the same ID and so that final object would contain only a single section finally after merging them,
just like we would want.

It would work for my sample case too I think. Let me try to implement this.

In D46874#1109615, @grimar wrote:

What I can probably do is to compute ID based on the .text section name.

So that for no -ffunction-sections case it would emit several .stack_sizes with the same ID and so that final object would contain only a single section finally after merging them,
just like we would want.

It would work for my sample case too I think. Let me try to implement this.

Yes, I think this all makes sense. Here's a summary of what I think is best without -function-sections enabled:

// These two share a .stack_sizes section
void func1() {}
void func2() {}

// These two share a different .stack_sizes section.
void func3()  __attribute__ ((section (".text.other"))) {}
void func4()  __attribute__ ((section (".text.other"))) {}

// This has it's own .stack_sizes section in its group
template <int I> int func5() { return I; }

Of course, if .stack_sizes section names were based on their "parent" section, then this would probably become much simpler, but we can't do that with the way current linkers behave.

Generate unique ID basing on begin symbol.
Updated test case.

In D46874#1110730, @jhenderson wrote:
In D46874#1109615, @grimar wrote:

What I can probably do is to compute ID based on the .text section name.

So that for no -ffunction-sections case it would emit several .stack_sizes with the same ID and so that final object would contain only a single section finally after merging them,
just like we would want.

It would work for my sample case too I think. Let me try to implement this.

Yes, I think this all makes sense. Here's a summary of what I think is best without -function-sections enabled:
// These two share a .stack_sizes section
void func1() {}
void func2() {}

// These two share a different .stack_sizes section.
void func3()  __attribute__ ((section (".text.other"))) {}
void func4()  __attribute__ ((section (".text.other"))) {}

// This has it's own .stack_sizes section in its group
template <int I> int func5() { return I; }

Yep. The latest diff implements exactly this behavior I believe.

LGTM, but as mentioned earlier, you should probably get a second opinion.

test/CodeGen/X86/stack-size-section-function-sections.ll
11 ↗	(On Diff #148356)	Nit: "to COMDAT" -> "to a COMDAT group" and "if such COMDAT exists" -> "if such a COMDAT exists".
test/CodeGen/X86/stack-size-section.ll
33 ↗	(On Diff #148356)	Nit: "an unique" -> "a unique" x 2. English is weird!

This revision is now accepted and ready to land.May 24 2018, 3:14 AM

Thanks, James!

Ping.

Addressed grammar nits.

Ping.

LGTM again :)

If there's no other opinion forthcoming, I'm okay with this going in, so that it doesn't sit around any longer.

In D46874#1136349, @jhenderson wrote:

LGTM again :)

If there's no other opinion forthcoming, I'm okay with this going in, so that it doesn't sit around any longer.

If there will be no more objections, I am inclined to commit this on Friday, 22/06 :) I am always happy to address post-commit comments,
and this functionality is enough specific and isolated under the particular flag (-stack-size-section), so I believe it is OK.

Closed by commit rL335332: [MC] - Add .stack_size sections into groups and link them with .text (authored by grimar). · Explain WhyJun 22 2018, 3:15 AM

This revision was automatically updated to reflect the committed changes.

Herald added a reviewer: javed.absar. · View Herald TranscriptJun 22 2018, 3:15 AM

Diff 152446

llvm/trunk/include/llvm/MC/MCObjectFileInfo.h

//===-- llvm/MC/MCObjectFileInfo.h - Object File Info ------------ C++ --===//		//===-- llvm/MC/MCObjectFileInfo.h - Object File Info ------------ C++ --===//
//		//
// The LLVM Compiler Infrastructure		// The LLVM Compiler Infrastructure
//		//
// This file is distributed under the University of Illinois Open Source		// This file is distributed under the University of Illinois Open Source
// License. See LICENSE.TXT for details.		// License. See LICENSE.TXT for details.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// This file describes common object file formats.		// This file describes common object file formats.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_MC_MCOBJECTFILEINFO_H		#ifndef LLVM_MC_MCOBJECTFILEINFO_H
#define LLVM_MC_MCOBJECTFILEINFO_H		#define LLVM_MC_MCOBJECTFILEINFO_H

		#include "llvm/ADT/DenseMap.h"
#include "llvm/ADT/Triple.h"		#include "llvm/ADT/Triple.h"
#include "llvm/Support/CodeGen.h"		#include "llvm/Support/CodeGen.h"

namespace llvm {		namespace llvm {
class MCContext;		class MCContext;
class MCSection;		class MCSection;
		class MCSymbol;

class MCObjectFileInfo {		class MCObjectFileInfo {
protected:		protected:
/// True if .comm supports alignment. This is a hack for as long as we		/// True if .comm supports alignment. This is a hack for as long as we
/// support 10.4 Tiger, whose assembler doesn't support alignment on comm.		/// support 10.4 Tiger, whose assembler doesn't support alignment on comm.
bool CommDirectiveSupportsAlignment;		bool CommDirectiveSupportsAlignment;

/// True if target object file supports a weak_definition of constant 0 for an		/// True if target object file supports a weak_definition of constant 0 for an
▲ Show 20 Lines • Show All 122 Lines • ▼ Show 20 Lines	protected:

/// EH frame section.		/// EH frame section.
///		///
/// It is initialized on demand so it can be overwritten (with uniquing).		/// It is initialized on demand so it can be overwritten (with uniquing).
MCSection *EHFrameSection;		MCSection *EHFrameSection;

/// Section containing metadata on function stack sizes.		/// Section containing metadata on function stack sizes.
MCSection *StackSizesSection;		MCSection *StackSizesSection;
		mutable DenseMap<const MCSymbol *, unsigned> StackSizesUniquing;

// ELF specific sections.		// ELF specific sections.
MCSection *DataRelROSection;		MCSection *DataRelROSection;
MCSection *MergeableConst4Section;		MCSection *MergeableConst4Section;
MCSection *MergeableConst8Section;		MCSection *MergeableConst8Section;
MCSection *MergeableConst16Section;		MCSection *MergeableConst16Section;
MCSection *MergeableConst32Section;		MCSection *MergeableConst32Section;

▲ Show 20 Lines • Show All 125 Lines • ▼ Show 20 Lines	public:

MCSection *getTLSExtraDataSection() const { return TLSExtraDataSection; }		MCSection *getTLSExtraDataSection() const { return TLSExtraDataSection; }
const MCSection *getTLSDataSection() const { return TLSDataSection; }		const MCSection *getTLSDataSection() const { return TLSDataSection; }
MCSection *getTLSBSSSection() const { return TLSBSSSection; }		MCSection *getTLSBSSSection() const { return TLSBSSSection; }

MCSection *getStackMapSection() const { return StackMapSection; }		MCSection *getStackMapSection() const { return StackMapSection; }
MCSection *getFaultMapSection() const { return FaultMapSection; }		MCSection *getFaultMapSection() const { return FaultMapSection; }

MCSection *getStackSizesSection() const { return StackSizesSection; }		MCSection *getStackSizesSection(const MCSection &TextSec) const;

// ELF specific sections.		// ELF specific sections.
MCSection *getDataRelROSection() const { return DataRelROSection; }		MCSection *getDataRelROSection() const { return DataRelROSection; }
const MCSection *getMergeableConst4Section() const {		const MCSection *getMergeableConst4Section() const {
return MergeableConst4Section;		return MergeableConst4Section;
}		}
const MCSection *getMergeableConst8Section() const {		const MCSection *getMergeableConst8Section() const {
return MergeableConst8Section;		return MergeableConst8Section;
▲ Show 20 Lines • Show All 79 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/AsmPrinter/AsmPrinter.cpp

Show First 20 Lines • Show All 983 Lines • ▼ Show 20 Lines	void AsmPrinter::emitFrameAlloc(const MachineInstr &MI) {
OutStreamer->EmitAssignment(FrameAllocSym,		OutStreamer->EmitAssignment(FrameAllocSym,
MCConstantExpr::create(FrameOffset, OutContext));		MCConstantExpr::create(FrameOffset, OutContext));
}		}

void AsmPrinter::emitStackSizeSection(const MachineFunction &MF) {		void AsmPrinter::emitStackSizeSection(const MachineFunction &MF) {
if (!MF.getTarget().Options.EmitStackSizeSection)		if (!MF.getTarget().Options.EmitStackSizeSection)
return;		return;

MCSection *StackSizeSection = getObjFileLowering().getStackSizesSection();		MCSection *StackSizeSection =
		getObjFileLowering().getStackSizesSection(*getCurrentSection());
if (!StackSizeSection)		if (!StackSizeSection)
return;		return;

const MachineFrameInfo &FrameInfo = MF.getFrameInfo();		const MachineFrameInfo &FrameInfo = MF.getFrameInfo();
// Don't emit functions with dynamic stack allocations.		// Don't emit functions with dynamic stack allocations.
if (FrameInfo.hasVarSizedObjects())		if (FrameInfo.hasVarSizedObjects())
return;		return;

▲ Show 20 Lines • Show All 2,025 Lines • Show Last 20 Lines

llvm/trunk/lib/MC/MCObjectFileInfo.cpp

Show First 20 Lines • Show All 942 Lines • ▼ Show 20 Lines	case Triple::UnknownObjectFormat:
break;		break;
}		}
}		}

MCSection *MCObjectFileInfo::getDwarfTypesSection(uint64_t Hash) const {		MCSection *MCObjectFileInfo::getDwarfTypesSection(uint64_t Hash) const {
return Ctx->getELFSection(".debug_types", ELF::SHT_PROGBITS, ELF::SHF_GROUP,		return Ctx->getELFSection(".debug_types", ELF::SHT_PROGBITS, ELF::SHF_GROUP,
0, utostr(Hash));		0, utostr(Hash));
}		}

		MCSection *
		MCObjectFileInfo::getStackSizesSection(const MCSection &TextSec) const {
		if (Env != IsELF)
		return StackSizesSection;

		const MCSectionELF &ElfSec = static_cast<const MCSectionELF &>(TextSec);
		unsigned Flags = ELF::SHF_LINK_ORDER;
		StringRef GroupName;
		if (const MCSymbol *Group = ElfSec.getGroup()) {
		GroupName = Group->getName();
		Flags \|= ELF::SHF_GROUP;
		}

		const MCSymbol *Link = TextSec.getBeginSymbol();
		auto It = StackSizesUniquing.insert({Link, StackSizesUniquing.size()});
		unsigned UniqueID = It.first->second;

		return Ctx->getELFSection(".stack_sizes", ELF::SHT_PROGBITS, Flags, 0,
		GroupName, UniqueID, cast<MCSymbolELF>(Link));
		}

llvm/trunk/test/CodeGen/ARM/stack-size-section.ll

	; RUN: llc < %s -mtriple=armv7-linux -stack-size-section \| FileCheck %s			; RUN: llc < %s -mtriple=armv7-linux -stack-size-section \| FileCheck %s

	; CHECK-LABEL: func1:			; CHECK-LABEL: func1:
	; CHECK-NEXT: .Lfunc_begin0:			; CHECK-NEXT: .Lfunc_begin0:
	; CHECK: .section .stack_sizes,"",%progbits			; CHECK: .section .stack_sizes,"o",%progbits,.text,unique,0
	; CHECK-NEXT: .long .Lfunc_begin0			; CHECK-NEXT: .long .Lfunc_begin0
	; CHECK-NEXT: .byte 8			; CHECK-NEXT: .byte 8
	define void @func1(i32, i32) #0 {			define void @func1(i32, i32) #0 {
	alloca i32, align 4			alloca i32, align 4
	alloca i32, align 4			alloca i32, align 4
	ret void			ret void
	}			}

	; CHECK-LABEL: func2:			; CHECK-LABEL: func2:
	; CHECK-NEXT: .Lfunc_begin1:			; CHECK-NEXT: .Lfunc_begin1:
	; CHECK: .section .stack_sizes,"",%progbits			; CHECK: .section .stack_sizes,"o",%progbits,.text,unique,0
	; CHECK-NEXT: .long .Lfunc_begin1			; CHECK-NEXT: .long .Lfunc_begin1
	; CHECK-NEXT: .byte 16			; CHECK-NEXT: .byte 16
	define void @func2() #0 {			define void @func2() #0 {
	alloca i32, align 4			alloca i32, align 4
	call void @func1(i32 1, i32 2)			call void @func1(i32 1, i32 2)
	ret void			ret void
	}			}

	; CHECK-LABEL: dynalloc:			; CHECK-LABEL: dynalloc:
	; CHECK-NOT: .section .stack_sizes			; CHECK-NOT: .section .stack_sizes
	define void @dynalloc(i32 %N) #0 {			define void @dynalloc(i32 %N) #0 {
	alloca i32, i32 %N			alloca i32, i32 %N
	ret void			ret void
	}			}

	attributes #0 = { "no-frame-pointer-elim"="true" }			attributes #0 = { "no-frame-pointer-elim"="true" }

llvm/trunk/test/CodeGen/SystemZ/stack-size-section.ll

	; RUN: llc < %s -mtriple=s390x-linux-gnu -stack-size-section \| FileCheck %s			; RUN: llc < %s -mtriple=s390x-linux-gnu -stack-size-section \| FileCheck %s

	; CHECK-LABEL: func1:			; CHECK-LABEL: func1:
	; CHECK-NEXT: .Lfunc_begin0:			; CHECK-NEXT: .Lfunc_begin0:
	; CHECK: .section .stack_sizes,"",@progbits			; CHECK: .section .stack_sizes,"o",@progbits,.text,unique,0
	; CHECK-NEXT: .quad .Lfunc_begin0			; CHECK-NEXT: .quad .Lfunc_begin0
	; CHECK-NEXT: .byte 0			; CHECK-NEXT: .byte 0
	define void @func1(i32, i32) #0 {			define void @func1(i32, i32) #0 {
	ret void			ret void
	}			}

	; CHECK-LABEL: func2:			; CHECK-LABEL: func2:
	; CHECK-NEXT: .Lfunc_begin1:			; CHECK-NEXT: .Lfunc_begin1:
	; CHECK: .section .stack_sizes,"",@progbits			; CHECK: .section .stack_sizes,"o",@progbits,.text,unique,0
	; CHECK-NEXT: .quad .Lfunc_begin1			; CHECK-NEXT: .quad .Lfunc_begin1
	; CHECK-NEXT: .ascii "\250\001"			; CHECK-NEXT: .ascii "\250\001"
	define void @func2(i32, i32) #0 {			define void @func2(i32, i32) #0 {
	alloca i32, align 4			alloca i32, align 4
	alloca i32, align 4			alloca i32, align 4
	ret void			ret void
	}			}

	; CHECK-LABEL: func3:			; CHECK-LABEL: func3:
	; CHECK-NEXT: .Lfunc_begin2:			; CHECK-NEXT: .Lfunc_begin2:
	; CHECK: .section .stack_sizes,"",@progbits			; CHECK: .section .stack_sizes,"o",@progbits,.text,unique,0
	; CHECK-NEXT: .quad .Lfunc_begin2			; CHECK-NEXT: .quad .Lfunc_begin2
	; CHECK-NEXT: .ascii "\250\001"			; CHECK-NEXT: .ascii "\250\001"
	define void @func3() #0 {			define void @func3() #0 {
	alloca i32, align 4			alloca i32, align 4
	call void @func1(i32 1, i32 2)			call void @func1(i32 1, i32 2)
	ret void			ret void
	}			}

	; CHECK-LABEL: dynalloc:			; CHECK-LABEL: dynalloc:
	; CHECK-NOT: .section .stack_sizes			; CHECK-NOT: .section .stack_sizes
	define void @dynalloc(i32 %N) #0 {			define void @dynalloc(i32 %N) #0 {
	alloca i32, i32 %N			alloca i32, i32 %N
	ret void			ret void
	}			}

	attributes #0 = { "no-frame-pointer-elim"="true" }			attributes #0 = { "no-frame-pointer-elim"="true" }

llvm/trunk/test/CodeGen/X86/stack-size-section-function-sections.ll

				; RUN: llc < %s -mtriple=x86_64-linux -stack-size-section -function-sections \| FileCheck %s

				; Check we add SHF_LINK_ORDER for .stack_sizes and link it with the corresponding .text sections.
				; CHECK: .section .text._Z3barv,"ax",@progbits
				; CHECK: .section .stack_sizes,"o",@progbits,.text._Z3barv,unique,0
				; CHECK: .section .text._Z3foov,"ax",@progbits
				; CHECK: .section .stack_sizes,"o",@progbits,.text._Z3foov,unique,1

				; Check we add .stack_size section to a COMDAT group with the corresponding .text section if such a COMDAT exists.
				; CHECK: .section .text._Z4fooTIiET_v,"axG",@progbits,_Z4fooTIiET_v,comdat
				; CHECK: .section .stack_sizes,"Go",@progbits,_Z4fooTIiET_v,comdat,.text._Z4fooTIiET_v,unique,2

				$_Z4fooTIiET_v = comdat any

				define dso_local i32 @_Z3barv() {
				ret i32 0
				}

				define dso_local i32 @_Z3foov() {
				%1 = call i32 @_Z4fooTIiET_v()
				ret i32 %1
				}

				define linkonce_odr dso_local i32 @_Z4fooTIiET_v() comdat {
				ret i32 0
				}

llvm/trunk/test/CodeGen/X86/stack-size-section.ll

	; RUN: llc < %s -mtriple=x86_64-linux -stack-size-section \| FileCheck %s			; RUN: llc < %s -mtriple=x86_64-linux -stack-size-section \| FileCheck %s

	; CHECK-LABEL: func1:			; CHECK-LABEL: func1:
	; CHECK-NEXT: .Lfunc_begin0:			; CHECK-NEXT: .Lfunc_begin0:
	; CHECK: .section .stack_sizes,"",@progbits			; CHECK: .section .stack_sizes,"o",@progbits
	; CHECK-NEXT: .quad .Lfunc_begin0			; CHECK-NEXT: .quad .Lfunc_begin0
	; CHECK-NEXT: .byte 8			; CHECK-NEXT: .byte 8
	define void @func1(i32, i32) #0 {			define void @func1(i32, i32) #0 {
	alloca i32, align 4			alloca i32, align 4
	alloca i32, align 4			alloca i32, align 4
	ret void			ret void
	}			}

	; CHECK-LABEL: func2:			; CHECK-LABEL: func2:
	; CHECK-NEXT: .Lfunc_begin1:			; CHECK-NEXT: .Lfunc_begin1:
	; CHECK: .section .stack_sizes,"",@progbits			; CHECK: .section .stack_sizes,"o",@progbits
	; CHECK-NEXT: .quad .Lfunc_begin1			; CHECK-NEXT: .quad .Lfunc_begin1
	; CHECK-NEXT: .byte 24			; CHECK-NEXT: .byte 24
	define void @func2() #0 {			define void @func2() #0 {
	alloca i32, align 4			alloca i32, align 4
	call void @func1(i32 1, i32 2)			call void @func1(i32 1, i32 2)
	ret void			ret void
	}			}

				; Check that we still put .stack_sizes into the corresponding COMDAT group if any.
				; CHECK: .section .text._Z4fooTIiET_v,"axG",@progbits,_Z4fooTIiET_v,comdat
				; CHECK: .section .stack_sizes,"Go",@progbits,_Z4fooTIiET_v,comdat,.text._Z4fooTIiET_v,unique,1
				$_Z4fooTIiET_v = comdat any
				define linkonce_odr dso_local i32 @_Z4fooTIiET_v() comdat {
				ret i32 0
				}

				; Check that we assign a unique ID to .stack_sizes if it is linked with a unique .text section.
				; CHECK: .section .text.func3,"ax",@progbits
				; CHECK: .section .stack_sizes,"o",@progbits,.text.func3,unique,2
				define dso_local i32 @func3() section ".text.func3" {
				%1 = alloca i32, align 4
				store i32 0, i32* %1, align 4
				ret i32 0
				}

	; CHECK-LABEL: dynalloc:			; CHECK-LABEL: dynalloc:
	; CHECK-NOT: .section .stack_sizes			; CHECK-NOT: .section .stack_sizes
	define void @dynalloc(i32 %N) #0 {			define void @dynalloc(i32 %N) #0 {
	alloca i32, i32 %N			alloca i32, i32 %N
	ret void			ret void
	}			}

	attributes #0 = { "no-frame-pointer-elim"="true" }			attributes #0 = { "no-frame-pointer-elim"="true" }

This is an archive of the discontinued LLVM Phabricator instance.

[MC] - Add .stack_size sections into groups and link them with .text
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 152446

llvm/trunk/include/llvm/MC/MCObjectFileInfo.h

llvm/trunk/lib/CodeGen/AsmPrinter/AsmPrinter.cpp

llvm/trunk/lib/MC/MCObjectFileInfo.cpp

llvm/trunk/test/CodeGen/ARM/stack-size-section.ll

llvm/trunk/test/CodeGen/SystemZ/stack-size-section.ll

llvm/trunk/test/CodeGen/X86/stack-size-section-function-sections.ll

llvm/trunk/test/CodeGen/X86/stack-size-section.ll

This is an archive of the discontinued LLVM Phabricator instance.

[MC] - Add .stack_size sections into groups and link them with .textClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 152446

llvm/trunk/include/llvm/MC/MCObjectFileInfo.h

llvm/trunk/lib/CodeGen/AsmPrinter/AsmPrinter.cpp

llvm/trunk/lib/MC/MCObjectFileInfo.cpp

llvm/trunk/test/CodeGen/ARM/stack-size-section.ll

llvm/trunk/test/CodeGen/SystemZ/stack-size-section.ll

llvm/trunk/test/CodeGen/X86/stack-size-section-function-sections.ll

llvm/trunk/test/CodeGen/X86/stack-size-section.ll

[MC] - Add .stack_size sections into groups and link them with .text
ClosedPublic