llvm/CodeGen at fa27f925be05edccc0b47c40dd808465fe25fca2 - llvm

RPCSX/llvm

mirror of https://github.com/RPCSX/llvm.git synced 2025-02-20 21:14:28 +00:00

History

Etienne Bergeron 70cf01c276 [stack-protection] Add support for MSVC buffer security check

Summary:
This patch is adding support for the MSVC buffer security check implementation

The buffer security check is turned on with the '/GS' compiler switch.
  * https://msdn.microsoft.com/en-us/library/8dbf701c.aspx
  * To be added to clang here: http://reviews.llvm.org/D20347

Some overview of buffer security check feature and implementation:
  * https://msdn.microsoft.com/en-us/library/aa290051(VS.71).aspx
  * http://www.ksyash.com/2011/01/buffer-overflow-protection-3/
  * http://blog.osom.info/2012/02/understanding-vs-c-compilers-buffer.html


For the following example:
```
int example(int offset, int index) {
  char buffer[10];
  memset(buffer, 0xCC, index);
  return buffer[index];
}
```

The MSVC compiler is adding these instructions to perform stack integrity check:
```
        push        ebp  
        mov         ebp,esp  
        sub         esp,50h  
  [1]   mov         eax,dword ptr [__security_cookie (01068024h)]  
  [2]   xor         eax,ebp  
  [3]   mov         dword ptr [ebp-4],eax  
        push        ebx  
        push        esi  
        push        edi  
        mov         eax,dword ptr [index]  
        push        eax  
        push        0CCh  
        lea         ecx,[buffer]  
        push        ecx  
        call        _memset (010610B9h)  
        add         esp,0Ch  
        mov         eax,dword ptr [index]  
        movsx       eax,byte ptr buffer[eax]  
        pop         edi  
        pop         esi  
        pop         ebx  
  [4]   mov         ecx,dword ptr [ebp-4]  
  [5]   xor         ecx,ebp  
  [6]   call        @__security_check_cookie@4 (01061276h)  
        mov         esp,ebp  
        pop         ebp  
        ret  
```

The instrumentation above is:
  * [1] is loading the global security canary,
  * [3] is storing the local computed ([2]) canary to the guard slot,
  * [4] is loading the guard slot and ([5]) re-compute the global canary,
  * [6] is validating the resulting canary with the '__security_check_cookie' and performs error handling.

Overview of the current stack-protection implementation:
  * lib/CodeGen/StackProtector.cpp
    * There is a default stack-protection implementation applied on intermediate representation.
    * The target can overload 'getIRStackGuard' method if it has a standard location for the stack protector cookie.
    * An intrinsic 'Intrinsic::stackprotector' is added to the prologue. It will be expanded by the instruction selection pass (DAG or Fast).
    * Basic Blocks are added to every instrumented function to receive the code for handling stack guard validation and errors handling.
    * Guard manipulation and comparison are added directly to the intermediate representation.

  * lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp
  * lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
    * There is an implementation that adds instrumentation during instruction selection (for better handling of sibbling calls).
      * see long comment above 'class StackProtectorDescriptor' declaration.
    * The target needs to override 'getSDagStackGuard' to activate SDAG stack protection generation. (note: getIRStackGuard MUST be nullptr).
      * 'getSDagStackGuard' returns the appropriate stack guard (security cookie)
    * The code is generated by 'SelectionDAGBuilder.cpp' and 'SelectionDAGISel.cpp'.

  * include/llvm/Target/TargetLowering.h
    * Contains function to retrieve the default Guard 'Value'; should be overriden by each target to select which implementation is used and provide Guard 'Value'.

  * lib/Target/X86/X86ISelLowering.cpp
    * Contains the x86 specialisation; Guard 'Value' used by the SelectionDAG algorithm.

Function-based Instrumentation:
  * The MSVC doesn't inline the stack guard comparison in every function. Instead, a call to '__security_check_cookie' is added to the epilogue before every return instructions.
  * To support function-based instrumentation, this patch is
    * adding a function to get the function-based check (llvm 'Value', see include/llvm/Target/TargetLowering.h),
      * If provided, the stack protection instrumentation won't be inlined and a call to that function will be added to the prologue.
    * modifying (SelectionDAGISel.cpp) do avoid producing basic blocks used for inline instrumentation,
    * generating the function-based instrumentation during the ISEL pass (SelectionDAGBuilder.cpp),
    * if FastISEL (not SelectionDAG), using the fallback which rely on the same function-based implemented over intermediate representation (StackProtector.cpp).

Modifications
  * adding support for MSVC (lib/Target/X86/X86ISelLowering.cpp)
  * adding support function-based instrumentation (lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp, .h)

Results

  * IR generated instrumentation:
```
clang-cl /GS test.cc /Od /c -mllvm -print-isel-input
```

```
*** Final LLVM Code input to ISel ***

; Function Attrs: nounwind sspstrong
define i32 @"\01?example@@YAHHH@Z"(i32 %offset, i32 %index) #0 {
entry:
  %StackGuardSlot = alloca i8*                                                  <<<-- Allocated guard slot
  %0 = call i8* @llvm.stackguard()                                              <<<-- Loading Stack Guard value
  call void @llvm.stackprotector(i8* %0, i8** %StackGuardSlot)                  <<<-- Prologue intrinsic call (store to Guard slot)
  %index.addr = alloca i32, align 4
  %offset.addr = alloca i32, align 4
  %buffer = alloca [10 x i8], align 1
  store i32 %index, i32* %index.addr, align 4
  store i32 %offset, i32* %offset.addr, align 4
  %arraydecay = getelementptr inbounds [10 x i8], [10 x i8]* %buffer, i32 0, i32 0
  %1 = load i32, i32* %index.addr, align 4
  call void @llvm.memset.p0i8.i32(i8* %arraydecay, i8 -52, i32 %1, i32 1, i1 false)
  %2 = load i32, i32* %index.addr, align 4
  %arrayidx = getelementptr inbounds [10 x i8], [10 x i8]* %buffer, i32 0, i32 %2
  %3 = load i8, i8* %arrayidx, align 1
  %conv = sext i8 %3 to i32
  %4 = load volatile i8*, i8** %StackGuardSlot                                  <<<-- Loading Guard slot
  call void @__security_check_cookie(i8* %4)                                    <<<-- Epilogue function-based check
  ret i32 %conv
}
```

  * SelectionDAG generated instrumentation:

```
clang-cl /GS test.cc /O1 /c /FA
```

```
"?example@@YAHHH@Z":                    # @"\01?example@@YAHHH@Z"
# BB#0:                                 # %entry
        pushl   %esi
        subl    $16, %esp
        movl    ___security_cookie, %eax                                        <<<-- Loading Stack Guard value
        movl    28(%esp), %esi
        movl    %eax, 12(%esp)                                                  <<<-- Store to Guard slot
        leal    2(%esp), %eax
        pushl   %esi
        pushl   $204
        pushl   %eax
        calll   _memset
        addl    $12, %esp
        movsbl  2(%esp,%esi), %esi
        movl    12(%esp), %ecx                                                  <<<-- Loading Guard slot
        calll   @__security_check_cookie@4                                      <<<-- Epilogue function-based check
        movl    %esi, %eax
        addl    $16, %esp
        popl    %esi
        retl
```

Reviewers: kcc, pcc, eugenis, rnk

Subscribers: majnemer, llvm-commits, hans, thakis, rnk

Differential Revision: http://reviews.llvm.org/D20346

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@272053 91177308-0d34-0410-b5e6-96231b3b80d8

2016-06-07 20:15:35 +00:00

AsmPrinter

Re-land "[codeview] Emit information about global variables"

2016-06-07 00:02:03 +00:00

GlobalISel

[RegBankSelect] Compute the repairing cost for copies.

2016-05-21 01:43:25 +00:00

MIRParser

MIR: Support MachineMemOperands without associated value

2016-06-04 00:06:31 +00:00

SelectionDAG

[stack-protection] Add support for MSVC buffer security check

2016-06-07 20:15:35 +00:00

AggressiveAntiDepBreaker.cpp

Do not rename registers that do not start an independent live range

2016-05-26 18:22:53 +00:00

AggressiveAntiDepBreaker.h

CodeGen: Use MachineInstr& in AntiDepBreaker API, NFC

2016-02-27 19:33:37 +00:00

AllocationOrder.cpp

TargetRegisterInfo: Provide a way to check assigned registers in getRegAllocationHints()

2015-07-15 22:16:00 +00:00

AllocationOrder.h

TargetRegisterInfo: Provide a way to check assigned registers in getRegAllocationHints()

2015-07-15 22:16:00 +00:00

Analysis.cpp

Use shouldAssumeDSOLocal on AArch64.

2016-05-26 12:42:55 +00:00

AntiDepBreaker.h

CodeGen: Use MachineInstr& in AntiDepBreaker API, NFC

2016-02-27 19:33:37 +00:00

AtomicExpandPass.cpp

Rename getLargestLegalIntTypeSize to getLargestLegalIntTypeSizeInBits(). NFC.

2016-05-13 18:38:35 +00:00

BasicTargetTransformInfo.cpp

constify the Function parameter to the TTI creation callback and

2015-09-16 23:38:13 +00:00

BranchFolding.cpp

Revert "[MBP] Reduce code size by running tail merging in MBP."

2016-06-07 15:17:21 +00:00

BranchFolding.h

Revert "[MBP] Reduce code size by running tail merging in MBP."

2016-06-07 15:17:21 +00:00

BuiltinGCs.cpp

[GC] Consolidate all built in GCs into a single file [NFC]

2016-01-19 03:57:18 +00:00

CalcSpillWeights.cpp

CodeGen: Update LiveIntervalAnalysis API to use MachineInstr&, NFC

2016-02-27 20:14:29 +00:00

CallingConvLower.cpp

CodeGen: Factor out code for tail call result compatibility check; NFC

2016-03-30 22:46:04 +00:00

CMakeLists.txt

CodeGen: Refactor renameDisconnectedComponents() as a pass

2016-05-31 22:38:06 +00:00

CodeGen.cpp

CodeGen: Refactor renameDisconnectedComponents() as a pass

2016-05-31 22:38:06 +00:00

CodeGenPrepare.cpp

Clarify that we match BSwap in InstCombine and BitReverse in CGP. NFC.

2016-05-25 16:22:14 +00:00

CriticalAntiDepBreaker.cpp

[CodeGen] Fix problem with X86 byte registers in CriticalAntiDepBreaker

2016-05-26 23:08:52 +00:00

CriticalAntiDepBreaker.h

[NFC] Header cleanup

2016-04-18 09:17:29 +00:00

DeadMachineInstructionElim.cpp

Re-commit optimization bisect support (r267022) without new pass manager support.

2016-04-22 22:06:11 +00:00

DetectDeadLanes.cpp

DetectDeadLanes: Increase precision when detecting undef inputs

2016-05-06 22:43:50 +00:00

DFAPacketizer.cpp

Apply clang-tidy's misc-static-assert where it makes sense.

2016-05-27 11:36:04 +00:00

DwarfEHPrepare.cpp

Move EH-specific helper functions to a more appropriate place

2015-12-02 23:06:39 +00:00

EarlyIfConversion.cpp

Add opt-bisect support to additional passes that can be skipped

2016-05-03 22:32:30 +00:00

EdgeBundles.cpp

Revert r240137 (Fixed/added namespace ending comments using clang-tidy. NFC)

2015-06-23 09:49:53 +00:00

ExecutionDepsFix.cpp

Add opt-bisect support to additional passes that can be skipped

2016-05-03 22:32:30 +00:00

ExpandISelPseudos.cpp

CodeGen: Remove a few more ilist iterator implicit conversions, NFC

2015-10-09 18:44:40 +00:00

ExpandPostRAPseudos.cpp

…

FaultMaps.cpp

Revert "[FaultMaps] Move FaultMapParser to Object/"

2015-06-23 20:09:03 +00:00

FuncletLayout.cpp

Introduce MachineFunctionProperties and the AllVRegsAllocated property

2016-03-28 17:05:30 +00:00

GCMetadata.cpp

Revert r240137 (Fixed/added namespace ending comments using clang-tidy. NFC)

2015-06-23 09:49:53 +00:00

GCMetadataPrinter.cpp

Revert 258157

2016-01-19 18:41:10 +00:00

GCRootLowering.cpp

[opaque pointer types] Alloca: use getAllocatedType() instead of getType()->getPointerElementType().

2016-01-18 00:10:01 +00:00

GCStrategy.cpp

Revert 258157

2016-01-19 18:41:10 +00:00

GlobalMerge.cpp

CodeGen: Make the global-merge pass independently testable, and add a test.

2016-05-19 04:38:56 +00:00

IfConversion.cpp

Revert "[MBP] Reduce code size by running tail merging in MBP."

2016-06-07 15:17:21 +00:00

ImplicitNullChecks.cpp

[ImplicitNullChecks] Account for implicit-defs as well when updating the liveness.

2016-05-03 18:09:06 +00:00

InlineSpiller.cpp

InsertPointAnalysis: Move current live interval from being a class member

2016-05-23 19:39:19 +00:00

InterferenceCache.cpp

CodeGen: Remove more ilist iterator implicit conversions, NFC

2015-10-09 19:13:58 +00:00

InterferenceCache.h

[CodeGen] Reduce visibility of implementation details

2015-07-01 14:47:39 +00:00

InterleavedAccessPass.cpp

[ARM, AArch64] Match additional patterns to ldN instructions

2016-05-19 21:39:00 +00:00

IntrinsicLowering.cpp

getParent() ^ 3 == getModule() ; NFCI

2015-12-14 17:24:23 +00:00

LatencyPriorityQueue.cpp

…

LexicalScopes.cpp

Fixed Dwarf debug info emission to skip DILexicalBlockFile entries.

2016-04-21 16:58:49 +00:00

LiveDebugValues.cpp

PR26055: Speed up LiveDebugValues::transferDebugValue()

2016-05-26 21:42:47 +00:00

LiveDebugVariables.cpp

Apply clang-tidy's misc-move-constructor-init throughout LLVM.

2016-05-27 14:27:24 +00:00

LiveDebugVariables.h

[NFC] Header cleanup

2016-04-18 09:17:29 +00:00

LiveInterval.cpp

CodeGen: Refactor renameDisconnectedComponents() as a pass

2016-05-31 22:38:06 +00:00

LiveIntervalAnalysis.cpp

CodeGen: Refactor renameDisconnectedComponents() as a pass

2016-05-31 22:38:06 +00:00

LiveIntervalUnion.cpp

…

LivePhysRegs.cpp

livePhysRegs: Pass MBB by reference in addLive{Ins|Outs}(); NFC

2016-05-03 00:24:32 +00:00

LiveRangeCalc.cpp

LiveIntervalAnalysis: Rework constructMainRangeFromSubranges()

2016-05-20 23:14:56 +00:00

LiveRangeCalc.h

LiveIntervalAnalysis: Rework constructMainRangeFromSubranges()

2016-05-20 23:14:56 +00:00

LiveRangeEdit.cpp

[foldMemoryOperand()] Pass LiveIntervals to enable liveness check.

2016-05-10 08:09:37 +00:00

LiveRangeUtils.h

CodeGen: Refactor renameDisconnectedComponents() as a pass

2016-05-31 22:38:06 +00:00

LiveRegMatrix.cpp

TargetRegisterInfo: Introduce PrintLaneMask.

2015-09-25 21:51:24 +00:00

LiveStackAnalysis.cpp

[NFC] Header cleanup

2016-04-18 09:17:29 +00:00

LiveVariables.cpp

LiveVariables: Fix typo and shorten comment

2016-03-29 19:07:40 +00:00

LLVMBuild.txt

Include ProfileData as CodeGen's required library.

2016-02-22 22:54:14 +00:00

LLVMTargetMachine.cpp

X86: permit using SjLj EH on x86 targets as an option

2016-05-31 01:48:07 +00:00

LocalStackSlotAllocation.cpp

CodeGen: Remove more ilist iterator implicit conversions, NFC

2015-10-09 19:13:58 +00:00

LowerEmuTLS.cpp

Re-commit optimization bisect support (r267022) without new pass manager support.

2016-04-22 22:06:11 +00:00

MachineBasicBlock.cpp

[MBB] Early exit to reduce indentation, per coding guidelines. NFC.

2016-05-25 21:53:46 +00:00

MachineBlockFrequencyInfo.cpp

CodeGen: Avoid ilist iterator implicit conversions in a few more places, NFC

2015-10-09 19:23:20 +00:00

MachineBlockPlacement.cpp

Revert "[MBP] Reduce code size by running tail merging in MBP."

2016-06-07 15:17:21 +00:00

MachineBranchProbabilityInfo.cpp

Replace hard coded probability threshold with parameter /NFC

2016-06-03 23:48:36 +00:00

MachineCombiner.cpp

[MachineCombiner] Support for floating-point FMA on ARM64 (re-commit r267098)

2016-04-24 05:14:01 +00:00

MachineCopyPropagation.cpp

Make MachineCopyPropagation preserve CFG

2016-06-02 00:04:26 +00:00

MachineCSE.cpp

Re-commit optimization bisect support (r267022) without new pass manager support.

2016-04-22 22:06:11 +00:00

MachineDominanceFrontier.cpp

…

MachineDominators.cpp

Remove macro guards for extern template instantiations.

2015-07-13 17:21:31 +00:00

MachineFunction.cpp

MachineFunction: Add a const modifier to print() parameter

2016-05-05 18:14:43 +00:00

MachineFunctionAnalysis.cpp

MIR Serialization: Connect the machine function analysis pass to the MIR parser.

2015-06-15 20:30:22 +00:00

MachineFunctionPass.cpp

Improve error message reporting for MachineFunctionProperties

2016-04-21 22:19:24 +00:00

MachineFunctionPrinterPass.cpp

Recommit r256952 "Filtering IR printing for print-after-all/print-before-all"

2016-01-06 22:55:03 +00:00

MachineInstr.cpp

Fix PR26655: Bail out if all regs of an inst BUNDLE have the correct kill flag

2016-05-10 17:57:27 +00:00

MachineInstrBundle.cpp

Apply clang-tidy's misc-move-constructor-init throughout LLVM.

2016-05-27 14:27:24 +00:00

MachineLICM.cpp

Fix DEBUG logs in MachineLICM.

2016-05-23 18:56:07 +00:00

MachineLoopInfo.cpp

ADT: Remove == and != comparisons between ilist iterators and pointers

2016-02-21 20:39:50 +00:00

MachineModuleInfo.cpp

Remove uses of builtin comma operator.

2016-02-18 22:09:30 +00:00

MachineModuleInfoImpls.cpp

…

MachinePassRegistry.cpp

…

MachinePostDominators.cpp

…

MachineRegionInfo.cpp

Introduce DominanceFrontierAnalysis to the new PassManager to compute DominanceFrontier. NFC

2016-02-25 17:54:15 +00:00

MachineRegisterInfo.cpp

Replace MachineRegisterInfo::TracksLiveness with a MachineFunctionProperty

2016-04-11 23:32:13 +00:00

MachineScheduler.cpp

CodeGen: Refactor renameDisconnectedComponents() as a pass

2016-05-31 22:38:06 +00:00

MachineSink.cpp

Re-commit optimization bisect support (r267022) without new pass manager support.

2016-04-22 22:06:11 +00:00

MachineSSAUpdater.cpp

[NFC] Header cleanup

2016-04-18 09:17:29 +00:00

MachineTraceMetrics.cpp

Reapply "CodeGen: Use references in MachineTraceMetrics::Trace, NFC"

2016-02-22 03:33:28 +00:00

MachineVerifier.cpp

MachineVerifier: subregs so not require defs/valnos on every path

2016-05-20 23:02:13 +00:00

MIRPrinter.cpp

MIR: Support MachineMemOperands without associated value

2016-06-04 00:06:31 +00:00

MIRPrinter.h

MIR Serialization: move the MIR printer out of the MIR printing pass.

2015-06-15 23:52:35 +00:00

MIRPrintingPass.cpp

Re-commit r247216: "Fix Clang-tidy misc-use-override warnings, other minor fixes"

2015-09-10 16:49:58 +00:00

OptimizePHIs.cpp

Re-commit optimization bisect support (r267022) without new pass manager support.

2016-04-22 22:06:11 +00:00

ParallelCG.cpp

[ParallelCG] SmallVector<char> -> SmallString.

2016-04-17 19:38:57 +00:00

PatchableFunction.cpp

Add a description for the PatchableFunction pass; NFC

2016-04-19 06:25:02 +00:00

PeepholeOptimizer.cpp

Re-commit optimization bisect support (r267022) without new pass manager support.

2016-04-22 22:06:11 +00:00

PHIElimination.cpp

LiveIntervalAnalysis: Remove LiveVariables requirement

2016-04-28 23:42:51 +00:00

PHIEliminationUtils.cpp

[WinEH] Add some support for code generating catchpad

2015-08-27 23:27:47 +00:00

PHIEliminationUtils.h

…

PostRAHazardRecognizer.cpp

PostRAHazardRecocgnizer: Fix unused-private-field warning

2016-04-22 15:11:08 +00:00

PostRASchedulerList.cpp

CodeGen: Move check of EnablePostRAScheduler to avoid disabling antidependency breaker

2016-05-19 16:40:49 +00:00

PreISelIntrinsicLowering.cpp

Introduce llvm.load.relative intrinsic.

2016-04-22 21:18:02 +00:00

ProcessImplicitDefs.cpp

Revert "CodeGen: MachineInstr::getIterator() => getInstrIterator(), NFC"

2016-02-22 20:49:58 +00:00

PrologEpilogInserter.cpp

[PEI, AArch64] Use empty spaces in stack area for local stack slot allocation.

2016-06-02 16:22:07 +00:00

PseudoSourceValue.cpp

[NFC] Header cleanup

2016-04-18 09:17:29 +00:00

README.txt

…

RegAllocBase.cpp

[NFC] Header cleanup

2016-04-18 09:17:29 +00:00

RegAllocBase.h

Recommit r265547, and r265610,r265639,r265657 on top of it, plus

2016-04-13 03:08:27 +00:00

RegAllocBasic.cpp

Recommit r265547, and r265610,r265639,r265657 on top of it, plus

2016-04-13 03:08:27 +00:00

RegAllocFast.cpp

Re-commit r269828 "X86: Avoid using _chkstk when lowering WIN_ALLOCA instructions"

2016-05-18 16:10:17 +00:00

RegAllocGreedy.cpp

Recommit r265547, and r265610,r265639,r265657 on top of it, plus

2016-04-13 03:08:27 +00:00

RegAllocPBQP.cpp

Recommit r265547, and r265610,r265639,r265657 on top of it, plus

2016-04-13 03:08:27 +00:00

RegisterClassInfo.cpp

…

RegisterCoalescer.cpp

CodeGen: Don't iterate over operands after we've erased an MI

2016-03-25 20:03:28 +00:00

RegisterCoalescer.h

Revert r240137 (Fixed/added namespace ending comments using clang-tidy. NFC)

2015-06-23 09:49:53 +00:00

RegisterPressure.cpp

RegisterPressure: Fix default lanemask for missing regunit intervals

2016-04-29 02:44:54 +00:00

RegisterScavenging.cpp

Use report_fatal_error after all

2016-05-20 19:46:42 +00:00

RenameIndependentSubregs.cpp

CodeGen: Refactor renameDisconnectedComponents() as a pass

2016-05-31 22:38:06 +00:00

SafeStack.cpp

[safestack] Add canary to unsafe stack frames

2016-04-11 22:27:48 +00:00

ScheduleDAG.cpp

MachineScheduler: Add regpressure information to debug dump

2015-11-06 20:59:02 +00:00

ScheduleDAGInstrs.cpp

ScheduleDAGInstrs: Fix memory corruption

2016-05-25 01:18:00 +00:00

ScheduleDAGPrinter.cpp

[NFC] Header cleanup

2016-04-18 09:17:29 +00:00

ScoreboardHazardRecognizer.cpp

ScoreboardHazardRecognizer: unbreak TSAN by moving a static mutated variable to a member

2016-04-20 00:21:24 +00:00

ShadowStackGCLowering.cpp

[GC] Consolidate all built in GCs into a single file [NFC]

2016-01-19 03:57:18 +00:00

ShrinkWrap.cpp

[ShrinkWrapping] Give up on irreducible CFGs.

2016-01-07 01:23:49 +00:00

SjLjEHPrepare.cpp

ADT: Remove == and != comparisons between ilist iterators and pointers

2016-02-21 20:39:50 +00:00

SlotIndexes.cpp

CodeGen: Take MachineInstr& in SlotIndexes and LiveIntervals, NFC

2016-02-27 06:40:41 +00:00

Spiller.h

Recommit r265547, and r265610,r265639,r265657 on top of it, plus

2016-04-13 03:08:27 +00:00

SpillPlacement.cpp

Reapply r263460: [SpillPlacement] Fix a quadratic behavior in spill placement.

2016-05-19 22:40:37 +00:00

SpillPlacement.h

Reapply r263460: [SpillPlacement] Fix a quadratic behavior in spill placement.

2016-05-19 22:40:37 +00:00

SplitKit.cpp

InsertPointAnalysis: Move current live interval from being a class member

2016-05-23 19:39:19 +00:00

SplitKit.h

InsertPointAnalysis: Move current live interval from being a class member

2016-05-23 19:39:19 +00:00

StackColoring.cpp

Better fix for PR27903.

2016-06-01 17:55:10 +00:00

StackMapLivenessAnalysis.cpp

livePhysRegs: Pass MBB by reference in addLive{Ins|Outs}(); NFC

2016-05-03 00:24:32 +00:00

StackMaps.cpp

Fix a couple of redundant conditional expressions (PR27283, PR28282)

2016-04-11 20:35:01 +00:00

StackProtector.cpp

[stack-protection] Add support for MSVC buffer security check

2016-06-07 20:15:35 +00:00

StackSlotColoring.cpp

CodeGen: Update LiveIntervalAnalysis API to use MachineInstr&, NFC

2016-02-27 20:14:29 +00:00

TailDuplication.cpp

Re-commit optimization bisect support (r267022) without new pass manager support.

2016-04-22 22:06:11 +00:00

TailDuplicator.cpp

[Tail duplication] Handle source registers with subregisters

2016-04-26 18:36:34 +00:00

TargetFrameLoweringImpl.cpp

CXX_FAST_TLS calling convention: performance improvement for PPC64

2016-04-08 12:04:32 +00:00

TargetInstrInfo.cpp

[foldMemoryOperand()] Pass LiveIntervals to enable liveness check.

2016-05-10 08:09:37 +00:00

TargetLoweringBase.cpp

[stack-protection] Add support for MSVC buffer security check

2016-06-07 20:15:35 +00:00

TargetLoweringObjectFileImpl.cpp

Simplify handling of hidden stub.

2016-05-17 16:01:32 +00:00

TargetOptionsImpl.cpp

Use function attribute "trap-func-name" and remove TargetOptions::TrapFuncName.

2015-07-02 22:13:27 +00:00

TargetPassConfig.cpp

CodeGen: Refactor renameDisconnectedComponents() as a pass

2016-05-31 22:38:06 +00:00

TargetRegisterInfo.cpp

[TargetRegisterInfo] Re-apply r265734.

2016-04-08 00:51:00 +00:00

TargetSchedule.cpp

CodeGen: TII: Take MachineInstr& in predicate API, NFC

2016-02-23 02:46:52 +00:00

TwoAddressInstructionPass.cpp

LiveIntervalAnalysis: Remove LiveVariables requirement

2016-04-28 23:42:51 +00:00

UnreachableBlockElim.cpp

CodeGen: Remove implicit ilist iterator conversions, NFC

2015-10-09 22:56:24 +00:00

VirtRegMap.cpp

Use false for bool instead of 0

2016-06-02 18:37:21 +00:00

WinEHPrepare.cpp

IR: RF_IgnoreMissingValues => RF_IgnoreMissingLocals, NFC

2016-04-07 00:26:43 +00:00

README.txt

//===---------------------------------------------------------------------===//

Common register allocation / spilling problem:

        mul lr, r4, lr
        str lr, [sp, #+52]
        ldr lr, [r1, #+32]
        sxth r3, r3
        ldr r4, [sp, #+52]
        mla r4, r3, lr, r4

can be:

        mul lr, r4, lr
        mov r4, lr
        str lr, [sp, #+52]
        ldr lr, [r1, #+32]
        sxth r3, r3
        mla r4, r3, lr, r4

and then "merge" mul and mov:

        mul r4, r4, lr
        str r4, [sp, #+52]
        ldr lr, [r1, #+32]
        sxth r3, r3
        mla r4, r3, lr, r4

It also increase the likelihood the store may become dead.

//===---------------------------------------------------------------------===//

bb27 ...
        ...
        %reg1037 = ADDri %reg1039, 1
        %reg1038 = ADDrs %reg1032, %reg1039, %NOREG, 10
    Successors according to CFG: 0x8b03bf0 (#5)

bb76 (0x8b03bf0, LLVM BB @0x8b032d0, ID#5):
    Predecessors according to CFG: 0x8b0c5f0 (#3) 0x8b0a7c0 (#4)
        %reg1039 = PHI %reg1070, mbb<bb76.outer,0x8b0c5f0>, %reg1037, mbb<bb27,0x8b0a7c0>

Note ADDri is not a two-address instruction. However, its result %reg1037 is an
operand of the PHI node in bb76 and its operand %reg1039 is the result of the
PHI node. We should treat it as a two-address code and make sure the ADDri is
scheduled after any node that reads %reg1039.

//===---------------------------------------------------------------------===//

Use local info (i.e. register scavenger) to assign it a free register to allow
reuse:
        ldr r3, [sp, #+4]
        add r3, r3, #3
        ldr r2, [sp, #+8]
        add r2, r2, #2
        ldr r1, [sp, #+4]  <==
        add r1, r1, #1
        ldr r0, [sp, #+4]
        add r0, r0, #2

//===---------------------------------------------------------------------===//

LLVM aggressively lift CSE out of loop. Sometimes this can be negative side-
effects:

R1 = X + 4
R2 = X + 7
R3 = X + 15

loop:
load [i + R1]
...
load [i + R2]
...
load [i + R3]

Suppose there is high register pressure, R1, R2, R3, can be spilled. We need
to implement proper re-materialization to handle this:

R1 = X + 4
R2 = X + 7
R3 = X + 15

loop:
R1 = X + 4  @ re-materialized
load [i + R1]
...
R2 = X + 7 @ re-materialized
load [i + R2]
...
R3 = X + 15 @ re-materialized
load [i + R3]

Furthermore, with re-association, we can enable sharing:

R1 = X + 4
R2 = X + 7
R3 = X + 15

loop:
T = i + X
load [T + 4]
...
load [T + 7]
...
load [T + 15]
//===---------------------------------------------------------------------===//

It's not always a good idea to choose rematerialization over spilling. If all
the load / store instructions would be folded then spilling is cheaper because
it won't require new live intervals / registers. See 2003-05-31-LongShifts for
an example.

//===---------------------------------------------------------------------===//

With a copying garbage collector, derived pointers must not be retained across
collector safe points; the collector could move the objects and invalidate the
derived pointer. This is bad enough in the first place, but safe points can
crop up unpredictably. Consider:

        %array = load { i32, [0 x %obj] }** %array_addr
        %nth_el = getelementptr { i32, [0 x %obj] }* %array, i32 0, i32 %n
        %old = load %obj** %nth_el
        %z = div i64 %x, %y
        store %obj* %new, %obj** %nth_el

If the i64 division is lowered to a libcall, then a safe point will (must)
appear for the call site. If a collection occurs, %array and %nth_el no longer
point into the correct object.

The fix for this is to copy address calculations so that dependent pointers
are never live across safe point boundaries. But the loads cannot be copied
like this if there was an intervening store, so may be hard to get right.

Only a concurrent mutator can trigger a collection at the libcall safe point.
So single-threaded programs do not have this requirement, even with a copying
collector. Still, LLVM optimizations would probably undo a front-end's careful
work.

//===---------------------------------------------------------------------===//

The ocaml frametable structure supports liveness information. It would be good
to support it.

//===---------------------------------------------------------------------===//

The FIXME in ComputeCommonTailLength in BranchFolding.cpp needs to be
revisited. The check is there to work around a misuse of directives in inline
assembly.

//===---------------------------------------------------------------------===//

It would be good to detect collector/target compatibility instead of silently
doing the wrong thing.

//===---------------------------------------------------------------------===//

It would be really nice to be able to write patterns in .td files for copies,
which would eliminate a bunch of explicit predicates on them (e.g. no side 
effects).  Once this is in place, it would be even better to have tblgen 
synthesize the various copy insertion/inspection methods in TargetInstrInfo.

//===---------------------------------------------------------------------===//

Stack coloring improvements:

1. Do proper LiveStackAnalysis on all stack objects including those which are
   not spill slots.
2. Reorder objects to fill in gaps between objects.
   e.g. 4, 1, <gap>, 4, 1, 1, 1, <gap>, 4 => 4, 1, 1, 1, 1, 4, 4

//===---------------------------------------------------------------------===//

The scheduler should be able to sort nearby instructions by their address. For
example, in an expanded memset sequence it's not uncommon to see code like this:

  movl $0, 4(%rdi)
  movl $0, 8(%rdi)
  movl $0, 12(%rdi)
  movl $0, 0(%rdi)

Each of the stores is independent, and the scheduler is currently making an
arbitrary decision about the order.

//===---------------------------------------------------------------------===//

Another opportunitiy in this code is that the $0 could be moved to a register:

  movl $0, 4(%rdi)
  movl $0, 8(%rdi)
  movl $0, 12(%rdi)
  movl $0, 0(%rdi)

This would save substantial code size, especially for longer sequences like
this. It would be easy to have a rule telling isel to avoid matching MOV32mi
if the immediate has more than some fixed number of uses. It's more involved
to teach the register allocator how to do late folding to recover from
excessive register pressure.