IR: New representation for CFI and virtual call optimization pass metadata.
The bitset metadata currently used in LLVM has a few problems:
1. It has the wrong name. The name "bitset" refers to an implementation
detail of one use of the metadata (i.e. its original use case, CFI).
This makes it harder to understand, as the name makes no sense in the
context of virtual call optimization.
2. It is represented using a global named metadata node, rather than
being directly associated with a global. This makes it harder to
manipulate the metadata when rebuilding global variables, summarise it
as part of ThinLTO and drop unused metadata when associated globals are
dropped. For this reason, CFI does not currently work correctly when
both CFI and vcall opt are enabled, as vcall opt needs to rebuild vtable
globals, and fails to associate metadata with the rebuilt globals. As I
understand it, the same problem could also affect ASan, which rebuilds
globals with a red zone.
This patch solves both of those problems in the following way:
1. Rename the metadata to "type metadata". This new name reflects how
the metadata is currently being used (i.e. to represent type information
for CFI and vtable opt). The new name is reflected in the name for the
associated intrinsic (llvm.type.test) and pass (LowerTypeTests).
2. Attach metadata directly to the globals that it pertains to, rather
than using the "llvm.bitsets" global metadata node as we are doing now.
This is done using the newly introduced capability to attach
metadata to global variables (r271348 and r271358).
See also: http://lists.llvm.org/pipermail/llvm-dev/2016-June/100462.html
Differential Revision: http://reviews.llvm.org/D21053
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@273729 91177308-0d34-0410-b5e6-96231b3b80d8
2016-06-24 21:21:32 +00:00
|
|
|
//===- TypeMetadataUtils.cpp - Utilities related to type metadata ---------===//
|
2016-05-10 18:07:21 +00:00
|
|
|
//
|
|
|
|
// The LLVM Compiler Infrastructure
|
|
|
|
//
|
|
|
|
// This file is distributed under the University of Illinois Open Source
|
|
|
|
// License. See LICENSE.TXT for details.
|
|
|
|
//
|
|
|
|
//===----------------------------------------------------------------------===//
|
|
|
|
//
|
IR: New representation for CFI and virtual call optimization pass metadata.
The bitset metadata currently used in LLVM has a few problems:
1. It has the wrong name. The name "bitset" refers to an implementation
detail of one use of the metadata (i.e. its original use case, CFI).
This makes it harder to understand, as the name makes no sense in the
context of virtual call optimization.
2. It is represented using a global named metadata node, rather than
being directly associated with a global. This makes it harder to
manipulate the metadata when rebuilding global variables, summarise it
as part of ThinLTO and drop unused metadata when associated globals are
dropped. For this reason, CFI does not currently work correctly when
both CFI and vcall opt are enabled, as vcall opt needs to rebuild vtable
globals, and fails to associate metadata with the rebuilt globals. As I
understand it, the same problem could also affect ASan, which rebuilds
globals with a red zone.
This patch solves both of those problems in the following way:
1. Rename the metadata to "type metadata". This new name reflects how
the metadata is currently being used (i.e. to represent type information
for CFI and vtable opt). The new name is reflected in the name for the
associated intrinsic (llvm.type.test) and pass (LowerTypeTests).
2. Attach metadata directly to the globals that it pertains to, rather
than using the "llvm.bitsets" global metadata node as we are doing now.
This is done using the newly introduced capability to attach
metadata to global variables (r271348 and r271358).
See also: http://lists.llvm.org/pipermail/llvm-dev/2016-June/100462.html
Differential Revision: http://reviews.llvm.org/D21053
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@273729 91177308-0d34-0410-b5e6-96231b3b80d8
2016-06-24 21:21:32 +00:00
|
|
|
// This file contains functions that make it easier to manipulate type metadata
|
|
|
|
// for devirtualization.
|
2016-05-10 18:07:21 +00:00
|
|
|
//
|
|
|
|
//===----------------------------------------------------------------------===//
|
|
|
|
|
IR: New representation for CFI and virtual call optimization pass metadata.
The bitset metadata currently used in LLVM has a few problems:
1. It has the wrong name. The name "bitset" refers to an implementation
detail of one use of the metadata (i.e. its original use case, CFI).
This makes it harder to understand, as the name makes no sense in the
context of virtual call optimization.
2. It is represented using a global named metadata node, rather than
being directly associated with a global. This makes it harder to
manipulate the metadata when rebuilding global variables, summarise it
as part of ThinLTO and drop unused metadata when associated globals are
dropped. For this reason, CFI does not currently work correctly when
both CFI and vcall opt are enabled, as vcall opt needs to rebuild vtable
globals, and fails to associate metadata with the rebuilt globals. As I
understand it, the same problem could also affect ASan, which rebuilds
globals with a red zone.
This patch solves both of those problems in the following way:
1. Rename the metadata to "type metadata". This new name reflects how
the metadata is currently being used (i.e. to represent type information
for CFI and vtable opt). The new name is reflected in the name for the
associated intrinsic (llvm.type.test) and pass (LowerTypeTests).
2. Attach metadata directly to the globals that it pertains to, rather
than using the "llvm.bitsets" global metadata node as we are doing now.
This is done using the newly introduced capability to attach
metadata to global variables (r271348 and r271358).
See also: http://lists.llvm.org/pipermail/llvm-dev/2016-June/100462.html
Differential Revision: http://reviews.llvm.org/D21053
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@273729 91177308-0d34-0410-b5e6-96231b3b80d8
2016-06-24 21:21:32 +00:00
|
|
|
#include "llvm/Analysis/TypeMetadataUtils.h"
|
2016-06-25 00:23:04 +00:00
|
|
|
#include "llvm/IR/Constants.h"
|
2016-05-10 18:07:21 +00:00
|
|
|
#include "llvm/IR/Intrinsics.h"
|
|
|
|
#include "llvm/IR/Module.h"
|
|
|
|
|
|
|
|
using namespace llvm;
|
|
|
|
|
|
|
|
// Search for virtual calls that call FPtr and add them to DevirtCalls.
|
|
|
|
static void
|
|
|
|
findCallsAtConstantOffset(SmallVectorImpl<DevirtCallSite> &DevirtCalls,
|
2016-06-25 00:23:04 +00:00
|
|
|
bool *HasNonCallUses, Value *FPtr, uint64_t Offset) {
|
2016-05-10 18:07:21 +00:00
|
|
|
for (const Use &U : FPtr->uses()) {
|
|
|
|
Value *User = U.getUser();
|
|
|
|
if (isa<BitCastInst>(User)) {
|
2016-06-25 00:23:04 +00:00
|
|
|
findCallsAtConstantOffset(DevirtCalls, HasNonCallUses, User, Offset);
|
2016-05-10 18:07:21 +00:00
|
|
|
} else if (auto CI = dyn_cast<CallInst>(User)) {
|
|
|
|
DevirtCalls.push_back({Offset, CI});
|
|
|
|
} else if (auto II = dyn_cast<InvokeInst>(User)) {
|
|
|
|
DevirtCalls.push_back({Offset, II});
|
2016-06-25 00:23:04 +00:00
|
|
|
} else if (HasNonCallUses) {
|
|
|
|
*HasNonCallUses = true;
|
2016-05-10 18:07:21 +00:00
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
// Search for virtual calls that load from VPtr and add them to DevirtCalls.
|
|
|
|
static void
|
|
|
|
findLoadCallsAtConstantOffset(Module *M,
|
|
|
|
SmallVectorImpl<DevirtCallSite> &DevirtCalls,
|
2016-07-13 03:42:38 +00:00
|
|
|
Value *VPtr, int64_t Offset) {
|
2016-05-10 18:07:21 +00:00
|
|
|
for (const Use &U : VPtr->uses()) {
|
|
|
|
Value *User = U.getUser();
|
|
|
|
if (isa<BitCastInst>(User)) {
|
|
|
|
findLoadCallsAtConstantOffset(M, DevirtCalls, User, Offset);
|
|
|
|
} else if (isa<LoadInst>(User)) {
|
2016-06-25 00:23:04 +00:00
|
|
|
findCallsAtConstantOffset(DevirtCalls, nullptr, User, Offset);
|
2016-05-10 18:07:21 +00:00
|
|
|
} else if (auto GEP = dyn_cast<GetElementPtrInst>(User)) {
|
|
|
|
// Take into account the GEP offset.
|
|
|
|
if (VPtr == GEP->getPointerOperand() && GEP->hasAllConstantIndices()) {
|
|
|
|
SmallVector<Value *, 8> Indices(GEP->op_begin() + 1, GEP->op_end());
|
2016-07-13 03:42:38 +00:00
|
|
|
int64_t GEPOffset = M->getDataLayout().getIndexedOffsetInType(
|
2016-05-10 18:07:21 +00:00
|
|
|
GEP->getSourceElementType(), Indices);
|
|
|
|
findLoadCallsAtConstantOffset(M, DevirtCalls, User, Offset + GEPOffset);
|
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2016-06-25 00:23:04 +00:00
|
|
|
void llvm::findDevirtualizableCallsForTypeTest(
|
2016-05-10 18:07:21 +00:00
|
|
|
SmallVectorImpl<DevirtCallSite> &DevirtCalls,
|
|
|
|
SmallVectorImpl<CallInst *> &Assumes, CallInst *CI) {
|
IR: New representation for CFI and virtual call optimization pass metadata.
The bitset metadata currently used in LLVM has a few problems:
1. It has the wrong name. The name "bitset" refers to an implementation
detail of one use of the metadata (i.e. its original use case, CFI).
This makes it harder to understand, as the name makes no sense in the
context of virtual call optimization.
2. It is represented using a global named metadata node, rather than
being directly associated with a global. This makes it harder to
manipulate the metadata when rebuilding global variables, summarise it
as part of ThinLTO and drop unused metadata when associated globals are
dropped. For this reason, CFI does not currently work correctly when
both CFI and vcall opt are enabled, as vcall opt needs to rebuild vtable
globals, and fails to associate metadata with the rebuilt globals. As I
understand it, the same problem could also affect ASan, which rebuilds
globals with a red zone.
This patch solves both of those problems in the following way:
1. Rename the metadata to "type metadata". This new name reflects how
the metadata is currently being used (i.e. to represent type information
for CFI and vtable opt). The new name is reflected in the name for the
associated intrinsic (llvm.type.test) and pass (LowerTypeTests).
2. Attach metadata directly to the globals that it pertains to, rather
than using the "llvm.bitsets" global metadata node as we are doing now.
This is done using the newly introduced capability to attach
metadata to global variables (r271348 and r271358).
See also: http://lists.llvm.org/pipermail/llvm-dev/2016-June/100462.html
Differential Revision: http://reviews.llvm.org/D21053
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@273729 91177308-0d34-0410-b5e6-96231b3b80d8
2016-06-24 21:21:32 +00:00
|
|
|
assert(CI->getCalledFunction()->getIntrinsicID() == Intrinsic::type_test);
|
2016-05-10 18:07:21 +00:00
|
|
|
|
|
|
|
Module *M = CI->getParent()->getParent()->getParent();
|
|
|
|
|
IR: New representation for CFI and virtual call optimization pass metadata.
The bitset metadata currently used in LLVM has a few problems:
1. It has the wrong name. The name "bitset" refers to an implementation
detail of one use of the metadata (i.e. its original use case, CFI).
This makes it harder to understand, as the name makes no sense in the
context of virtual call optimization.
2. It is represented using a global named metadata node, rather than
being directly associated with a global. This makes it harder to
manipulate the metadata when rebuilding global variables, summarise it
as part of ThinLTO and drop unused metadata when associated globals are
dropped. For this reason, CFI does not currently work correctly when
both CFI and vcall opt are enabled, as vcall opt needs to rebuild vtable
globals, and fails to associate metadata with the rebuilt globals. As I
understand it, the same problem could also affect ASan, which rebuilds
globals with a red zone.
This patch solves both of those problems in the following way:
1. Rename the metadata to "type metadata". This new name reflects how
the metadata is currently being used (i.e. to represent type information
for CFI and vtable opt). The new name is reflected in the name for the
associated intrinsic (llvm.type.test) and pass (LowerTypeTests).
2. Attach metadata directly to the globals that it pertains to, rather
than using the "llvm.bitsets" global metadata node as we are doing now.
This is done using the newly introduced capability to attach
metadata to global variables (r271348 and r271358).
See also: http://lists.llvm.org/pipermail/llvm-dev/2016-June/100462.html
Differential Revision: http://reviews.llvm.org/D21053
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@273729 91177308-0d34-0410-b5e6-96231b3b80d8
2016-06-24 21:21:32 +00:00
|
|
|
// Find llvm.assume intrinsics for this llvm.type.test call.
|
2016-05-10 18:07:21 +00:00
|
|
|
for (const Use &CIU : CI->uses()) {
|
|
|
|
auto AssumeCI = dyn_cast<CallInst>(CIU.getUser());
|
|
|
|
if (AssumeCI) {
|
|
|
|
Function *F = AssumeCI->getCalledFunction();
|
|
|
|
if (F && F->getIntrinsicID() == Intrinsic::assume)
|
|
|
|
Assumes.push_back(AssumeCI);
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
// If we found any, search for virtual calls based on %p and add them to
|
|
|
|
// DevirtCalls.
|
|
|
|
if (!Assumes.empty())
|
|
|
|
findLoadCallsAtConstantOffset(M, DevirtCalls,
|
|
|
|
CI->getArgOperand(0)->stripPointerCasts(), 0);
|
|
|
|
}
|
2016-06-25 00:23:04 +00:00
|
|
|
|
|
|
|
void llvm::findDevirtualizableCallsForTypeCheckedLoad(
|
|
|
|
SmallVectorImpl<DevirtCallSite> &DevirtCalls,
|
|
|
|
SmallVectorImpl<Instruction *> &LoadedPtrs,
|
|
|
|
SmallVectorImpl<Instruction *> &Preds, bool &HasNonCallUses, CallInst *CI) {
|
|
|
|
assert(CI->getCalledFunction()->getIntrinsicID() ==
|
|
|
|
Intrinsic::type_checked_load);
|
|
|
|
|
|
|
|
auto *Offset = dyn_cast<ConstantInt>(CI->getArgOperand(1));
|
|
|
|
if (!Offset) {
|
|
|
|
HasNonCallUses = true;
|
|
|
|
return;
|
|
|
|
}
|
|
|
|
|
|
|
|
for (Use &U : CI->uses()) {
|
|
|
|
auto CIU = U.getUser();
|
|
|
|
if (auto EVI = dyn_cast<ExtractValueInst>(CIU)) {
|
|
|
|
if (EVI->getNumIndices() == 1 && EVI->getIndices()[0] == 0) {
|
|
|
|
LoadedPtrs.push_back(EVI);
|
|
|
|
continue;
|
|
|
|
}
|
|
|
|
if (EVI->getNumIndices() == 1 && EVI->getIndices()[0] == 1) {
|
|
|
|
Preds.push_back(EVI);
|
|
|
|
continue;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
HasNonCallUses = true;
|
|
|
|
}
|
|
|
|
|
|
|
|
for (Value *LoadedPtr : LoadedPtrs)
|
|
|
|
findCallsAtConstantOffset(DevirtCalls, &HasNonCallUses, LoadedPtr,
|
|
|
|
Offset->getZExtValue());
|
|
|
|
}
|