llvm-capstone

mirror of https://github.com/capstone-engine/llvm-capstone.git synced 2025-01-31 22:25:56 +00:00

History

Michael Kruse a66f7769a3 [OpenMPIRBuilder] Implement static-chunked workshare-loop schedules.

Add applyStaticChunkedWorkshareLoop method implementing static schedule when chunk-size is specified. Unlike a static schedule without chunk-size (where chunk-size is chosen by the runtime such that each thread receives one chunk), we need two nested loops: one for looping over the iterations of a chunk, and a second for looping over all chunks assigned to the threads.

This patch includes the following related changes:
 * Adapt applyWorkshareLoop to triage between the schedule types, now possible since all schedules have been implemented. The default schedule is assumed to be non-chunked static, as without OpenMPIRBuilder.
 * Remove the chunk parameter from applyStaticWorkshareLoop, it is ignored by the runtime. Change the value for the value passed to the init function to 0, as without OpenMPIRBuilder.
 * Refactor CanonicalLoopInfo::setTripCount and CanonicalLoopInfo::mapIndVar as used by both, applyStaticWorkshareLoop and applyStaticChunkedWorkshareLoop.
 * Enable Clang to use the OpenMPIRBuilder in the presence of the schedule clause.

Differential Revision: https://reviews.llvm.org/D114413

2022-02-28 18:18:33 -06:00

ABIInfo.h

Introduce _BitInt, deprecate _ExtInt

2021-12-06 12:52:01 -05:00

Address.h

[CodeGen] Rename deprecated Address constructor

2022-02-17 11:26:42 +01:00

BackendUtil.cpp

[OpenMP] Make section variable external to prevent collisions

2022-02-24 10:57:09 -05:00

CGAtomic.cpp

[CodeGen] Rename deprecated Address constructor

2022-02-17 11:26:42 +01:00

CGBlocks.cpp

[CodeGen] Rename deprecated Address constructor

2022-02-17 11:26:42 +01:00

CGBlocks.h

[CodeGen] Treat ObjC __unsafe_unretained and class types as trivial

2022-01-11 11:18:24 -08:00

CGBuilder.h

[CodeGen] Rename deprecated Address constructor

2022-02-17 11:26:42 +01:00

CGBuiltin.cpp

[CodeGen] Rename deprecated Address constructor

2022-02-17 11:26:42 +01:00

CGCall.cpp

[Clang] noinline call site attribute

2022-02-28 21:21:17 +01:00

CGCall.h

[clang] Remove some getPointerElementType() uses

2022-01-25 12:09:06 +01:00

CGClass.cpp

[clang] Remove Address::deprecated() from CGClass.cpp

2022-02-23 13:31:56 -08:00

CGCleanup.cpp

[CodeGen] Rename deprecated Address constructor

2022-02-17 11:26:42 +01:00

CGCleanup.h

[clang] Remove unused forward declarations (NFC)

2022-01-08 11:56:40 -08:00

CGCoroutine.cpp

[C++20] [Coroutines] Implement return value optimization for get_return_object

2022-02-16 13:38:00 +08:00

CGCUDANV.cpp

[HIP] Support -fgpu-default-stream

2022-02-23 22:28:29 -05:00

CGCUDARuntime.cpp

…

CGCUDARuntime.h

…

CGCXX.cpp

…

CGCXXABI.cpp

[clang] CGCXXABI::EmitLoadOfMemberFunctionPointer - use castAs<> instead of getAs<> to avoid dereference of nullptr

2022-02-17 13:18:23 +00:00

CGCXXABI.h

[clang] Remove Address::deprecated() in CGCXXABI.h

2022-02-17 14:23:02 -08:00

CGDebugInfo.cpp

Revert "Don't append the working directory to absolute paths"

2022-02-25 17:00:10 -08:00

CGDebugInfo.h

[DEBUGINFO] [LLDB] Add support for generating debug-info for structured bindings of structs and arrays

2022-02-17 11:14:14 -08:00

CGDecl.cpp

[CodeGen] Rename deprecated Address constructor

2022-02-17 11:26:42 +01:00

CGDeclCXX.cpp

[CodeGen] Make element type in emitArrayDestroy() predictable

2022-01-11 09:25:29 +01:00

CGException.cpp

[CodeGen] Rename deprecated Address constructor

2022-02-17 11:26:42 +01:00

CGExpr.cpp

[clang] Remove Address::deprecated() from CGClass.cpp

2022-02-23 13:31:56 -08:00

CGExprAgg.cpp

[NFC][Clang][OpaquePtr] Move away from deprecated Address constructor in VisitArrayInitLoopExpr

2022-01-27 14:44:53 -08:00

CGExprComplex.cpp

…

CGExprConstant.cpp

Remove reference type when checking const structs

2022-01-28 13:08:58 -08:00

CGExprCXX.cpp

[clang] Remove Address::deprecated() call in CGExprCXX.cpp

2022-02-17 13:58:26 -08:00

CGExprScalar.cpp

[clang] Remove Address::deprecated() from CGClass.cpp

2022-02-23 13:31:56 -08:00

CGGPUBuiltin.cpp

[OpenMP] Lower printf to __llvm_omp_vprintf

2021-11-10 15:30:56 +00:00

CGLoopInfo.cpp

…

CGLoopInfo.h

…

CGNonTrivialStruct.cpp

[CodeGen] Rename deprecated Address constructor

2022-02-17 11:26:42 +01:00

CGObjC.cpp

[CodeGen] Rename deprecated Address constructor

2022-02-17 11:26:42 +01:00

CGObjCGNU.cpp

[CodeGen] Rename deprecated Address constructor

2022-02-17 11:26:42 +01:00

CGObjCMac.cpp

[CodeGen] Rename deprecated Address constructor

2022-02-17 11:26:42 +01:00

CGObjCRuntime.cpp

[CodeGen] Rename deprecated Address constructor

2022-02-17 11:26:42 +01:00

CGObjCRuntime.h

Fix a variety of bugs with nil-receiver checks when targeting

2021-10-08 05:44:06 -04:00

CGOpenCLRuntime.cpp

[clang] Cache OpenCL types

2022-02-07 09:23:04 -08:00

CGOpenCLRuntime.h

[clang] Cache OpenCL types

2022-02-07 09:23:04 -08:00

CGOpenMPRuntime.cpp

[OPENMP]Fix PR50347: Mapping of global scope deep object fails.

2022-02-25 10:54:24 -08:00

CGOpenMPRuntime.h

[clang] Remove unused forward declarations (NFC)

2022-01-08 11:56:40 -08:00

CGOpenMPRuntimeGPU.cpp

[OpenMP] Add flag for disabling thread state in runtime

2022-02-18 08:35:05 -05:00

CGOpenMPRuntimeGPU.h

[OpenMP][FIX] Pass the num_threads value directly to parallel_51

2021-12-09 16:30:29 -05:00

CGRecordLayout.h

[clang] Remove redundant member initialization (NFC)

2022-01-02 10:20:23 -08:00

CGRecordLayoutBuilder.cpp

[clang] Fix bugprone argument comments (NFC)

2022-01-09 00:19:49 -08:00

CGStmt.cpp

[Clang] noinline call site attribute

2022-02-28 21:21:17 +01:00

CGStmtOpenMP.cpp

[OpenMPIRBuilder] Implement static-chunked workshare-loop schedules.

2022-02-28 18:18:33 -06:00

CGValue.h

[CodeGen] Store element type in RValue

2021-12-17 09:05:59 +01:00

CGVTables.cpp

[clang] Remove Address::deprecated() calls in CGVTables.cpp

2022-02-22 16:54:28 -08:00

CGVTables.h

…

CGVTT.cpp

…

CMakeLists.txt

[clang][openmp][NFC] Remove arch-specific CGOpenMPRuntimeGPU files

2021-11-09 15:11:05 -05:00

CodeGenABITypes.cpp

…

CodeGenAction.cpp

[OpenMP] Add a flag for embedding a file into the module

2022-01-31 15:56:00 -05:00

CodeGenFunction.cpp

[clang] Remove Address::deprecated() calls in CodeGenFunction.cpp

2022-02-22 18:28:49 -08:00

CodeGenFunction.h

[Clang] noinline call site attribute

2022-02-28 21:21:17 +01:00

CodeGenModule.cpp

[clang][CodeGen] Avoid emitting ifuncs with undefined resolvers

2022-02-26 11:17:49 +02:00

CodeGenModule.h

[clang][CodeGen][NFC] Remove unused CodeGenModule fields

2022-01-31 23:45:53 +02:00

CodeGenPGO.cpp

[clang] Remove redundant member initialization (NFC)

2022-01-02 10:20:23 -08:00

CodeGenPGO.h

…

CodeGenTBAA.cpp

Introduce _BitInt, deprecate _ExtInt

2021-12-06 12:52:01 -05:00

CodeGenTBAA.h

[clang] Remove unused forward declarations (NFC)

2022-01-08 11:56:40 -08:00

CodeGenTypeCache.h

…

CodeGenTypes.cpp

[clang] Properly cache member pointer LLVM types

2022-02-08 13:22:24 -08:00

CodeGenTypes.h

[clang] Properly cache member pointer LLVM types

2022-02-08 13:22:24 -08:00

ConstantEmitter.h

…

ConstantInitBuilder.cpp

…

CoverageMappingGen.cpp

[clang] Use isa instead of dyn_cast (NFC)

2021-11-14 09:32:40 -08:00

CoverageMappingGen.h

…

EHScopeStack.h

…

ItaniumCXXABI.cpp

[clang] Remove getPointerElementType() in EmitVTableTypeCheckedLoad()

2022-02-23 09:38:33 -08:00

MacroPPCallbacks.cpp

…

MacroPPCallbacks.h

[clang] Remove unused forward declarations (NFC)

2022-01-08 11:56:40 -08:00

MicrosoftCXXABI.cpp

[clang] Remove getPointerElementType() in EmitVTableTypeCheckedLoad()

2022-02-23 09:38:33 -08:00

ModuleBuilder.cpp

[clang][macho] add clang frontend support for emitting macho files with two build version load commands

2022-02-02 08:30:39 -08:00

ObjectFilePCHContainerOperations.cpp

Emit the C++ dialect in -gmodules .pcm files.

2022-01-10 16:13:40 -08:00

PatternInit.cpp

…

PatternInit.h

…

README.txt

…

SanitizerMetadata.cpp

[asan] Add support for disable_sanitizer_instrumentation attribute

2022-02-18 09:51:54 +01:00

SanitizerMetadata.h

…

SwiftCallingConv.cpp

…

TargetInfo.cpp

[CUDA][SPIRV] Assign global address space to CUDA kernel arguments

2022-02-24 20:51:43 -08:00

TargetInfo.h

[clang] Remove unused forward declarations (NFC)

2022-01-08 11:56:40 -08:00

VarBypassDetector.cpp

…

VarBypassDetector.h

Use {DenseSet,SmallPtrSet}::contains (NFC)

2021-10-29 20:26:07 -07:00

README.txt

IRgen optimization opportunities.

//===---------------------------------------------------------------------===//

The common pattern of
--
short x; // or char, etc
(x == 10)
--
generates an zext/sext of x which can easily be avoided.

//===---------------------------------------------------------------------===//

Bitfields accesses can be shifted to simplify masking and sign
extension. For example, if the bitfield width is 8 and it is
appropriately aligned then is is a lot shorter to just load the char
directly.

//===---------------------------------------------------------------------===//

It may be worth avoiding creation of alloca's for formal arguments
for the common situation where the argument is never written to or has
its address taken. The idea would be to begin generating code by using
the argument directly and if its address is taken or it is stored to
then generate the alloca and patch up the existing code.

In theory, the same optimization could be a win for block local
variables as long as the declaration dominates all statements in the
block.

NOTE: The main case we care about this for is for -O0 -g compile time
performance, and in that scenario we will need to emit the alloca
anyway currently to emit proper debug info. So this is blocked by
being able to emit debug information which refers to an LLVM
temporary, not an alloca.

//===---------------------------------------------------------------------===//

We should try and avoid generating basic blocks which only contain
jumps. At -O0, this penalizes us all the way from IRgen (malloc &
instruction overhead), all the way down through code generation and
assembly time.

On 176.gcc:expr.ll, it looks like over 12% of basic blocks are just
direct branches!

//===---------------------------------------------------------------------===//