llvm-capstone/clang/lib/CodeGen
Michael Kruse a66f7769a3 [OpenMPIRBuilder] Implement static-chunked workshare-loop schedules.
Add applyStaticChunkedWorkshareLoop method implementing static schedule when chunk-size is specified. Unlike a static schedule without chunk-size (where chunk-size is chosen by the runtime such that each thread receives one chunk), we need two nested loops: one for looping over the iterations of a chunk, and a second for looping over all chunks assigned to the threads.

This patch includes the following related changes:
 * Adapt applyWorkshareLoop to triage between the schedule types, now possible since all schedules have been implemented. The default schedule is assumed to be non-chunked static, as without OpenMPIRBuilder.
 * Remove the chunk parameter from applyStaticWorkshareLoop, it is ignored by the runtime. Change the value for the value passed to the init function to 0, as without OpenMPIRBuilder.
 * Refactor CanonicalLoopInfo::setTripCount and CanonicalLoopInfo::mapIndVar as used by both, applyStaticWorkshareLoop and applyStaticChunkedWorkshareLoop.
 * Enable Clang to use the OpenMPIRBuilder in the presence of the schedule clause.

Differential Revision: https://reviews.llvm.org/D114413
2022-02-28 18:18:33 -06:00
..
ABIInfo.h Introduce _BitInt, deprecate _ExtInt 2021-12-06 12:52:01 -05:00
Address.h [CodeGen] Rename deprecated Address constructor 2022-02-17 11:26:42 +01:00
BackendUtil.cpp [OpenMP] Make section variable external to prevent collisions 2022-02-24 10:57:09 -05:00
CGAtomic.cpp [CodeGen] Rename deprecated Address constructor 2022-02-17 11:26:42 +01:00
CGBlocks.cpp [CodeGen] Rename deprecated Address constructor 2022-02-17 11:26:42 +01:00
CGBlocks.h [CodeGen] Treat ObjC __unsafe_unretained and class types as trivial 2022-01-11 11:18:24 -08:00
CGBuilder.h [CodeGen] Rename deprecated Address constructor 2022-02-17 11:26:42 +01:00
CGBuiltin.cpp [CodeGen] Rename deprecated Address constructor 2022-02-17 11:26:42 +01:00
CGCall.cpp [Clang] noinline call site attribute 2022-02-28 21:21:17 +01:00
CGCall.h [clang] Remove some getPointerElementType() uses 2022-01-25 12:09:06 +01:00
CGClass.cpp [clang] Remove Address::deprecated() from CGClass.cpp 2022-02-23 13:31:56 -08:00
CGCleanup.cpp [CodeGen] Rename deprecated Address constructor 2022-02-17 11:26:42 +01:00
CGCleanup.h [clang] Remove unused forward declarations (NFC) 2022-01-08 11:56:40 -08:00
CGCoroutine.cpp [C++20] [Coroutines] Implement return value optimization for get_return_object 2022-02-16 13:38:00 +08:00
CGCUDANV.cpp [HIP] Support -fgpu-default-stream 2022-02-23 22:28:29 -05:00
CGCUDARuntime.cpp
CGCUDARuntime.h
CGCXX.cpp
CGCXXABI.cpp [clang] CGCXXABI::EmitLoadOfMemberFunctionPointer - use castAs<> instead of getAs<> to avoid dereference of nullptr 2022-02-17 13:18:23 +00:00
CGCXXABI.h [clang] Remove Address::deprecated() in CGCXXABI.h 2022-02-17 14:23:02 -08:00
CGDebugInfo.cpp Revert "Don't append the working directory to absolute paths" 2022-02-25 17:00:10 -08:00
CGDebugInfo.h [DEBUGINFO] [LLDB] Add support for generating debug-info for structured bindings of structs and arrays 2022-02-17 11:14:14 -08:00
CGDecl.cpp [CodeGen] Rename deprecated Address constructor 2022-02-17 11:26:42 +01:00
CGDeclCXX.cpp [CodeGen] Make element type in emitArrayDestroy() predictable 2022-01-11 09:25:29 +01:00
CGException.cpp [CodeGen] Rename deprecated Address constructor 2022-02-17 11:26:42 +01:00
CGExpr.cpp [clang] Remove Address::deprecated() from CGClass.cpp 2022-02-23 13:31:56 -08:00
CGExprAgg.cpp [NFC][Clang][OpaquePtr] Move away from deprecated Address constructor in VisitArrayInitLoopExpr 2022-01-27 14:44:53 -08:00
CGExprComplex.cpp
CGExprConstant.cpp Remove reference type when checking const structs 2022-01-28 13:08:58 -08:00
CGExprCXX.cpp [clang] Remove Address::deprecated() call in CGExprCXX.cpp 2022-02-17 13:58:26 -08:00
CGExprScalar.cpp [clang] Remove Address::deprecated() from CGClass.cpp 2022-02-23 13:31:56 -08:00
CGGPUBuiltin.cpp [OpenMP] Lower printf to __llvm_omp_vprintf 2021-11-10 15:30:56 +00:00
CGLoopInfo.cpp
CGLoopInfo.h
CGNonTrivialStruct.cpp [CodeGen] Rename deprecated Address constructor 2022-02-17 11:26:42 +01:00
CGObjC.cpp [CodeGen] Rename deprecated Address constructor 2022-02-17 11:26:42 +01:00
CGObjCGNU.cpp [CodeGen] Rename deprecated Address constructor 2022-02-17 11:26:42 +01:00
CGObjCMac.cpp [CodeGen] Rename deprecated Address constructor 2022-02-17 11:26:42 +01:00
CGObjCRuntime.cpp [CodeGen] Rename deprecated Address constructor 2022-02-17 11:26:42 +01:00
CGObjCRuntime.h Fix a variety of bugs with nil-receiver checks when targeting 2021-10-08 05:44:06 -04:00
CGOpenCLRuntime.cpp [clang] Cache OpenCL types 2022-02-07 09:23:04 -08:00
CGOpenCLRuntime.h [clang] Cache OpenCL types 2022-02-07 09:23:04 -08:00
CGOpenMPRuntime.cpp [OPENMP]Fix PR50347: Mapping of global scope deep object fails. 2022-02-25 10:54:24 -08:00
CGOpenMPRuntime.h [clang] Remove unused forward declarations (NFC) 2022-01-08 11:56:40 -08:00
CGOpenMPRuntimeGPU.cpp [OpenMP] Add flag for disabling thread state in runtime 2022-02-18 08:35:05 -05:00
CGOpenMPRuntimeGPU.h [OpenMP][FIX] Pass the num_threads value directly to parallel_51 2021-12-09 16:30:29 -05:00
CGRecordLayout.h [clang] Remove redundant member initialization (NFC) 2022-01-02 10:20:23 -08:00
CGRecordLayoutBuilder.cpp [clang] Fix bugprone argument comments (NFC) 2022-01-09 00:19:49 -08:00
CGStmt.cpp [Clang] noinline call site attribute 2022-02-28 21:21:17 +01:00
CGStmtOpenMP.cpp [OpenMPIRBuilder] Implement static-chunked workshare-loop schedules. 2022-02-28 18:18:33 -06:00
CGValue.h [CodeGen] Store element type in RValue 2021-12-17 09:05:59 +01:00
CGVTables.cpp [clang] Remove Address::deprecated() calls in CGVTables.cpp 2022-02-22 16:54:28 -08:00
CGVTables.h
CGVTT.cpp
CMakeLists.txt [clang][openmp][NFC] Remove arch-specific CGOpenMPRuntimeGPU files 2021-11-09 15:11:05 -05:00
CodeGenABITypes.cpp
CodeGenAction.cpp [OpenMP] Add a flag for embedding a file into the module 2022-01-31 15:56:00 -05:00
CodeGenFunction.cpp [clang] Remove Address::deprecated() calls in CodeGenFunction.cpp 2022-02-22 18:28:49 -08:00
CodeGenFunction.h [Clang] noinline call site attribute 2022-02-28 21:21:17 +01:00
CodeGenModule.cpp [clang][CodeGen] Avoid emitting ifuncs with undefined resolvers 2022-02-26 11:17:49 +02:00
CodeGenModule.h [clang][CodeGen][NFC] Remove unused CodeGenModule fields 2022-01-31 23:45:53 +02:00
CodeGenPGO.cpp [clang] Remove redundant member initialization (NFC) 2022-01-02 10:20:23 -08:00
CodeGenPGO.h
CodeGenTBAA.cpp Introduce _BitInt, deprecate _ExtInt 2021-12-06 12:52:01 -05:00
CodeGenTBAA.h [clang] Remove unused forward declarations (NFC) 2022-01-08 11:56:40 -08:00
CodeGenTypeCache.h Fix __attribute__((annotate("")) with non-zero globals AS 2021-08-26 10:09:40 +01:00
CodeGenTypes.cpp [clang] Properly cache member pointer LLVM types 2022-02-08 13:22:24 -08:00
CodeGenTypes.h [clang] Properly cache member pointer LLVM types 2022-02-08 13:22:24 -08:00
ConstantEmitter.h
ConstantInitBuilder.cpp
CoverageMappingGen.cpp [clang] Use isa instead of dyn_cast (NFC) 2021-11-14 09:32:40 -08:00
CoverageMappingGen.h
EHScopeStack.h
ItaniumCXXABI.cpp [clang] Remove getPointerElementType() in EmitVTableTypeCheckedLoad() 2022-02-23 09:38:33 -08:00
MacroPPCallbacks.cpp
MacroPPCallbacks.h [clang] Remove unused forward declarations (NFC) 2022-01-08 11:56:40 -08:00
MicrosoftCXXABI.cpp [clang] Remove getPointerElementType() in EmitVTableTypeCheckedLoad() 2022-02-23 09:38:33 -08:00
ModuleBuilder.cpp [clang][macho] add clang frontend support for emitting macho files with two build version load commands 2022-02-02 08:30:39 -08:00
ObjectFilePCHContainerOperations.cpp Emit the C++ dialect in -gmodules .pcm files. 2022-01-10 16:13:40 -08:00
PatternInit.cpp
PatternInit.h
README.txt
SanitizerMetadata.cpp [asan] Add support for disable_sanitizer_instrumentation attribute 2022-02-18 09:51:54 +01:00
SanitizerMetadata.h
SwiftCallingConv.cpp
TargetInfo.cpp [CUDA][SPIRV] Assign global address space to CUDA kernel arguments 2022-02-24 20:51:43 -08:00
TargetInfo.h [clang] Remove unused forward declarations (NFC) 2022-01-08 11:56:40 -08:00
VarBypassDetector.cpp
VarBypassDetector.h Use {DenseSet,SmallPtrSet}::contains (NFC) 2021-10-29 20:26:07 -07:00

IRgen optimization opportunities.

//===---------------------------------------------------------------------===//

The common pattern of
--
short x; // or char, etc
(x == 10)
--
generates an zext/sext of x which can easily be avoided.

//===---------------------------------------------------------------------===//

Bitfields accesses can be shifted to simplify masking and sign
extension. For example, if the bitfield width is 8 and it is
appropriately aligned then is is a lot shorter to just load the char
directly.

//===---------------------------------------------------------------------===//

It may be worth avoiding creation of alloca's for formal arguments
for the common situation where the argument is never written to or has
its address taken. The idea would be to begin generating code by using
the argument directly and if its address is taken or it is stored to
then generate the alloca and patch up the existing code.

In theory, the same optimization could be a win for block local
variables as long as the declaration dominates all statements in the
block.

NOTE: The main case we care about this for is for -O0 -g compile time
performance, and in that scenario we will need to emit the alloca
anyway currently to emit proper debug info. So this is blocked by
being able to emit debug information which refers to an LLVM
temporary, not an alloca.

//===---------------------------------------------------------------------===//

We should try and avoid generating basic blocks which only contain
jumps. At -O0, this penalizes us all the way from IRgen (malloc &
instruction overhead), all the way down through code generation and
assembly time.

On 176.gcc:expr.ll, it looks like over 12% of basic blocks are just
direct branches!

//===---------------------------------------------------------------------===//