llvm-capstone

mirror of https://github.com/capstone-engine/llvm-capstone.git synced 2024-11-23 13:50:11 +00:00

Author	SHA1	Message	Date
John Ericson	4a678f8072	[cmake] Use `GNUInstallDirs` to support custom installation dirs. This is the original patch in my GNUInstallDirs series, now last to merge as the final piece! It arose as a new draft of D28234. I initially did the unorthodox thing of pushing to that when I wasn't the original author, but since I ended up - Using `GNUInstallDirs`, rather than mimicking it, as the original author was hesitant to do but others requested. - Converting all the packages, not just LLVM, effecting many more projects than LLVM itself. I figured it was time to make a new revision. I have used this patch series (and many back-ports) as the basis of https://github.com/NixOS/nixpkgs/pull/111487 for my distro (NixOS), which was merged last spring (2021). It looked like people were generally on board in D28234, but I make note of this here in case extra motivation is useful. --- As pointed out in the original issue, a central tension is that LLVM already has some partial support for these sorts of things. Variables like `COMPILER_RT_INSTALL_PATH` have already been dealt with. Variables like `LLVM_LIBDIR_SUFFIX` however, will require further work, so that we may use `CMAKE_INSTALL_LIBDIR`. These remaining items will be addressed in further patches. What is here is now rote and so we should get it out of the way before dealing more intricately with the remainder. Reviewed By: #libunwind, #libc, #libc_abi, compnerd Differential Revision: https://reviews.llvm.org/D99484	2022-01-16 05:33:07 +00:00
John Ericson	6e52bfe09d	Revert "[cmake] Use `GNUInstallDirs` to support custom installation dirs." Sorry for the disruption, I will try again later. This reverts commit `efeb501970`.	2022-01-15 07:35:02 +00:00
John Ericson	efeb501970	[cmake] Use `GNUInstallDirs` to support custom installation dirs. This is the original patch in my GNUInstallDirs series, now last to merge as the final piece! It arose as a new draft of D28234. I initially did the unorthodox thing of pushing to that when I wasn't the original author, but since I ended up - Using `GNUInstallDirs`, rather than mimicking it, as the original author was hesitant to do but others requested. - Converting all the packages, not just LLVM, effecting many more projects than LLVM itself. I figured it was time to make a new revision. I have used this patch series (and many back-ports) as the basis of https://github.com/NixOS/nixpkgs/pull/111487 for my distro (NixOS), which was merged last spring (2021). It looked like people were generally on board in D28234, but I make note of this here in case extra motivation is useful. --- As pointed out in the original issue, a central tension is that LLVM already has some partial support for these sorts of things. Variables like `COMPILER_RT_INSTALL_PATH` have already been dealt with. Variables like `LLVM_LIBDIR_SUFFIX` however, will require further work, so that we may use `CMAKE_INSTALL_LIBDIR`. These remaining items will be addressed in further patches. What is here is now rote and so we should get it out of the way before dealing more intricately with the remainder. Reviewed By: #libunwind, #libc, #libc_abi, compnerd Differential Revision: https://reviews.llvm.org/D99484	2022-01-15 01:08:35 +00:00
Roman Lebedev	82fb4f4b22	[SCEV] Sequential/in-order `UMin` expression As discussed in https://github.com/llvm/llvm-project/issues/53020 / https://reviews.llvm.org/D116692, SCEV is forbidden from reasoning about 'backedge taken count' if the branch condition is a poison-safe logical operation, which is conservatively correct, but is severely limiting. Instead, we should have a way to express those poison blocking properties in SCEV expressions. The proposed semantics is: ``` Sequential/in-order min/max SCEV expressions are non-commutative variants of commutative min/max SCEV expressions. If none of their operands are poison, then they are functionally equivalent, otherwise, if the operand that represents the saturation point* of given expression, comes before the first poison operand, then the whole expression is not poison, but is said saturation point. ``` * saturation point - the maximal/minimal possible integer value for the given type The lowering is straight-forward: ``` compare each operand to the saturation point, perform sequential in-order logical-or (poison-safe!) ordered reduction over those checks, and if reduction returned true then return saturation point else return the naive min/max reduction over the operands ``` https://alive2.llvm.org/ce/z/Q7jxvH (2 ops) https://alive2.llvm.org/ce/z/QCRrhk (3 ops) Note that we don't need to check the last operand: https://alive2.llvm.org/ce/z/abvHQS Note that this is not commutative: https://alive2.llvm.org/ce/z/FK9e97 That allows us to handle the patterns in question. Reviewed By: nikic, reames Differential Revision: https://reviews.llvm.org/D116766	2022-01-10 20:51:26 +03:00
Kazu Hirata	8afcfbfb8f	Use true/false instead of 1/0 (NFC) Identified by modernize-use-bool-literals.	2022-01-09 12:21:06 -08:00
Kazu Hirata	fb7cf90071	Use nullptr instead of 0 or NULL (NFC) Identified with modernize-use-nullptr.	2022-01-07 10:17:29 -08:00
Kazu Hirata	42a4f5103b	[Transform] Remove redundant declaration PollyAllowFullFunction (NFC) The variable is declared in ScopDetection.h, which ScopInliner.cpp includes. Identified by readability-redundant-declaration.	2022-01-02 23:08:40 -08:00
Kazu Hirata	e7774f499b	Use static_assert instead of assert (NFC) Identified with misc-static-assert.	2021-12-26 14:26:44 -08:00
Kazu Hirata	2d303e6781	Remove redundant return and continue statements (NFC) Identified with readability-redundant-control-flow.	2021-12-24 23:17:54 -08:00
Kazu Hirata	76f0f1cc5c	Use {DenseSet,SetVector,SmallPtrSet}::contains (NFC)	2021-12-24 21:43:06 -08:00
Kazu Hirata	a8cbddc994	[CodeGen] Fix a memory leak	2021-12-24 19:51:10 -08:00
Michael Kruse	937b00ab2c	[Polly][SchedOpt] Account for prevectorization of multiple statements. A prevectorized loop may contain multiple statements, in which case isl_schedule_node_band_sink will sink the vector band to multiple leaves. Instead of statically assuming a specific tree structure after sinking, add a SIMD marker to all inner bands. Fixes llvm.org/PR52637	2021-12-23 14:06:41 -06:00
Fangrui Song	0b39ec835d	[Polly][Isl] Fix -Wsign-compare after D113101	2021-11-11 00:17:52 -08:00
Arthur Eubanks	7f62759697	[polly] Properly create and initialize new PM analysis managers If we don't properly initialize all the analysis managers, we may be missing analyses that other analyses depend on. Fixes broken polly test, e.g. https://lab.llvm.org/buildbot/#/builders/10/builds/7501.	2021-11-05 09:32:54 -07:00
Riccardo Mori	44596fe6a9	[Polly][Isl] Use the function unsignedFromIslSize to manage a isl::size object. NFCI This is part of an effort to reduce the differences between the custom C++ bindings used right now by polly in lib/External/isl/include/isl/isl-noxceptions.h and the official isl C++ interface. In the official interface the type `isl::size` cannot be casted to an unsigned without previously having checked if it contains a valid value with the function `isl::size::is_error()`. For this reason two helping functions have been added: - `IslAssert`: assert that no errors are present in debug builds and just disables the mandatory error check in non-debug builds - `unisgnedFromIslSIze`: cast the `isl::size` object to `unsigned` Changes made: - Add the functions `IslAssert` and `unsignedFromIslSize` - Add the utility function `rangeIslSize()` - Retype `MaxDisjunctsInDomain` from `int` to `unsigned` - Retype `RunTimeChecksMaxAccessDisjuncts` from `int` to `unsigned` - Retype `MaxDimensionsInAccessRange` from `int` to `unsigned` - Replaced some usages of `isl_size` to `unsigned` since we aim not to use `isl_size` anymore - `isl-noexceptions.h` has been generated by `e704f73c88` No functional change intended. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D113101	2021-11-05 11:15:22 +01:00
Kazu Hirata	8568ca789e	Use llvm::erase_if (NFC)	2021-10-18 09:33:42 -07:00
Max Fan	5bf32a0e48	[Polly] Remove checkIslAstExprInt and use RAII instead of manually freeing Expr. NFC. Polly is trying to move towards using isl::ast_expr / isl-noexceptions.h (which implements RAII) where possible instead of manually managing memory. checkIslAstExprInt manually frees Expr, so it has been removed to be more idiomatic and consistent. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D111769	2021-10-15 09:17:00 -05:00
Michael Kruse	19db33c06e	[Polly] Remove support for code generated by gfortran+DragonEgg. DragonEgg is not maintained anymore, hence there is no need for this functionality. Fixes llvm.org/PR52173	2021-10-14 14:12:06 -05:00
Michael Kruse	fa6c5107c3	[Polly] Fix unused variable in non-assert builds. NFC.	2021-10-13 00:20:22 -05:00
Michael Kruse	ec2029f986	[Polly] Do not inline dumpIslObj methods. NFC. Instead of being inline and having a neverCalled() workaround to make it work in the debugger, define it as a regular exported function. Also add overloads for the C API types isl_* so it works with managed as well as unmanaged ISL objects.	2021-10-12 23:52:36 -05:00
Michael Kruse	64489255be	[Polly] Add greedy fusion algorithm. When the option -polly-loopfusion-greedy is set, the ScheduleOptimizer tries to aggressively fuse any band it can and does not violate any dependences. As part if the implementation, the functionalty for copying a band into an new schedule was extracted out of the ScheduleTreeRewriter.	2021-10-08 20:33:30 -05:00
Reid Kleckner	89b57061f7	Move TargetRegistry.(h\|cpp) from Support to MC This moves the registry higher in the LLVM library dependency stack. Every client of the target registry needs to link against MC anyway to actually use the target, so we might as well move this out of Support. This allows us to ensure that Support doesn't have includes from MC/*. Differential Revision: https://reviews.llvm.org/D111454	2021-10-08 14:51:48 -07:00
Reid Kleckner	46ef2e0bf9	Update polly for removal of CallInst::arg_operands/getNumArgOperands Fixes polly build	2021-10-08 10:46:05 -07:00
Michael Kruse	cb879d00d8	[Polly] Completely remove -polly-opt-fusion. This was missing from `07e7cb9433`. The switch did nothing since then.	2021-10-08 02:10:34 -05:00
Simon Pilgrim	f1be391bed	[polly] Replace report_fatal_error(std::string) uses with report_fatal_error(Twine) As described on D111049, we're trying to remove the <string> dependency from error handling and replace uses of report_fatal_error(const std::string&) with the Twine() variant which can be forward declared.	2021-10-06 13:32:57 +01:00
Christopher Tetreault	67acc772d0	[NFC] Fix build failure in ScopDetection In some build environments, the C++ compiler is unable to infer the correct type for the DenseMap::insert in isErrorBlock. Typing out std::make_pair helps.	2021-10-04 09:19:27 -07:00
Roman Gareev	113fa82c3c	[Polly] Check the properties of accesses to operands of a matrix-matrix multiplication The following code modifies elements of the array D. for (i = 0; i < _PB_NI; i++) for (j = 0; j < _PB_NJ; j++) { for (k = 0; k < _PB_NK; k++) { double Mul = A[i][k] * B[k][j]; D[i][j][k] += Mul; C[i][j] += Mul; } } Nevertheless, the code is recognised as a matrix-matrix multiplication, since the second and third dimensions of D are accessed with non-zero strides. This fixes the typo, which was made during the translation to C++ bindings (https://reviews.llvm.org/D35845). Reviewed By: Michael Kruse <llvm@meinersbur.de> Differential Revision: https://reviews.llvm.org/D110491	2021-09-28 22:58:57 +05:00
Michael Kruse	027c036663	[Polly] Reject regions entered by an indirectbr/callbr. SplitBlockPredecessors is unable to insert an additional BasicBlock between an indirectbr/callbr terminator and the successor blocks. This is needed by Polly to normalize the control flow before emitting its optimzed code. This patches rejects regions entered by an indirectbr/callbr to not fail later at code generation. This fixes llvm.org/PR51964 Recommit with "REQUIRES: asserts" in test that uses statistics.	2021-09-27 18:49:11 -05:00
Haowei Wu	283ed7de32	Revert "[Polly] Reject reject regions entered by an indirectbr/callbr." This reverts commit `91f46bb77e` which causes test failures when assertions are off.	2021-09-27 16:05:33 -07:00
Michael Kruse	91f46bb77e	[Polly] Reject reject regions entered by an indirectbr/callbr. SplitBlockPredecessors is unable to insert an additional BasicBlock between an indirectbr/callbr terminator and the successor blocks. This is needed by Polly to normalize the control flow before emitting its optimzed code. This patches rejects regions entered by an indirectbr/callbr to not fail later at code generation. This fixes llvm.org/PR51964	2021-09-26 21:21:50 -05:00
Michael Kruse	9820dd970c	[Polly] Support for InlineAsm. Inline assembly was not handled at all and treated like a llvm::Value. In particular, it tried to create a pointer it which is not allowed. Fix by handling like a llvm::Constant such that it is just reused when required, instead of trying to marshall it in memory. Fixes llvm.org/PR51960	2021-09-26 03:26:43 -05:00
Michael Kruse	d5c87162db	[Polly] Use VirtualUse to determine references. VirtualUse ensures consistency over different source of values with Polly. In particular, this enables its use of instructions moved between Statement. Before the patch, the code wrongly assumed that the BB's instructions are also the ScopStmt's instructions. Reference are determined for OpenMP outlining and GPGPU kernel extraction. GPGPU CodeGen had some problems. For one, it generated GPU kernel parameters for constants. Second, it emitted GPU-side invariant loads which have already been loaded by the host. This has been partially fixed, it still generates a store for the invariant load result, but using the value that the host has already written. WARNING: I did not test the generated PollyACC code on an actual GPU. The improved consistency will be made use of in the next patch.	2021-09-26 03:26:43 -05:00
Michael Kruse	1cea25eec9	[Polly] Remove isConstCall. The function was intended to catch OpenMP functions such as get_thread_id(). If matched, the call would be considered synthesizable. There were a few problems with this: * get_thread_id() is not 'const' in the sense of have the gcc manual defines it: "do not examine any values except their arguments". get_thread_id() reads OpenCL runtime libreary global state. What was inteded was probably 'speculable'. * isConstCall was implemented using mayReadOrWriteMemory(). 'const' is stricter than that, mayReadOrWriteMemory is e.g. true for malloc(), since it may only read/write addresses that are considered inaccessible fro the application. However, malloc is certainly not speculable. * Values that are isConstCall were not handled consistently throughout Polly. In particular, it was not considered for referenced values (OpenMP outlining and PollyACC). Fix by removing special handling for isConstCall entirely.	2021-09-26 03:26:43 -05:00
Michael Kruse	e470f9268a	[Polly] Implement user-directed loop distribution/fission. This is a simple version without the possibility to define distribute points or followup-transformations. However, it is the first transformation that has to check whether the transformation is correct. It interprets the same metadata as the LoopDistribute pass. Re-apply after revert in `c7bcd72a38` with fix: Take isBand out of #ifndef NDEBUG since it now is used unconditionally.	2021-09-23 21:11:01 -05:00
Petr Hosek	c7bcd72a38	Revert "[Polly] Implement user-directed loop distribution/fission." This reverts commit `52c30adc7d` which breaks the build when NDEBUG is defined.	2021-09-23 14:04:25 -07:00
Michael Kruse	07e7cb9433	[Polly] Remove -polly-opt-fusion option. The name of the option is misleading and has been renamed by isl to "serialize-sccs". Instead of also renaming the option, remove it. The option is still accessible using -polly-isl-arg=--no-schedule-serialize-sccs	2021-09-23 15:43:08 -05:00
Michael Kruse	52c30adc7d	[Polly] Implement user-directed loop distribution/fission. This is a simple version without the possibility to define distribute points or followup-transformations. However, it is the first transformation that has to check whether the transformation is correct. It interprets the same metadata as the LoopDistribute pass.	2021-09-22 17:28:25 -05:00
Michael Kruse	ced20c6672	[Polly] Add -polly-reschedule and -polly-postopts options. This command line options allow to off parts of the schedule tree optimization pipeline.	2021-09-22 00:18:19 -05:00
Michael Kruse	cad9f98a2a	[Polly] Don't generate inter-iteration noalias metadata. This metadata was intended to mark all accesses within an iteration to be pairwise non-aliasing, in this case because every memory of a base pointer is touched (read or write) at most once. This is typical for 'sweeps' over all data. The stated motivation from D30606 is to ensure that unrolled iterations are considered non-aliasing. Rhe implemention had multiple issues: * The structure of the noalias metadata was malformed. D110026 added check in the verifier for this metadata, and the tests were failing since then. * This is not true for the outer loops of the BLIS matrix multiplication, where it was being inserted. Each element of A, B, C is accessed multiple times, as often as the loop not used as an index is iterating. * Scopes were added to SecondLevelOtherAliasScopeList (used for the !noalias scop list) on-the-fly when another SCEV was seen. This meant that previously visited instructions would not be updated with alias scopes that are only seen later, missing out those SCEVs they should not be aliasing with. * Since the !noalias scope list would ideally consists of all other SCEV for this base pointer, we might run quickly into scalability issues. Especially after unrolling there would probably at least once SCEV per instruction and unroll instance. * The inter-iteration noalias base pointer was not removed after leaving the loop marked with it, effectively marking everything after it to noalias as well. A solution I considered was to mark each instruction as non-aliasing with its own scope. The instruction itself would obviously alias itself, but such construction might also be considered invalid. Duplicating the instruction (e.g. due to speculation) would mark the instruction non-aliasing with its clone. I don't want to go into this territory, especially since the original motivation of determining unrolled instances as noalias based on SCEV is the what scev-aa does as well. This effectively reverts D30606 and D35761.	2021-09-20 22:20:17 -05:00
Nikita Popov	53720f74e4	[Polly] Partially fix scoped alias metadata This partially addresses the verifier failures caused by D110026. In particular, it does not fix the "second level" alias metadata.	2021-09-20 22:51:31 +02:00
Nikita Popov	0fc624f029	[IR] Return AAMDNodes from Instruction::getMetadata() (NFC) getMetadata() currently uses a weird API where it populates a structure passed to it, and optionally merges into it. Instead, we can return the AAMDNodes and provide a separate merge() API. This makes usages more compact. Differential Revision: https://reviews.llvm.org/D109852	2021-09-16 21:06:57 +02:00
Michael Kruse	658eb9e142	[Polly] Remove autotools build systems from Externals. NFC. Building a source distribution using autotools adds GPL-licenced files into the the sources. Although redistribution of theses files is explicitly allowed with an exception, these are not used by Polly which uses a CMake replacement. Use the direct source checkout instead (replacing the output of 'make dist'). Some m4 scripts with the same licence are also included in isl/ppcg repository. Removing them renders the autotools-based build scipts inoperable, so remove the autotools build system altogether.	2021-09-15 17:11:15 -05:00
Leonard Chan	9da62d3ed9	[polly] Fix "no member named 'getIndexExpressionsFromGEP'" As of 741fabc222f226d34d806056b804244b012853b, polly builders are failing from this error. The signiature is slightly different and accepts a ScalarEvolution reference instead. This should fix the polly builders.	2021-09-08 20:04:56 -07:00
Michael Kruse	8ae6933881	[Polly] Compile fix after Delinearization move. by commit `585c594d74`	2021-09-08 15:30:19 -05:00
Michael Kruse	c62d9a5ca0	[Polly] Use subtyped isl::schedule_nodes for ScheduleTreeVisitor. NFC. Change pass-by-const-ref to pass-by-value as objects are recreated due to custom up-/down-casting anwyway.	2021-08-31 20:54:12 -05:00
Michael Kruse	c6913905d1	[Polly] Mention correct flag in debug output. NFCI.	2021-08-31 20:54:12 -05:00
Michael Kruse	e4f3f2c0c5	[Polly] Don't prune non-external function itself from dump.	2021-08-28 17:06:53 -05:00
Michael Kruse	955b91c19c	[Polly] Never consider non-SCoP blocks as error blocks. Code outside the SCoP will be executed recardless of the code versioning runtime check introduced by CodeGeneration. Assumption made based on that these are never executed in Polly-optimized code does not hold. This fixes the miscompilation of MultiSource/Applications/lambda-0.1.3	2021-08-23 01:04:01 -05:00
Michael Kruse	9cfab5e249	[Polly] Add support for -polly-dump-before/after with NPM. The new pass manager does not allow adding module passes at the -polly-position=before-vectorizer extension point. Introduce a DumpFunctionPass that dumps only current function. In contrast to the legacy pass manager's -polly-dump-before, each function will be dumped into its own file. -polly-dump-before-file is still not supported. The DumpFunctionPass uses llvm::CloneModule to copy the current function into a new module and then write it into a file.	2021-08-22 20:43:35 -05:00
Michael Kruse	58e4e71fc8	[Polly] Introduce caching for the isErrorBlock function. NFC. Compilation of the file insn-attrtab.c of the SPEC CPU 2017 502.gcc_r benchmark takes excessive time (> 30min) with Polly enabled. Most time is spent in the isErrorBlock function querying the DominatorTree. The isErrorBlock is invoked redundantly over the course of ScopDetection and ScopBuilder. This patch introduces a caching mechanism for its result. Instead of a free function, isErrorBlock is moved to ScopDetection where its cache map resides. This also means that many functions directly or indirectly calling isErrorBlock are not "const" anymore. The DetectionContextMap was marked as "mutable", but IMHO it never should have been since it stores the detection result. 502.gcc_r only takes excessive time with the new pass manager. The reason seeams to be that it invalidates the ScopDetection analysis more often than the legacy pass manager, for unknown reasons.	2021-08-18 14:05:50 -05:00
Michael Kruse	e8c8407aca	[Polly] Break early when the result is known. NFC.	2021-08-18 12:41:04 -05:00
Michael Kruse	0f1e67fac2	[Polly] Fix possibly infinite loop. The loop had no side-effect since first committed in `642594ae87`. While it is obvious what was intended, the code seems to never trigger.	2021-08-17 10:43:04 -05:00
Riccardo Mori	ce8272afb3	[Polly][Isl] Use isl::val::sub instead of isl::val::sub_ui. NFC This is part of an effort to reduce the differences between the custom C++ bindings used right now by polly in `lib/External/isl/include/isl/isl-noxceptions.h` and the official isl C++ interface. Changes made: - Use `isl::val::sub` instead of `isl::val::sub_ui` - `isl-noexceptions.h` has been generated by `355e84163a` Depends on D107225 Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D107293	2021-08-17 09:34:52 +02:00
Riccardo Mori	d3fdbda6b0	[Polly][Isl] Move to the new-polly-generator branch version of isl-noexceptions.h. NFCI This is part of an effort to reduce the differences between the custom C++ bindings used right now by polly in `lib/External/isl/include/isl/isl-noxceptions.h` and the official isl C++ interface. With this commit we are moving from the `polly-generator` branch to the `new-polly-generator` branch that is more mantainable and is based on the official C++ interface `cpp-checked.h`. Changes made: - There are now many sublcasses for `isl::ast_node` representing different isl types. Use `isl::ast_node_for`, `isl::ast_node_user`, `isl::ast_node_block` and `isl::ast_node_mark` where needed. - There are now many sublcasses for `isl::schedule_node` representing different isl types. Use `isl::schedule_node_mark`, `isl::schedule_node_extension`, `isl::schedule_node_band` and `isl::schedule_node_filter` where needed. - Replace the `isl::*::dump` with `dumpIslObj` since the isl dump method is not exposed in the C++ interface. - `isl::schedule_node::get_child` has been renamed to `isl::schedule_node::child` - `isl::pw_multi_aff::get_pw_aff` has been renamed to `isl::pw_multi_aff::at` - The constructor `isl::union_map(isl::union_pw_multi_aff)` has been replaced with the static method `isl::union_map::from()` - Replace usages of `isl::val::add_ui` with `isl::val::add` - `isl::union_set_list::alloc` is now a constructor - All the `isl_size` values are now wrapped inside the class `isl::size` use `isl::size::release` to get the internal `isl_size` value where needed. - `isl-noexceptions.h` has been generated by `73f5ed1f4d` No functional change intended. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D107225	2021-08-16 15:53:26 +02:00
Michael Kruse	5eeaac22af	[Polly] Rename CodeGen -> generateCode. NFC. To conform to function naming convention: camelCase and start with a verb.	2021-08-13 12:46:07 -05:00
Michael Kruse	0232c1d10d	[Polly] Decompose object construction and detection algorithm. NFC. Avoid doing the detection work inside the constructor. In addition to polymorphism being unintuitive in constructors and other design problems such as if an exception is thrown, the ScopDetection class is usable without detection in the sense of "no Scop found" or "function skipped".	2021-08-13 12:44:37 -05:00
Michael Kruse	9069082785	[Polly] Simplify domains before gist. The compilation of the file 526.blender_r/src/blender/source/blender/editors/space_logic/logic_ops.c from the SPEC CPU 2017 benchmarks took excessive time to compute InvalidDomain.gist_params(Ctx) Simplifying beforehand, specifically using isl_set_detect_equalities, reduces the computation time to a negible level again.	2021-08-12 08:48:14 -05:00
Tarindu Jayatilaka	7a797b2902	Take OptimizationLevel class out of Pass Builder Pulled out the OptimizationLevel class from PassBuilder in order to be able to access it from within the PassManager and avoid include conflicts. Reviewed By: mtrofin Differential Revision: https://reviews.llvm.org/D107025	2021-07-29 21:57:23 -07:00
Riccardo Mori	ec3da1a43f	Update isl to isl-0.24-69-g54aac5ac This is needed for having the functions isl_{set,map}_n_basic_{set,map} exported to the C++ interface. Some tests have been modified to reflect the isl changes.	2021-07-27 17:38:12 +02:00
Riccardo Mori	2ae7d6d427	Revert "Update isl to isl-0.24-69-g54aac5ac" This reverts commit `13f95cc3d1`. The commit makes some polly tests failing	2021-07-23 19:07:19 +02:00
Riccardo Mori	13f95cc3d1	Update isl to isl-0.24-69-g54aac5ac This is needed for having the functions isl_{set,map}_n_basic_{set,map} exported to the C++ interface	2021-07-23 18:46:48 +02:00
Riccardo Mori	84dfbf5a77	[Polly][Isl] Stop using isl::set::lex_le_set. NFC This is part of an effort to reduce the differences between the custom C++ bindings used right now by polly in `lib/External/isl/include/isl/isl-noxceptions.h` and the official isl C++ interface. Changes made: - Stop using `isl::set::lex_le_set`. The official way to do this is to use `isl::map::lex_le_at` - Removed `isl::set::lex_le_set` from `isl-noexceptions.h` - `isl-noexceptions.h` has been generated by this `266fea1d3d` Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D106269	2021-07-19 17:23:56 +02:00
Riccardo Mori	159e072f81	[Polly] Use isl::set::tuple_dim instead of isl::set::dim. NFC This is part of an effort to reduce the differences between the custom C++ bindings used right now by polly in `lib/External/isl/include/isl/isl-noxceptions.h` and the official isl C++ interface. Note that not all the usages of `isl::set::dim` were replaced	2021-07-19 12:43:23 +02:00
Riccardo Mori	d5ee355f89	[Polly][Isl] Use isl::union_map::unite() instead of isl::union_map::add_map(). NFC This is part of an effort to reduce the differences between the custom C++ bindings used right now by polly in `lib/External/isl/include/isl/isl-noxceptions.h` and the official isl C++ interface. Changes made: - Use `isl::union_map::unite()` instead of `isl::union_map::add_map()` - `isl-noexceptions.h` has been generated by this `3f43ae29fa` Depends on D106059 Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D106061	2021-07-19 12:11:00 +02:00
Riccardo Mori	bad3ebbaae	[Polly][Isl] Stop generating isl::union_{set,map} from isl::space. NFC This is part of an effort to reduce the differences between the custom C++ bindings used right now by polly in `lib/External/isl/include/isl/isl-noxceptions.h` and the official isl C++ interface. Changes made: - Stop generating `isl::union_set` and isl::union_map` from `isl::space` and instead generate them from `isl::ctx` - Disable clang-format on `isl-noexceptions.h` - Removed `isl::union_{set,map}` generator from `isl::space` from `isl-noexceptions.h` - `isl-noexceptions.h` has been generated by this `87c3413b6f` Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D106059	2021-07-19 11:24:53 +02:00
Eli Friedman	0ce9acf61d	[polly] Fix uses of deprecated overload of IRBuilder::CreateGEP.	2021-07-18 18:43:01 -07:00
Eli Friedman	8422bfa3c3	[polly] Fix regression tests with POLLY_ENABLE_GPGPU_CODEGEN Apparently there was a latent bug here.	2021-07-18 18:43:01 -07:00
Eli Friedman	43705149ed	[polly] Get rid of a couple uses of PointerType::getElementType().	2021-07-18 13:51:38 -07:00
Nikita Popov	2c68ecccc9	[OpaquePtr] Remove uses of CreateGEP() without element type Remove uses of to-be-deprecated API. In cases where the correct element type was not immediately obvious to me, fall back to explicit getPointerElementType().	2021-07-17 22:56:27 +02:00
Nikita Popov	6225d0cc6e	[OpaquePtr] Remove uses of CreateInBoundsGEP() without element type Remove uses of to-be-deprecated API. Unfortunately this one mostly just makes the use of getPointerElementType() explicit, as the correct type to use wasn't immediately available (deriving it from QualType is left as an excercise to the reader).	2021-07-17 21:27:16 +02:00
Michael Kruse	7e29e57917	[Polly] Fix misleading debug message. NFC. The number of parameters can be the reason for aliasing checks not being generated, but most of the time it for other reasons.	2021-07-14 16:25:45 -05:00
Riccardo Mori	0813bd1696	[Polly][Isl] Use isl::::ctx instead of isl::::get_ctx. NFC This is part of an effort to reduce the differences between the custom C++ bindings used right now by polly in `lib/External/isl/include/isl/isl-noxceptions.h` and the official isl C++ interface. Changes made: - Use `isl::::ctx()` instead of `isl::::get_ctx()` (for example `isl::space::ctx()` instead of `isl::space::get_ctx()`) - Add `isl::` namespace in front of isl types to avoid confusion (for example `isl::space::ctx` and `isl::ctx` - `isl-noexceptions.h` has been generated by this `b64e33c62d` Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D105691	2021-07-09 21:14:14 +02:00
patacca	dddb225acf	[Polly][Isl] Update isl to isl-0.24-52-gd7be01f7 This is needed because the functions `isl_union_{set,map}_get_{set,map}_list` have been exposed to the C++ interface	2021-07-09 12:27:55 +02:00
patacca	b55aedd0b8	[Polly][Isl] Use isl::union_set::unite() instead of isl::union_set::add_set(). NFC This is part of an effort to reduce the differences between the custom C++ bindings used right now by polly in `lib/External/isl/include/isl/isl-noxceptions.h` and the official isl C++ interface. Changes made: - Use `isl::union_set::unite()` instead of `isl::union_set::add_set()` - `isl-noexceptions.h` has been generated by this `390c44982b` Depends on D104994 Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D105444	2021-07-07 16:26:55 +02:00
patacca	f482497c38	[Polly][Isl] Use isl::set::tuple_dim, isl::map::domain_tuple_dim and isl::map::range_tuple_dim. NFC This is part of an effort to reduce the differences between the custom C++ bindings used right now by polly in `lib/External/isl/include/isl/isl-noxceptions.h` and the official isl C++ interface. Changes made: - Use `isl::set::tuple_dim` instead of `isl::set::dim` and `isl::set::n_dim` - Use `isl::map::domain_tuple_dim` instead of `isl::map::dim` - Use `isl::map::range_tuple_dim` instead of `isl::map::dim` - isl-noexceptions.h has been generated by this `45576e1b42` Note that not all the usage of `isl::{set,map}::dim` where replaced Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D104994	2021-07-06 11:20:45 +02:00
patacca	3f9bf9f42a	[Polly][Isl] Update isl to isl-0.24-47-g8853f375 This is needed for the new functions exposed in the C++ interface as used in https://reviews.llvm.org/D104994 Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D105132	2021-07-04 19:50:39 +02:00
Michael Kruse	e2d4b02404	[Polly][ScopInliner] Indicate if the IR has changed. Return true to indicate that the IR has changed if the nested pass manager has changed it. Fixes the ScopInliner tests in the LLVM_ENABLE_EXPENSIVE_CHECKS=ON configuration. Thanks to Alexandre Ganea for reporting.	2021-06-24 15:44:39 -05:00
patacca	2a629efc74	[Polly][Isl] Refactoring IslAstInfo::getBuild() and IslAstInfo::IslAstUserPayload::Build to use isl++. NFC Polly uses algorithms from the Integer Set Library (isl), which is a library written in C and which is incompatible with the rest of the LLVM as it is written in C++. Changes made: - Refactoring the method `IslAstInfo::getBuild()` - `IslAstInfo::IslAstUserPayload.Build` now uses C++ types instead of C types - Removing destructor of `IslAstInfo::IslAstUserPayload` Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D104370	2021-06-18 09:53:51 +02:00
Bjorn Pettersson	6aac2773d8	[polly][GPGPU] Fixup related to overloading exponent type in llvm.powi Commit `4c7f820b2b` changed the llvm.powi intrinsic to support different 'int' sizes for the exponent. That happened to break the IntrinsicToLibdeviceFunc mapping in PPCGCodeGeneration, which obviously should have been updated as part of commit `4c7f820b2b` (https://reviews.llvm.org/D99439). The shortcoming was found by buildbots that use -DPOLLY_ENABLE_GPGPU_CODEGEN=ON This patch should fixup the problem.	2021-06-18 08:59:06 +02:00
patacca	cfe117def7	[Polly][Isl] Replacing isl method `to_str()` with `stringFromIslObj()`. NFC. This is part of an effort to reduce the differences between the custom C++ bindings used right now by polly in `lib/External/isl/include/isl/isl-noxceptions.h` and the official isl C++ interface. Changes made: - Removing method `to_str()` from all the classes in the isl C++ bindings. - Overload method `stringFromIslObj()` so it accepts isl C++ objects. - To keep backward compatibility `stringFromIslObj()` now accepts a value that is returned if the isl C object is `null` or doesn't have a string representation (by default it's an empty string). In some cases it's better to have the string "null" instead of an empty string. - isl-noexceptions.h has been generated by this `d33ec3a3bb` Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D104211	2021-06-15 14:22:10 +02:00
patacca	7c7978a122	[Polly][Isl] Removing explicit operator bool() from isl C++ bindings. NFC. This is part of an effort to reduce the differences between the custom C++ bindings used right now by polly in `lib/External/isl/include/isl/isl-noxceptions.h` and the official isl C++ interface. Changes made: - Removing explicit operator bool() from all the classes in the isl C++ bindings. - Replace each call to operator bool() to method `is_null()`. - isl-noexceptions.h has been generated by this `27396daac5` Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D103976	2021-06-11 14:44:24 +02:00
Michael Kruse	a56bd7dec8	[Polly][Matmul] Re-pack A in every iteration. Packed_A must be copied repeatedly, not just for the first iteration of the outer tile. This fixes llvm.org/PR50557	2021-06-09 15:19:52 -05:00
patacca	9b41d0958e	[Polly][Isl] Removing nullptr constructor from C++ bindings. NFC. [Polly][Isl] Removing nullptr constructor from C++ bindings. NFC. This is part of an effort to reduce the differences between the custom C++ bindings used right now by polly in `lib/External/isl/include/isl/isl-noxceptions.h` and the official isl C++ interface. Changes made: - Removed `std::nullptr_t` constructor from all the classes in the isl C++ bindings. - `isl-noexceptions.h` has been generated by this `a7e00bea38` Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D103751	2021-06-08 23:46:28 +02:00
patacca	f60ea691a9	Revert "[Polly][Isl] Removing nullptr constructor from C++ bindings. NFC." This reverts commit `be5e2fc7bf`. This introduced a building error for polly. https://lab.llvm.org/buildbot#builders/10/builds/4951	2021-06-08 17:12:10 +02:00
patacca	be5e2fc7bf	[Polly][Isl] Removing nullptr constructor from C++ bindings. NFC. [Polly][Isl] Removing nullptr constructor from C++ bindings. NFC. This is part of an effort to reduce the differences between the custom C++ bindings used right now by polly in `lib/External/isl/include/isl/isl-noxceptions.h` and the official isl C++ interface. Changes made: - Removed `std::nullptr_t` constructor from all the classes in the isl C++ bindings. - `isl-noexceptions.h` has been generated by this `a7e00bea38` Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D103751	2021-06-08 15:28:20 +02:00
Michael Kruse	d123e983b3	[Polly] Move MatMul optimization into its own file. NFC. Functions shared between generalized matrix-multiplication optimization and other post-reschedule optimizations (tiling, prevect) are moved into the schedule tree transformation utility ScheduleTreeTransform.	2021-06-04 23:22:30 -05:00
Michael Kruse	86008477a4	[Polly] Avoid compiler warning. NFC. Avoid the warning /polly/lib/Support/RegisterPasses.cpp:833:3: warning: default label in switch which covers all enumeration values [-Wcovered-switch-default] default: ^ since all cases are now handled. Thanks to Luke Benes for reporting.	2021-05-22 00:21:20 -05:00
Michael Kruse	ad568f4286	[Polly] Add support for -polly-dump-after(-file) with the NPM. For the same reason as with -polly-dump-before, it is only supported with -polly-position=early.	2021-05-17 22:20:47 -05:00
Michael Kruse	29bef8e4e3	[Polly] Add support for -polly-dump-before(-file) with the NPM. Only supported with -polly-position=early. Unfortunately, the extension point callpack for VectorizerStart only passes a FunctionPassManager, making it impossible to add a module pass.	2021-05-17 20:58:37 -05:00
Michael Kruse	5aafcb2b44	[Polly] Add support for -polly-position=early with the NPM. This required support for the canonicalization passes, inlcuding porting RewriteByReferenceParams to the NPM. For some reason, the legacy pass pipeline with -polly-position=early did not run the CodePreparation pass. This was fixed as well.	2021-05-14 12:55:03 -05:00
Arthur Eubanks	34a8a437bf	[NewPM] Hide pass manager debug logging behind -debug-pass-manager-verbose Printing pass manager invocations is fairly verbose and not super useful. This allows us to remove DebugLogging from pass managers and PassBuilder since all logging (aside from analysis managers) goes through instrumentation now. This has the downside of never being able to print the top level pass manager via instrumentation, but that seems like a minor downside. Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D101797	2021-05-07 21:51:47 -07:00
Fangrui Song	d8aba75a76	Internalize some cl::opt global variables or move them under namespace llvm	2021-05-07 11:15:43 -07:00
Bjorn Pettersson	3ee826594a	Make dependency between certain analysis passes transitive (reapply) LazyBlockFrequenceInfoPass, LazyBranchProbabilityInfoPass and LoopAccessLegacyAnalysis all cache pointers to their nestled required analysis passes. One need to use addRequiredTransitive to describe that the nestled passes can't be freed until those analysis passes no longer are used themselves. There is still a bit of a mess considering the getLazyBPIAnalysisUsage and getLazyBFIAnalysisUsage functions. Those functions are used from both Transform, CodeGen and Analysis passes. I figure it is OK to use addRequiredTransitive also when being used from Transform and CodeGen passes. On the other hand, I figure we must do it when used from other Analysis passes. So using addRequiredTransitive should be more correct here. An alternative solution would be to add a bool option in those functions to let the user tell if it is a analysis pass or not. Since those lazy passes will be obsolete when new PM has conquered the world I figure we can leave it like this right now. Intention with the patch is to fix PR49950. It at least solves the problem for the reproducer in PR49950. However, that reproducer need five passes in a specific order, so there are lots of various "solutions" that could avoid the crash without actually fixing the root cause. This is a reapply of commit `3655f0757f`, that was reverted in `33ff3c2049` due to problems with assertions in the polly lit tests. That problem is supposed to be solved by also adjusting ScopPass to explicitly preserve LazyBlockFrequencyInfo and LazyBranchProbabilityInfo (it already preserved OptimizationRemarkEmitter which depends on those lazy passes). Differential Revision: https://reviews.llvm.org/D100958	2021-05-05 15:17:55 +02:00
Michael Kruse	286677870b	[Polly][ManualOpt] Match interpretation of unroll metadata to LoopUnrolls's. We previously had a different interpretation of unroll transformation attributes than how LoopUnroll interpreted it. In particular, llvm.loop.unroll.enable was needed explicitly to enable it and disabling metadata was ignored. Additionally, it required that either full unrolling or an unroll factor to be specified or fail otherwise. An unroll factor is still required, but the transformation is ignored with the hope that LoopUnroll is going to apply the unrolling, since Polly currently does not implement an heuristic. Fixes llvm.org/PR50109	2021-04-24 04:30:19 -05:00
Krzysztof Parzyszek	a7f33d720b	[Polly] Explicitly include all std headers There were some recent header changes, and now some std classes are no longer available in ScopDetection.cpp with the pre-existing includes.	2021-04-21 12:52:11 -05:00
Michael Kruse	89b59345ee	[PollyACC] Fix declaration/stub definition mismatch. NFC. external.c defines stub functions that are never used because of how Polly uses PPCG. Unfortunately, they are declared as functions without return values or parameters which does not match their declarations. Since they are never called, this was usually not a problem, but an LTO build gets confused with differently declared functions, or in case of pet_options_args, a global variable declaration that is defined as a function Resolve by including the declaring headers in external.c which forces the declaration and definition to match at compile-time. This fixes llvm.org/50021	2021-04-21 11:52:43 -05:00
Michael Kruse	6048d1d19c	[PollyACC] Configure PollyPPCG only if needed. The PollyPPCG library is only needed when POLLY_ENABLE_GPGPU_CODEGEN=ON. If disabled, the library target is still created, but not linked against anything. This change does not add create the PollyPPCG build target if not needed. Motivated by llvm.org/PR50021	2021-04-21 01:08:01 -05:00
Michael Kruse	90e5ce0b0d	[PollyACC] Fix implicit function definitions. NFC. The isl_id_* have been in used without including the correspodning isl/id.h header. According to rules in C, a function is defined implicitly when first used with an assumed int return type (32 bits on 64 bit systems). But the implementation returns a pointer (64 bits on 64 bit systems). Is usually has no consequence because the return value is stored in a registers that is 64 bits (RAX) and the optimizer does not truncate its value before using it again as a pointer value. However, LTO optimizers will be rightfull;y confused. Fix by including <isl/id.h> This fixes llvm.org/PR50021	2021-04-21 01:08:00 -05:00
patacca	4170d6cdd5	[Polly][Ast] Partial refactoring of IslAst and IslAstInfo to use isl++. NFC. Polly use algorithms from the Integer Set Library (isl), which is a library written in C and which is incompatible with the rest of the LLVM as it is written in C++. Changes made: - Refactoring the following methods of class `IslAst` - `getAst()` `getRunCondition()` `buildRunCondition()` - Removed the destructor in favor of the default one - Change the type of the attribute `IslAst.RunCondition` to `isl::ast_expr` - Change the type of the attribute `IslAst.Root` to `isl::ast_node` - Change the order of attributes in class `IslAst` to reflect the data dependencies so that the destructor won't complain - Refactoring the following methods of class `IslAstInfo` - `getAst()` `getRunCondition()` Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D100265	2021-04-16 00:40:26 -05:00
patacca	82fbc5d45b	[Polly] Partially refactoring of IslAstInfo and IslNodeBuilder to use isl++. NFC. Polly use algorithms from the Integer Set Library (isl), which is a library written in C and which is incompatible with the rest of the LLVM as it is written in C++. Changes made: - Refactoring the following methods of class IslAstInfo - isParallel() isExecutedInParallel() isReductionParallel() getSchedule() getMinimalDependenceDistance() getBrokenReductions() - Refactoring the following methods of class IslNodeBuilder - getReferencesInSubtree() getScheduleForAstNode() - Refactoring function getBrokenReductionsStr() - Fixed the mismatching function declaration for getScheduleForAstNode() Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D99971	2021-04-10 21:28:02 -05:00
Abhina Sreeskantharajan	82b3e28e83	[SystemZ][z/OS][Windows] Add new OF_TextWithCRLF flag and use this flag instead of OF_Text Problem: On SystemZ we need to open text files in text mode. On Windows, files opened in text mode adds a CRLF '\r\n' which may not be desirable. Solution: This patch adds two new flags - OF_CRLF which indicates that CRLF translation is used. - OF_TextWithCRLF = OF_Text \| OF_CRLF indicates that the file is text and uses CRLF translation. Developers should now use either the OF_Text or OF_TextWithCRLF for text files and OF_None for binary files. If the developer doesn't want carriage returns on Windows, they should use OF_Text, if they do want carriage returns on Windows, they should use OF_TextWithCRLF. So this is the behaviour per platform with my patch: z/OS: OF_None: open in binary mode OF_Text : open in text mode OF_TextWithCRLF: open in text mode Windows: OF_None: open file with no carriage return OF_Text: open file with no carriage return OF_TextWithCRLF: open file with carriage return The Major change is in llvm/lib/Support/Windows/Path.inc to only set text mode if the OF_CRLF is set. ``` if (Flags & OF_CRLF) CrtOpenFlags \|= _O_TEXT; ``` These following files are the ones that still use OF_Text which I left unchanged. I modified all these except raw_ostream.cpp in recent patches so I know these were previously in Binary mode on Windows. ./llvm/lib/Support/raw_ostream.cpp ./llvm/lib/TableGen/Main.cpp ./llvm/tools/dsymutil/DwarfLinkerForBinary.cpp ./llvm/unittests/Support/Path.cpp ./clang/lib/StaticAnalyzer/Core/HTMLDiagnostics.cpp ./clang/lib/Frontend/CompilerInstance.cpp ./clang/lib/Driver/Driver.cpp ./clang/lib/Driver/ToolChains/Clang.cpp Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D99426	2021-04-06 07:23:31 -04:00
patacca	812ce7f9be	[Polly] Refactoring isInnermost() from isl to use the C++ wrapper Polly use algorithms from the Integer Set Library (isl), which is a library written in C and which is incompatible with the rest of the LLVM as it is written in C++. Changes made: - Refactoring isInnermost() to take C++ bindings instead of the plain isl C api. - Addition of manage_copy() when needed to get the reference for the isl_ast_node object Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D99841	2021-04-05 21:16:52 -05:00
David Blaikie	30df6d5d6a	Preprocessor conditionalize some assert-only functions to suppress -Wunused-function	2021-04-03 14:03:43 -07:00
Fangrui Song	927050af53	[Polly] Fix -Wunused-function in -DLLVM_ENABLE_ASSERTIONS=off builds	2021-03-24 19:56:43 -07:00
Nikita Popov	7d91d81c6b	[polly] Fix build This produced a compile error with GCC: llvm-project/polly/lib/Transform/ScheduleOptimizer.cpp:1220:49: error: cannot convert ‘bool’ to ‘llvm::TargetTransformInfo::RegisterKind’ 1220 \| RegisterBitwidth = TTI->getRegisterBitWidth(true);	2021-03-24 17:46:46 +01:00
Michael Kruse	8796451d6e	[Polly] Port DeadCodeElim to the NewPM.	2021-03-24 01:01:29 -05:00
Michael Kruse	5f58aae8f3	[Polly][CodeGen] Allow nesting of BandAttr mark without loop. BandAttr markers are added as parents of schedule tree bands. These also appear as markers its equivalent AST, but a band does not necessarily corresponds to a loop in this. Iterations may be peeled or the loop being unrolled (e.g. if it has just one iteration). In such cases it may happend that there is not loop between a BandAttr marker and the marker for a loop nested in the former parent band/loop. Handle the situation by giving priority to the inner marker over the outer. Fixes the polly-x86_64-linux-test-suite buildbot.	2021-03-16 16:17:07 -05:00
Michael Kruse	f51427afb5	[Polly][Unroll] Fix unroll_double test. We enumerated the cross product Domain x Scatter, but sorted only be the scatter key. In case there are are multiple statement instances per scatter value, the order between statement instances of the same loop iteration was undefined. Propertly enumerate and sort only by the scatter value, and group the domains using the scatter dimension again. Thanks to Leonard Chan for the report.	2021-03-16 09:00:42 -05:00
Michael Kruse	9c486eb348	[Polly] Fix deprecation warning. NFC. IRBuilder::CreateLoad without type parameter was deprecated in `6312c538` to prepare for opaque pointers.	2021-03-15 14:31:16 -05:00
Michael Kruse	3f170eb197	[Polly][Optimizer] Apply user-directed unrolling. Make Polly look for unrolling metadata (https://llvm.org/docs/TransformMetadata.html#loop-unrolling) that is usually only interpreted by the LoopUnroll pass and apply it to the SCoP's schedule. While not that useful by itself (there already is an unroll pass), it introduces mechanism to apply arbitrary loop transformation directives in arbitrary order to the schedule. Transformations are applied until no more directives are found. Since ISL's rescheduling would discard the manual transformations and it is assumed that when the user specifies the sequence of transformations, they do not want any other transformations to apply. Applying user-directed transformations can be controlled using the `-polly-pragma-based-opts` switch and is enabled by default. This does not influence the SCoP detection heuristic. As a consequence, loop that do not fulfill SCoP requirements or the initial profitability heuristic will be ignored. `-polly-process-unprofitable` can be used to disable the latter. Other than manually editing the IR, there is currently no way for the user to add loop transformations in an order other than the order in the default pipeline, or transformations other than the one supported by clang's LoopHint. See the `unroll_double.ll` test as example that clang currently is unable to emit. My own extension of `#pragma clang loop` allowing an arbitrary order and additional transformations is available here: https://github.com/meinersbur/llvm-project/tree/pragma-clang-loop. An effort to upstream this functionality as `#pragma clang transform` (because `#pragma clang loop` has an implicit transformation order defined by the loop pipeline) is D69088. Additional transformations from my downstream pragma-clang-loop branch are tiling, interchange, reversal, unroll-and-jam, thread-parallelization and array packing. Unroll was chosen because it uses already-defined metadata and does not require correctness checks. Reviewed By: sebastiankreutzer Differential Revision: https://reviews.llvm.org/D97977	2021-03-15 13:05:39 -05:00
Kevin Zhou	b7df372cdc	[Polly] Refactoring astScheduleDimIsParallel to take the C++ wrapper object. NFC Polly currently needs to be slowly refactor to use the C++ wrapper objects to handle the reference counters automatically. I took the function of astScheduleDimIsParallel and refactored it so that it uses the C++ wrapper function as much as possible. There are some problems with the IsParallel since it expects the C objects, so the C++ wrapper functions must be .release() and .get() first before they are able to be used with IsParallel. When checking the ReductionDependencies Parallelism with the Build's Schedule, I opted to keep the union map as a C object rather than a C++ object. Eventually, changes will need to be made to IsParallel to refactor it to the C++ wrappers. When this is done, this function will also need to be slightly refactored to not use the C object. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D98455	2021-03-15 12:08:50 -05:00
Roman Lebedev	78b8ce40ef	Reland [SCEV] Improve modelling for (null) pointer constants This reverts commit `329aeb5db4`, and relands commit `61f006ac65`. This is a continuation of D89456. As it was suggested there, now that SCEV models `PtrToInt`, we can try to improve SCEV's pointer handling. In particular, i believe, i will need this in the future to further fix `SCEVAddExpr`operation type handling. This removes special handling of `ConstantPointerNull` from `ScalarEvolution::createSCEV()`, and add constant folding into `ScalarEvolution::getPtrToIntExpr()`. This way, `null` constants stay as such in SCEV's, but gracefully become zero integers when asked. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D98147	2021-03-13 16:05:34 +03:00
Roman Lebedev	329aeb5db4	Temporairly evert "[SCEV] Improve modelling for (null) pointer constants" This appears to have broken ubsan bot: https://lab.llvm.org/buildbot/#/builders/85/builds/3062 https://reviews.llvm.org/D98147#2623549 It looks like LSR needs some kind of a change around insertion point handling. Reverting until i have a fix. This reverts commit `61f006ac65`.	2021-03-13 09:10:28 +03:00
Nikita Popov	42eb658f65	[OpaquePtrs] Remove some uses of type-less CreateGEP() (NFC) This removes some (but not all) uses of type-less CreateGEP() and CreateInBoundsGEP() APIs, which are incompatible with opaque pointers. There are a still a number of tricky uses left, as well as many more variation APIs for CreateGEP.	2021-03-12 21:01:16 +01:00
Roman Lebedev	61f006ac65	[SCEV] Improve modelling for (null) pointer constants This is a continuation of D89456. As it was suggested there, now that SCEV models `PtrToInt`, we can try to improve SCEV's pointer handling. In particular, i believe, i will need this in the future to further fix `SCEVAddExpr`operation type handling. This removes special handling of `ConstantPointerNull` from `ScalarEvolution::createSCEV()`, and add constant folding into `ScalarEvolution::getPtrToIntExpr()`. This way, `null` constants stay as such in SCEV's, but gracefully become zero integers when asked. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D98147	2021-03-12 22:11:58 +03:00
Nikita Popov	ff9b37e95f	[Polly] Remove uses of type-less CreateLoad() APIs (NFC) These are incompatible with opaque pointers and are going away. Explicitly specify the loaded type instead.	2021-03-11 18:39:20 +01:00
Nikita Popov	46354bac76	[OpaquePtrs] Remove some uses of type-less CreateLoad APIs (NFC) Explicitly pass loaded type when creating loads, in preparation for the deprecation of these APIs. There are still a couple of uses left.	2021-03-11 14:40:57 +01:00
Michael Kruse	b85c98b4c5	[Polly][Codegen] Emit access group metadata. Emit llvm.loop.parallel_accesses metadata instead of llvm.mem.parallel_loop_access. The latter is deprecated because it assumes that LoopIDs are persistent, which they are not. We also emit parallel access metadata for all surrounding parallel loops, not just the innermost parallel.	2021-03-04 03:58:03 -06:00
Prateek Pardeshi	50e34497ac	[Polly] Refabricating IsOutermostParallel() from Integer Set Libarary(ISL) to take the C++ wrapper Polly use algorithms from the Integer Set Library (isl), which is a library written in C and which is incompatible with the rest of the LLVM as it is written in C++. Changes made: * Refabricating IsOutermostParallel() to take C++ bindings instead of reference-counting in C isl lib. * Addition of manage_copy() to be used as reference for C objects instead of IsOutermostParallel() Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D97751	2021-03-03 01:49:37 -06:00
Kevin Zhou	1ab2753d4c	[Polly] Refactoring IsInnermostParallel() in ISL to take the C++ wrapper object. NFC Currently, the IslAst library is a C library that would be incompatible with the rest of the LLVM because LLVM is written in C++. I took one function, IsInnermostParallel(), and refactored it so that it would take the C++ wrapper object instead of using reference counters with the C ISL library. As well, all the references that use IsInnermostParallel() will use manage_copy() since they are still expecting the C object. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D97425	2021-02-26 18:41:44 -06:00
Harmen Stoppels	a54f160b3a	Prefer /usr/bin/env xxx over /usr/bin/xxx where xxx = perl, python, awk Allow users to use a non-system version of perl, python and awk, which is useful in certain package managers. Reviewed By: JDevlieghere, MaskRay Differential Revision: https://reviews.llvm.org/D95119	2021-02-25 11:32:27 +01:00
Michael Kruse	ea37ee5bc4	[Polly] Update IslAstInfo::getNodePayload to use isl C++ interface. NFC.	2021-02-18 17:53:32 -06:00
Michael Kruse	ab0556bb20	[Polly] Regenerate isl-noexceptions.h. Regenerate the C++ wrapper header from the current isl version's headers. The most notable change is that some dimension sizes are represented by an isl_size (instead of unsigned), which is a signed int. Additionally, some function may return -1 in case of an error which already had been fixed in the past. The C++ may no return -1 instead of UINT_MAX which caused the problems. Some types in Polly had been changed from unsigned to isl_size (that were not already auto) and some loops/comparision had to be changed to avoid unsigned/signed comparison warnings.	2021-02-14 19:17:54 -06:00
Michael Kruse	95ef556bd1	[Polly] Preserve DetectionContext references. DetectionContext objects are stored as values in a DenseMap. When the DenseMap reaches its maximum load factor, it is resized and all its objects moved to a new memory allocation. Unfortunately Scop object have a reference to its DetectionContext. When the DenseMap resizes, all the DetectionContexts reference now point to invalid memory, even if caused by an unrelated DetectionContext. Even worse, NewPM's ScopPassManager called isMaxRegionInScop with the Verify=true parameter before each pass. This caused the old DetectionContext to be removed an a new on created and re-verified. Of course, the Scop object was already created pointing to the old DetectionContext. Because the new DetectionContext would usually be stored at the same position in the DenseMap, the reference would usually reference the new DetectionContext of the same Region. Usually. If not, the old position still points to memory in the DenseMap allocation (unless also a resizing occurs) such that tools like Valgrind and AddressSanitizer would not be able to diagnose this. Instead of storing the DetectionContext inside the DenseMap, use a std::unique_ptr to a DetectionContext allocation, i.e. it will not move around anymore. This also allows use to remove the very strange DetectionContext(const DetectionContext &&) copy/move(?) constructor. DetectionContext objects now are neither copied nor moved. As a result, every re-verification of a DetectionContext will use a new allocation. Therefore, once a Scop object has been created using a DetectionContext, it must not be re-verified (the Scop data structure requires its underlying Region to not change before code generation anyway). The NewPM may call isMaxRegionInScop only with Validate=false parameter.	2021-02-13 03:36:09 -06:00
Michael Kruse	f0f5afc4dd	[Polly] Remove unused declaration. NFC.	2021-02-12 02:20:31 -06:00
Michael Kruse	9b123cde63	[Polly] Sanitize optimization levels. The description of the -polly switch stated that it was only enabled with -O3. This was a lie, the optimization level was ignored. Only at -O0 Polly was not added to the pass pipeline because the pass builder, but only because the extension points were not triggered. In the NewPM, the VectorizerStart extensions point is actually trigger even with -O0 which leads to the following crash: Assertion `Level != OptimizationLevel::O0 && "Must request optimizations!"' failed. We sanitize the optimization levels using the following rules for both pass mangers: 1. Only enable Polly if optimizing at all (-O1, -O2 or -O3). 2. Do not enable Polly when optimizing for size. 3. Ignore the optimization level for diagnostic passes (printer, viewer or JScop-exporter). 4. If only diagnostic passes enabled, skip the code-generation. 5. Fix the description of the -polly command line option.	2021-02-11 23:07:48 -06:00
Michael Kruse	7387f33bfe	[Polly] Hide IslScheduleOptimizer implementation from header. NFC. These are implementation details of the IslScheduleOptimizer pass implementation and not use anywhere else. Hence, we can move them to the cpp file and into an anonymous namespace. Only getPartialTilePrefixes is, aside from the pass itself, used externally (by the ScheduleOptimizerTest) and moved into the polly namespace.	2021-02-11 21:02:29 -06:00
Michael Kruse	606aa622b2	Revert "[AssumptionCache] Avoid dangling llvm.assume calls in the cache" This reverts commit `b7d870eae7` and the subsequent fix "[Polly] Fix build after AssumptionCache change (D96168)" (commit `e6810cab09`). It caused indeterminism in the output, such that e.g. the polly-x86_64-linux buildbot failed accasionally.	2021-02-11 12:17:38 -06:00
Michael Kruse	23753c6088	[Polly] Hide Simplify implementation from header. NFC. Move SimplifiyVisitor from Simplify.h to Simplify.cpp. It is not relevant for applying the pass in either the NewPM or the legacyPM. Rename it to SimplifyImpl to account for that. This is possible due its state not being necessary to be preserved between runs and thefore SimplifyImpl not needed to be held in the pass object. Instead, SimplifyImpl is only instatiated for the current Scop. In the NewPM as a function-local variable, and in the legacy PM inside a llvm::Optional object because the state must be preserved between the printScop (invoked by opt -analyze) and the most recent runOnScop calls.	2021-02-10 22:11:52 -06:00
Michael Kruse	91ca9adc9e	[Polly] Avoid "using namespace llvm" in public headers. NFC. "using namespace" pollutes the namespace of every file that includes such a header and universally considered a bad thing. Even the variant namespace polly { using namespace llvm; } (previously used by LoopGenerators.h) imports more symbols than the file is in control of. The header may include a fixed set of files from LLVM, but the header itself may by be included together with other headers from LLVM. For instance, LLVM's MemorySSA.h and Polly's ScopInfo.h both declare a class 'MemoryAccess' which may conflict. Instead of prefixing everything in Polly's header files, this patch adds 'using' statements to import only the symbols that are actually referenced in Polly. This approach is also used by MLIR to import commonly used symbols into the mlir namespace. This patch also puts the symbols declared in IslNodeBuilder.h into the Polly namespace to also be able to use the imported symbols.	2021-02-10 20:58:33 -06:00
Fangrui Song	564788ddce	[Polly] Fix -Wunused-lambda-capture	2021-02-10 09:19:05 -08:00
Fangrui Song	89e257bd62	[Polly] Fix -DPOLLY_ENABLE_GPGPU_CODEGEN=off build after `222d380d2f`	2021-02-10 09:17:13 -08:00
Michael Kruse	222d380d2f	[Polly] Make the NewPM pass pipeline more similar to the legacy's. Even though it has some oddities, both pipelines should be as similar as possible. Also use report_fatal_error instead of assertions to ensure a proper failure in release builds for unsupported options. This finalizes the patch serious to make Polly run in the default configuration when using the NewPM by default.	2021-02-09 23:56:22 -06:00
Michael Kruse	08bab4b030	[Polly] Make NewPM's IslAstAnalysis more similar to the legacy IslAstInfoWrapperPass. In particular, print the ast with -debug-only=polly-ast, print a per-scop header with print<polly-ast> and force-add the analysis with -polly-code-generation=ast.	2021-02-09 23:56:21 -06:00
Michael Kruse	13f758a805	[Polly] Improve Simplify pass PM integration. 1. LegacyPM: Rename SimplifyLegacyPass to SimplifyWrapperPass. 2. LegacyPM: Complete create/init functions in LinkAllPasses.h 3. NewPM: Only invalidate non-Scop passes if changed. 4. NewPM: Add to default pass pipeline. 5. NewPM: Print -analyze header for each print<polly-simplify>	2021-02-09 23:56:21 -06:00
Michael Kruse	e200df952b	[Polly] Port IslScheduleOptimizer to the NewPM.	2021-02-09 23:56:21 -06:00
Michael Kruse	e7b9e43c9a	[Polly] Register pass-instrumentation for NewPM's Scop level. The pass-instrumentation pass is implicitly execute by the NewPM whenever a new analysis runs. Not registering it will cause the crash whenever a scop pass requests an analysis. For instance this is the case for the IstAstAnalysis requesting the DependenceAnalsis result.	2021-02-09 23:56:20 -06:00
Michael Kruse	b687fc9122	[Polly] Port PruneUnprofitable to the NewPM.	2021-02-09 23:56:20 -06:00
Michael Kruse	7903d594ea	[Polly] Port DeLICM to the NewPM.	2021-02-09 23:56:19 -06:00
Michael Kruse	4c64d8ee3a	[Polly] Port ForwardOpTree to the NewPM.	2021-02-09 23:56:19 -06:00
Michael Kruse	3dcb535115	[Polly] Remove use of -O3 in regression test. In addition to that regression tests should not test the intire pass pipeline (unless they are testing the pipeline itself), the Polly-ACC currently does not support the new pass manager. If enabled by default, such tests will therefore fail. Use the -polly-gpu-runtime and -polly-gpu-arch options also as default values for the PPCGCodeGeneration pass. This requires to move the option to be moved from the pipeline-building Register passes to the PPCGCodeGeneration implementation. Fixes the spir-typesize.ll buildbot fail.	2021-02-09 18:13:35 -06:00
Fangrui Song	e6810cab09	[Polly] Fix build after AssumptionCache change (D96168)	2021-02-06 11:56:15 -08:00
Michael Kruse	3b9677e1ec	[Polly] Track defined behavior for PHI predecessor computation. ZoneAlgorithms's computePHI relies on being provided with consistent a schedule to compute the statement prodecessors of a statement containing PHINodes. Otherwise unexpected results such as PHI nodes with multiple predecessors can occur which would result in problems in the algorithms expecting consistent data. In the added test case, statement instances are scrubbed from the SCoP their execution would result in undefined behavior (Due to a nsw overflow). As already being undefined behavior in LLVM-IR, neither AssumedContext nor InvalidContext are updated, giving computePHI no means to avoid these cases. Intoduce a new SCoP property, the DefinedBehaviorContext, that among the runtime-checked conditions, also tracks the assumptions not needing a runtime check, in particular those affecting the assumed control flow. This replaces the manual combination of the 3 other contexts that was already done in computePHI and setNewAccessRelation. Currently, the only additional assumption is that loop induction variables will nsw flag for not wrap, but potentially more can be added. Use in hasFeasibleRuntimeContext, isl::ast_build and gisting are other potential uses. To limit computational complexity, the DefinedBehaviorContext is not availabe if it grows too large (atm hardcoded to 8 disjuncts). Possible other fixes include bailing out in computePHI when inconsistencies are detected, choose an arbitrary value for inconsistent cases (since it is undefined behavior anyways), or make the code receiving the result from ComputePHI handle inconsistent data. All of them reduce the quality of implementation having to bail out more often and disabling the ability to assert on actually wrong results. This fixes llvm.org/PR48783.	2021-01-23 13:03:49 -06:00
Michael Kruse	02e8a5ad3c	[Polly] Allow param sets for dumpPw().	2021-01-23 13:03:48 -06:00
Michael Kruse	de0457a013	[Polly] Clean up hasFeasibleRuntimeContext.	2021-01-23 13:03:48 -06:00
Michael Kruse	a5b895110f	[Polly] Gist new access relations using the SCoP context. This simplifies the access relations.	2021-01-23 13:03:48 -06:00
Michael Kruse	842314b5f0	[Polly] Update isl to isl-0.23-61-g24e8cd12. This fixes llvm.org/PR48554 Some test cases had to be updated because the hash function for union_maps have been changed which affects the output order.	2021-01-19 12:01:31 -06:00
Wei Mi	86341247c4	[NFC] Rename ThinLTOPhase to ThinOrFullLTOPhase and move it from PassBuilder.h to Pass.h. In some compiler passes like SampleProfileLoaderPass, we want to know which LTO/ThinLTO phase the pass is in. Currently the phase is represented in enum class PassBuilder::ThinLTOPhase, so it is only available in PassBuilder and it also cannot represent phase in full LTO. The patch extends it to include full LTO phases and move it from PassBuilder.h to Pass.h, then it is much easier for PassBuilder to communiate with each pass about current LTO phase. Differential Revision: https://reviews.llvm.org/D94613	2021-01-13 15:55:40 -08:00
Michael Kruse	bc633fe46b	[Polly] Consider InvalidContext to determine partial READ. MemoryAccess::setNewAccessRelation() in assert-builds checks whether the access relation for a READ has a memory location for every instance of the domain. Otherwise, we would not have value to load from. That check already considered that instances outside the Scop's context do not matter since they are never executed (or would be undefined behavior). In this patch also take instances of the InvalidContext into account, as these can also be assumed to never occur. InvalidContext was introduced to avoid the computational complexity of subtracting restrictions from the AssumedContext. However, this additional check in setNewAccessRelation is only done in assert-builds. The assertion case with an InvalidContext may occur with DeLICM on a conditionally infinite loops, as it is the case in the following code: for (int i = 0; i < n; i+=b) vreg = ...; Dest = vreg; The loop is infinite when b=0, and [b] -> { : b = 0 } is part of the InvalidContext. When DeLICM tries to map the memory for %vreg to Dest, there is no store instance that uses the value of vreg when b = 0, hence no location to map it to. However, the case is irrelevant since Polly's runtime condition check ensures that this is never case. Fixes llvm.org/PR48445	2020-12-10 22:25:19 -06:00
Michael Kruse	6249bfeefe	[Polly][CodeGen] Remove use of ScalarEvolution. ScalarEvolution::getSCEV cannot be used during codegen. ScalarEvolution assumes a stable IR and control flow which is under construction during Polly's CodeGen. In particular, it uses DominatorTree for compute the backedge taken count. However the DominatorTree is not updated during codegen. In this case, SCEV was used to determine the base pointer of an array access. Replace it by our own function. Polly generates only GEP and BitCasts for array acceses, i.e. it is sufficient to handle these to to find the base pointer. Fixes llvm.org/PR48422	2020-12-07 15:21:51 -06:00

1 2 3 4 5 ...

3208 Commits