llvm-capstone

mirror of https://github.com/capstone-engine/llvm-capstone.git synced 2024-11-23 13:50:11 +00:00

Author	SHA1	Message	Date
Karthika Devi C	61968286f9	[polly][ScheduleOptimizer] Reland Fix long compile time(hang) reported in polly (#77280 ) There is no upper cap set on current Schedule Optimizer to compute schedule. In some cases a very long compile time taken to compute the schedule resulting in hang kind of behavior. This patch introduces a flag 'polly-schedule-computeout' to pass the capwhich is initialized to 300000. This patch handles the compute out cases by bailing out and exiting gracefully. Fixed the test that failed in previous commit. Fixes #69090	2024-01-08 09:48:02 -08:00
Eli Friedman	2cc111e422	Revert "[polly][ScheduleOptimizer] Fix long compile time(hang) reported in polly (#75141 )" This reverts commit `d6c4d4c9b9`. Broke buildldbots with asserts disabled; -debug-only is only available in asserts builds.	2024-01-02 19:00:58 -08:00
Karthika Devi C	d6c4d4c9b9	[polly][ScheduleOptimizer] Fix long compile time(hang) reported in polly (#75141 ) There is no upper cap set on current Schedule Optimizer to compute schedule. In some cases a very long compile time taken to compute the schedule resulting in hang kind of behavior. This patch introduces a flag 'polly-schedule-computeout' to pass the capwhich is initialized to 300000. This patch handles the compute out cases by bailing out and exiting gracefully. Fixes #69090	2024-01-02 10:53:29 -08:00
Nikita Popov	17168f7f78	[Polly] Add disjoint flag to some tests (NFC)	2023-12-05 14:37:19 +01:00
Philip Reames	f8742b8d6a	[SCEV] Teach SCEVExpander to use zext nneg when possible (#70815 ) zext nneg was recently added to the IR in #67982. Teaching SCEVExpander to emit nneg when possible is valuable since SCEV may have proved non-trivial facts about loop bounds which would otherwise be lost when materializing the value.	2023-10-31 09:33:07 -07:00
Nikita Popov	4491f0b969	[IR] Remove unnecessary bitcast from CreateMalloc() This bitcast is no longer necessary with opaque pointers. This results in some annoying variable name changes in tests.	2023-09-18 14:58:16 +02:00
Marek Sedláček	ebf01690d9	Bug fix for multi-line labels in CFG dot graph After D154102 multi-line labels would get split incorrectly. When CFG is generated for a function with basic block name longer than 80 lines, then the header separator will be placed after the line break for the label name instead of after the whole label name. The fix is simple by just moving the insert of \| character before the line splitting happens. Differential Revision: https://reviews.llvm.org/D159207	2023-09-05 22:01:51 -07:00
Marek Sedláček	3fee76cff3	Headers for basic blocks in CFG dot graphs This change adds separators for basic block names, which makes it easier to find a basic block based on its name and separates it from the code. Currently there is also a chance that the basic block label will be present twice, that is in case the basic block has explicit numbering, this change fixes this bug. Differential Revision: https://reviews.llvm.org/D154102	2023-08-17 07:55:23 -07:00
Joshua Cao	6ed152aff4	[SCEV] Compute AddRec range computations using different type BECount Before this patch, we can only use the MaxBECount for an AddRec's range computation if the MaxBECount has <= bit width of the AddRec. This patch reasons that if a MaxBECount has > bit width, and is <= the max value of AddRec's bit width, we can still use the MaxBECount. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D151698	2023-05-31 21:05:17 -07:00
Tobias Hieta	f98ee40f4b	[NFC][Py Reformat] Reformat python files in the rest of the dirs This is an ongoing series of commits that are reformatting our Python code. This catches the last of the python files to reformat. Since they where so few I bunched them together. Reformatting is done with `black`. If you end up having problems merging this commit because you have made changes to a python file, the best way to handle that is to run git checkout --ours <yourfile> and then reformat it with black. If you run into any problems, post to discourse about it and we will try to help. RFC Thread below: https://discourse.llvm.org/t/rfc-document-and-standardize-python-code-style Reviewed By: jhenderson, #libc, Mordante, sivachandra Differential Revision: https://reviews.llvm.org/D150784	2023-05-25 11:17:05 +02:00
Bjorn Pettersson	047a41446d	[polly] Drop redundant use of -instnamer in polly MemAccess lit tests Differential Revision: https://reviews.llvm.org/D148530	2023-04-28 13:18:45 +02:00
Nikita Popov	28adee04d1	[Polly] Update polly test after SCEV changes Make the phi non-trivial, so SCEV can't see through it.	2023-04-27 16:09:08 +02:00
DianQK	533b7c1f6c	[GlobalOpt] Don't replace the aliasee if it has other references. As long as aliasee has `@llvm.used` or `@llvm.compiler.used` references, we cannot do the related replace or delete operations. Even if it is a Local Linkage, we cannot infer if there is no other use for it, such as asm or other future added cases. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D145293	2023-04-27 09:53:47 +08:00
Nikita Popov	55cb579652	[Polly] Convert remaining tests to opaque pointers (NFC)	2023-03-17 15:44:54 +01:00
Nikita Popov	fc78ebad05	[Polly] Convert some tests to opaque pointers (NFC)	2023-03-17 15:09:11 +01:00
Nikita Popov	5ecba15434	[Polly] Convert some tests to opaque pointers (NFC)	2023-03-17 14:43:59 +01:00
Nikita Popov	687b5b9a0c	[SCEVExpander] Always use scevgep as name With opaque pointers the scevgep / uglygep distinction no longer makes sense -- GEPs are always emitted in offset-based representation.	2023-03-17 14:27:03 +01:00
Arthur Eubanks	758e06bbd9	[polly][test] Fix after `7c3c98144`	2023-03-10 19:29:50 -08:00
Michael Kruse	19afbfe331	[Polly] Remove Polly-ACC. Polly-ACC is unmaintained and since it has never been ported to the NPM pipeline, since D136621 it is not even accessible anymore without manually specifying the passes on the `opt` command line. Since there is no plan to put it to a maintainable state, remove it from Polly. Reviewed By: grosser Differential Revision: https://reviews.llvm.org/D142580	2023-03-08 17:33:04 -06:00
Michael Kruse	42cd38c01e	[Polly] Remove -polly-vectorizer=polly. Polly's internal vectorizer is not well maintained and is known to not work in some cases such as region ScopStmts. Unlike LLVM's LoopVectorize pass it also does not have a target-dependent cost heuristics, and we recommend using LoopVectorize instead of -polly-vectorizer=polly. In the future we hope that Polly can collaborate better with LoopVectorize, like Polly marking a loop is safe to vectorize with a specific simd width, instead of replicating its functionality. Reviewed By: grosser Differential Revision: https://reviews.llvm.org/D142640	2023-03-08 12:51:42 -06:00
Florian Hahn	7019624ee1	[SCEV] Strengthen nowrap flags via ranges for ARs on construction. At the moment, proveNoWrapViaConstantRanges is only used when creating SCEV[Zero,Sign]ExtendExprs. We can get significant improvements by strengthening flags after creating the AddRec. I'll also share a follow-up patch that removes the code to strengthen flags when creating SCEV[Zero,Sign]ExtendExprs. Modifying AddRecs while creating those can lead to surprising changes. Compile-time looks neutral: https://llvm-compile-time-tracker.com/compare.php?from=94676cf8a13c511a9acfc24ed53c98964a87bde3&to=aced434e8b103109104882776824c4136c90030d&stat=instructions:u Reviewed By: mkazantsev, nikic Differential Revision: https://reviews.llvm.org/D144050	2023-03-07 17:10:34 +01:00
Arthur Eubanks	6c0fe14414	[polly] Remove unnecessary -enable-new-pm flags	2023-02-27 15:38:55 -08:00
Nikita Popov	833923f0a8	[Polly] Fix tests (NFC) Missed in `5f01a626dd`.	2023-01-31 12:05:40 +01:00
Michael Kruse	d327667d8a	[Polly] Remove failing Polly-ACC test. The test has been failing since after last update of ISL (`a749e09e`).	2023-01-25 11:11:45 -06:00
Nikita Popov	9de0086d4c	[Polly] Convert some tests to opaque pointers (NFC) Convert ScopInfo tests.	2023-01-25 14:40:34 +01:00
Jannik Silvanus	f287c1d315	[polly] Fix i8 alignment in datalayout of lit test Two lit test used overaligned i8, without the test case actually depending on i8 alignment. Change the datalayout string to use naturally aligned i8, preparing for the upcoming requirement of naturally aligned i8.	2023-01-20 15:52:07 +01:00
Nikita Popov	9ed2f14c87	[AsmParser] Remove typed pointer auto-detection IR is now always parsed in opaque pointer mode, unless -opaque-pointers=0 is explicitly given. There is no automatic detection of typed pointers anymore. The -opaque-pointers=0 option is added to any remaining IR tests that haven't been migrated yet. Differential Revision: https://reviews.llvm.org/D141912	2023-01-18 09:58:32 +01:00
Nikita Popov	b332499a94	[Polly] Convert some tests to opaque pointers (NFC)	2023-01-17 10:15:18 +01:00
Paul Robinson	4d8b99af73	[Polly] Fix REQUIRES for nvptx-dependent tests These have been effectively disabled ever since 'nvptx' was added to the REQUIRES clauses, because REQUIRES does not support triple checks. The new 'target=<triple>' is supported, so switch to that scheme. Fix up XFAIL annotations, now that these tests are actually run. Part of the project to eliminate special handling for triples in lit expressions. Differential Revision: https://reviews.llvm.org/D139728	2023-01-13 11:27:54 -08:00
Paul Walker	eae26b6640	[IRBuilder] Use canonical i64 type for insertelement index used by vector splats. Instcombine prefers this canonical form (see getPreferredVectorIndex), as does IRBuilder when passing the index as an integer so we may as well use the prefered form from creation. NOTE: All test changes are mechanical with nothing else expected beyond a change of index type from i32 to i64. Differential Revision: https://reviews.llvm.org/D140983	2023-01-11 14:08:06 +00:00
Nico Weber	281a5c7ef1	[llvm,polly,clang] Stop setting config.enable_shared in most places Clang's lit.cfg.py reads this to add an "enable-shared" feature that three of clang's lit tests use. Nothing else reads enable_shared, so remove it from most lit.site.cfg.py.in files. Differential Revision: https://reviews.llvm.org/D138301	2022-11-21 08:54:14 -05:00
Arthur Eubanks	f3210d46ef	[polly][test] Remove -polly-target from tests This flag was removed in D136621.	2022-10-28 10:39:07 -07:00
Arthur Eubanks	f7985dda22	[polly] Remove legacy pass manager hooks And some options that only throw errors with the new PM. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D136621	2022-10-28 10:16:34 -07:00
Michael Kruse	bb90d7ee39	[Polly] Add support for -polly-export with NPM. The JSONExporter pass was already ported to the NPM, the pass builder jsut had to add it to the pass pipeline when requested.	2022-10-26 17:21:34 -05:00
Michael Kruse	b4b7fa234c	[Polly] Ensure -polly-detect-keep-going still eventually rejects invalid regions. Fixes #58484	2022-10-20 13:35:09 -05:00
John Ericson	3a1c81e327	[CMake] Avoid `LLVM_BINARY_DIR` when other more specific variable are better-suited, part 2 A simple sed doing these substitutions: - `${LLVM_BINARY_DIR}/lib${LLVM_LIBDIR_SUFFIX}\>` -> `${LLVM_LIBRARY_DIR}` - `${LLVM_BINARY_DIR}/bin\>` -> `${LLVM_TOOLS_BINARY_DIR}` where `\>` means "word boundary". The only manual modifications were reverting changes in - `runtimes/CMakeLists.txt` because these were "entry points" where we wanted to tread carefully not not introduce a "loop" which would end with an undefined variable being expanded to nothing. There are some `${LLVM_BINARY_DIR}/lib` without the `${LLVM_LIBDIR_SUFFIX}`, but these refer to the lib subdirectory of the source (`llvm/lib`). That `lib` is automatically appended to make the local `CMAKE_CURRENT_BINARY_DIR` value by `add_subdirectory`; since the directory name in the source tree is fixed without any suffix, the corresponding `CMAKE_CURRENT_BINARY_DIR` will also be. We therefore do not replace it but leave it as-is. This picks up where D133828 left off, getting the occurrences without `CMAKE_CFG_INTDIR`. But this is difficult to do correctly and so not done in the (retroactively) previous diff. This hopefully increases readability overall, and also decreases the usages of `LLVM_LIBDIR_SUFFIX`, preparing us for D130586. Reviewed By: sebastian-ne Differential Revision: https://reviews.llvm.org/D132316	2022-09-14 15:48:58 -04:00
John Ericson	34fe6ddce1	Revert "[CMake] Avoid `LLVM_BINARY_DIR` when other more specific variable are better-suited" This reverts commit `ad8c34bc30`.	2022-08-25 11:13:46 -04:00
Eli Friedman	2c29268bfc	Exclude check-polly-unittests and check-polly-isl from check-all The unittests are already included in check-polly, so check-all was running them twice. Running them twice causes a race on the output files, which led to intermittent failures on the reverse-iteration buildbot.	2022-08-24 12:55:45 -07:00
John Ericson	ad8c34bc30	[CMake] Avoid `LLVM_BINARY_DIR` when other more specific variable are better-suited A simple sed doing these substitutions: - `${LLVM_BINARY_DIR}/(\$\{CMAKE_CFG_INTDIR}/)?lib(${LLVM_LIBDIR_SUFFIX})?\>` -> `${LLVM_LIBRARY_DIR}` - `${LLVM_BINARY_DIR}/(\$\{CMAKE_CFG_INTDIR}/)?bin\>` -> `${LLVM_TOOLS_BINARY_DIR}` where `\>` means "word boundary". The only manual modifications were reverting changes in - `compiler-rt/cmake/Modules/CompilerRTUtils.cmake - `runtimes/CMakeLists.txt` because these were "entry points" where we wanted to tread carefully not not introduce a "loop" which would end with an undefined variable being expanded to nothing. This hopefully increases readability overall, and also decreases the usages of `LLVM_LIBDIR_SUFFIX`, preparing us for D130586. Reviewed By: sebastian-ne Differential Revision: https://reviews.llvm.org/D132316	2022-08-24 10:14:05 -04:00
John Ericson	e941b031d3	Revert "[cmake] Use `CMAKE_INSTALL_LIBDIR` too" This reverts commit `f7a33090a9`. Unfortunately this causes a number of failures that didn't show up in my local build.	2022-08-18 22:46:32 -04:00
John Ericson	f7a33090a9	[cmake] Use `CMAKE_INSTALL_LIBDIR` too We held off on this before as `LLVM_LIBDIR_SUFFIX` conflicted with it. Now we return this. `LLVM_LIBDIR_SUFFIX` is kept as a deprecated way to set `CMAKE_INSTALL_LIBDIR`. The other `*_LIBDIR_SUFFIX` are just removed entirely. I imagine this is too potentially-breaking to make LLVM 15. That's fine. I have a more minimal version of this in the disto (NixOS) patches for LLVM 15 (like previous versions). This more expansive version I will test harder after the release is cut. Reviewed By: sebastian-ne, ldionne, #libc, #libc_abi Differential Revision: https://reviews.llvm.org/D130586	2022-08-18 15:33:35 -04:00
Vitaly Buka	3f5f2905c4	[test] Propagate HWASAN_OPTIONS	2022-08-17 18:59:49 -07:00
Roman Gareev	a5d981045d	[Polly] Remove the test case that depends on InstCombine and DeLICM.	2022-08-14 12:51:57 +03:00
Gabriel Ravier	ea540bc210	[polly] Fixed a number of typos. NFC I went over the output of the following mess of a command: `(ulimit -m 2000000; ulimit -v 2000000; git ls-files -z \| parallel --xargs -0 cat \| aspell list --mode=none --ignore-case \| grep -E '^[A-Za-z][a-z]*$' \| sort \| uniq -c \| sort -n \| grep -vE '.{25}' \| aspell pipe -W3 \| grep : \| cut -d' ' -f2 \| less)` and proceeded to spend a few days looking at it to find probable typos and fixed a few hundred of them in all of the llvm project (note, the ones I found are not anywhere near all of them, but it seems like a good start). Reviewed By: inclyc Differential Revision: https://reviews.llvm.org/D131167	2022-08-07 22:56:07 +08:00
Roman Gareev	e8c9eb49ea	[Polly] Suppress the LLVM-IR output for pattern matching tests, if there is no FileCheck-ing for it.	2022-08-07 14:56:26 +03:00
Roman Gareev	b02c7e2b63	[Polly] Generalize the pattern matching to the case of tensor contractions The pattern matching optimization of Polly detects and optimizes dense general matrix-matrix multiplication. The generated code is close to high performance implementations of matrix-matrix multiplications, which are contained in manually tuned libraries. The described pattern matching optimization is a particular case of tensor contraction optimization, which was introduced in [1]. This patch generalizes the pattern matching to the case of tensor contractions using the form of data dependencies and memory accesses produced by tensor contractions [1]. Optimization of tensor contractions will be added in the next patch. Following the ideas introduced in [2], it will logically represent tensor contraction operands as matrix multiplication operands and use an approach for optimization of matrix-matrix multiplications. [1] - Gareev R., Grosser T., Kruse M. High-Performance Generalized Tensor Operations: A Compiler-Oriented Approach // ACM Transactions on Architecture and Code Optimization (TACO). 2018. Vol. 15, no. 3. P. 34:1–34:27. DOI: 10.1145/3235029. [2] - Matthews D. High-Performance Tensor Contraction without BLAS // SIAM Journal on Scientific Computing. 2018. Vol. 40, no. 1. P. C 1—C 24. DOI: 110.1137/16m108968x. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D114336	2022-08-07 13:10:32 +03:00
Michael Kruse	fe0e5b3e43	[Polly] Insert !dbg metadata for emitted CallInsts. The IR Verifier requires that every call instruction to an inlineable function (among other things, its implementation must be visible in the translation unit) must also have !dbg metadata attached to it. When parallelizing, Polly emits calls to OpenMP runtime function out of thin air, or at least not directly derived from a bounded list of previous instruction. While we could search for instructions in the SCoP that has some debug info attached to it, there is no guarantee that we find any. Our solution is to generate a new DILocation that points to line 0 to represent optimized code. The OpenMP function implementation is usually not available in the user's translation unit, but can become visible in an LTO build. For the bug to appear, libomp must also be built with debug symbols. IMHO, the IR verifier rule is too strict. Runtime functions can also be inserted by other optimization passes, such as LoopIdiomRecognize. When inserting a call to e.g. memset, it uses the DebugLoc from a StoreInst from the unoptimized code. It is not required to have !dbg metadata attached either. Fixes #56692	2022-07-26 19:43:53 -05:00
Nikita Popov	2a721374ae	[IR] Don't use blockaddresses as callbr arguments Following some recent discussions, this changes the representation of callbrs in IR. The current blockaddress arguments are replaced with `!` label constraints that refer directly to callbr indirect destinations: ; Before: %res = callbr i8* asm "", "=r,r,i"(i8* %x, i8* blockaddress(@test8, %foo)) to label %asm.fallthrough [label %foo] ; After: %res = callbr i8* asm "", "=r,r,!i"(i8* %x) to label %asm.fallthrough [label %foo] The benefit of this is that we can easily update the successors of a callbr, without having to worry about also updating blockaddress references. This should allow us to remove some limitations: * Allow unrolling/peeling/rotation of callbr, or any other clone-based optimizations (https://github.com/llvm/llvm-project/issues/41834) * Allow duplicate successors (https://github.com/llvm/llvm-project/issues/45248) This is just the IR representation change though, I will follow up with patches to remove limtations in various transformation passes that are no longer needed. Differential Revision: https://reviews.llvm.org/D129288	2022-07-15 10:18:17 +02:00
Michael Kruse	6fa65f8a98	[Polly][MatMul] Abandon dependence analysis. The copy statements inserted by the matrix-multiplication optimization introduce new dependencies between the copy statements and other statements. As a result, the DependenceInfo must be recomputed. Not recomputing them caused IslAstInfo to deduce that some loops are parallel but cause race conditions when accessing the packed arrays. As a result, matrix-matrix multiplication currently cannot be parallelized. Also see discussion at https://reviews.llvm.org/D125202	2022-06-29 17:20:05 -05:00
Nikita Popov	41d5033eb1	[IR] Enable opaque pointers by default This enabled opaque pointers by default in LLVM. The effect of this is twofold: * If IR that contains neither explicit ptr nor %T* types is passed to tools, we will now use opaque pointer mode, unless -opaque-pointers=0 has been explicitly passed. * Users of LLVM as a library will now default to opaque pointers. It is possible to opt-out by calling setOpaquePointers(false) on LLVMContext. A cmake option to toggle this default will not be provided. Frontends or other tools that want to (temporarily) keep using typed pointers should disable opaque pointers via LLVMContext. Differential Revision: https://reviews.llvm.org/D126689	2022-06-02 09:40:56 +02:00

1 2 3 4 5 ...

1562 Commits