llvm-capstone

mirror of https://github.com/capstone-engine/llvm-capstone.git synced 2025-01-14 03:45:33 +00:00

Author	SHA1	Message	Date
Tres Popp	09d72ad399	Revert "[CGP] Enable CodeGenPrepares phi type convertion." This reverts commit 67121d7b82ed78a47ea32f0c87b7317e2b469ab2. This is causing compile times to be 2x slower on some large binaries.	2020-06-22 13:06:18 +02:00
Loïc Joly	cba56e026c	[ASTMatcher] Correct memoization bug ignoring direction (descendants or ancestors) Summary: In ASTMatcher, when we have `has(...)` and `hasParent(...)` called with the same internal matcher on the same node, the memoization process will mix-up the two calls because the direction of the traversal is not part of the memoization key. This patch adds this information. Reviewers: klimek Reviewed By: klimek Subscribers: Godin, njames93, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D80025	2020-06-22 12:56:29 +02:00
Serguei Katkov	eae0d2e9b2	Revert "[Peeling] Extend the scope of peeling a bit" This reverts commit 29b2c1ca72096ca06415b5e626e6728c42ef1e74. The patch causes the DT verifier failure like: DominatorTree is different than a freshly computed one! Not sure the patch itself it wrong but revert to investigate the failure.	2020-06-22 17:48:29 +07:00
Vitaly Buka	5d964e262f	[StackSafety] Check variable lifetime We can't consider variable safe if out-of-lifetime access is possible. So if StackLifetime can't prove that the instruction always uses the variable when it's still alive, we consider it unsafe.	2020-06-22 03:45:29 -07:00
Vitaly Buka	8f592ed333	[StackSafety] Ignore unreachable instructions Usually DominatorTree provides this info, but here we use StackLifetime. The reason is that in the next patch StackLifetime will be used for actual lifetime checks and we can avoid forwarding the DominatorTree into this code.	2020-06-22 03:45:29 -07:00
Denys Petrov	01f9388d95	[analyzer] Handle `\l` symbol in string literals in exploded-graph-rewriter Summary: Handle `\l` separately because a string literal can be in code like "string\\literal" with the `\l` inside. Also on Windows macros __FILE__ produces specific delimiters `\` and a directory or file may starts with the letter `l`. Fix: Use regex for replacing all `\l` (like `,\l`, `}\l`, `[\l`) except `\\l`, because a literal as a rule contains multiple `\` before `\l`. Differential Revision: https://reviews.llvm.org/D82092	2020-06-22 13:44:27 +03:00
Anton Korobeynikov	6cb80fbe40	Revert "[MSP430] Update register names" This reverts commit 8f6620f663031da2bb35b788239f4b607271af84.	2020-06-22 13:37:22 +03:00
David Zarzycki	2a9c3eb9ec	Make ninja smart console builds more pretty Summary: CMake's `find_package` outputs to the console on success, which confuses the smart console mode of the `ninja` build system. Let's quiet the success message and manually warn instead. Reviewers: tstellar, phosek, mehdi_amini Reviewed By: mehdi_amini Subscribers: mgorny, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82276	2020-06-22 06:35:13 -04:00
Anatoly Trosinenko	8f6620f663	[MSP430] Update register names When writing a unit test on replacing standard epilogue sequences with `BR __mspabi_func_epilog_<N>`, by manually asm-clobbering `rN` - `r10` for N = 4..10, everything worked well except for seeming inability to clobber r4. The problem was that MSP430 code generator of LLVM used an obsolete name FP for that register. Things were worse because when `llc` read an unknown register name, it silently ignored it. Differential Revision: https://reviews.llvm.org/D82184	2020-06-22 13:24:03 +03:00
Momchil Velikov	75b0bbca1d	[LTO] Use StringRef instead of C-style strings in setCodeGenDebugOptions Fixes an issue with missing nul-terminators and saves us some string copying, compared to a version which would insert nul-terminators. Differential Revision: https://reviews.llvm.org/D82033	2020-06-22 11:22:18 +01:00
Anatoly Trosinenko	a5bd75aab8	[MSP430] Enable some basic support for debug information This commit technically permits LLVM to emit the debug information for ELF files for MSP430 architecture. Aside from this, it only defines the register numbers as defined by part 10.1 of MSP430 EABI specification (assuming the 1-byte subregisters share the register numbers with corresponding full-size registers). This commit was basically tested by me with TI-provided GCC 8.3.1 toolchain by compiling an example program with `clang` (please note manual linking may be required due to upstream `clang` not yet handling the `-msim` option necessary to run binaries on the GDB-provided simulator) and then running it and single-stepping with `msp430-elf-gdb` like this: ``` $sysroot/bin/msp430-elf-gdb ./test -ex "target sim" -ex "load ./test" (gdb) ... traditional GDB commands follow ... ``` While this implementation is most probably far from completeness and is considered experimental, it can already help with debugging MSP430 programs as well as finding issues in LLVM debug info support for MSP430 itself. One of the use cases includes trying to find a point where UBSan check in a trap-on-error mode was triggered. The expected debug information format is described in the [MSP430 Embedded Application Binary Interface](http://www.ti.com/lit/an/slaa534/slaa534.pdf) specification, part 10. Differential Revision: https://reviews.llvm.org/D81488	2020-06-22 13:14:07 +03:00
Anatoly Trosinenko	359fae6eb0	[DebugInfo] Explicitly permit addr_size = 0x02 when parsing DWARF data Current LLVM implementation uses `MCAsmInfo::CodePointerSize` as addr_size when emitting the DWARF data. llvm-dwarfdump, on the other hand, handles `addr_size`s of 4 and 8 properly and considers all other sizes as an error. This works for most of mainline targets except for MSP430 and AVR. msp430-gcc v8.3.1 emits DWARF32 with addr_size = 4 (DWARF32 does not imply addr_size = 4, 32 refers to internal offset width of 4 bytes) that is handled by llvm-dwarfdump already. Still, emitting 2-byte target pointers on MSP430 seems correct as well (but not for MSP430X that is supported by msp430-gcc but not by LLVM and has 20-bit address space). This patch make it possible for MSP430 debug info support to be tested with llvm-dwarfdump. Differential Revision: https://reviews.llvm.org/D82055	2020-06-22 13:11:55 +03:00
Nathan James	23063296b5	[clang-tidy] Improved accuracy of check list updater script - Added `FixItHint` comments to Check files for the script to mark those checks as offering fix-its when the fix-its are generated in another file. - Case insensitive file searching when looking for the file a checker code resides in. Also regenerated the list, sphinx had no issue generating the docs after this. Reviewed By: sylvestre.ledru Differential Revision: https://reviews.llvm.org/D81932	2020-06-22 11:07:24 +01:00
Florian Hahn	0e19ff02d8	[DSE,MSSA] Remove unused arguments for isDSEBarrier (NFC).	2020-06-22 10:58:53 +01:00
Nathan James	c2b22c57fc	Fixed ASTMatchers registry and regen ast docs	2020-06-22 10:56:20 +01:00
Tobias Gysi	d10b1a38a7	[mlir] make the bitwidth of device side index computations configurable The patch makes the index type lowering of the GPU to NVVM/ROCDL conversion configurable. It introduces a pass option that controls the bitwidth used when lowering index computations. Differential Revision: https://reviews.llvm.org/D80285	2020-06-22 11:43:37 +02:00
Balázs Kéri	e935a540ea	[Analyzer][StreamChecker] Add note tags for file opening. Summary: Bug reports of resource leak are now improved. If there are multiple resource leak paths for the same stream, only one wil be reported. Reviewers: Szelethus, xazax.hun, baloghadamsoftware, NoQ Reviewed By: Szelethus, NoQ Subscribers: NoQ, rnkovacs, xazax.hun, baloghadamsoftware, szepet, a.sidorin, mikhail.ramalho, Szelethus, donat.nagy, dkrupp, gamesh411, Charusso, martong, ASDenysPetrov, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D81407	2020-06-22 11:15:35 +02:00
Djordje Todorovic	792786e34d	[CSInfo][MIPS] Don't describe parameters loaded by sub/super reg copy When describing parameter value loaded by a COPY instruction, consider case where needed Reg value is a sub- or super- register of the COPY instruction's destination register. Without this patch, compile process will crash with the assertion "TargetInstrInfo::describeLoadedValue can't describe super- or sub-regs for copy instructions". Patch by Nikola Tesic Differential revision: https://reviews.llvm.org/D82000	2020-06-22 10:49:02 +02:00
David Spickett	028571d608	[clang][Driver] Correct tool search path priority Summary: As seen in: https://bugs.llvm.org/show_bug.cgi?id=45693 When clang looks for a tool it has a set of possible names for it, in priority order. Previously it would look for these names in the program path. Then look for all the names in the PATH. This means that aarch64-none-elf-gcc on the PATH would lose to gcc in the program path. (which was /usr/bin in the bug's case) This changes that logic to search each name in both possible locations, then move to the next name. Which is more what you would expect to happen when using a non default triple. (-B prefixes maybe should follow this logic too, but are not changed in this patch) Subscribers: kristof.beyls, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D79988	2020-06-22 09:41:13 +01:00
Stephan Herhut	4bcd08eb1c	[mlir] Add for loop specialization Summary: We already had a parallel loop specialization pass that is used to enable unrolling and consecutive vectorization by rewriting loops whose bound is defined as a min of a constant and a dynamic value into a loop with static bound (the constant) and the minimum as bound, wrapped into a conditional to dispatch between the two. This adds the same rewriting for for loops. Differential Revision: https://reviews.llvm.org/D82189	2020-06-22 10:14:17 +02:00
Vassil Vassilev	46ea465b5b	Return false if the identifier is not in the global module index. This allows clients to use the idiom: if (GlobalIndex->lookupIdentifier(Name, FoundModules)) { // work on the FoundModules } This is also a minor performance improvent for clang. Differential Revision: https://reviews.llvm.org/D81077	2020-06-22 08:03:11 +00:00
Serguei Katkov	29b2c1ca72	[Peeling] Extend the scope of peeling a bit Currently we allow peeling of the loops if there is a exiting latch block and all other exits are blocks ending with deopt. Actually we want that exit would end up with deopt unconditionally but it is not required that exit itself ends with deopt. Reviewers: reames, ashlykov, fhahn, apilipenko, fedor.sergeev Reviewed By: apilipenko Subscribers: hiraditya, zzheng, dantrushin, llvm-commits Differential Revision: https://reviews.llvm.org/D81140	2020-06-22 12:17:44 +07:00
sameeran joshi	fa5d416efd	[flang]Fix individual tests with lit when building out of tree Summary: Fix individual check tests with lit when building out-of-tree `ninja check-flang-<folder>` was not working. The CMakeLists.txt was looking for the lit tests in the source directory instead of the build directory. This commit extends @CarolineConcatto previous patch[D81002] Reviewers: DavidTruby, sscalpone, tskeith, CarolineConcatto, jdoerfert Reviewed By: DavidTruby Subscribers: flang-commits, llvm-commits, CarolineConcatto Tags: #flang, #llvm Differential Revision: https://reviews.llvm.org/D82120	2020-06-22 10:30:33 +05:30
Craig Topper	d3c79d1953	[X86] Add an AVX check prefix to bitcast-vector-bool.ll to combine checks where AVX1/2/512 are all the same. NFC	2020-06-21 20:30:17 -07:00
Craig Topper	59d48eadd0	[X86] Add test file that was supposed to go with D81327. Must have forgotten to git add the file.	2020-06-21 20:23:44 -07:00
Michael Liao	20a1700293	[amdgpu] Fix REL32 relocations with negative offsets. Summary: - The offset should be treated as a signed one. Reviewers: rampitec, arsenm Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82234	2020-06-21 23:09:03 -04:00
Jez Ng	3646ee503d	[lld-macho] Refactor segment/section creation, sorting, and merging Summary: There were a few issues with the previous setup: 1. The section sorting comparator used a declarative map of section names to determine the correct order, but it turns out we need to match on more than just names -- in particular, an upcoming diff will sort based on whether the S_ZERO_FILL flag is set. This diff changes the sorter to a more imperative but flexible form. 2. We were sorting OutputSections stored in a MapVector, which left the MapVector in an inconsistent state -- the wrong keys map to the wrong values! In practice, we weren't doing key lookups (only container iteration) after the sort, so this was fine, but it was still a dubious state of affairs. This diff copies the OutputSections to a vector before sorting them. 3. We were adding unneeded OutputSections to OutputSegments and then filtering them out later, which meant that we had to remember whether an OutputSegment was in a pre- or post-filtered state. This diff only adds the sections to the segments if they are needed. In addition to those major changes, two minor ones worth noting: 1. I renamed all OutputSection variable names to `osec`, to parallel `isec`. Previously we were using some inconsistent combination of `osec`, `os`, and `section`. 2. I added a check (and a test) for InputSections with names that clashed with those of our synthetic OutputSections. Reviewers: #lld-macho Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81887	2020-06-21 17:13:59 -07:00
Craig Topper	90406d62e5	[X86] Add cooperlake and tigerlake to the enum in cpu_model.c I forgot to do this when I added then to _cpu_indicator_init.	2020-06-21 16:20:26 -07:00
Craig Topper	1d4c87335d	[X86] Assign a feature priority to 'tigerlake' so it won't assert when used with function multiversioning Also test cooperlake since it was also just added to function multiversioning when it was enabled for __builtin_cpu_is.	2020-06-21 13:24:58 -07:00
Sanjay Patel	6bdd531af5	[VectorCombine] create class for pass to hold analyses, etc; NFC This doesn't change anything currently, but it would make sense to create a class-level IRBuilder instead of recreating that everywhere. As we expand to more optimizations, we will probably also want to hold things like the DataLayout or other constant refs in here too.	2020-06-21 16:07:33 -04:00
Craig Topper	42c176c328	[X86] Add 'cooperlake' and 'tigerlake' to __builtin_cpu_is. Cooperlake can be detect by compiler-rt now, but not libgcc yet. Tigerlake can't be detected by either. Both names are accepted by gcc. Hopefully the detection code will be in place soon.	2020-06-21 13:03:18 -07:00
Craig Topper	0e6c9316d4	[X86] Add cooperlake detection to _cpu_indicator_init. libgcc has this enum encoding defined for a while, but their detection code is missing. I've raised a bug with them so that should get fixed soon.	2020-06-21 13:02:33 -07:00
Nathan James	db90d31570	[clang-tidy] Implement storeOptions for checks missing it. Just adds the storeOptions for Checks that weren't already storing their options. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D82223	2020-06-21 19:01:11 +01:00
Luboš Luňák	448bbc512f	fix clang/PCH/delayed-pch-instantiate test -target must match between PCH creation and use.	2020-06-21 19:00:42 +02:00
David Green	67121d7b82	[CGP] Enable CodeGenPrepares phi type convertion.	2020-06-21 16:46:16 +01:00
Florian Hahn	40569db7b3	[DSE,MSSA] Move reachability check to main loop. As we traverse the CFG backwards, we could end up reaching unreachable blocks. For unreachable blocks, we won't have computed post order numbers and because DomAccess is reachable, unreachable blocks cannot be on any path from it. This fixes a crash with unreachable blocks.	2020-06-21 16:38:10 +01:00
Luboš Luňák	a45f713c67	add option to instantiate templates already in the PCH Add -fpch-instantiate-templates which makes template instantiations be performed already in the PCH instead of it being done in every single file that uses the PCH (but every single file will still do it as well in order to handle its own instantiations). I can see 20-30% build time saved with the few tests I've tried. The change may reorder compiler output and also generated code, but should be generally safe and produce functionally identical code. There are some rare cases that do not compile with it, such as test/PCH/pch-instantiate-templates-forward-decl.cpp. If template instantiation bailed out instead of reporting the error, these instantiations could even be postponed, which would make them work. Enable this by default for clang-cl. MSVC creates PCHs by compiling them using an empty .cpp file, which means templates are instantiated while building the PCH and so the .h needs to be self-contained, making test/PCH/pch-instantiate-templates-forward-decl.cpp to fail with MSVC anyway. So the option being enabled for clang-cl matches this. Differential Revision: https://reviews.llvm.org/D69585	2020-06-21 17:05:52 +02:00
David Green	730ecb63ec	[CGP] Convert phi types If a collection of interconnected phi nodes is only ever loaded, stored or bitcast then we can convert the whole set to the bitcast type, potentially helping to reduce the number of register moves needed as the phi's are passed across basic block boundaries. This has to be done in CodegenPrepare as it naturally straddles basic blocks. The alorithm just looks from phi nodes, looking at uses and operands for a collection of nodes that all together are bitcast between float and integer types. We record visited phi nodes to not have to process them more than once. The whole subgraph is then replaced with a new type. Loads and Stores are bitcast to the correct type, which should then be folded into the load/store, changing it's type. This comes up in the biquad testcase due to the way MVE needs to keep values in integer registers. I have also seen it come up from aarch64 partner example code, where a complicated set of sroa/inlining produced integer phis, where float would have been a better choice. I also added undef and extract element handling which increased the potency in some cases. This adds it with an option that defaults to off, and disabled for 32bit X86 due to potential issues around canonicalizing NaNs. Differential Revision: https://reviews.llvm.org/D81827	2020-06-21 15:54:17 +01:00
David Green	0ee21cdb63	[CGP][AArch64] Convert Phi type tests. NFC	2020-06-21 15:35:52 +01:00
Nikita Popov	37d3030711	[ValueTracking, BasicAA] Don't simplify instructions GetUnderlyingObject() (and by required symmetry DecomposeGEPExpression()) will call SimplifyInstruction() on the passed value if other checks fail. This simplification is very expensive, but has little effect in practice. This patch removes the SimplifyInstruction call(), and replaces it with a check for single-argument phis (which can occur in canonical IR in LCSSA form), which is the only useful simplification case I was able to identify. At O3 the geomean CTMark improvement is -1.7%. The largest improvement is SPASS with ThinLTO at -6%. In test-suite, I see only two tests with a hash difference and no code size difference (PAQ8p, Ptrdist), which indicates that the simplification only ends up being useful very rarely. (I would have liked to figure out which simplification is responsible here, but wasn't able to spot it looking at transformation logs.) The AMDGPU test case that is update was using two selects with undef condition, in which case GetUnderlyingObject will return the first select operand as the underlying object. This will of course not happen with non-undef conditions, so this was not testing anything realistic. Additionally this illustrates potential unsoundness: While GetUnderlyingObject will pick the first operand, the select might be later replaced by the second operand, resulting in inconsistent assumptions about the undef value. Differential Revision: https://reviews.llvm.org/D82261	2020-06-21 16:31:07 +02:00
Bruno Ricci	5342dd6bf4	Revert "Add --hot-func-list to llvm-profdata show for sample profiles" This reverts commit 7348b951fe74f306970f6ac567fe5dddbb1c42d4. It is causing Asan failures.	2020-06-21 14:33:08 +01:00
Sanjay Patel	2ad42c2653	[ValueTracking] improve analysis for fdiv with same operands (The 'nnan' variant of this pattern is already tested to produce '1.0'.) https://alive2.llvm.org/ce/z/D4hPBy define i1 @src(float %x, i32 %y) { %0: %d = fdiv float %x, %x %uge = fcmp uge float %d, 0.000000 ret i1 %uge } => define i1 @tgt(float %x, i32 %y) { %0: ret i1 1 } Transformation seems to be correct!	2020-06-21 09:07:59 -04:00
Sanjay Patel	97c0232621	[InstSimplify] add test for fdiv signbit; NFC	2020-06-21 09:07:59 -04:00
Bruno Ricci	cddc9993ea	[clang][test][NFC] Also test for serialization in AST dump tests, part 3/n. The outputs between the direct ast-dump test and the ast-dump test after deserialization should match modulo a few differences. For hand-written tests, strip the "<undeserialized declarations>"s and the "imported"s with sed. For tests generated with "make-ast-dump-check.sh", regenerate the output. Part 3/n.	2020-06-21 13:59:11 +01:00
Bruno Ricci	ecbf2f5f3d	[clang][test][NFC] Also test for serialization in AST dump tests, part 2/n. The outputs between the direct ast-dump test and the ast-dump test after deserialization should match modulo a few differences. For hand-written tests, strip the "<undeserialized declarations>"s and the "imported"s with sed. For tests generated with "make-ast-dump-check.sh", regenerate the output. Part 2/n.	2020-06-21 13:59:11 +01:00
Bruno Ricci	e560280cd5	[clang][NFC] Regenerate test/AST/ast-dump-lambda.cpp with --match-full-lines.	2020-06-21 13:59:11 +01:00
Bruno Ricci	0dbeffddd1	[clang][utils] Minor tweak to make-ast-dump-check.sh Remove the space after the "CHECK:" on each line. This space makes the use of FileCheck --match-full-lines impossible.	2020-06-21 13:59:10 +01:00
Bruno Ricci	e7ce052820	[clang][Serialization] Fix the serialization of ConstantExpr. The serialization of ConstantExpr has currently a number of problems: - Some fields are just not serialized (ConstantExprBits.APValueKind and ConstantExprBits.IsImmediateInvocation). - ASTStmtReader::VisitConstantExpr forgets to add the trailing APValue to the list of objects to be destroyed when the APValue needs cleanup. While we are at it, bring the serialization of ConstantExpr more in-line with what is done with the other expressions by doing the following NFCs: - Get rid of ConstantExpr::DefaultInit. It is better to not initialize the fields of an empty ConstantExpr since this will allow msan to detect if a field was not deserialized. - Move the initialization of the fields of ConstantExpr to the constructor; ConstantExpr::Create allocates the memory and ConstantExpr::ConstantExpr is responsible for the initialization. Review after commit since this is a straightforward mechanical fix similar to the other serialization fixes.	2020-06-21 13:59:10 +01:00
Bruno Ricci	ef3adbfc70	[clang][NFC] Fix typos/wording in the comments of ConstantExpr. It is "trailing objects" and "tail-allocated storage".	2020-06-21 13:59:10 +01:00
Nikita Popov	93a0f0e4fe	[LangRef] Fix sphinx warnings	2020-06-21 13:51:07 +02:00

... 2 3 4 5 6 ...

358139 Commits