llvm-capstone

mirror of https://github.com/capstone-engine/llvm-capstone.git synced 2025-02-26 05:18:46 +00:00

Author	SHA1	Message	Date
Matt Arsenault	4844bf0fe2	AMDGPU: Apply i16 add->sub pattern with zext to i32 This was only applying the deeper nested zext pattern, and missing the special case code size fold.	2020-01-07 16:36:31 -05:00
Mitchell Balan	73d93617d3	[clang-tidy] modernize-use-using uses AST and now supports struct defintions and multiple types in a typedef Summary: It now handles `typedef`s that include comma-separated multiple types, and handles embedded struct definitions, which previously could not be automatically converted. For example, with this patch `modernize-use-using` now can convert: typedef struct { int a; } R_t, R_p; to: using R_t = struct { int a; }; using R_p = R_t; `-ast-dump` showed that the `CXXRecordDecl` definitions and multiple `TypedefDecl`s come consecutively in the tree, so `check()` stores information between calls to determine when it is receiving a second or additional `TypedefDecl` within a single `typedef`, or when the current `TypedefDecl` refers to an embedded `CXXRecordDecl` like a `struct`. Reviewers: alexfh, aaron.ballman Patch by: poelmanc Subscribers: riccibruno, sammccall, cfe-commits, aaron.ballman Tags: clang-tools-extra, clang Differential Revision: https://reviews.llvm.org/D70270	2020-01-07 16:36:11 -05:00
Eric Fiselier	0c5102bd93	[libc++] Add additional benchmark functions to libcxx/benchmarks/string.bench This change adds the following benchmarks: - StringAssignStr Assign a const basic::string& value - StringAssignAsciiz Assign a const char* asciiz value StringAssignAsciizMix Assign mixed long/short const char* asciiz values - StringResizeDefaultInit Resize default init benchmark Patch by Martijn Vels (mvels@google.com) Reviewed as D72343	2020-01-07 16:31:40 -05:00
Craig Topper	9685cf709f	[X86] Enable v2i64->v2f32 uint_to_fp code in ReplaceNodeResults on SSE4.1 target Now that we generate decent code for (v2i64 (setlt zero, X)) on pre-sse4.2 targets I think we can use this now. Differential Revision: https://reviews.llvm.org/D72354	2020-01-07 13:25:29 -08:00
Jonas Devlieghere	512b2c7dc7	[lldb/Test] Remove old binary created by TestConvenienceVariables On a dirty build directory the new mkdir fails because the file already exists and is not a directory.	2020-01-07 13:21:47 -08:00
Jonas Devlieghere	4c935c66c4	[lldb/Test] Make TestConvenienceVariables more strict This test was passing even when the output of lldb.target was empty. I've made the test more strict by checking explicitly for the target name and by using CHECK-NEXT lines.	2020-01-07 13:06:13 -08:00
Daniel Sanders	39c05703a6	[gicombiner] Correct 64f1bb5cd2c to account for MSVC's %p format	2020-01-07 12:50:05 -08:00
Bill Wendling	c5877ec9b4	Remove extraneous semicolon.	2020-01-07 12:49:09 -08:00
Sanjay Patel	6d52edebc9	[x86] add tests for extract-of-concat; NFC	2020-01-07 15:48:54 -05:00
Jonas Devlieghere	fe27e8ee80	Revert "[lldb/Lua] Add string conversion operator for SBTarget." This reverts commit 640d0ba8760051afc002c672121c6989517fc94e.	2020-01-07 12:47:00 -08:00
Christopher Tetreault	76ee0de00c	[cmake] Use source-groups in Polly. Configure CMake to setup source-groups for Polly. Source groups describe how source files should be organized in IDEs. By default, all headers are dumped into one folder under PollyCore and all source files into another. On disk, these files are organized into folders, but this isn't reflected in the IDE. This change uses CMake source groups to have the IDE reflect the on disk layout. This will make it easier to visualize the project structure for users of Visual Studio and XCode Patch by Christopher Tetreault <ctetreau@quicinc.com> Reviewed By: Meinersbur, grosser Differential Revision: https://reviews.llvm.org/D72117	2020-01-07 14:20:06 -06:00
Matt Arsenault	449ab10509	AMDGPU: Add baseline test for missing pattern The optimization to turn an add into a sub isn't triggering when the pattern to use the zeroed high bits is used.	2020-01-07 15:10:08 -05:00
Matt Arsenault	c3a10faadc	AMDGPU: Remove VOP3Mods0Clamp0OMod Now that overridable default operands work, there's no reason to use complex patterns to just produce 0s.	2020-01-07 15:10:08 -05:00
Matt Arsenault	de46ab698b	AMDGPU: Fix misleading, misplaced end block comments	2020-01-07 15:10:08 -05:00
Matt Arsenault	bd8d696c14	AMDGPU: Use ImmLeaf	2020-01-07 15:10:07 -05:00
Matt Arsenault	68e70fb098	AMDGPU: Fix not using v_cvt_f16_[iu]16 We weren't treating i16->f16 casts as legal on targets with these instructions, and always using a pair of casts through i32.	2020-01-07 15:10:07 -05:00
Jonas Devlieghere	640d0ba876	[lldb/Lua] Add string conversion operator for SBTarget. Extend the SBTarget class with a string conversion operator and reuse the same code between Python and Lua. This should happen for all the SB classes, but I'm doing just this one as an example and for use in a test case.	2020-01-07 12:01:56 -08:00
Michael Kruse	b6503901f2	[cmake] Use relative cmake binary dir for processing pass plugins. https://reviews.llvm.org/D61446 introduced a new function to process pass plugins that used CMAKE_BINARY_DIR. This is problematic when LLVM is a subproject. Instead use LLVM_BINARY_DIR to get the right relative directory for cmake. Patch by Alan Baker <alanbaker@google.com> Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D72109	2020-01-07 20:42:35 +01:00
Fangrui Song	8edf759ca7	[PowerPC][Triple] Use elfv2 on freebsd>=13 and linux-musl Summary: Every powerpc64le platform uses elfv2. For powerpc64, the environments "elfv1" and "elfv2" were added for FreeBSD ELFv1->ELFv2 migration in D61950. FreeBSD developers have decided to use OS versions to select ABI, and no one is relying on the environments. Also use elfv2 on powerpc64-linux-musl. Users can always use -mabi=elfv1 and -mabi=elfv2 to override the default ABI. Reviewed By: adalava Differential Revision: https://reviews.llvm.org/D72352	2020-01-07 11:40:56 -08:00
Fangrui Song	fb6e80da44	[test] Move ppc64 tests from test/Preprocessor/init.c to init-ppc64.c	2020-01-07 11:32:52 -08:00
Alexey Bataev	4558842891	[OPENMP]Reduce calls for the mangled names. Use canonical decls instead of mangled names in the set of already emitted decls. This allows to reduce the number of function calls for getting declarations mangled names and speedup the compilation.	2020-01-07 14:28:17 -05:00
Jessica Paquette	acd2580824	[MachineOutliner][AArch64] Save + restore LR in noreturn functions Conservatively always save + restore LR in noreturn functions. These functions do not end in a RET, and so they aren't guaranteed to have an instruction which uses LR in any way. So, as a result, you can end up in unfortunate situations where you can't backtrace out of these functions in a debugger. Remove the old noreturn test, and add a new one which is more descriptive. Remove the restriction that we can't outline from noreturn functions as well since we now do the right thing.	2020-01-07 11:27:25 -08:00
Craig Topper	afa8211e97	[X86] Improve lowering of (v2i64 (setgt X, -1)) on pre-SSE2 targets. Enable v2i64 in foldVectorXorShiftIntoCmp. Similar to D72302 but for the canonical form for the opposite case. I've changed foldVectorXorShiftIntoCmp to form a target independent setcc node instead of PCMPGT now and enabled its for v2i64 on pre-SSE4.2 targets. The setcc should eventually get lowered to PCMPGT or the new v2i64 sequence. Differential Revision: https://reviews.llvm.org/D72318	2020-01-07 11:22:04 -08:00
Craig Topper	b9376690a0	[X86] Improve lowering of v2i64 sign bit tests on pre-sse4.2 targets Without sse4.2 a v2i64 setlt needs to expand into a pcmpgtd, pcmpeqd, 3 shuffles, and 2 logic ops. But if we're only interested in the sign bit of the i64 elements, we can just use one pcmpgtd and shuffle the odd elements to the even elements. Differential Revision: https://reviews.llvm.org/D72302	2020-01-07 11:22:03 -08:00
LLVM GN Syncbot	2f1e5d934e	[gn build] Port 1d94fb21118	2020-01-07 19:13:41 +00:00
Daniel Sanders	1d94fb2111	[gicombiner] Add GIMatchTree and use it for the code generation Summary: GIMatchTree's job is to build a decision tree by zipping all the GIMatchDag's together. Each DAG is added to the tree builder as a leaf and partitioners are used to subdivide each node until there are no more partitioners to apply. At this point, the code generator is responsible for testing any untested predicates and following any unvisited traversals (there shouldn't be any of the latter as the getVRegDef partitioner handles them all). Note that the leaves don't always fit into partitions cleanly and the partitions may overlap as a result. This is resolved by cloning the leaf into every partition it belongs to. One example of this is a rule that can match one of N opcodes. The leaf for this rule would end up in N partitions when processed by the opcode partitioner. A similar example is the getVRegDef partitioner where having rules (add $a, $b), and (add ($a, $b), $c) will result in the former being in the partition for successfully following the vreg-def and failing to do so as it doesn't care which happens. Depends on D69151 Fixed the issues with the windows bots which were caused by stdout/stderr interleaving. Reviewers: bogner, volkan Reviewed By: volkan Subscribers: lkail, mgorny, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69152	2020-01-07 11:12:53 -08:00
Alexandre Ganea	75eacbf1a9	Fix issues reported by -Wrange-loop-analysis when building with latest Clang (trunk). NFC. Fixes warning: loop variable 'E' of type 'const llvm::StringRef' creates a copy from type 'const llvm::StringRef' [-Wrange-loop-analysis]	2020-01-07 13:58:26 -05:00
Alexey Bataev	c972f6fd79	[OPENMP]Allow using of members in standalone declaration pragmas. If standalone OpenMP declaration pragma, like declare mapper or declare reduction, is declared in the class context, it may reference a member (data or function) in its internal expressions/statements. So, the parsing of such pragmas must be dalayed just like the parsing of the member initializers/definitions before the completion of the class declaration.	2020-01-07 13:44:10 -05:00
Nathan Ridge	16f47cf607	[clangd] Heuristically resolve dependent call through smart pointer type Summary: Fixes https://github.com/clangd/clangd/issues/227 Reviewers: sammccall Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, kadircet, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D71644	2020-01-07 12:52:03 -05:00
Nathan Ridge	751d4dae32	[clangd] Assert that the testcases in LocateSymbol.All have no diagnostics Summary: Also fix some bugs in the testcases which this exposed. Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, kadircet, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D72066	2020-01-07 12:13:32 -05:00
Pavel Labath	57835d0198	[lldb] Initialize some bitfields in FuncUnwinders.cpp This got flagged by msan.	2020-01-07 18:01:59 +01:00
Simon Pilgrim	55de6fc0b6	[ARM] Regenerate bfi.ll test cases	2020-01-07 16:51:11 +00:00
Simon Pilgrim	0e912e22b6	[X86] Pull out repeated SrcVT.getVectorNumElements() call. NFCI.	2020-01-07 16:51:10 +00:00
Gabor Horvath	46ac6a4dcd	[analyzer] Update help text to reflect sarif support Differential Revision: https://reviews.llvm.org/D72289	2020-01-07 08:37:49 -08:00
Gabor Horvath	247a603254	[LifetimeAnalysis] Do not forbid void deref type in gsl::Pointer/gsl::Owner annotations It turns out it is useful to be able to define the deref type as void. In case we have a type erased owner, we want to express that the pointee can be basically any type. It should not be unnatural to have a void deref type as we already familiar with "pointers to void". Differential Revision: https://reviews.llvm.org/D72097	2020-01-07 08:32:40 -08:00
diggerlin	a3832f33d9	[AIX][XCOFF]Implement mergeable const SUMMARY: In this patch, we map mergeable const objects to the read-only section in the same manner as const objects that are not mergeable. Reviewers: hubert.reinterpretcast,jasonliu Subscribers: wuzish, nemanjai, hiraditya Differential Revision: https://reviews.llvm.org/D71551	2020-01-07 11:20:51 -05:00
Yaxun (Sam) Liu	9f2d8b5c0c	[HIP] Add option --gpu-max-threads-per-block=n Add this option to change the default launch bounds. Differential Revision: https://reviews.llvm.org/D71221	2020-01-07 11:18:00 -05:00
Sjoerd Meijer	ee811808a9	[ARM][MVE] Renamed VPT Block tests and files to something more informative. NFC	2020-01-07 16:16:54 +00:00
Matt Arsenault	78b30a54c9	AMDGPU/GlobalISel: Fix readfirstlane pattern import The imm folding optimization pattern failed to import. The instruction pattern was already working, but failing to fail on SGPR inputs.	2020-01-07 11:07:08 -05:00
Med Ismail Bennani	9daa44c993	Remove extraneous spaces Signed-off-by: Med Ismail Bennani <medismail.bennani@gmail.com>	2020-01-07 16:46:04 +01:00
Sanjay Patel	f8962571f7	[InstCombine] try to pull 'not' of select into compare operands not (select ?, (cmp TPred, ?, ?), (cmp FPred, ?, ?) --> select ?, (cmp TPred', ?, ?), (cmp FPred', ?, ?) If both sides of the select are cmps, we can remove an instruction. The case where only side is a cmp is deferred to a possible follow-on patch. We have a more general 'isFreeToInvert' analysis, but I'm not seeing a way to use that more widely without inducing infinite looping (opposing transforms). Here, we flip the compare predicates directly, so we should not have any danger by creating extra intermediate 'not' ops. Alive proofs: https://rise4fun.com/Alive/jKa Name: both select values are compares - invert predicates %tcmp = icmp sle i32 %x, %y %fcmp = icmp ugt i32 %z, %w %sel = select i1 %cond, i1 %tcmp, i1 %fcmp %not = xor i1 %sel, true => %tcmp_not = icmp sgt i32 %x, %y %fcmp_not = icmp ule i32 %z, %w %not = select i1 %cond, i1 %tcmp_not, i1 %fcmp_not Name: false val is compare - invert/not %fcmp = icmp ugt i32 %z, %w %sel = select i1 %cond, i1 %tcmp, i1 %fcmp %not = xor i1 %sel, true => %tcmp_not = xor i1 %tcmp, -1 %fcmp_not = icmp ule i32 %z, %w %not = select i1 %cond, i1 %tcmp_not, i1 %fcmp_not Differential Revision: https://reviews.llvm.org/D72007	2020-01-07 10:44:23 -05:00
Matt Arsenault	e699c03c9b	AMDGPU/GlobalISel: Fix import of s_abs_i32 pattern	2020-01-07 10:32:07 -05:00
Matt Arsenault	9150d6bd73	AMDGPU/GlobalISel: Select llvm.amdgcn.wqm.vote	2020-01-07 10:15:29 -05:00
Tim Northover	e130eef588	OpaquePtr: print byval types containing anonymous types correctly. Attribute::getAsString doesn't have enough information to print anonymous Module-level types correctly, so they come back as "%type 0xabcd". This results in broken IR when printing as text. Instead, print type-attributes (currently just byval) using the TypePrinting infrastructure available in AsmWriter. This only applies to function argument attributes.	2020-01-07 15:11:43 +00:00
Matt Arsenault	f26ed6e47c	llc: Change behavior of -mcpu with existing attribute Don't overwrite existing target-cpu attributes. I've often found the replacement behavior annoying, and this is inconsistent with how the fast math command line flags interact with the function attributes. Does not yet change target-features, since I think that should behave as a concatenation.	2020-01-07 10:10:25 -05:00
Matt Arsenault	a428386d4a	AMDGPU/GlobalISel: Partially fix llvm.amdgcn.kill pattern import Tests deferred since the existing DAG test depends on some other operations, but isn't far from working as-is.	2020-01-07 10:09:59 -05:00
Hans Wennborg	e334a3a60f	[docs] NFC: Fix typos in documents "the the" -> "the" "an" -> "a" Patch by Kazuaki Ishizaki <ishizaki@jp.ibm.com>! Differential revision: https://reviews.llvm.org/D72091	2020-01-07 16:06:14 +01:00
Sam Parker	3c7f740f28	[TypePromotion] Use SetVectors instead of PtrSets Remove the chance of non-deterministic insertion of zexts of the sources by using a SetVector instead of SmallPtrSet. Do the same for sinks for consistency and to negate the small issue from possibly happening. The SafeWrap instructions are now also stored in a SmallVector. The IRPromoter members of these structures have been changed to references. Differential Revision: https://reviews.llvm.org/D72322	2020-01-07 14:51:54 +00:00
Sanjay Patel	58e2e92a57	[DAGCombiner] reduce shuffle of concat of same vector This is possibly a small part towards solving PR42024: https://bugs.llvm.org/show_bug.cgi?id=42024 The vectorizer is creating shuffles of concat like this: %63 = shufflevector <4 x i64> %x, <4 x i64> undef, <8 x i32> <i32 0, i32 1, i32 2, i32 3, i32 0, i32 1, i32 2, i32 3> %64 = shufflevector <8 x i64> %63, <8 x i64> undef, <8 x i32> <i32 0, i32 4, i32 1, i32 5, i32 2, i32 6, i32 3, i32 7> That might be fixable in the vectorizers, but we're not allowed to fold that into a single shuffle in instcombine, so we should have a backend backstop to convert that into the likely simpler form: %64 = shufflevector <4 x i64> %x, <4 x i64> undef, <8 x i32> <i32 0, i32 0, i32 1, i32 1, i32 2, i32 2, i32 3, i32 3> Differential Revision: https://reviews.llvm.org/D72300	2020-01-07 09:48:59 -05:00
Alexey Bataev	3f2e3dc44b	[OPENMP]Do not diagnose references to non-integral types for ref in declare simd. According to the standard, a list-item that appears in a linear clause without the ref modifier must be of integral or pointer type, or must be a reference to an integral or pointer type. Added check that this restriction is applied only to non-ref items.	2020-01-07 09:28:50 -05:00

... 5 6 7 8 9 ...

339056 Commits