llvm-capstone

mirror of https://github.com/capstone-engine/llvm-capstone.git synced 2024-11-25 06:40:18 +00:00

Author	SHA1	Message	Date
Mehdi Amini	a9386bb0f9	Fix MLIR lit test configuration after cmake Python detection change `07f1047f41` changed the CMake detection to use find_package(Python3 ... but didn't update the lit configuration to use the expected Python3_EXECUTABLE cmake variable to point to the interpreter path. This resulted in an empty path on MacOS.	2020-11-13 04:44:45 +00:00
Philip Reames	d4e81cd9dd	[Tests][LoopVect] Exercise basic uniform memory operand logic	2020-11-12 20:34:31 -08:00
Shilei Tian	24d0ef0f50	[OpenMP] Fixed a bug when displaying affinity Currently the affinity format string has initial value. When users set the format via OMP_AFFINITY_FORMAT, it will overwrite the format string. However, when copying the format, the tailing null is missing. As a result, if the user format string is shorter than default value, the remaining part in the default value still makes effort. This bug is not exposed because the test case doesn't check the end of a string. It only checks whether given output "contains" the check string. Reviewed By: AndreyChurbanov Differential Revision: https://reviews.llvm.org/D91309	2020-11-12 22:27:32 -05:00
Michael Liao	8920ef06a1	[hip] Remove the coercion on aggregate kernel arguments. - If an aggregate argument is indirectly accessed within kernels, direct passing results in unpromotable `alloca`, which degrade performance significantly. InferAddrSpace pass is enhanced in [D91121](https://reviews.llvm.org/D91121) to take the assumption that generic pointers loaded from the constant memory could be regarded global ones. The need for the coercion on aggregate arguments is mitigated. Differential Revision: https://reviews.llvm.org/D89980	2020-11-12 21:19:30 -05:00
Michael Kruse	243511a24e	[Polly] Fix memory leak.	2020-11-12 20:04:17 -06:00
Sanjay Patel	0abde4bc92	[InstCombine] fold sub of low-bit masked value from offset of same value There might be some demanded/known bits way to generalize this, but I'm not seeing it right now. This came up as a regression when I was looking at a different demanded bits improvement. https://rise4fun.com/Alive/5fl Name: general Pre: ((-1 << countTrailingZeros(C1)) & C2) == 0 %a1 = add i8 %x, C1 %a2 = and i8 %x, C2 %r = sub i8 %a1, %a2 => %r = and i8 %a1, ~C2 Name: test 1 %a1 = add i8 %x, 192 %a2 = and i8 %x, 10 %r = sub i8 %a1, %a2 => %r = and i8 %a1, -11 Name: test 2 %a1 = add i8 %x, -108 %a2 = and i8 %x, 3 %r = sub i8 %a1, %a2 => %r = and i8 %a1, -4	2020-11-12 20:10:28 -05:00
Sanjay Patel	87e006bedd	[InstCombine] add tests for sub with masked bits; NFC	2020-11-12 20:10:28 -05:00
Rahul Joshi	5883c4b470	[MLIR] Fix standard -> LLVM conversion to fail for unsupported memref element type. - Move isSupportedMemRefType() to ConvertToLLVMPatterns and check if the memref element type is supported there. Differential Revision: https://reviews.llvm.org/D91374	2020-11-12 17:06:05 -08:00
peter klausler	c2bccd66f6	[flang] Document DO CONCURRENT's problems (NFC) Differential Revision: https://reviews.llvm.org/D86556	2020-11-12 15:30:43 -08:00
Jonas Devlieghere	406ad18748	[lldb/DataFormatters] Display null C++ pointers as nullptr Display null pointer as `nullptr`, `nil` and `NULL` for C++, Objective-C/Objective-C++ and C respectively. The original motivation for this patch was to display a null std::string pointer as nullptr instead of "", but the fix seemed generic enough to be done for all summary providers. Differential revision: https://reviews.llvm.org/D77153	2020-11-12 15:24:06 -08:00
Stanislav Mekhanoshin	5ab1702129	[AMDGPU] Remove scratch rsrc from spill pseudos Differential Revision: https://reviews.llvm.org/D91110	2020-11-12 15:23:37 -08:00
Nico Weber	fa9f41330d	[gn build] (manually) port `410626c9b5`	2020-11-12 18:21:22 -05:00
Sean Silva	796880288a	[mlir] Make tensor_to_memref op docs match reality The previous code defined it as allocating a new memref for its result. However, this is not how it is treated by the dialect conversion framework, that does the equivalent of inserting and folding it away internally (even independent of any canonicalization patterns that we have defined). The semantics as they were previously written were also very constraining: Nontrivial analysis is needed to prove that the new allocation isn't needed for correctness (e.g. to avoid aliasing). By removing those semantics, we avoid losing that information. Differential Revision: https://reviews.llvm.org/D91382	2020-11-12 14:56:10 -08:00
Sean Silva	faa66b1b2c	[mlir] Bufferize tensor constant ops We lower them to a std.global_memref (uniqued by constant value) + a std.get_global_memref to produce the corresponding memref value. This allows removing Linalg's somewhat hacky lowering of tensor constants, now that std properly supports this. Differential Revision: https://reviews.llvm.org/D91306	2020-11-12 14:56:10 -08:00
Sean Silva	ad2f9f6745	[mlir] Fix subtensor_insert bufferization. It was incorrect in the presence of a tensor argument with multiple uses. The bufferization of subtensor_insert was writing into a converted memref operand, but there is no guarantee that the converted memref for that operand is safe to write into. In this case, the same converted memref is written to in-place by the subtensor_insert bufferization, violating the tensor-level semantics. I left some comments in a TODO about ways forward on this. I will be working actively on this problem in the coming days. Differential Revision: https://reviews.llvm.org/D91371	2020-11-12 14:56:09 -08:00
Jessica Paquette	d0ba6c4002	[AArch64][GlobalISel] Select CSINC and CSINV for G_SELECT with constants Select the following: - G_SELECT cc, 0, 1 -> CSINC zreg, zreg, cc - G_SELECT cc 0, -1 -> CSINV zreg, zreg cc - G_SELECT cc, 1, f -> CSINC f, zreg, inv_cc - G_SELECT cc, -1, f -> CSINV f, zreg, inv_cc - G_SELECT cc, t, 1 -> CSINC t, zreg, cc - G_SELECT cc, t, -1 -> CSINC t, zreg, cc (IR example: https://godbolt.org/z/YfPna9) These correspond to a bunch of the AArch64csel patterns in AArch64InstrInfo.td. Unfortunately, it doesn't seem like we can import patterns that use NZCV like those ones do. E.g. ``` def : Pat<(AArch64csel GPR32:$tval, (i32 1), (i32 imm:$cc), NZCV), (CSINCWr GPR32:$tval, WZR, (i32 imm:$cc))>; ``` So we have to manually select these for now. This replaces `selectSelectOpc` with an `emitSelect` function, which performs these optimizations. Differential Revision: https://reviews.llvm.org/D90701	2020-11-12 14:44:01 -08:00
Kazushi (Jam) Marukawa	410626c9b5	[VE] Support vld intrinsics Add intrinsics for vector load instructions. Add a regression test also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D91332	2020-11-13 07:34:42 +09:00
Sanjay Patel	9e0c35655b	[LoopVectorize] regenerate test checks; NFC	2020-11-12 17:15:46 -05:00
shafik	bae9aedb34	[LLDB] Fix handling of bit-fields in a union When parsing DWARF and laying out bit-fields we don't properly take into account when they are in a union, they will all have a zero offset. Differential Revision: https://reviews.llvm.org/D91118	2020-11-12 14:09:27 -08:00
Sanjay Patel	d5e89e8fc1	[PhaseOrdering] regenerate test checks; NFC	2020-11-12 17:02:14 -05:00
Sanjay Patel	96f4aa6765	[InstCombine] add tests for low-mask-of-add; NFC	2020-11-12 17:02:14 -05:00
Amy Huang	0c80b542d3	Some updates/fixes to the creduce script. This was motivated by changes to llvm's `not --crash` disabling symbolization but I ended up removing `not` from the script entirely because it returns differently depending on whether clang "crashes" or exits for some other reason. The script had to choose between calling `not` and `not --crash` and sometimes it was wrong. The script also now disables symbolization when we don't read the stack trace because symbolizing is kind of slow. Differential Revision: https://reviews.llvm.org/D91372	2020-11-12 13:40:26 -08:00
Stanislav Mekhanoshin	cf6565f6d0	[AMDGPU] Enable multi-dword flat scratch load/stores Differential Revision: https://reviews.llvm.org/D91384	2020-11-12 13:38:56 -08:00
Stella Laurenzo	4726a402a3	[mlir][Python] Fix 'unreferenced local variable' warning on MSVC. Differential Revision: https://reviews.llvm.org/D91282	2020-11-12 13:34:57 -08:00
Florian Hahn	c1f6f30040	[PatternMatch] Add single index InsertValue matcher. This patch adds a new matcher for single index InsertValue instructions, similar to the existing matcher for ExtractValue. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D91352	2020-11-12 21:27:18 +00:00
Alexey Bataev	07b568a9c8	[OPENMP]Fix PR47790: segfault in frontend while parsing Objective-C with OpenMP. Need to check if the sema is actually finishing a function decl. Differential Revision: https://reviews.llvm.org/D91376	2020-11-12 13:26:11 -08:00
Kazushi (Jam) Marukawa	9c504ec06d	[VE] Disable -fsigaddr option for VE VE needs to support integrated assembler and "nas". This "nas" doesn't recognize ".sigaddr" pseudo mnemonics, so need to disable it. This patch disable it on VE by default. Also add a regression test for that. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D91350	2020-11-13 06:22:12 +09:00
peter klausler	04a147987a	[flang] Include source information in an invalid file-unit-number message An io-unit that is an internal-file-variable is syntactically identical to a file-unit-number expression that is a variable reference. An ambiguous unit is initially parsed as an internal-file-variable. If semantic analysis determines that the unit is not of character type, it is rewritten as an internal-file-variable. This modification must retain source coordinate information. Differential revision: https://reviews.llvm.org/D91375	2020-11-12 13:12:17 -08:00
Joe Pletcher	f897e82bfd	[fuzzer] Add Windows Visual C++ exception intercept Adds a new option, `handle_winexcept` to try to intercept uncaught Visual C++ exceptions on Windows. On Linux, such exceptions are handled implicitly by `std::terminate()` raising `SIBABRT`. This option brings the Windows behavior in line with Linux. Unfortunately this exception code is intentionally undocumented, however has remained stable for the last decade. More information can be found here: https://devblogs.microsoft.com/oldnewthing/20100730-00/?p=13273 Reviewed By: morehouse, metzman Differential Revision: https://reviews.llvm.org/D89755	2020-11-12 13:11:14 -08:00
peter klausler	6c516cda39	[flang] Recognize END FILE as ENDFILE in free form source The ENDFILE statement may be spelled as two words. Differential revision: https://reviews.llvm.org/D91377	2020-11-12 12:46:04 -08:00
Arthur Eubanks	3a7b57b7ca	[NFC][NewPM] Reuse PassBuilder callbacks with -O0 This removes lots of duplicated code which was necessary before https://reviews.llvm.org/D89158. Now we can use PassBuilder::runRegisteredEPCallbacks(). This is mostly sanitizers. There is likely more that can be done to simplify, but let's start with this. Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D90870	2020-11-12 12:42:59 -08:00
Alexey Bataev	3c6b457bee	[OPENMP]Fix PR48076: Check map types array before accessing its front. Need to check if there are map types for the components before trying to access them when trying to modify type mappings for combined partial mappings. Differential Revision: https://reviews.llvm.org/D91370	2020-11-12 12:00:29 -08:00
Jay Foad	6881a82e8c	[AMDGPU] Fix scheduling of exp pos4 Also fix a similar issue in SIInsertWaitcnts, but I don't think that fix has any effect in practice. Differential Revision: https://reviews.llvm.org/D91290	2020-11-12 19:57:14 +00:00
Jay Foad	d7d6ac5624	[AMDGPU] Define and use names for export targets. NFC. Differential Revision: https://reviews.llvm.org/D91289	2020-11-12 19:57:14 +00:00
Jianzhou Zhao	2d96859ea6	[msan] Break the getShadow loop after matching an argument Reviewed-by: eugenis Differential Revision: https://reviews.llvm.org/D91320	2020-11-12 19:48:59 +00:00
Zbigniew Sarbinowski	aa8a5b800d	[SystemZ][ZOS] libcxx - no posix memalign The unavailability of posix_memalign on z/OS forces us to define _LIBCPP_HAS_NO_LIBRARY_ALIGNED_ALLOCATION'. The use of posix_memalign is being used in libcxx/src/new.cpp. Reviewed By: #libc, ldionne Differential Revision: https://reviews.llvm.org/D90178	2020-11-12 14:47:58 -05:00
Nikita Popov	c00545dc32	[BasicAA] Remove checks for GEP decomposition limit reached The GEP aliasing code currently checks for the GEP decomposition limit being reached (i.e., we did not reach the "final" underlying object). As far as I can see, these checks are not necessary. It is perfectly fine to work with a GEP whose base can still be further decomposed. Looking back through the commit history, these checks were originally introduced in `1a444489e9`. However, I believe that the problem this was intended to address was later properly fixed with `1726fc698c`, and the checks are no longer necessary since then (and were not the right fix in the first place). Differential Revision: https://reviews.llvm.org/D91010	2020-11-12 20:43:38 +01:00
Anatoliy Tomilov	e7c7a19826	[Frontend] Treat .cuh files as CUDA source files to synchronize with tools/clang-format/git-clang-format tra: Keeping them in sync does have a minor benefit of not raising a question why the two maps are different. Differential Revision: https://reviews.llvm.org/D91034	2020-11-12 11:42:53 -08:00
peter klausler	3ce5f95716	fix clang build	2020-11-12 11:38:20 -08:00
Louis Dionne	557e268425	[libc++] NFC: Remove symbol from ABI list changelog that was never added The `posix_memalign@GLIBC_2.2.5` symbol can't have been added by r284206, because it doesn't show up in the corresponding ABI list. It's also not defined in libc++, so that wouldn't make sense. It must have made it into that comment by mistake.	2020-11-12 14:35:56 -05:00
Rahul Joshi	dea24b422c	[NFC] Switch printFunctionLikeOp and parseFunctionLikeOp to only support "inline" visibility. - Remove the default valued arguments from these functions. - Besides FuncOp, looks like no other in-tree op is using these functions. Differential Revision: https://reviews.llvm.org/D91369	2020-11-12 11:29:01 -08:00
Nico Weber	9a8fad2a89	Revert "[gn build] (semi-manually) port 173b51169b8" This reverts commit `37a1336de7`. `173b51169b` was reverted in `777ca48`.	2020-11-12 13:54:24 -05:00
Louis Dionne	997d41cdec	[libc++] Instantiate additional <iostream> members in the dylib This commit adds new explicit instantiations for some classes in <iostream> in the library. This is done after noticing that many programs that use streams end up containing weak definitions of these classes, which has a negative impact on both code size and load times (due to the need to resolve weak symbols at load time). Note that we are just adding the additional explicit instantiations for the `char` specializations, since the `wchar_t` specializations are not used as often, and as a result there wouldn't be a clear benefit. This change is not an ABI break, since we are just adding additional symbols. Differential Revision: https://reviews.llvm.org/D90677	2020-11-12 13:52:47 -05:00
Louis Dionne	777ca48c9f	Revert "[SystemZ][ZOS] Porting the time functions within libc++ to z/OS" This reverts commit `173b51169b`. That commit was applied incorrectly, and undid previous changes. That was clearly not intended.	2020-11-12 13:36:18 -05:00
Craig Topper	4cdf1d2110	[MSP430] Remove unused MVT::Glue output from MSP430ISD::SELECT_CC nodes. Follow up from a similar patch on RISCV `637f19c36b` Nothing reads this Glue value that I could see. The SDNode def in the td file does not have the SDNPOutGlue flag so I don't think this glue would get properly propagated to MachineSDNodes if it was used.	2020-11-12 10:34:01 -08:00
peter klausler	8df28f0aa3	[flang] Implement runtime support for basic ALLOCATE/DEALLOCATE Add error reporting infrastructure and support for ALLOCATE and DEALLOCATE statements of intrinsic types without SOURCE= or MOLD=. Differential revision: https://reviews.llvm.org/D91215	2020-11-12 10:21:40 -08:00
Nathan James	06db8f984f	[clang-tidy] Merge options inplace instead of copying Changed `ClangTidyOptions::mergeWith` to operate on the instance instead of returning a copy. The old mergeWith method has been renamed to merge and marked as nodiscard, to aid in disambiguating which one is which. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D91184	2020-11-12 18:19:12 +00:00
Aaron Ballman	b336826c1d	Non-implicit attribute creation requires a source range; NFC There are two factory functions used to create a semantic attribute, Create() and CreateImplicit(). CreateImplicit() does not need to specify the source range of the attribute since it's an implicitly- generated attribute. The same logic does not apply to Create(), so this removes the default argument from those declarations to avoid accidentally creating a semantic attribute without source location information.	2020-11-12 10:06:30 -08:00
Fangrui Song	8df4e60945	[ELF] Don't consider SHF_ALLOC ".debug" sections debug sections Fixes PR48071 The Rust compiler produces SHF_ALLOC `.debug_gdb_scripts` (which normally does not have the flag) * `.debug_gdb_scripts` sections are removed from `inputSections` due to --strip-debug/--strip-all * When processing --gc-sections, pieces of a SHF_MERGE section can be marked live separately `=>` segfault when marking liveness of a `.debug_gdb_scripts` which is not split into pieces (because it is not in `inputSections`) This patch circumvents the problem by not treating SHF_ALLOC ".debug*" as debug sections (to prevent --strip-debug's stripping) (which is still useful on its own). Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D91291	2020-11-12 09:59:43 -08:00
Kevin P. Neal	ac523d2de5	[FPEnv][Clang][Driver] Use MarshallingInfoFlag for -fexperimental-strict-floating-point As of D80952 we are disabling strict floating point on all hosts except those that are explicitly listed as supported. Use of strict floating point on other hosts requires use of the -fexperimental-strict-floating-point flag. This is to avoid bugs like "https://bugs.llvm.org/show_bug.cgi?id=45329" (which has an incorrect link in the previous review). In the review for D80952 I was asked to mark the -fexperimental option as a MarshallingInfoFlag. This patch does exactly that. Differential Revision: https://reviews.llvm.org/D88987	2020-11-12 12:51:35 -05:00

1 2 3 4 5 ...

371992 Commits