llvm-capstone

mirror of https://github.com/capstone-engine/llvm-capstone.git synced 2025-02-14 06:29:38 +00:00

Author	SHA1	Message	Date
gysit	d629645fcd	[mlir][OpDSL] Add support for adding canonicalization patterns. Extend OpDSL with a `defines` method that can set the `hasCanonicalizer` flag for an OpDSL operation. If the flag is set via `defines(Canonicalizer)` the operation needs to implement the `getCanonicalizationPatterns` method. The revision specifies the flag for linalg.fill_tensor and adds an empty `FillTensorOp::getCanonicalizationPatterns` implementation. This revision is a preparation step to replace linalg.fill by its OpDSL counterpart linalg.fill_tensor. The two are only functionally equivalent if both specify the same canonicalization patterns. The revision is thus a prerequisite for the linalg.fill replacement. Depends On D120725 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D120726	2022-03-08 15:56:59 +00:00
gysit	8d7850705c	[mlir][linalg] Add returning rewrite method to fusion pattern (NFC). Enhance `LinalgTileAndFuseTensorOpsPattern` with an additional rewrite signature that returns the result of the rewrite. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D121212	2022-03-08 15:50:28 +00:00
gysit	f4ae02afe7	[mlir][linalg] Add a FillOpInterface. Add a FillOpInterface similar to the contraction and convolution op interfaces. The FillOpInterface is a preparation step to replace linalg.fill by its OpDSL version linalg.fill_tensor. The interface implements the `value()`, `output()`, and `result()` methods that by default are not available on linalg.fill_tensor. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D120725	2022-03-08 15:48:02 +00:00
Louis Dionne	95c0f2d115	[libc++] Remove workarounds for re-defining _LIBCPP_ASSERT in the test suite As a fly-by fix, enable the complexity-changing assertions in __debug_less only when the full debug mode is enabled, since debugging level 0 is usually understood to only contain basic assertions that do not change the complexity of algorithms. Differential Revision: https://reviews.llvm.org/D121129	2022-03-08 10:41:38 -05:00
Aaron Ballman	1c55f05c6a	Properly diagnose constant evaluation issues at TU scope We were not creating an evaluation context for the TU scope, so we never popped an evaluation context for it. Popping the evaluation context triggers a number of diagnostics, including warnings about immediate invocations that we were previously missing. Note: I think we have an additional issue that we should solve, but not as part of this patch. I don't think Clang is properly modeling static initialization as happening before constant expression evaluation. I think structure members members are zero initialized per http://eel.is/c++draft/basic.start.static#1, https://eel.is/c++draft/basic.start.static#2.sentence-2, and http://eel.is/c++draft/dcl.init#general-6.2 and the new test case actually should be accepted. However, it's also worth noting that other compilers behave the way this patch makes Clang behave: https://godbolt.org/z/T7noqhdPr	2022-03-08 10:19:15 -05:00
Timm Bäder	5b7941ad7c	[clang][driver] Fix float128 diagnostics with glibc >= 2.32 Fix checking for an unsupported stdlib++. Differential Revision: https://reviews.llvm.org/D121209	2022-03-08 15:49:01 +01:00
Sanjay Patel	4fcaa0632d	[InstCombine] add tests for fcmp (fpext X), C; NFC	2022-03-08 09:45:55 -05:00
Nathan Sidwell	21e16ab6b8	[clang][ABI] New C++20 module mangling scheme The existing module symbol mangling scheme turns out to be undemangleable. It is also desirable to switch to the strong-ownership model as the hoped-for C++17 compatibility turns out to be fragile, and we also now have a better way of controlling that. The issue is captured on the ABI list at: https://github.com/itanium-cxx-abi/cxx-abi/issues/134 A document describing the issues and new mangling is at: https://drive.google.com/file/d/1qQjqptzOFT_lfXH8L6-iD9nCRi34wjft/view This patch is the code-generation part. I have a demangler too, but that patch is based on some to-be-landed refactoring of the demangler. The old mangling is unceremoniously dropped. No backwards compatibility, no deprectated old-mangling flag. It was always labelled experimental. (Old and new manglings cannot be confused.) Reviewed By: ChuanqiXu Differential Revision: https://reviews.llvm.org/D118352	2022-03-08 06:21:50 -08:00
Jez Ng	ce2ae38124	[lld-macho] Deduplicate the `__objc_classrefs` section contents ld64 breaks down `__objc_classrefs` on a per-word level and deduplicates them. This greatly reduces the number of bind entries emitted (and therefore the amount of work `dyld` has to do at runtime). For chromium_framework, this change to LLD cuts the number of (non-lazy) binds from 912 to 190, getting us to parity with ld64 in this aspect. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D121053	2022-03-08 08:34:04 -05:00
Jez Ng	8ec1033933	[lld-macho] Deduplicate CFStrings during ICF `__cfstring` has embedded addends that foil ICF's hashing / equality checks. (We can ignore embedded addends when doing ICF because the same information gets recorded in our Reloc structs.) Therefore, in order to properly dedup CFStrings, we create a mutable copy of the CFString and zero out the embedded addends before performing any hashing / equality checks. (We did in fact have a partial implementation of CFString deduplication already. However, it only worked when the cstrings they point to are at identical offsets in their object files.) I anticipate this approach can be extended to other similar statically-allocated struct sections in the future. In addition, we previously treated all references with differing addends as unequal. This is not true when the references are to literals: different addends may point to the same literal in the output binary. In particular, `__cfstring` has such references to `__cstring`. I've adjusted ICF's `equalsConstant` logic accordingly, and I've added a few more tests to make sure the addend-comparison code path is adequately covered. Fixes https://github.com/llvm/llvm-project/issues/51281. Reviewed By: #lld-macho, Roger Differential Revision: https://reviews.llvm.org/D120137	2022-03-08 08:34:03 -05:00
Simon Pilgrim	d0aa77440c	[X86] convertIntLogicToFPLogic - pull out condcodes. NFCI.	2022-03-08 13:31:17 +00:00
Nikolas Klauser	7aef4a0cae	[libc++] Enable more clang-tidy checks and list potential candidates These are some checks that make sense in libc++ IMO. The checks after `#TODO: investigate these checks` are candidates, but they can't be enabled without some cleanup. Reviewed By: ldionne, #libc Spies: aheejin, libcxx-commits Differential Revision: https://reviews.llvm.org/D120925	2022-03-08 14:14:55 +01:00
Daniel Cederman	1c235c3754	[Sparc] Add tail call support This patch adds tail call support to the 32-bit Sparc backend. Two new instructions are defined, TAIL_CALL and TAIL_CALLri. They are encoded the same as CALL and BINDri, but are marked with isReturn so that the epilogue gets emitted. In contrast to CALL, TAIL_CALL is not marked with isCall. This makes it possible to use the leaf function optimization when the only call a function makes is a tail call. TAIL_CALL modifies the return address in %o7, so for leaf functions the value in %o7 needs to be restored after the call. For normal functions which uses the restore instruction this is not necessary. Reviewed By: koakuma Differential Revision: https://reviews.llvm.org/D51206	2022-03-08 13:50:54 +01:00
Kirill Bobyrev	d5106c8f97	[clangd] NFC: Cleanup IncludeCleaner API Make a further improvement to decrease verbosity of the API: ASTContext provides SourceManager access. Reviewed By: sammccall Differential Revision: https://reviews.llvm.org/D119842	2022-03-08 13:43:25 +01:00
Marek Kurdej	f537a40916	[clang-format] Correctly detect `!` as TT_NonNullAssertion after `default`. Fixes https://github.com/llvm/llvm-project/issues/53153. Depends on D121132. Reviewed By: HazardyKnusperkeks, owenpan Differential Revision: https://reviews.llvm.org/D121136	2022-03-08 13:35:26 +01:00
Marek Kurdej	7a54fceb25	[clang-format] Handle C# 9 `init` accessor specifier. Before, the code: ``` int Value { get; } = 0; int Value { init; } = 0; ``` was formatted incoherently: ``` int Value { get; } = 0; int Value { init; } = 0; ``` because `init` was not recognised as an accessor specifier. Reviewed By: MyDeveloperDay, HazardyKnusperkeks Differential Revision: https://reviews.llvm.org/D121132	2022-03-08 13:33:36 +01:00
Christian Sigg	dfaadf6b12	Update more `parseSourceString()` call sites. Change to non-deprecated function template (see D121075). Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D121102	2022-03-08 13:25:54 +01:00
Lorenzo Chelini	f3e1dcc5df	[MLIR][Presburger] Silence warnings in GCC (NFC) Reviewed By: arjunp Differential Revision: https://reviews.llvm.org/D121205	2022-03-08 12:38:59 +01:00
Simon Pilgrim	f0e3972f08	[X86] Add add / mul identity select tests for D119654	2022-03-08 11:31:35 +00:00
Adrian Kuegel	75aca24d2f	[clang] Fix reference to file that was moved.	2022-03-08 12:26:02 +01:00
Florian Hahn	3836003e87	[AArch64] Add test for D120481 with multiple uses.	2022-03-08 11:11:03 +00:00
Krystian Kuzniarek	481f681867	[AST] Fix typo in assert messages Differential Revision: https://reviews.llvm.org/D120221	2022-03-08 11:06:50 +00:00
Matthias Springer	d1d79920f7	[mlir][bufferize][NFC] Better error messages if analysis fails Differential Revision: https://reviews.llvm.org/D121203	2022-03-08 19:45:00 +09:00
Luo, Yuanke	5494769e32	[X86] Update avx512vbmi2 intrinsic tests to avoid adds As noticed in D119654, by adding the masked intrinsics results together we can end up with the selects being canonicalized away from the intrinsic - this isn't what we want to test here so replace with a insertvalue chain into a aggregate instead to retain all the results. Differential Revision: https://reviews.llvm.org/D121196	2022-03-08 17:49:44 +08:00
Dmitry Vyukov	f831d6fc80	tsan: fix false positive during fd close FdClose is a subjet to the same atomicity problem as MemoryRangeFreed (memory state is not "monotoic" wrt race detection). So we need to lock the thread slot in FdClose the same way we do in MemoryRangeFreed. This fixes the modified stress.cpp test. Reviewed By: vitalybuka, melver Differential Revision: https://reviews.llvm.org/D121143	2022-03-08 10:40:56 +01:00
Muhammad Omair Javaid	de0b4f4b86	[LLDB] Skip TestIOHandlerProcessSTDIO.py for Arm/AArch64 Linux This patch disables TestIOHandlerProcessSTDIO.py for Arm/AArch64 Linux to silence random test failures on buildbots. IO handler tests are known to randomly fail on arm/aarch64 linux buildbots due to pexpect timeouts.	2022-03-08 14:15:41 +05:00
Javier Setoain	f2b89c7ae0	[mlir][Vector] Use create_mask in transfer mask materializations Currently, the transfer mask is materialized by generating the vector comparison: [offset + 0, .., offset + length - 1] < [dim, .., dim] A better alternative is to materialize the transfer mask by using the operation: `vector.create_mask (dim - offset)`, which will generate simpler code and compose better with scalable vectors. Differential Revision: https://reviews.llvm.org/D120487	2022-03-08 09:02:50 +00:00
Argyrios Kyrtzidis	f2b24905bf	[Sema] Mark the referenced destructor during transformation of a `CXXBindTemporaryExpr` Otherwise we will fail to generate the definition of a defaulted destructor, if the only reference was in a templated temporary. rdar://89366678 Differential Revision: https://reviews.llvm.org/D120426	2022-03-08 01:00:07 -08:00
Groverkss	802fc8cb53	[MLIR][Presburger] Move PresburgerLocalSpace::isEqual from protected to public This change is done to match the interface of PresburgerSpace, which has PresburgerSpace::isEqual as public.	2022-03-08 14:22:35 +05:30
Luo, Yuanke	1a423831a6	[X86] Update avx512vbmi2 intrinsic tests to avoid adds As noticed in D119654, by adding the masked intrinsics results together we can end up with the selects being canonicalized away from the intrinsic - this isn't what we want to test here so replace with a insertvalue chain into a aggregate instead to retain all the results. Differential Revision: https://reviews.llvm.org/D121188	2022-03-08 16:40:33 +08:00
River Riddle	11a7635b30	[mlir][NFC] Properly include PassDetail.h for FoldSubViewOpsPass	2022-03-08 00:08:39 -08:00
River Riddle	043f4e3b33	[mlir][NFC] Remove dead forward declaration of FuncOp	2022-03-08 00:08:39 -08:00
Adrian Kuegel	49a29e5c9f	[mlir][Bazel] Adjust regarding StandardToLLVM -> FuncToLLVM rename. Differential Revision: https://reviews.llvm.org/D121185	2022-03-08 09:07:36 +01:00
Petr Hosek	601022fd96	Revert "[CMake][compiler-rt] Make CRT separately buildable" This reverts commit ed28461e2716fda004f4f55654ef33bc8c78f203 since it broke the VE bot.	2022-03-07 23:59:29 -08:00
Arthur Eubanks	53e5e58670	[NewPM][Inliner] Make inlined calls to functions in same SCC as callee exponentially expensive Introduce a new attribute "function-inline-cost-multiplier" which multiplies the inline cost of a call site (or all calls to a callee) by the multiplier. When processing the list of calls created by inlining, check each call to see if the new call's callee is in the same SCC as the original callee. If so, set the "function-inline-cost-multiplier" attribute of the new call site to double the original call site's attribute value. This does not happen when the original call site is intra-SCC. This is an alternative to D120584, which marks the call sites as noinline. Hopefully fixes PR45253. Reviewed By: davidxl Differential Revision: https://reviews.llvm.org/D121084	2022-03-07 23:51:09 -08:00
Lorenzo Chelini	b204ce0ca8	[MLIR][OpDSL] Silence warning (NFC)	2022-03-08 08:33:17 +01:00
Stanislav Mekhanoshin	9eabea3968	[AMDGPU] Set noclobber metadata on loads instead of cast to constant A load via pointer cast to constant will return true from pointsToConstantMemory which is not necessarily so. Fixes: SWDEV-326463 Differential Revision: https://reviews.llvm.org/D121172	2022-03-07 23:13:02 -08:00
Petr Hosek	ed28461e27	[CMake][compiler-rt] Make CRT separately buildable This is useful when building a complete toolchain to ensure that CRT is built after builtins but before the rest of the compiler-rt. Differential Revision: https://reviews.llvm.org/D120682	2022-03-07 23:05:20 -08:00
Zi Xuan Wu	3297758901	[CSKY] Enable TargetAsmStreamer/ELFStreamer and support asm directive of csky_attribute Add support of parsing .csky_attribute directive and emit related target attributes in .csky.attribute section. It does not emit attribute directive in assembly code, so only emit target attributes in ELF streamer. In ELF streamer, it handles the header EFlag and the csky_attribute section which contains some attribute items. The EFlag and attribute items are calculated from feature bits based on Subtarget.	2022-03-08 14:00:38 +08:00
Yuanfang Chen	eddd94c27d	Reland "[clang][debug] port clang-cl /JMC flag to ELF" This relands commit 731347431976509823e38329a96fcbc69fe98cd2. It failed on Windows/Mac because `-fjmc` is only checked for ELF targets. Check the flag unconditionally instead and issue a warning for non-ELF targets.	2022-03-07 21:55:41 -08:00
Valentin Clement	4730582d5d	[flang] Remove unused variable in Allocatable.cpp Fix for buildbot failure: https://lab.llvm.org/buildbot/#/builders/180/builds/3532	2022-03-08 06:51:23 +01:00
Johannes Doerfert	5b4acb20ff	[OpenMP][FIX] Ensure flag to disable de-globalization works properly If the user disables de-globalization we did not seed the AAHeapToShared and AAHeapToStack but we still could end up with them through in-flight lookups. With this patch we disable AAHeapToShared completely if the user disabled de-globalization. Heap-2-stack is still run though. Differential Revision: https://reviews.llvm.org/D121059	2022-03-07 23:43:05 -06:00
Johannes Doerfert	1660288b28	[OpenMP][CUDA] Use one event pool per device An event pool, similar to the stream pool, needs to be kept per device. For one, events are associated with cuda contexts which means we cannot destroy the former after the latter. Also, CUDA documentation states streams and events need to be associated with the same context, which we did not ensure at all. Differential Revision: https://reviews.llvm.org/D120142	2022-03-07 23:43:05 -06:00
Johannes Doerfert	10aa83ff74	[OpenMP] Allow to explicitly deinitialize device resources There are two problems this patch tries to address: 1) We currently free resources in a random order wrt. plugin and libomptarget destruction. This patch should ensure the CUDA plugin is less fragile if something during the deinitialization goes wrong. 2) We need to support (hard) pause runtime calls eventually. This patch allows us to free all associated resources, though we cannot reinitialize the device yet. Follow up patch will associate one event pool per device/context. Differential Revision: https://reviews.llvm.org/D120089	2022-03-07 23:43:04 -06:00
Johannes Doerfert	307bbd3c82	[OpenMP][NFCI] Use RAII lock guards in libomptarget where possible Differential Revision: https://reviews.llvm.org/D121060	2022-03-07 23:43:04 -06:00
Uday Bondhugula	da93537bf8	NFC. Improve isInnermostAffineForOp - drop unnecessary check Rewrite isInnermostAffineForOp utility to make it more direct/efficient. Drop unnecessary check. NFC. Differential Revision: https://reviews.llvm.org/D121170	2022-03-08 10:44:54 +05:30
Shao-Ce SUN	365c858a5d	[RISCV] Share PatFprFpr classes for F, D, and Zfh Inspired by D115469 Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D121066	2022-03-08 13:02:04 +08:00
Lang Hames	36f0dfd07a	[ORC][JITLink] Fix MachO absolute symbol handling, add test case. This patch removes the unintended resolution of locally scoped absolute symbols (which was causing unexpected definition errors). It stops using the JITSymbolFlags::Absolute flag (it isn't set or used elsewhere, and causes mismatch-flags asserts), and adds JITSymbolFlags::Exported to default scoped absolute symbols. Finally, we now set the scope of absolute symbols correctly in MachOLinkGraphBuilder.	2022-03-07 20:54:55 -08:00
Christudasan Devadasan	0d849b8249	AMDGPU: Skip folding REG_SEQUENCE if found unknown regclasses for its users Use TII::getRegClass to return a valid regclass or a nullptr if the RC is unknown for a given OpIdx. This fixes a potential crash occurred while getting the RC from a variadic instruction. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D120813	2022-03-08 10:11:57 +05:30
Phoebe Wang	4de9a752d6	[X86] Add helper enum for ternary intrinsics Reviewed By: RKSimon, LuoYuanke Differential Revision: https://reviews.llvm.org/D120307	2022-03-08 11:19:05 +08:00

1 2 3 4 5 ...

417238 Commits