llvm-capstone

mirror of https://github.com/capstone-engine/llvm-capstone.git synced 2025-03-03 08:02:41 +00:00

Author	SHA1	Message	Date
Daniel Bertalan	73b659ff55	[lld-macho] Fix left shift of negative value UB I introduced this mistake in 573c7e6b3c79c7ce80a2221e000fab7dd20c0bb4. Fixes the failure on this UBSan bot: https://lab.llvm.org/buildbot/#/builders/5/builds/25537	2022-07-01 12:00:16 +02:00
Dmitry Preobrazhensky	3a4d9b6a68	[AMDGPU][GFX908][DOC][NFC] Update assembler syntax description Summary of changes: - Remove dst for global_atomic_add_f32, global_atomic_pk_add_f16. - Make vdata input-only for buffer_atomic_add_f32, buffer_atomic_pk_add_f16. - Other minor improvements.	2022-07-01 12:46:45 +03:00
Simon Pilgrim	26708fa166	Revert rG057db2002bb3: [X86] combineAndnp - constant fold ANDNP(C,X) -> AND(~C,X) If the LHS op has a single use then using the more general AND op is likely to allow commutation, load folding, generic folds etc. Reverted due to reports from @alexfh about it causing an infinite loop (repro still pending).	2022-07-01 10:36:09 +01:00
Dmitry Preobrazhensky	36c9e9968a	[AMDGPU][GFX940][DOC][NFC] Update assembler syntax description Summary of changes: - Update SMEM syntax (see https://reviews.llvm.org/D127314). - Minor improvements.	2022-07-01 12:22:57 +03:00
Florian Hahn	8704cfc744	[LV] Add test case for #56319 . Test case for PR56319.	2022-07-01 10:09:24 +01:00
Nico Weber	12109a3b40	[gn build] (manually) port fe66aebd7551 (PseudoCLI)	2022-07-01 10:45:35 +02:00
Nico Weber	bacb56cdc5	[gn build] (manually) port cd2292ef824 (PseudoCXX) This target will be used in the next commit.	2022-07-01 10:45:35 +02:00
Christian Kandeler	ac511fd439	[clangd] Also mark output arguments of array subscript expressions ... with the "usedAsMutableReference" semantic token modifier. It's quite unusual to declare the index parameter of a subscript operator as a non-const reference type, but arguably that makes it even more helpful to be aware of it when working with such code. Reviewed By: nridge Differential Revision: https://reviews.llvm.org/D128892	2022-07-01 04:45:20 -04:00
Serge Pavlov	dc34d8df4c	Revert "[FPEnv] Allow CompoundStmt to keep FP options" On some buildbots test `ast-print-fp-pragmas.c` fails, need to investigate it. This reverts commit 0401fd12d4aa0553347fe34d666fb236d8719173. This reverts commit b822efc7404bf09ccfdc1ab7657475026966c3b2.	2022-07-01 15:42:39 +07:00
Valentin Clement	0dd4fb0408	[flang] Fix for broken/degenerate forall case Fix for broken/degenerate forall case where there is no assignment to an array under the explicit iteration space. While this is a multiple assignment, semantics only raises a warning. The fix is to add a test that the explicit space has any sort of array to be updated, and if not then the do_loop nest will not require a terminator to forward array values to the next iteration. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: jeanPerier Differential Revision: https://reviews.llvm.org/D128973 Co-authored-by: Eric Schweitz <eschweitz@nvidia.com>	2022-07-01 10:37:15 +02:00
Mikhail Goncharov	de3fb0f29e	[fix/build] bazel rule for ParallelCombiningOpInterface	2022-07-01 10:34:15 +02:00
Muhammad Omair Javaid	3d477bbeee	[LLDB] Xfail TestStepNoDebug.py AArch64/Windows LLDB fails to step in/out/over code with missing debug information. This is only reproducible on AArch64/Windows. I have reported a issue upstream at llvm.org/pr56292 This patch Xfail TestStepNoDebug.py for AArch64/Windows.	2022-07-01 12:25:43 +04:00
Chen Zheng	6a9434bb9c	[SCEV] pre-commit test case for D127835, NFC	2022-07-01 04:25:00 -04:00
Serge Pavlov	0401fd12d4	Fix warning on unhandled enumeration value	2022-07-01 15:17:29 +07:00
Valentin Clement	df417c372f	[flang] Add correct number of args for wait Add source coordinates to BeginWait and BeginWaitAll calls This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: jeanPerier Differential Revision: https://reviews.llvm.org/D128970 Co-authored-by: V Donaldson <vdonaldson@nvidia.com>	2022-07-01 10:16:36 +02:00
Chen Zheng	758de0e931	[InstructionSimplify] handle denormal input for fcmp Handle denormal constant input for fcmp instructions based on the denormal handling mode. Reviewed By: spatel, dcandler Differential Revision: https://reviews.llvm.org/D128647	2022-07-01 03:51:28 -04:00
Daniel Bertalan	573c7e6b3c	[lld-macho] Handle LOH_ARM64_ADRP_LDR linker optimization hints This linker optimization hint transforms a pair of adrp+ldr (immediate) instructions into an ldr (literal) load from a PC-relative address if it is 4-byte aligned and within +/- 1 MiB, as ldr can encode a signed 19-bit offset that gets multiplied by 4. In the wild, only a small number of these hints are applicable because not many loads end up close enough to the data segment. However, the added helper functions will be useful in implementing the rest of the LOH types. Differential Revision: https://reviews.llvm.org/D128942	2022-07-01 09:44:24 +02:00
Serge Pavlov	b822efc740	[FPEnv] Allow CompoundStmt to keep FP options AST does not have special nodes for pragmas. Instead a pragma modifies some state variables of Sema, which in turn results in modified attributes of AST nodes. This technique applies to floating point operations as well. Every AST node that can depend on FP options keeps current set of them. This technique works well for options like exception behavior or fast math options. They represent instructions to the compiler how to modify code generation for the affected nodes. However treatment of FP control modes has problems with this technique. Modifying FP control mode (like rounding direction) usually requires operations on hardware, like writing to control registers. It must be done prior to the first operation that depends on the control mode. In particular, such operations are required for implementation of `pragma STDC FENV_ROUND`, compiler should set up necessary rounding direction at the beginning of compound statement where the pragma occurs. As there is no representation for pragmas in AST, the code generation becomes a complicated task in this case. To solve this issue FP options are kept inside CompoundStmt. Unlike to FP options in expressions, these does not affect any operation on FP values, but only inform the codegen about the FP options that act in the body of the statement. As all pragmas that modify FP environment may occurs only at the start of compound statement or at global level, such solution works for all relevant pragmas. The options are kept as a difference from the options in the enclosing compound statement or default options, it helps codegen to set only changed control modes. Differential Revision: https://reviews.llvm.org/D123952	2022-07-01 14:32:33 +07:00
Nikita Popov	9ac386495d	[ConstExpr] Don't create insertvalue expressions In preparation for the removal in D128719, this stops creating insertvalue constant expressions (well, unless they are directly used in LLVM IR). Differential Revision: https://reviews.llvm.org/D128792	2022-07-01 09:23:28 +02:00
Nicolas Vasilache	b994d388ae	[mlir][SCF] Add a ParallelCombiningOpInterface to decouple scf::PerformConcurrently from its contained operations This allows purging references of scf.ForeachThreadOp and scf.PerformConcurrentlyOp from ParallelInsertSliceOp. This will allowmoving the op closer to tensor::InsertSliceOp with which it should share much more code. In the future, the decoupling will also allow extending the type of ops that can be used in the parallel combinator as well as semantics related to multiple concurrent inserts to the same result. Differential Revision: https://reviews.llvm.org/D128857	2022-07-01 00:16:02 -07:00
Nicolas Vasilache	6a57d8fba5	[mlir][vector] Untangle TransferWriteDistribution and avoid crashing in the 0-D case. This revision avoids a crash in the 0-D case of distributing vector.transfer ops out of vector.warp_execute_on_lane_0. Due to the code complexity and lack of documentation, it took untangling the implementation before realizing that the simple fix was to fail in the 0-D case. The rewrite is still very useful to understand this code better. Differential Revision: https://reviews.llvm.org/D128793	2022-07-01 00:15:34 -07:00
Nikita Popov	9b994593cc	[SCCP] Only handle unknown lattice values in resolvedUndefsIn() This is a minor refinement of resolvedUndefsIn(), mostly for clarity. If the value of an instruction is undef, then that's already a legal final result -- we can safely rauw such an instruction with undef. We only need to mark unknown values as overdefined, as that's the result we get for an instruction that has not been processed because it has an undef operand. Differential Revision: https://reviews.llvm.org/D128251	2022-07-01 09:14:37 +02:00
Piotr Sobczak	4a78225212	[AMDGPU] Add WMMA clang builtins Add WMMA clang builtins and tests. Extra changes in code are needed to handle function overloads. WavefrontSize 32: __builtin_amdgcn_wmma_f32_16x16x16_f16_w32 __builtin_amdgcn_wmma_f32_16x16x16_bf16_w32 __builtin_amdgcn_wmma_f16_16x16x16_f16_w32 __builtin_amdgcn_wmma_bf16_16x16x16_bf16_w32 __builtin_amdgcn_wmma_i32_16x16x16_iu8_w32 __builtin_amdgcn_wmma_i32_16x16x16_iu4_w32 WavefrontSize 64: __builtin_amdgcn_wmma_f32_16x16x16_f16_w64 __builtin_amdgcn_wmma_f32_16x16x16_bf16_w64 __builtin_amdgcn_wmma_f16_16x16x16_f16_w64 __builtin_amdgcn_wmma_bf16_16x16x16_bf16_w64 __builtin_amdgcn_wmma_i32_16x16x16_iu8_w64 __builtin_amdgcn_wmma_i32_16x16x16_iu4_w64 Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D128952	2022-07-01 08:55:25 +02:00
Piotr Sobczak	b6ef36a1c4	[AMDGPU] Update WMMA intrinsics with explicit f16 types Update intrinsics to use n x f16 and n x i16 instead of 32-bit types. This may avoid the need for a bitcast and is probably less confusing. Depends on making v16f16 and v16i16 types legal. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D128951	2022-07-01 08:55:25 +02:00
Chen Zheng	92a3e1b5c9	[NFC] add --match-full-lines to the RUN line	2022-07-01 02:37:35 -04:00
Haojian Wu	fe66aebd75	[pseudo] Define a clangPseudoCLI library. - define a common data structure Language which is a compiled result of the bnf grammar. It is defined in Language.h; - creates a clangPseudoCLI lib which defines a grammar commandline flag and expose a function to get the Language. It supports --grammar=cxx, --grammmar=/path/to/file.bnf; - use the clangPseudoCLI in clang-pseudo, fuzzer, and benchmark tools ( simplify the code and use the prebuilt cxx grammar); Split out from https://reviews.llvm.org/D127448. Differential Revision: https://reviews.llvm.org/D128679	2022-07-01 08:31:34 +02:00
Valentin Clement	39377d5227	[flang] Fix APFloat conversion cases This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: PeteSteinfeld Differential Revision: https://reviews.llvm.org/D128935 Co-authored-by: Eric Schweitz <eschweitz@nvidia.com> Co-authored-by: Peter Steinfeld <psteinfeld@nvidia.com>	2022-07-01 08:29:54 +02:00
Chen Zheng	39fe49aa57	[Inline] don't add noalias metadata for unknown objects. The unidentified objects recognized in `getUnderlyingObjects` may still alias to the noalias parameter because `getUnderlyingObjects` may not check deep enough to get the underlying object because of `MaxLookup`. The real underlying object for the unidentified object may still be the noalias parameter. Originally Patched By: tingwang Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D127202	2022-07-01 02:16:55 -04:00
Pavel Labath	b6b65403b3	[lldb] Add tests which simulate the various std::string layouts Checking whether a formatter change does not break some of the supported string layouts is difficult because it requires tracking down and/or building different versions and build configurations of the library. The purpose of this patch is to avoid that by providing an in-tree simulation of the string class. It is a reduced version of the real string class, obtained by elimitating all non-trivial code, leaving just the internal data structures used by the data formatter. Different versions of the class can be simulated through preprocessor defines. The test (ab)uses the fact that our formatters kick in for any double-underscore sub-namespace of `std`, so it avoids colliding with the real string class by declaring the test class in the std::__lldb namespace. I do not consider this to be a replacement for the existing data formatter tests, as producing this kind of a test is not trivial, and it is easy to make a mistake in the process. However, it's also not realistic to expect that every person changing the data formatter will test it against all versions of the real class, so I think it can be useful as a first line of defence. Adding support for new layouts can become particularly unwieldy, but this complexity will also be reflected in the actual code, so if we find ourselves needing to support too many variants, we may need to start dropping support for old ones, or come up with a completely different strategy. Differential Revision: https://reviews.llvm.org/D124155	2022-07-01 08:08:22 +02:00
Pavel Labath	553558292e	[lldb/dyld-posix] Avoid reading the module list in inconsistent states New glibc versions (since 2.34 or including this <`ed3ce71f5c`> patch) trigger the rendezvous breakpoint after they have already added some modules to the list. This did not play well with our dynamic loader plugin which was doing a diff of the the reported modules in the before (RT_ADD) and after (RT_CONSISTENT) states. Specifically, it caused us to miss some of the modules. While I think the old behavior makes more sense, I don't think that lldb is doing the right thing either, as the documentation states that we should not be expecting a consistent view in the RT_ADD (and RT_DELETE) states. Therefore, this patch changes the lldb algorithm to compare the module list against the previous consistent snapshot. This fixes the previous issue, and I believe it is more correct in general. It also reduces the number of times we are fetching the module info, which should speed up the debugging of processes with many shared libraries. The change in RefreshModules ensures we don't broadcast the loaded notification for the dynamic loader (ld.so) module more than once. Differential Revision: https://reviews.llvm.org/D128264	2022-07-01 08:08:22 +02:00
Emre Kultursay	c0702ac07b	[PATCH] [lldb-server] Skip shared regions for memory allocation Differential Revision: https://reviews.llvm.org/D128832	2022-07-01 13:45:42 +08:00
Hubert Tong	6bd53df9b6	[clang][NFC][tests] dr208.c optional signext handling Fixes llvm/llvm-project#56325.	2022-07-01 00:03:58 -04:00
jacquesguan	8f45c5862f	[mlir][Vector] Fold InsertStridedSliceOp of ExtractStridedSliceOp. This patch supports to fold InsertStridedSliceOp(ExtractStridedSliceOp(dst), dst) to dst. Differential Revision: https://reviews.llvm.org/D128903	2022-07-01 11:43:35 +08:00
jacquesguan	91ab4d4231	[mlir][Vector] Fold InsertStridedSliceOp of two splat with the same input to splat. This patch folds InsertStridedSliceOp(SplatOp(X):src_type, SplatOp(X):dst_type) to SplatOp(X):dst_type. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D128891	2022-07-01 10:46:47 +08:00
Volodymyr Sapsai	2ceb9c347f	[ODRHash diagnostics] Move common code for calculating diag locations in `DiagnoseODRMismatch` into a lambda. NFC. Differential Revision: https://reviews.llvm.org/D128489	2022-06-30 19:39:22 -07:00
Fangrui Song	27abff670b	Remove unneeded cl::ZeroOrMore. NFC	2022-06-30 19:11:27 -07:00
Fangrui Song	62a4e6ab15	[mlir] Remove unneeded cl::ZeroOrMore for ListOption variables. NFC	2022-06-30 19:04:44 -07:00
Volodymyr Sapsai	15cb180dcb	[ODRHash diagnostics] Split `err_module_odr_violation_mismatch_decl_diff` into per-entity diagnostics. NFC. We'll need to add more cases for Objective-C entities and adding everything to `err_module_odr_violation_mismatch_decl_diff` makes it harder to work with over time. Differential Revision: https://reviews.llvm.org/D128488	2022-06-30 18:40:46 -07:00
wren romano	46394861a3	[mlir][tblgen] Improving error messages This differential improves two error conditions, by detecting them earlier and by providing better messages to help users understand what went wrong. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D128555	2022-06-30 18:25:53 -07:00
Xiang1 Zhang	72a23cef7e	[ISel] Match all bits when merge undefs for DAG combine Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D128570	2022-07-01 09:09:43 +08:00
Xiang1 Zhang	64f44a90ef	Revert "[ISel] Match all bits when merge undef(s) for DAG combine" This reverts commit 5fe5aa284efed1ee1492e1f266351b35f0a8bb69.	2022-07-01 08:59:04 +08:00
Xiang1 Zhang	5fe5aa284e	[ISel] Match all bits when merge undef(s) for DAG combine	2022-07-01 08:58:00 +08:00
Alexey Bataev	4be3fc35aa	[SLP][NFC]Cleanup up operands of the removed insertelements, NFC. Replace all operands of the insertelement instruction, replaced by shuffles, by poisons to avoid false-positive reports about incorrect function.	2022-06-30 17:51:43 -07:00
Craig Topper	eec5d04002	[X86] Pre-commit tests for D128769. NFC	2022-06-30 17:24:28 -07:00
Craig Topper	058d521ea4	[RISCV] Avoid repeated code in SelectAddrRegImm. NFC	2022-06-30 17:22:04 -07:00
Paul Walker	43f8a6b749	[SVE] Use CPY to zero active lanes of a floating point vector. Patterns exist for the integer case that are trivially expandable to cover 0.0f. Differential Revision: https://reviews.llvm.org/D128669	2022-07-01 00:59:00 +01:00
Paul Walker	2be4a7a209	[SVE] Extend "and(ipg,cmp(x,y))" patterns to cover the case when y is an immediate. Differential Revision: https://reviews.llvm.org/D128479	2022-07-01 00:56:22 +01:00
Alexander Yermolovich	e159abdb04	[BOLT][DWARF] Support mix mode DWARF Added support for mixing monolithic DWARF5 with legacy DWARF, and monolithic legacy and DWARF5 split dwarf. Reviewed By: maksfb Differential Revision: https://reviews.llvm.org/D128232	2022-06-30 16:53:15 -07:00
Christopher Di Bella	ef0b20d8e8	[runtimes] adds llvm-libgcc to the list of runtimes to be sorted llvm-libgcc is not a part of `LLVM_ALL_RUNTIMES` because llvm-libgcc is incompatible with an explicit libunwind and compiler-rt. This meant that it was being filtered out and not built. Differential Revision: https://reviews.llvm.org/D128568	2022-06-30 23:50:24 +00:00
Fangrui Song	dd58735a1d	[MC][Mips] Support .reloc , BFD_RELOC_{NONE,16,32,64}, ... to match most other common architectures which already support BFD_RELOC_*. BFD_RELOC_NONE provides a generic way indicating a dependency between two sections and is useful for some instrumentations which encode symbol index information (e.g. `.cg_profile`).	2022-06-30 16:39:23 -07:00

1 2 3 4 5 ...

428612 Commits