llvm-capstone

mirror of https://github.com/capstone-engine/llvm-capstone.git synced 2024-11-23 13:50:11 +00:00

Author	SHA1	Message	Date
Sam Clegg	5b0e45c8ce	[lld][WebAssembly] Fix use of undefined funcs under --warn-unresolved-symbols (#78643 ) When undefined functions exist in the final link we need to create stub functions (otherwise direct calls to those functions could not be generated). We were creating those stub when `--unresolved-symbols=ignore-all` was passed but overlooked the fact that `--warn-unresolved-symbols` essentially has the same effect (i.e. undefined function can exist in the final link). Fixes: #53987	2024-01-19 09:32:22 -08:00
Felipe de Azevedo Piovezan	b6677835fe	[AsmPrinter][DebugNames] Implement DW_IDX_parent entries (#77457 ) This implements the ideas discussed in [1]. To summarize, this commit changes AsmPrinter so that it outputs DW_IDX_parent information for debug_name entries. It will enable debuggers to speed up queries for fully qualified types (based on a DWARFDeclContext) significantly, as debuggers will no longer need to parse the entire CU in order to inspect the parent chain of a DIE. Instead, a debugger can simply take the parent DIE offset from the accelerator table and peek at its name in the debug_info/debug_str sections. The implementation uses two types of DW_FORM for the DW_IDX_parent attribute: 1. DW_FORM_ref4, which points to the accelerator table entry for the parent. 2. DW_FORM_flag_present, when the entry has a parent that is not in the table (that is, the parent doesn't have a name, or isn't allowed to be in the table as per the DWARF spec). This is space-efficient, since it takes 0 bytes. The implementation works by: 1. Changing how abbreviations are encoded (so that they encode which form, if any, was used to encode IDX_Parent) 2. Creating an MCLabel per accelerator table entry, so that they may be referred by IDX_parent references. When all patches related to this are merged, we are able to show that evaluating an expression such as: ``` lldb --batch -o 'b CodeGenFunction::GenerateCode' -o run -o 'expr Fn' -- \ clang++ -c -g test.cpp -o /dev/null ``` is far faster: from ~5000 ms to ~1500ms. Building llvm-project + clang with and without this patch, and looking at its impact on object file size: ``` ls -la $(find build_stage2_Debug_idx_parent_assert_dwarf5 -name \.cpp.o) \| awk '{s+=$5} END {printf "%\047d\n", s}' 11,507,327,592 -la $(find build_stage2_Debug_no_idx_parent_assert_dwarf5 -name \.cpp.o) \| awk '{s+=$5} END {printf "%\047d\n", s}' 11,436,446,616 ``` That is, an increase of 0.62% in total object file size. Looking only at debug_names: ``` $stage1_build/bin/llvm-objdump --section-headers $(find build_stage2_Debug_idx_parent_assert_dwarf5 -name \.cpp.o) \| grep __debug_names \| awk '{s+="0x"$3} END {printf "%\047d\n", s}' 440,772,348 $stage1_build/bin/llvm-objdump --section-headers $(find build_stage2_Debug_no_idx_parent_assert_dwarf5 -name \.cpp.o) \| grep __debug_names \| awk '{s+="0x"$3} END {printf "%\047d\n", s}' 369,867,920 ``` That is an increase of 19%. DWARF Linkers need to be changed in order to support this. This commit already brings support to "base" linker, but it does not attempt to modify the parallel linker. Accelerator entries refer to the corresponding DIE offset, and this patch also requires the parent DIE offset -- it's not clear how the parallel linker can access this. It may be obvious to someone familiar with it, but it would be nice to get help from its authors. [1]: https://discourse.llvm.org/t/rfc-improve-dwarf-5-debug-names-type-lookup-parsing-speed/74151/	2024-01-19 09:19:09 -08:00
lntue	c80d68a676	[libc] Add float.h header. (#78737 )	2024-01-19 12:04:34 -05:00
Jordan Rupprecht	d0d0727104	[lldb][test] Apply @expectedFailureAll/@skipIf early for debug_info tests (#73067 ) The @expectedFailureAll and @skipIf decorators will mark the test case as xfail/skip if _all_ conditions passed in match, including debug_info. * If debug_info is not one of the matching conditions, we can immediately evaluate the check and decide if it should be decorated. * If debug_info is present as a match condition, we need to defer whether or not to decorate until when the `LLDBTestCaseFactory` metaclass expands the test case into its potential variants. This is still early enough that the standard `unittest` framework will recognize the test as xfail/skip by the time the test actually runs. TestDecorators exhibits the edge cases more thoroughly. With the exception of `@expectedFailureIf` (added by this commit), all those test cases pass prior to this commit. This is a followup to `212a60ec37`.	2024-01-19 10:50:05 -06:00
Joseph Huber	cebe4de66f	[libc] Fix test failing on GPU using deprecated 'add_unittest' Summary: We use `add_libc_test' now because it works for both hermetic and unit tests. If the test needs to be unit test only you use `UNIT_TEST_ONLY` as an argument.	2024-01-19 10:38:41 -06:00
Marius Brehler	205e15c176	[mlir][docs] Fix broken link	2024-01-19 17:38:27 +01:00
Sander de Smalen	5f41cef58f	[AArch64] NFC: Simplify discombobulating 'requiresSMChange' interface (#78703 ) Having it return a `std::optional<bool>` is unnecessarily confusing. This patch changes it to a simple 'bool'. This patch also removes the 'BodyOverridesInterface' operand because there is only a single use for this which is easily rewritten.	2024-01-19 16:15:38 +00:00
Sander de Smalen	40a631f452	[Clang] Refactor diagnostics for SME builtins. (#78258 ) The arm_sme.td file was still using `IsSharedZA` and `IsPreservesZA`, which should be changed to match the new state attributes added in #76971. This patch adds `IsInZA`, `IsOutZA` and `IsInOutZA` as the state for the Clang builtins and fixes up the code in SemaChecking and SveEmitter to match. Note that the code is written in such a way that it can be easily extended with ZT0 state (to follow in a future patch).	2024-01-19 16:02:24 +00:00
Jay Foad	e89a7c41ba	[AMDGPU] Update comment on SIInstrInfo::isLegalFLATOffset for GFX12	2024-01-19 15:53:06 +00:00
Jay Foad	1abf2570b3	[AMDGPU] Make use of CPol::SWZ_* in SelectionDAG. NFC. For GlobalISel this was already done in AMDGPUInstructionSelector::selectBufferLoadLds.	2024-01-19 15:48:45 +00:00
Kareem Ergawy	5dbb30d950	[MLIR][OpenMP] Better error reporting for unsupported `nowait` (#78551 ) Provides some context for failing to generate LLVM IR for `target enter\|exit\|update` directives when `nowait` is provided. This is directly helpful for flang users since they would get this error message if they tried to use `nowait`. Before that we had a very generic message. This is a follow-up to https://github.com/llvm/llvm-project/pull/78269, please only review the latest commit (the one with the same commit message as the PR title).	2024-01-19 16:47:24 +01:00
Jay Foad	e21b0b083e	[AMDGPU] Remove gws feature from GFX12 (#78711 ) This was already done for LLVM. This patch just updates the Clang builtin handling to match.	2024-01-19 15:45:53 +00:00
Jay Foad	97747467f1	[AMDGPU] Update hazard recognition for new GFX12 wait counters (#78722 ) In most cases the hazards no longer apply, so just assert that we are not on GFX12.	2024-01-19 15:30:41 +00:00
Jay Foad	89226ecbb9	[AMDGPU] Do not widen scalar loads on GFX12 (#78724 ) GFX12 has subword scalar loads so there is no need to do this.	2024-01-19 15:30:07 +00:00
Kiran Chandramohan	aac1d9710b	[Flang][OpenMP] Consider renames when processing reduction intrinsics (#70822 ) Fixes #68654 Depends on https://github.com/llvm/llvm-project/pull/70790	2024-01-19 15:17:21 +00:00
Vinayak Dev	497a8604b3	[FileCheck]: Fix diagnostics for NOT prefixes (#78412 ) Fixes #70221 Fix a bug in FileCheck that corrects the error message when multiple prefixes are provided through --check-prefixes and one of them is a PREFIX-NOT. Earlier, only the first of the provided prefixes was displayed as the erroneous prefix, while the actual error might be on the prefix that occurred at the end of the prefix list in the input file. Now, the right NOT prefix is shown in the error message.	2024-01-19 15:08:24 +00:00
Nikita Popov	9350860824	[AsmParser] Add support for reading incomplete IR (part 1) (#78421 ) Add an `-allow-incomplete-ir` flag to the IR parser, which allows reading IR with missing declarations. This is intended to produce a best-effort interpretation of the IR, along the same lines of what we would manually do when taking, for example, a function from `-print-after-all` output and fixing it up to be valid IR. This patch only supports dropping references to undeclared metadata, either by dropping metadata attachments from instructions/functions, or by dropping calls to certain intrinsics (like debug intrinsics). I will implement support for inserting missing function/global declarations in a followup patch. We don't have real use lists for metadata, so the approach here is to iterate over the whole IR and identify metadata that needs to be dropped. This does not support all possible cases, but should handle anything that's relevant for the function-only IR use case.	2024-01-19 16:08:16 +01:00
LLVM GN Syncbot	535b197b8e	[gn build] Port `9ff4be640f`	2024-01-19 14:42:19 +00:00
Jie Fu	0d51c8704c	[X86] Fix -Wsign-compare in X86MCInstLower.cpp (NFC) llvm-project/llvm/lib/Target/X86/X86MCInstLower.cpp:1867:20: error: comparison of integers of different signs: 'int' and 'unsigned int' [-Werror,-Wsign-compare] if (SclWidth == C->getType()->getScalarSizeInBits()) { ~~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1 error generated.	2024-01-19 22:38:47 +08:00
Jay Foad	ed12388082	[AMDGPU] Do not emit `V_DOT2C_F32_F16_e32` on GFX12 (#78709 ) That instruction is not supported on GFX12. Added a testcase which previously crashed without this change. Co-authored-by: pvanhout <pierre.vanhoutryve@amd.com>	2024-01-19 14:36:27 +00:00
Manish Kausik H	a0b9117454	LoopDeletion: Move EH pad check before the isLoopNeverExecuted Check (#78189 ) This commit modifies `LoopDeletion::deleteLoopIfDead` to check if the exit block of a loop is an EH pad before checking if the loop gets executed. This handles the case where an unreachable loop has a landingpad as an Exit block, and the loop gets deleted, leaving leaving the landingpad without an edge from an unwind clause. Fixes #76852.	2024-01-19 15:30:20 +01:00
Alexey Bataev	4d11f04b20	[InstCombine] Try to fold trunc(shuffle(zext)) to just a shuffle (#78636 ) Tries to remove extra trunc/ext instruction for shufflevector instructions.	2024-01-19 09:29:01 -05:00
Simon Pilgrim	a2a0089ac3	[X86] movsd/movss/movd/movq - add support for constant comments (#78601 ) If we're loading a constant value, print the constant (and the zero upper elements) instead of just the shuffle mask. This did require me to move the shuffle mask handling into addConstantComments as we can't handle this in the MC layer.	2024-01-19 14:21:26 +00:00
Nikita Popov	d54dfdd1b5	[Clang] Fix build with GCC 14 on ARM (#78704 ) GCC 14 defines `__arm_streaming` as a macro expanding to `[[arm::streaming]]`. Due to the nested macro use, this gets expanded prior to concatenation. It doesn't look like C++ has a really clean way to prevent macro expansion. The best I have found is to use `EMPTY ## X` where `EMPTY` is an empty macro argument, so this is the hack I'm implementing here. Fixes https://github.com/llvm/llvm-project/issues/78691.	2024-01-19 15:19:58 +01:00
Yi Wu	5a7f9a5a9c	[flang] use setsid to assign the child to prevent zombie as it will be clean up by init process (#77944 ) When using `setsid()` in a child process created by `fork()`, a new session is created, and the child becomes a session leader. If the parent process terminates before the child, the child becomes an orphan and is adopted by the `init` process. The `init` process will eventually clean up the child process once it exits. However, killing the parent does not automatically kill the child; the child will continue running until it exits. Proper cleanup involves waiting for the child process to exit using `wait()` or `waitpid()` in the parent process to avoid zombie processes, but this approach is not valid for `EXECUTE_COMMAND_LINE` with async mode. Fix: https://github.com/llvm/llvm-project/issues/77803	2024-01-19 14:18:57 +00:00
jeanPerier	eaa8def929	[flang] Expand parent component in procedure pointer component ref (#78593 ) For simplicity, lowering relies on semantics expansion of parent components in designators. This was not done in `call x%p()` where `p` is a procedure component pointer of a parent component of `x`. Do it and turn lowering TODO into a new lowering TODO for `call bar(x%type_bound_procedure)` (passing a tybe bound procedure is allowed as an extension, but lowering does not handle this extension yet. This is a lowering issue, will do in different patch).	2024-01-19 15:09:25 +01:00
Simon Pilgrim	836dcdb84a	[llvm-jitlink] Fix MSVC "not all control paths return a value" warning. NFC.	2024-01-19 14:04:10 +00:00
Simon Pilgrim	aac1a41c11	[llvm-exegesis] Fix MSVC "not all control paths return a value" warning. NFC.	2024-01-19 14:04:09 +00:00
Nikita Popov	0f20d5a8b1	[AsmParser] Deduplicate argument list parsing code (NFC) The handling for parsing the first argument and all later arguments was duplicated. Consolidate them into a single loop.	2024-01-19 15:03:12 +01:00
trevyn	d1a2f11feb	[builtins] Mark `int_lib.h` builtins as `static` (#69305 ) Mark the following symbols as `static` to prevent duplicate definitions: `__builtin_ctz` `__builtin_clz` `__builtin_clzll` `__builtin_sadd_overflow` >Without these then all of these functions show up in all object files which include int_lib.h on Windows. This'll help prevent duplicate symbols by ensuring they're not exported. See: https://github.com/rust-lang/compiler-builtins/issues/167 https://reviews.llvm.org/D34599	2024-01-19 06:02:00 -08:00
Alexey Lapshin	9ff4be640f	[DWARFLinker][NFC] Decrease DWARFLinker dependence on DwarfStreamer. (#77932 ) This patch is extracted from #74725. The DwarfStreamer interface looks overcomplicated and has unnecessary dependencies. This patch avoids creation of DwarfStreamer by DWARFLinker and simplifies interface.	2024-01-19 16:57:09 +03:00
Nikita Popov	6f371149c1	[AsmParser] Don't require value numbers to be consecutive (#78171 ) Currently, the IR parser requires that %n style numbered values are consecutive. This means that the IR becomes invalid as soon as you remove an instruction, argument or block. This makes it very annoying to modify IR without running it through instnamer first. I don't think there is any good reason to impose this requirement. This PR relaxes it to allow value IDs to be non-consecutive, but it still keeps the requirement that they're increasing (i.e. you can't skip a value number and then assign it later). This only implements support for skipping numbers for local values. We should extend this to global values in the future as well.	2024-01-19 14:55:31 +01:00
Jay Foad	ea9d75aa2a	[AMDGPU] Misc formatting fixes. NFC.	2024-01-19 13:50:26 +00:00
Jay Foad	7017efa1a1	Fix typo "widended"	2024-01-19 13:50:26 +00:00
Sander de Smalen	340054e561	[AArch64][SME] Remove combination of private-ZA and preserves_za. (#78563 ) The new Clang attributes no longer support the combination of having a private-ZA function that preserves ZA. The use of __arm_preserves("za") means that ZA is shared and preserved. There wasn't that much benefit to the special handling of this, because in practice it only meant that we'd avoid restoring the lazy-save afterwards, but it still needed setting up a lazy-save (with the possibility of using a 0-sized buffer). Perhaps a new attribute will be added in the future to support this case, at which point we can revert back some of the changes removed in this patch. But for now removing this code simplifies things.	2024-01-19 13:48:44 +00:00
Florian Hahn	42fb1fac9e	[VPlan] Use DebugLoc from recipe in VPWidenCallRecipe (NFCI). Instead of using the debug location of the underlying instruction, use the debug location from the recipe. This removes an unneeded dependency of the underlying instruction.	2024-01-19 13:33:03 +00:00
Sirraide	061eb62a90	[Clang] [NFC] Remove default argument in ASTUnit.h (#78566 ) This removes a default argument that is currently broken in C++23 mode due to `std::default_delete` now being `constexpr`. This is a known problem (see #74963, #59966, #69996, and a couple more), fixing which will probably take some time, so this at least makes it possible to compile `ASTUnit.h` in C++23 mode. Note that we can’t simply include the header that provides the definition of the class causing the problem either, as that would create a circular dependency.	2024-01-19 08:29:23 -05:00
Danila Malyutin	9ad7d8f0e4	[Statepoint] Optimize Location structure size (#78600 ) Reduce its size from 24 to 12 bytes. Improves memory consumption when dealing with statepoint-heavy code.	2024-01-19 17:15:36 +04:00
Sam McCall	1ab418beb3	[Tooling] Fix FixedCompilationDatabase with header compile flags (#73913 ) Summary: The logic to strip positional args feels very fragile, but it's terribly useful when you want to use a tool on a file and have the exact argv. Today doesn't work with header-parsing actions because these are "precompile" rather than "compile", from tooling's perspective it's all the same. Reviewers: kadircet Subscribers:	2024-01-19 14:02:04 +01:00
Graham Hunter	689da340ed	[NFC][LV] Test precommit for interleaved linear args	2024-01-19 12:59:09 +00:00
Daniil Dudkin	24e5229230	[clang-apply-replacements] Deduplicate Implementation of `collectReplacementsFromDirectory` (NFC) (#78630 ) * Convert `collectReplacementsFromDirectory` into a function template. * Employ explicit specialization to maintain implementation in the source file. * Utilize the function template in the source file to eliminate code duplication. * Update the documentation for the function.	2024-01-19 15:57:02 +03:00
Matthias Springer	b4f24be7ef	[mlir][bufferization] Simplify helper `potentiallyAliasesMemref` (#78690 ) This commit simplifies a helper function in the ownership-based buffer deallocation pass. Fixes a potential double-free (depending on the scheduling of patterns).	2024-01-19 13:22:02 +01:00
Jay Foad	80ccc72ec7	[AMDGPU] Remove GFX12 encoding hack (#78702 ) This is no longer needed now that we have implemented GFX12 encoding for all instructions.	2024-01-19 12:19:29 +00:00
David Spickett	955417ade2	Revert "[llvm][AArch64] Copy all operands when expanding BLR_BTI bundle (#78267 )" This reverts commit `228aecbcf1`. Failing expensive checks: https://lab.llvm.org/buildbot/#/builders/16/builds/59798	2024-01-19 12:06:30 +00:00
Jay Foad	879cbe06ed	[AMDGPU] Fix predicates for BUFFER_ATOMIC_CSUB pattern (#78701 ) Use OtherPredicates to avoid interfering with other uses of SubtargetPredicate for GFX12.	2024-01-19 12:01:31 +00:00
Florian Hahn	74f6ae9f24	[LTO] Require asserts for discard-value-names.ll test. The test requires asserts, as it depends on the default value for -lto-discard-value-names at the moment.	2024-01-19 12:00:16 +00:00
Stefan Gränitz	01ba627f43	[llvm-jitlink] Refactor GOT and stubs registration (NFC) (#78365 ) Add methods `registerGOTEntry()` and `registerStubEntry()` in `Session::FileInfo` to factor out generic code from the individual object type implementations.	2024-01-19 12:38:22 +01:00
Benjamin Chetioui	35121add2e	[mlir][NFC] Remove unused variable.	2024-01-19 11:32:19 +00:00
David Spickett	228aecbcf1	[llvm][AArch64] Copy all operands when expanding BLR_BTI bundle (#78267 ) Fixes #77915 Previously I based the operand copying on expandCALL_RVMARKER but did not understand it properly at the time. This lead to me dropping the arguments of the function being branched to. This fixes that by copying all operands from the BLR_BTI to the BL/BLR without skipping anything. I've updated the existing test by adding function arguments.	2024-01-19 11:22:43 +00:00
Han-Chung Wang	12b676de72	[mlir][vector] Drop innermost unit dims on transfer_write. (#78554 )	2024-01-19 03:15:13 -08:00

1 2 3 4 5 ...

486830 Commits