llvm-capstone

mirror of https://github.com/capstone-engine/llvm-capstone.git synced 2024-11-27 23:51:56 +00:00

Author	SHA1	Message	Date
Jason Molenda	394e52a0bb	[lldb] NFC reflow comments in WatchpointLocations Reading through this header, many of the comment line breaks are a mess after various code reformats.	2023-08-30 17:58:44 -07:00
Aart Bik	733edde005	[mlir][sparse] reorganized method order in sparse support lib header The header was split in memref/no-memref section, but various methods slipped in at the bottom at the wrong place Reviewed By: Peiming, yinying-lisa-li Differential Revision: https://reviews.llvm.org/D159218	2023-08-30 16:39:33 -07:00
Joseph Huber	ca10bc4f41	[libc] Implement the 'nanosleep' function on the GPU The GPU has the ability to sleep for very short periods of time. We can map this to the existing `nanosleep` utility. This patch maps the nanosleep utility to the existing hardware instructions as best as possible. Depends on D159118 Reviewed By: JonChesterfield, sivachandra Differential Revision: https://reviews.llvm.org/D159225	2023-08-30 18:34:59 -05:00
Reid Kleckner	cacdf3b8c4	[VE] Remove CodeGen deps from MCTargetDesc, NFC See issue #64166 for more information about the layering issue.	2023-08-30 16:26:44 -07:00
Reid Kleckner	52abac26ef	[bazel] Add missing dep from DwarfLinkerParallel to Object llvm-dwarfutil now builds successfully with Bazel.	2023-08-30 16:26:44 -07:00
Qiongsi Wu	a6e6abd76c	[AIX] Fix Link Issue when `-fprofile-update=[atomic\|prefer-atomic]` is in Effect https://reviews.llvm.org/D157280 enabled `-fprofile-update` for `-fprofile-generate`, but omitted adding `-latomic` to the linker command on AIX. This omission causes linking to fail due to an undefined symbol. This patch fixes the link error. Reviewed By: w2yehia Differential Revision: https://reviews.llvm.org/D159137	2023-08-30 19:20:17 -04:00
Reid Kleckner	9f72e3f70f	[Lanai] Fix MC CodeGen layering, NFC See issue #64166 for more information about the layering issue.	2023-08-30 16:17:31 -07:00
Reid Kleckner	cda23c0732	[PPC] Fix layering issues between MCTargetDesc and CodeGen See issue #64166 for more information about the layering issue. The PPCMCTargetDesc library was including CodeGen headers such as PPCInstrInfo.h and calling inline functions in them. This doesn't work in the Bazel build, and is error-prone. If the inline function moves to a cpp file, it will result in linker errors. To address the issue, I moved several inline functions to PPCMCTargetDesc.cpp, and declared them in the PPC namespace in PPCMCTargetDesc.h, which seemed like the most straightforward fix. Differential Revision: https://reviews.llvm.org/D156488	2023-08-30 16:09:01 -07:00
Alex Brachet	1d1f230c92	[libc] Define long long limits if not defined Some older gcc toolchains don't define these on 32 bit platforms. This is a problem for pigweed which uses an older gcc toolchain and targets 32 bit. Differential Revision: https://reviews.llvm.org/D157112	2023-08-30 23:03:36 +00:00
Jacques Pienaar	7c800a41b0	[mlir] Relax requirement on memory buffer creation. parseSourceString does not require null-terminated string, hence requirement on memory buffer can be relaxed. Differential Revision: https://reviews.llvm.org/D159214	2023-08-30 15:29:53 -07:00
Jason Molenda	3621f83804	Don't enable stdin/out with a no-output test program A followon to https://reviews.llvm.org/D158237 , where this text can print stdout text when run under address-sanitizer, and the test harness does not expect any output, resulting in a test failure on a sanitizer CI bot.	2023-08-30 15:12:31 -07:00
Joseph Huber	20f4f295da	[libc] Fix 'clock()' testing on the GPU Summary: We should check for the GPU architectures first, since `__linux__` can be set potentially during these compilations. Also the test needs to be a hermetic test.	2023-08-30 16:58:11 -05:00
Owen Pan	14feca5d14	[clang-format][NFC] Replace EXPECT_EQ() with verifyFormat() Replaces the remaining ones not covered in `8b131d2e14`.	2023-08-30 14:55:26 -07:00
Shoaib Meenai	f41cd47736	[Driver] Enable stdlibxx-isystem test on Windows With `%/t`, we can now compare `-###` output against the installtion dir without backslash escaping getting in the way. Having to check against the doubled-up backslashes is ugly, but it's the pattern used in lots of tests (including for things like debug info where an `-###` alternative that didn't escape backslashes wouldn't help).	2023-08-30 14:39:00 -07:00
Joseph Huber	30307a7bb7	[libc] Implement the 'clock()' function on the GPU This patch implements the `clock()` function on the GPU. This function is supposed to return a timestamp that can be converted into seconds using the `CLOCKS_PER_SEC` macro. The GPU has a fixed frequency timer that can be used for this purpose. However, there are some considerations. First is that AMDGPU does not have a statically known fixed frequency. I know internally that the gfx10xx and gfx11xx series use a 100 MHz clock which will probably remain for the future. Gfx9xx typically uses a 25 MHz clock except for the Vega 10 GPU. The only way to know for sure is to look it up from the runtime. For this purpose, I elected to default it to some known values and assign these to an exteranlly visible symbol that can be initialized if needed. If we do not have a good guess we just return zero. Second is that the `CLOCKS_PER_SEC` macro only gives about a microsecond of resolution. POSIX demands that it's 1,000,000 so it's best that we keep with this tradition as almost all targets seem to respect this. The reason this is important is because on the GPU we will almost assuredly be copying the host's macro value (see the wrapper header) so we should go with the POSIX version that's most likely to be set. (We could probably make a warning if the included header doesn't match the expected value). Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D159118	2023-08-30 16:16:34 -05:00
David Green	58a2f839fd	[AArch64][GISel] Expand coverage of FDiv and move into place. This adds some more extensive test coverage for fdiv through global isel, switching the opcodes to use the more complete ActionDefinitions to handle more cases and moving it into the position of the existing code which is no longer needed.	2023-08-30 22:09:53 +01:00
Leonard Chan	1abcf58402	[lsan][Fuchsia] Fix bounds checking for thread_local allocator cache when scanning TLS regions When scanning over TLS regions, we attempt to check if one of the regions is one of the thread_local allocator caches which would be located in one of the TLS blocks pointer to by the DTV. This is to prevent marking a pointer that was allocated by the primary allocator (from a thread_local cache) as reachable. The check is a simple bounds check to see if the allocator cache is within the bounds of one of the TLS block we're iterating over, but it looks like the check for the end of the cache is slightly incorrect. Differential Revision: https://reviews.llvm.org/D156015	2023-08-30 20:49:21 +00:00
Snehasish Kumar	37fd3c96b9	[memprof] Add a MemProfReader base class. Add a MemProfReader base class which can be used directly where symbolization and processing a raw profile is unnecessary. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D159141	2023-08-30 20:20:55 +00:00
Owen Pan	8b131d2e14	[clang-format][NFC] Change EXPECT_EQ to verifyFormat or verifyNoChange Replaces some 600 EXPECT_EQ() to verifyFormat() or verifyNoChange() in FormatTest.cpp because the former neither checks stability of formatting nor tests formatting C++ as Objective C. Also marks dozens of unstable test cases with FIXME comments. Differential Revision: https://reviews.llvm.org/D159051	2023-08-30 13:07:12 -07:00
Peiming Liu	22e8d5b428	[mlir][sparse] Support strided convolution on dense level. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D159020	2023-08-30 20:00:50 +00:00
LLVM GN Syncbot	88dbb5a94c	[gn build] Port `c8bf93dba0`	2023-08-30 19:46:15 +00:00
zhijian	adc317d369	llvm-nm ignore the Import symbol file for the --export-symbol option. Summary: On AIX OS, clang may use llvm-nm to export the symbols from all input files (see `515c435e37/clang/lib/Driver/ToolChains/AIX.cpp (L236)`). However, the clang command-line may include import files (identified by them starting with #!). llvm-nm previously reported "invalid object file" errors for import files, meaning that the clang driver would fail to link when import files are included this way. In this patch, llvm-nm is changed to ignore import files when the --export-symbol option, meaning that clang will now succeed in this case. For more information about AIX import files, see https://www.ibm.com/docs/en/aix/7.3?topic=l-ld-command Reviewers: Hubert Tong, James Henderson, MaskRay, Stephen Peckham Differential Revision: https://reviews.llvm.org/D158004	2023-08-30 15:45:37 -04:00
Philip Reames	079c968eb9	[RISCV] Form vmv.s.f/x from single element splats via DAG combine This re-implements the special casing we had in lowerScalarSplat as a DAG combine. As can be seen in the tests, this ends up triggering in a bunch more cases. The semantically interesting bit of this change is the use of the implicit truncate semantics for when XLEN > SEW. We'd already been doing this for vmv.v.x, but this change extends e.g. the constant matching to make the same assumption about vmv.s.x. Per my reading of the specification, this should be fine, and if anything, is more obviously true of vmv.s.x than vmv.v.x. Differential Revision: https://reviews.llvm.org/D158874	2023-08-30 12:44:36 -07:00
Peiming Liu	07bd5f20bc	[mlir][sparse] Support strided convolution on compressed level. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D158912	2023-08-30 19:37:50 +00:00
Christopher Ferris	c8bf93dba0	[scudo] Remove RSS checking code. The RSS code is not very useful and can be replicated by using ulimit. Remove it and remove the options associated with it. Reviewed By: Chia-hungDuan Differential Revision: https://reviews.llvm.org/D159155	2023-08-30 12:35:14 -07:00
Pranav Kant	d26dd681f9	[Bazel] Fix build	2023-08-30 19:30:12 +00:00
Philip Reames	fd465f377c	[RISCV] Move vmv_s_x and vfmv_s_f special casing to DAG combine We'd discussed this in the original set of patches months ago, but decided against it. I think we should reverse ourselves here as the code is significantly more readable, and we do pick up cases we'd missed by not calling the appropriate helper routine. Differential Revision: https://reviews.llvm.org/D158854	2023-08-30 12:04:48 -07:00
Peiming Liu	e015d385c9	[mlir][sparse] Pass down constant coefficients of affine index expressions to LoopEmitter. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D158914	2023-08-30 18:44:50 +00:00
Ian Anderson	680da4b5d7	[Headers][Modules] Make separate headers for the stdarg.h and stddef.h pieces so that they can be modularized stdarg.h and stddef.h have to be textual headers in their upcoming modules to support their `__needs_xxx` macros. That means that they won't get precompiled into their modules' pcm, and instead their declarations will go into every other pcm that uses them. For now that's ok since the type merger can handle the declarations in these headers, but it's suboptimal at best. Make separate headers for all of the pieces so that they can be properly modularized. Reviewed By: aaron.ballman, ChuanqiXu Differential Revision: https://reviews.llvm.org/D158709	2023-08-30 11:41:12 -07:00
walter erquinigo	79d5d9a082	[lldb] Allow synthetic providers in C++ and fix linking problems - Allow the definition of synthetic formatters in C++ even when LLDB is built without python scripting support. - Fix linking problems with the CXXSyntheticChildren Differential Revision: https://reviews.llvm.org/D158010	2023-08-30 14:14:28 -04:00
Matt Arsenault	5f8ee45d5a	AMDGPU: Implement llvm.get.rounding There are really two rounding modes, so only return the standard values if both modes are the same. Otherwise, return a bitmask representing the two modes. Annoyingly the register doesn't use the same values as FLT_ROUNDS. Use a simple integer table we can shift into to convert. https://reviews.llvm.org/D153158	2023-08-30 14:06:13 -04:00
Piotr Zegar	c6fa07ca96	[clang-tidy][NFC][DOC] Update links to HICPP standard Old links pointed out to old domain, and then redirected to correct one, but to wrong pages. Changed links from old to new domain and page. Fixes: #65064	2023-08-30 18:02:34 +00:00
Dhruv Chawla	4ea8212775	[NFC][LoopVectorize] Regenerate test checks	2023-08-30 23:22:57 +05:30
Peiming Liu	96e1914aa2	[mlir][sparse] fix crash when generating convolution kernel with sparse input in DCCD format. Reviewed By: aartbik, anlunx Differential Revision: https://reviews.llvm.org/D159170	2023-08-30 17:49:36 +00:00
Piotr Zegar	26f230fe52	[clang-tidy][NFC] Fix llvm-else-after-return findings Fix issues found by clang-tidy in clang-tidy source directory.	2023-08-30 17:41:41 +00:00
Philip Reames	aea452841b	[RISCV] Improve cost model test coverage for insert/extract element In particular, high LMULs, constant offsets within high LMUL, and types which require splitting. Note that most of these are way off with current lowering.	2023-08-30 10:34:02 -07:00
Simon Pilgrim	967d95382d	[X86] lowerShuffleAsVALIGN - extend to recognize basic shifted element masks Try to use VALIGN as a cross-lane version of VSHLDQ/VSRLDQ	2023-08-30 18:32:55 +01:00
Simon Pilgrim	d3d71b8d5b	[X86] Add shuffle tests cases showing missed opportunity to use VALIGN	2023-08-30 18:32:55 +01:00
Aiden Grossman	d944cab731	[IR] Make detailed structural hash test look at diffs This patch makes structural-hash-detailed.ll look at diffs rather than specific values to make the test more robust against the underlying hashing implementation while still testing that the hash value is deterministic across multiple runs. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D159154	2023-08-30 10:31:58 -07:00
David Tenty	ef3e27385b	Revert "[libc++] Comment out the AIX jobs that are failing" This reverts commit `014830193b`. and `ce12d6563e`. Differential Revision: https://reviews.llvm.org/D158754	2023-08-30 13:28:30 -04:00
Philip Reames	eb5fe55b81	[RISCV] Expand codegen test coverage extract/insert element In particular, at mixed LMULS, high LMULS, and types which require splitting.	2023-08-30 10:10:40 -07:00
Mikhail Goncharov	0a0aff2d24	fix unused variable warnings in conditionals warning was updated in `92023b1509`	2023-08-30 19:09:27 +02:00
Ben Langmuir	aca23d8ac3	[clang] Fix assertion failure using -MJ with -fsyntax-only If there is no output filename we should not assert when writing output for -MJ. Differential Revision: https://reviews.llvm.org/D159016	2023-08-30 10:04:34 -07:00
Andrzej Warzynski	acc572f490	[mlir][nfc] Avoid implicit casting VectorType -> Type Just a small follow-up for https://reviews.llvm.org/D159122	2023-08-30 18:01:00 +01:00
Arthur Eubanks	f93aa1c657	[gn build] Manually port 9ef536	2023-08-30 09:59:15 -07:00
Craig Topper	c9db799dda	[RISCV] Don't add -unaligned-scalar-mem to target features by default. Only pass it +unaligned-scalar-mem/-unaligned-scalar-mem if the user has passed one of the alignment options. This allows us to add unaligned-scalar-mem as a feature on CPUs that support it. Reviewed By: wangpc Differential Revision: https://reviews.llvm.org/D159145	2023-08-30 09:52:16 -07:00
Fangrui Song	651b2fbc1c	[CodeGen] Function multi-versioning: don't set comdat for internal linkage resolvers For function multi-versioning using the target or target_clones function attributes, currently we incorrectly set comdat for internal linkage resolvers. This is problematic for ELF linkers as GRP_COMDAT deduplication will kick in even with STB_LOCAL signature (https://groups.google.com/g/generic-abi/c/2X6mR-s2zoc "GRP_COMDAT group with STB_LOCAL signature"). In short, two `__attribute((target_clones(...))) static void foo()` in two translation units will be deduplicated. Fix this. Fix #65114 Reviewed By: erichkeane Differential Revision: https://reviews.llvm.org/D158963	2023-08-30 09:46:48 -07:00
Anton Rydahl	3c9988f85d	[OpenMP] Allow exceptions in target regions when offloading to GPUs The motivation for this patch is that many code bases use exception handling. As GPUs are not expected to support exception handling in the near future, we can experiment with compiling the code for GPU targets anyway. This will allow us to run the code, as long as no exception is thrown. The overall idea is very simple: - If a throw expression is compiled to AMDGCN or NVPTX, it is replaced with a trap during code generation. - If a try/catch statement is compiled to AMDGCN or NVPTX, we generate code for the try statement as if it were a basic block. With this patch, the compilation of the following example ``` int gaussian_sum(int a,int b){ if ((a + b) % 2 == 0) {throw -1;}; return (a+b) * ((a+b)/2); } int main(void) { int gauss = 0; #pragma omp target map(from:gauss) { try { gauss = gaussian_sum(1,100); } catch (int e){ gauss = e; } } std::cout << "GaussianSum(1,100)="<<gauss<<std::endl; #pragma omp target map(from:gauss) { try { gauss = gaussian_sum(1,101); } catch (int e){ gauss = e; } } std::cout << "GaussianSum(1,101)="<<gauss<<std::endl; return (gauss > 1) ? 0 : 1; } ``` with offloading to `gfx906` results in ``` ./bin/target_try_minimal_fail GaussianSum(1,100)=5050 AMDGPU fatal error 1: Received error in queue 0x155555506000: HSA_STATUS_ERROR_EXCEPTION: An HSAIL operation resulted in a hardware exception. zsh: abort (core dumped) ``` Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D153924	2023-08-30 09:36:22 -07:00
Ying Yi	9ef536a12e	[UBSan] Disable the function and kcfi sanitizers on an execute-only target. An execute-only target disallows data access to code sections. -fsanitize=function and -fsanitize=kcfi instrument indirect function calls to load a type hash before the function label. This results in a non-execute access to the code section and a runtime error. To solve the issue, -fsanitize=function should not be included in any check group (e.g. undefined) on an execute-only target. If a user passes -fsanitize=undefined, there is no error and no warning. However, if the user explicitly passes -fsanitize=function or -fsanitize=kcfi on an execute-only target, an error will be emitted. Fixes: https://github.com/llvm/llvm-project/issues/64931. Reviewed By: MaskRay, probinson, simon_tatham Differential Revision: https://reviews.llvm.org/D158614	2023-08-30 17:17:37 +01:00
Philip Reames	aada8f2e54	[slp] Tweak debug costing output to include VL This makes it much easier to understand which vector length is being considered when the same set of nodes are evaluated at multiple vector lengths.	2023-08-30 09:13:19 -07:00

1 2 3 4 5 ...

473101 Commits