llvm-capstone

mirror of https://github.com/capstone-engine/llvm-capstone.git synced 2024-10-07 19:03:57 +00:00

Author	SHA1	Message	Date
Joseph Huber	70c08dbcfb	[Libomptarget] Remove the remote and ve plugins from libomptarget These plugins are unmaintained and are not in a workable state. The VE plugin has not been touched for years and has never had any running tests. The remote plugin is in an unfinished state and is not production ready upstream. These will need to be ported to the new nextgen interface in the future if they are needed. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D154548	2023-07-05 17:39:46 -05:00
Joseph Huber	33859fb962	[Libomptarget][Obvious] Missing comma on enum	2023-07-04 22:01:03 -05:00
Joseph Huber	ec39b35178	[Libomptarget] Add missing HSA agent info enumeration Summary: This was not added to dynamic_hsa.h	2023-07-04 21:55:49 -05:00
Mark de Wever	cbaa3597aa	Reland "[CMake] Bumps minimum version to 3.20.0. This reverts commit `d763c6e5e2`. Adds the patch by @hans from https://github.com/llvm/llvm-project/issues/62719 This patch fixes the Windows build. `d763c6e5e2` reverted the reviews D144509 [CMake] Bumps minimum version to 3.20.0. This partly undoes D137724. This change has been discussed on discourse https://discourse.llvm.org/t/rfc-upgrading-llvms-minimum-required-cmake-version/66193 Note this does not remove work-arounds for older CMake versions, that will be done in followup patches. D150532 [OpenMP] Compile assembly files as ASM, not C Since CMake 3.20, CMake explicitly passes "-x c" (or equivalent) when compiling a file which has been set as having the language C. This behaviour change only takes place if "cmake_minimum_required" is set to 3.20 or newer, or if the policy CMP0119 is set to new. Attempting to compile assembly files with "-x c" fails, however this is workarounded in many cases, as OpenMP overrides this with "-x assembler-with-cpp", however this is only added for non-Windows targets. Thus, after increasing cmake_minimum_required to 3.20, this breaks compiling the GNU assembly for Windows targets; the GNU assembly is used for ARM and AArch64 Windows targets when building with Clang. This patch unbreaks that. D150688 [cmake] Set CMP0091 to fix Windows builds after the cmake_minimum_required bump The build uses other mechanism to select the runtime. Fixes #62719 Reviewed By: #libc, Mordante Differential Revision: https://reviews.llvm.org/D151344	2023-05-27 12:51:21 +02:00
Nico Weber	d763c6e5e2	Revert "Reland "[CMake] Bumps minimum version to 3.20.0."" This reverts commit `65429b9af6`. Broke several projects, see https://reviews.llvm.org/D144509#4347562 onwards. Also reverts follow-up commit "[OpenMP] Compile assembly files as ASM, not C" This reverts commit `4072c8aee4`. Also reverts fix attempt "[cmake] Set CMP0091 to fix Windows builds after the cmake_minimum_required bump" This reverts commit `7d47dac5f8`.	2023-05-17 10:53:33 -04:00
Mark de Wever	65429b9af6	Reland "[CMake] Bumps minimum version to 3.20.0." The owner of the last two failing buildbots updated CMake. This reverts commit `e8e8707b4a`.	2023-05-13 11:42:25 +02:00
Joseph Huber	b09953a4a3	[Libomptarget] Fix AMDGPU Note handling after D150022 Summary: The changes in https://reviews.llvm.org/D150022 changed the API for this function that we query. Simply pass in the alignment from the associated header to fix.	2023-05-10 14:12:39 -05:00
Mark de Wever	e8e8707b4a	Revert "Reland "[CMake] Bumps minimum version to 3.20.0."" Unfortunatly not all buildbots are updated. This reverts commit `ffb807ab53`.	2023-05-06 17:03:56 +02:00
Mark de Wever	ffb807ab53	Reland "[CMake] Bumps minimum version to 3.20.0." All build bots should be updated now. This reverts commit `44d38022ab`.	2023-05-06 11:43:02 +02:00
Shilei Tian	d4ecd1241c	Revert "[OpenMP] Introduce kernel environment" This reverts commit `35cfadfbe2`. It makes a couple of buildbots unhappy because of the following test failures: - `Transforms/OpenMP/add_attributes.ll'` - `mapping/declare_mapper_target_data.cpp` on AMDGPU	2023-04-22 20:56:35 -04:00
Shilei Tian	35cfadfbe2	[OpenMP] Introduce kernel environment This patch introduces per kernel environment. Previously, flags such as execution mode are set through global variables with name like `__kernel_name_exec_mode`. They are accessible on the host by reading the corresponding global variable, but not from the device. Besides, some assumptions, such as no nested parallelism, are not per kernel basis, preventing us applying per kernel optimization in the device runtime. This is a combination and refinement of patch series D116908, D116909, and D116910. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D142569	2023-04-22 20:46:38 -04:00
Mark de Wever	44d38022ab	Revert "Revert "Revert "[CMake] Bumps minimum version to 3.20.0.""" This reverts commit `1ef4c3c859`. Two buildbots still haven't been updated.	2023-04-15 20:12:24 +02:00
Mark de Wever	1ef4c3c859	Revert "Revert "[CMake] Bumps minimum version to 3.20.0."" This reverts commit `92523a35a8`. Reland to see whether CIs are updated.	2023-04-15 13:12:04 +02:00
Joseph Huber	1c43be0276	[Libomptarget] Update CMake messages if the tests aren't build Summary: These messages have been wrong for quite some time. Update them to be more descriptive of why the tests weren't built.	2023-03-24 14:26:23 -05:00
Mark de Wever	d0398d3593	Revert "Reland "[CMake] Bumps minimum version to 3.20.0."" This reverts commit `a72165e5df`. Some buildbots have not been updated yet.	2023-03-18 20:32:43 +01:00
Mark de Wever	a72165e5df	Reland "[CMake] Bumps minimum version to 3.20.0." This reverts commit `92523a35a8`. Test whether all CI runners are updated.	2023-03-18 13:33:42 +01:00
Shao-Ce SUN	420d2fcac9	[OpenMP][CUDA] Get rid of redundant macro def Resolve warning of `TARGET_NAME` macro redefinition. Reviewed By: jhuber6 Differential Revision: https://reviews.llvm.org/D145307	2023-03-05 02:01:59 +08:00
Mark de Wever	92523a35a8	Revert "[CMake] Bumps minimum version to 3.20.0." Some build bots have not been updated to the new minimal CMake version. Reverting for now and ping the buildbot owners. This reverts commit `44c6b905f8`.	2023-03-04 18:28:13 +01:00
Mark de Wever	44c6b905f8	[CMake] Bumps minimum version to 3.20.0. This partly undoes D137724. This change has been discussed on discourse https://discourse.llvm.org/t/rfc-upgrading-llvms-minimum-required-cmake-version/66193 Note this does not remove work-arounds for older CMake versions, that will be done in followup patches. Reviewed By: mehdi_amini, MaskRay, ChuanqiXu, to268, thieta, tschuett, phosek, #libunwind, #libc_vendors, #libc, #libc_abi, sivachandra, philnik, zibi Differential Revision: https://reviews.llvm.org/D144509	2023-03-04 12:40:57 +01:00
Joseph Huber	9f650ae779	[Libomptarget] Remove dependency on the DeviceRTL from the GPU plugins The GPU plugins have a dependency on the device libraries. Sometimes we cannot build the device libraries because the user does not have a valid `clang` to use or it was explicitly disabled. Currently this leads to a transitive failure because we cannot meet this dependency. This patch simply removes that dependency. Fixes https://github.com/llvm/llvm-project/issues/60457 Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D143196	2023-02-13 07:01:52 -06:00
Samuel Thibault	cc72df2b7b	[Libomptarget] Add the same to the other AMD plugin Summary: The previous patch also needed to apply this to the other AMDGPU plugin, this will be removed soon but it should be correct while it's here at least.	2023-02-04 07:46:25 -06:00
Samuel Thibault	71fb11ff34	[Libomptarget] Fix disabling amdgpu on non-Linux. Previously, on non-Linux, amdgpu would get enabled whatever the CPU architecture. Reviewed By: jhuber6 Differential Revision: https://reviews.llvm.org/D143017	2023-02-04 07:45:03 -06:00
Joseph Huber	70ff191900	[Libomptarget] Add new enum to the dynamically opened HSA implementation Summary: We added a new agent information enum in a previous commit. This was not added to the dynamic HSA implementation so it failed to compile without a local HSA install to use.	2023-02-02 15:15:09 -06:00
Joseph Huber	1bde4ccae6	[Libomptarget] Fix building AMDGPU tests Summary: Accidentally deleted this.	2023-01-30 17:56:48 -06:00
Shilei Tian	516ae48170	[OpenMP][NVPTX] Guard the target name macro definition	2023-01-30 14:02:22 -05:00
Joseph Huber	292eca41d9	[Libomptarget] Fix tests after previous patch Summary: The previous patch didn't remove these tests correctly.	2023-01-30 07:18:51 -06:00
Joseph Huber	9b1d0ee10c	[Libomptarget] Remove unused test targets in libomptaget Summary: These don't need to be set.	2023-01-30 06:34:15 -06:00
Shilei Tian	ad95b0e977	[OpenMP][NVPTX] Added `__tgt_rtl_launch_kernel` in old CUDA plugin Fix #60248. Reviewed By: jhuber6 Differential Revision: https://reviews.llvm.org/D142819	2023-01-28 18:56:07 -05:00
Joseph Huber	84d0243d21	[Libomptarget] Clean up CUDA plugin CMake files Clean up this file after changing it in D142568. Depends on D142568 Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D142573	2023-01-25 13:58:02 -06:00
Joseph Huber	c568622046	[Libomptarget] Remove find_package(CUDA) as it has been deprecated Since D137724 and the LLVM 17 release we have updated to CMake version 3.20. This means that `find_package(CUDA)` is officially deprecated and can be replaced with `find_package(CUDAToolkit)` instead. This patch does this and also cleans up a bit of the CMake. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D142568	2023-01-25 13:58:01 -06:00
Joseph Huber	b280e12a3d	[Libomptarget][NFC] Address a few warnings in libomptarget Summary: Fix a few minor warnings that show up in `libomptarget`.	2023-01-23 08:56:03 -06:00
Joseph Huber	716bae0b48	[Libomptarget] Include "hsa/hsa.h" instead Summary: Recently AMD moved the "hsa.h" include to "hsa/hsa.h". This causes several warning. This patch checks to see if we can include that one instead. This should hopefully keep things backwards compatible while silencing the warnings.	2023-01-23 08:56:03 -06:00
Johannes Doerfert	16a385ba21	[OpenMP] Modernize the kernel launching interface and APIs We already created a versioned `__tgt_kernel_arguments` struct but it was only briefly used and its content was passed in isolation anyway. This makes it hard to add more information in the future. With this patch we fully embrace the struct as means to pass information from the compiler to the plugin as part of a kernel launch. The patch also extends and renames the struct, bumping the version number to 2. Version 1 entries are auto-upgraded. This is in preparation for "bare" kernel launches, per kernel dynamic shared memory, CUDA/HIP lowering, etc. The `__tgt_target_kernel_nowait` interface was deprecated as it was unused. Once we actually implement support for something like that, we can add an appropriate API. Note: Only plugins with the `launch_kernel` interface are now supported. That means that a new clang won't be able to use an old runtime. An old clang can still use the new runtime since the libomptarget interface did not change. Differential Revision: https://reviews.llvm.org/D141232	2023-01-21 11:16:21 -08:00
Jon Chesterfield	2257e3d2e5	[openmp] Workaround for HSA in issue 60119 Move plugin initialization to libomptarget initialization. Removes the call_once control, probably fractionally faster overall. Fixes issue 60119 because the plugin initialization, which might try to dlopen unrelated shared libraries, is no longer nested within a call from application code. Fixes #60119 Reviewed By: Maetveis, jhuber6 Differential Revision: https://reviews.llvm.org/D142249	2023-01-21 12:01:14 +00:00
Shilei Tian	3ff1726cf8	[OpenMP][AMDGPU] Get rid of redundant macro def The next gen plugin adds the def of `DEBUG_PREFIX` in CMake, causing compiler warning that `DEBUG_PREFIX` is defined multiple times. This patch simply guards the macro def. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D142064	2023-01-18 20:08:18 -05:00
Joseph Huber	ea0eee80d8	[Libomptarget] Only build GPU tests if a GPU is found on the system Currently we build tests as long as the libraries are found on the machine. This doesn't necessarily mean there is a GPU to use though. This patch changes it to where we only will build the tests if we found a compatible GPU via `nvptx-arch` or `amdgpu-arch`. The only downside to this I could see if someone were to build LLVM on a home node of a cluster and then wished to run the tests after switching to a compute node. For this I think we should allow it to be overridden. I think that's better than allowing us to run tests that will fail by default. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D142018	2023-01-18 10:35:37 -06:00
Joseph Huber	566ecc2231	[Libomptarget][NFC] Rename device environment variable This variable is used by the runtime. Before kernel launch we set it to indicate several configuration options from the host. This patch renames it to be more in-line with the rest of the named exported from the runtime. This is better because this is the only symbol visible to the host from the runtime, so it should have a reserved name. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D141960	2023-01-17 14:28:04 -06:00
Carlo Bertolli	7928d9e12d	[OpenMP][libomptarget][AMDGPU] Add missing declarations to fix non amdgpu builds Fix after commit of https://reviews.llvm.org/D139208	2023-01-13 15:04:14 -06:00
Carlo Bertolli	b215932e69	[OpenMP][libomptarget][AMDGPU] lock/unlock (pin/unpin) mechanism in libomptarget amdgpu plugin (API and implementation) The current only way to obtain pinned memory with libomptarget is to use a custom allocator llvm_omp_target_alloc_host. This reflects well the CUDA implementation of libomptarget, but it does not correctly expose the AMDGPU runtime API, where any system allocated page can be locked/unlocked through a call to hsa_amd_memory_lock/unlock. This patch enables users to allocate memory through malloc (mmap, sbreak) and then pin the related memory pages with a libomptarget special call. It is a base support in the amdgpu libomptarget plugin to enable users to prelock their host memory pages so that the runtime doesn't need to lock them itself for asynchronous memory transfers. Reviewed By: jdoerfert, ye-luo Differential Revision: https://reviews.llvm.org/D139208	2023-01-13 12:18:49 -06:00
Kevin Sala	7b97941721	[OpenMP][libomptarget] Add missing symbols in dynamic_hsa This patch prepares for the new AMDGPU NextGen plugin. Differential Revision: https://reviews.llvm.org/D140213	2022-12-17 00:01:24 +01:00
Joseph Huber	d8b0f007cb	[libomptarget] Add HSA definitions for memory faults to dynamic_hsa Summary: We use the dynamic HSA file to forward declare needed definitions from the HSA runtime if not present at build time. These definitions were not included so using them caused problems on systems without it if used. Just add them.	2022-12-16 07:06:44 -06:00
Kevin Sala	a66826a233	Revert "[OpenMP][libomptarget] Add AMDGPU NextGen plugin with asynchronous behavior" This reverts commit `87e6b96b00`.	2022-12-16 11:53:45 +01:00
Kevin Sala	87e6b96b00	[OpenMP][libomptarget] Add AMDGPU NextGen plugin with asynchronous behavior This commit adds the AMDGPU NextGen plugin inheriting from PluginInterface's classes. It also implements the asynchronous behavior in the plugin operations: kernel launches and memory transfers. To this end, it implements the concept of streams of asynchronous operations. The streams are implemented using the HSA signals to define input and output dependencies between asynchronous operations. Missing features: - Retrieve the maximum number of threads per group that a kernel can run. This requires reading the image. - Implement __tgt_rtl_sync_event, not used on the libomptarget side. Differential Revision: https://reviews.llvm.org/D138389	2022-12-16 00:30:43 +01:00
Kevin Sala	39fe657b66	[OpenMP][libomptarget] Add utility header for AMDGPU plugins This patch prepares the PluginInterface for the new AMDGPU NextGen plugin. The original and the NextGen plugin will share some structures and functionalities. We use this header for defining them and avoiding code duplication. Differential Revision: https://reviews.llvm.org/D139792	2022-12-15 21:06:04 +01:00
Guilherme Valarini	89c82c8394	[OpenMP] Add non-blocking support for target nowait regions This patch better integrates the target nowait functions with the tasking runtime. It splits the nowait execution into two stages: a dispatch stage, which triggers all the necessary asynchronous device operations and stores a set of post-processing procedures that must be executed after said ops; and a synchronization stage, responsible for synchronizing the previous operations in a non-blocking manner and running the appropriate post-processing functions. Suppose during the synchronization stage the operations are not completed. In that case, the attached hidden helper task is re-enqueued to any hidden helper thread to be later synchronized, allowing other target nowait regions to be concurrently dispatched. Reviewed By: jdoerfert, tianshilei1992 Differential Revision: https://reviews.llvm.org/D132005	2022-12-14 14:03:32 -03:00
Jon Chesterfield	56ec7ce80d	[openmp][amdgpu] Let fine grain and kernarg pools differ	2022-12-14 02:04:21 +00:00
Shilei Tian	59ae452983	[OpenMP] Refactor CMake files related to `PluginInterface` in `plugins-nextgen` This patch uses refactors CMake files related to `PluginInterface` in `plugins-nextgen` to handle LLVM dependences in a better way. Reviewed By: jhuber6 Differential Revision: https://reviews.llvm.org/D139371	2022-12-06 17:39:41 -05:00
Roman Lebedev	aa6ea6009f	Revert "[OpenMP] Use `add_llvm_library` to build the target `PluginInterface` in `plugins-nextgen`" This is still not working for me: ``` -- Configuring done CMake Error: install(EXPORT "LLVMExports" ...) includes target "omptarget.rtl.amdgpu" which requires target "elf_common" that is not in any export set. CMake Error: install(EXPORT "LLVMExports" ...) includes target "omptarget.rtl.cuda" which requires target "elf_common" that is not in any export set. CMake Error: install(EXPORT "LLVMExports" ...) includes target "omptarget.rtl.x86_64" which requires target "elf_common" that is not in any export set. CMake Error: install(EXPORT "LLVMExports" ...) includes target "omptarget.rtl.cuda.nextgen" which requires target "elf_common" that is not in any export set. CMake Error: install(EXPORT "LLVMExports" ...) includes target "omptarget.rtl.cuda.nextgen" which requires target "PluginInterface" that is not in any export set. CMake Error: install(EXPORT "LLVMExports" ...) includes target "omptarget.rtl.x86_64.nextgen" which requires target "elf_common" that is not in any export set. CMake Error: install(EXPORT "LLVMExports" ...) includes target "omptarget.rtl.x86_64.nextgen" which requires target "PluginInterface" that is not in any export set. -- Generating done ``` This reverts commit `e682a76c3b`.	2022-12-06 20:47:20 +03:00
Shilei Tian	e682a76c3b	[OpenMP] Use `add_llvm_library` to build the target `PluginInterface` in `plugins-nextgen` This patch uses `add_llvm_library` to build the target `PluginInterface` since it can handle LLVM dependences much better. One temporary drawback of using this is that currently LLVM CMake macro doesn't support object libraries very well (there was a try a couple years ago but it was reverted later `29e5722949`). After switching to that, `CXX_VISIBILITY_PRESET` can not be set correctly, which can cause runtime error that a function call from one plugin could go to another. As a consequence, `PluginInterface` is built as a static library for now. I have asked the question in CMake community (https://discourse.cmake.org/t/set-target-properties-doesnt-work-properly/7016). Once that issue is solved, I'll switch it back to object library. It is not necessarily too bad to use static library, especially `BUILDTREE_ONLY` is already set such that `PluginInterface.a` will not be installed. Reviewed By: jhuber6 Differential Revision: https://reviews.llvm.org/D139371	2022-12-06 11:37:37 -05:00
Ron Lieberman	b09a5e5cb3	Revert "Add mean_anyway to hpc config" my bad, wrong repo ,so sorry. This reverts commit `0b9350f3da`.	2022-11-29 15:20:23 -06:00

1 2 3 4 5 ...

367 Commits