llvm-capstone

mirror of https://github.com/capstone-engine/llvm-capstone.git synced 2024-12-01 01:31:26 +00:00

Author	SHA1	Message	Date
Matt Morehouse	b7d1ab75cf	[HWASan] Add aliasing flag and enable HWASan to use it. -fsanitize-hwaddress-experimental-aliasing is intended to distinguish aliasing mode from LAM mode on x86_64. check-hwasan is configured to use aliasing mode while check-hwasan-lam is configured to use LAM mode. The current patch doesn't actually do anything differently in the two modes. A subsequent patch will actually build the separate runtimes and use them in each mode. Currently LAM mode tests must be run in an emulator that has LAM support. To ensure LAM mode isn't broken by future patches, I will next set up a QEMU buildbot to run the HWASan tests in LAM. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D102288	2021-05-14 09:47:20 -07:00
Anastasia Stulova	769cc335e6	[OpenCL] Simplify use of C11 atomic types. Remove requirements on extension pragma in atomic types because it has not respected the spec wrt disabling types and hasn't been useful either. With this change, the developers can use atomic types from the extensions if they are supported without enabling the pragma just like the builtin functions This patch does not break backward compatibility since the extension pragma is still supported and it makes the behavior of the compiler less strict by accepting code without needless and inconsistent pragma statements. Differential Revision: https://reviews.llvm.org/D100976	2021-05-14 17:43:00 +01:00
Benjamin Kramer	4dd546131a	Bump googletest to 1.8.1 We've accumulated a scary amount of local patches to this directory. I tried to merge them all, but if your favorite change is missing please reapply it manually (and send it upstream).	2021-05-14 17:20:06 +02:00
David Candler	3d59f9d224	[ARM][AArch64] Correct __ARM_FEATURE_CRYPTO macro and crypto feature This patch contains a couple of minor corrections to my previous crypto patch: Since both AArch32 and AArch64 are now correctly setting the aes and sha2 features individually, it is not necessary to continue to check the crypto feature when defining feature macros. In the AArch32 driver, the feature vector is only modified when the crypto feature is actually in the vector. If crypto is not present, there is no need to split it and explicitly define crypto/sha2/aes. Reviewed By: lenary Differential Revision: https://reviews.llvm.org/D102406	2021-05-14 14:19:46 +01:00
Nemanja Ivanovic	7cd2833311	[PowerPC] Add vec_vupkhpx and vec_vupklpx for XL compatibility These are old names for these functions that XL still supports.	2021-05-14 08:02:00 -05:00
Nathan Sidwell	0566f97961	[clang][NFC] remove unused return value In working on p0388 (ary[N] -> ary[] conversion), I discovered neither use of UnwrapSimilarArrayTypes used the return value. So let's nuke it. Differential Revision: https://reviews.llvm.org/D102480	2021-05-14 05:25:47 -07:00
David Spickett	2db090a2eb	[llvm][AsmPrinter] Restore source location to register clobber warning Since `5de2d189e6` this particular warning hasn't had the location of the source file containing the inline assembly. Fix this by reporting via LLVMContext. Which means that we no longer have the "instantiated into assembly here" lines but they were going to point to the start of the inline asm string anyway. This message is already tested via IR in llvm. However we won't have the required location info there so I've added a C file test in clang to cover it. (though strictly, this is testing llvm code) Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D102244	2021-05-14 08:22:57 +00:00
Reid Kleckner	d2f4b7d778	Use enum comparison instead of generated switch/case, NFC Clang's coverage data for auto-generated switch cases is really, really large. Before this change, when I enable code coverage, SemaDeclAttr.obj is 4.0GB. Naturally, this fails to link. Replacing the RISCV builtin id check with a comparison reduces object file size from 4.0GB to 330MB. Replacing the AArch64 SVE range check reduces the size again down to 17MB, which is reasonable. I think the RISCV switch is larger in coverage data because it uses more levels of macro expansion, while the SVE intrinsics only use one. In any case, please try to avoid switches with 1000+ cases, they usually don't optimize well.	2021-05-13 20:26:50 -07:00
Weston Carvalho	be5c7c5d82	Widen `name` stencil to support `TypeLoc` nodes. Differential Revision: https://reviews.llvm.org/D102185	2021-05-13 23:23:12 +01:00
Michael Kruse	83ff0ff463	[Clang][OpenMP] Allow unified_shared_memory for Pascal-generation GPUs. The Pascal architecture supports the page migration engine required for unified_shared_memory, as indicated by NVIDIA: * https://developer.nvidia.com/blog/unified-memory-cuda-beginners/ * https://developer.nvidia.com/blog/beyond-gpu-memory-limits-unified-memory-pascal/ * https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#um-requirements The limitation was introduced in D54493 which justified the cut-off by the requirement for unified addressing. However, Unified Virtual Addressing (UVA) is already available with sm20 (Fermi, Kepler, Maxwell): * https://docs.nvidia.com/cuda/gpudirect-rdma/index.html#basics-of-uva-cuda-memory-management Unified shared memory might even be possible with these, but with migration of entire allocations on kernel startup. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D101595	2021-05-13 17:15:34 -05:00
Lang Hames	71a0609a2b	[clang-repl] Temporarily disable the execute.cpp test on ppc64. This test is failing on some builders (see [1]) with the following error: error: Added modules have incompatible data layouts: e-m:e-i64:64-n32:64-S128-v256:256:256-v512:512:512 (module) vs E-m:a-i64:64-n32:64-S128-v256:256:256-v512:512:512 (jit) The JIT layout is correct, but some IR module added to the JIT is using a little-endian layout instead. This commit disables the test on ppc64 until we can investigate further and fix the bug. [1] https://lab.llvm.org/staging/#/builders/126/builds/371	2021-05-13 14:39:12 -07:00
Fangrui Song	4f05f4c8e6	[CMake][ELF] Link libLLVM.so and libclang-cpp.so with -Bsymbolic-functions llvm-dev message: https://lists.llvm.org/pipermail/llvm-dev/2021-May/150465.html In an ELF shared object, a default visibility defined symbol is preemptible by default. This creates some missed optimization opportunities. -Bsymbolic-functions is more aggressive than our current -fvisibility-inlines-hidden (present since 2012) as it applies to all function definitions. It can * avoid PLT for cross-TU function calls && reduce dynamic symbol lookup * reduce dynamic symbol lookup for taking function addresses and optimize out GOT/TOC on x86-64/ppc64 In a -DLLVM_TARGETS_TO_BUILD=X86 build, the number of JUMP_SLOT decreases from 12716 to 1628, and the number of GLOB_DAT decreases from 1918 to 1313 The built clang with `-DLLVM_LINK_LLVM_DYLIB=on -DCLANG_LINK_CLANG_DYLIB=on` is significantly faster. See the Linux kernel build result https://bugs.archlinux.org/task/70697 Note: the performance of -fno-semantic-interposition -Bsymbolic-functions libLLVM.so and libclang-cpp.so is close to a PIE binary linking against `libLLVM.a` and `libclang.a`. When the host compiler is Clang, -Bsymbolic-functions is the major contributor. On x86-64 (with GOTPCRELX) and ppc64 ELFv2, the GOT/TOC relocations can be optimized. Some implication: Interposing a subset of functions is no longer supported. (This is fragile on ELF and unsupported on Mach-O at all. For Mach-O we don't use `ld -interpose` or `-flat_namespace`) Compiling a program which takes the address of any LLVM function with `{gcc,clang} -fno-pic` and expects the address to equal to the address taken from libLLVM.so or libclang-cpp.so is unsupported. I am fairly confident that llvm-project shouldn't have different behaviors depending on such pointer equality (as we've been using -fvisibility-inlines-hidden which applies to inline functions for a long time), but if we accidentally do, users should be aware that they should not make assumption on pointer equality in `-fno-pic` mode. See more on https://maskray.me/blog/2021-05-09-fno-semantic-interposition Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D102090	2021-05-13 13:44:57 -07:00
Aaron En Ye Shi	a249ffa421	[HIP] Clean up llvm intrinsics using __asm Instead of using inline asm, use clang builtins for llvm intrinsics. Differential Revision: https://reviews.llvm.org/D102427	2021-05-13 18:55:51 +00:00
Aakanksha Patil	464e4dc50f	[AMDGPU] Add gfx1034 target Differential Revision: https://reviews.llvm.org/D102306	2021-05-13 14:25:18 -04:00
Artem Dergachev	6a079dfdc9	[ASTMatchers] Add forCallable(), a generalization of forFunction(). The new matcher additionally covers blocks and Objective-C methods. This matcher actually makes sure that the statement truly belongs to that declaration's body. forFunction() incorrectly reported that a statement in a nested block belonged to the surrounding function. forFunction() is now deprecated due to the above footgun, in favor of forCallable(functionDecl()) when only functions need to be considered. Differential Revision: https://reviews.llvm.org/D102213	2021-05-13 11:25:00 -07:00
Artem Dergachev	dd98ea528c	[ASTMatchers] NFC: Fix formatting around forFunction(). Differential Revision: https://reviews.llvm.org/D102303	2021-05-13 11:25:00 -07:00
Roman Lebedev	0d8f91d2a9	[NFC] Delete two newly-added test cases Failing on bots in unobvious ways.	2021-05-13 21:23:01 +03:00
Roman Lebedev	ecc4e9e8f4	[NFC] Try to fix CodeGenCXX/thunk-wrong-return-type.cpp test	2021-05-13 21:17:31 +03:00
cynecx	8ec9fd4839	Support unwinding from inline assembly I've taken the following steps to add unwinding support from inline assembly: 1) Add a new `unwind` "attribute" (like `sideeffect`) to the asm syntax: ``` invoke void asm sideeffect unwind "call thrower", "~{dirflag},~{fpsr},~{flags}"() to label %exit unwind label %uexit ``` 2.) Add Bitcode writing/reading support + LLVM-IR parsing. 3.) Emit EHLabels around inline assembly lowering (SelectionDAGBuilder + GlobalISel) when `InlineAsm::canThrow` is enabled. 4.) Tweak InstCombineCalls/InlineFunction pass to not mark inline assembly "calls" as nounwind. 5.) Add clang support by introducing a new clobber: "unwind", which lower to the `canThrow` being enabled. 6.) Don't allow unwinding callbr. Reviewed By: Amanieu Differential Revision: https://reviews.llvm.org/D95745	2021-05-13 19:13:03 +01:00
Roman Lebedev	9d3eb7885d	[NFC] Try to fix CodeGenCXX/thunk-wrong-this.cpp test	2021-05-13 21:10:14 +03:00
Duncan P. N. Exon Smith	7c57a9bd7d	Modules: Simplify how DisableGeneratingGlobalModuleIndex is set, likely NFC DisableGeneratingGlobalModuleIndex was being set by CompilerInstance::findOrCompileModuleAndReadAST most of (but not all of) the times it returned `nullptr` as a "normal" failure. Pull that up to the caller, CompilerInstance::loadModule, to simplify the code. This resolves a number of FIXMEs added during the refactoring in `5cca622310`. The extra cases where this is set are all some version of a fatal error, and the only client of the field, shouldBuildGlobalModuleIndex, seems to be unreachable in that case. Even if there is some corner case where this has an effect, it seems like the right/consistent behaviour. Differential Revision: https://reviews.llvm.org/D101672	2021-05-13 10:39:40 -07:00
Roman Lebedev	16d0381841	Return "[CGCall] Annotate `this` argument with alignment" The original change was reverted because it was discovered that clang mishandles thunks, and they receive wrong attributes for their this/return types - the ones for the function they will call, not the ones they have. While i have tried to fix this in https://reviews.llvm.org/D100388 that patch has been up and stuck for a month now, with little signs of progress. So while it will be good to solve this for real, for now we can simply avoid introducing the bug, by not annotating this/return for thunks. This reverts commit `6270b3a1ea`, relanding `0aa0458f14`.	2021-05-13 20:33:14 +03:00
Roman Lebedev	a624cec56d	[Clang][Codegen] Do not annotate thunk's this/return types with align/deref/nonnull attrs As it was discovered in post-commit feedback for `0aa0458f14`, we handle thunks incorrectly, and end up annotating their this/return with attributes that are valid for their callees, not for thunks themselves. While it would be good to fix this properly, and keep annotating them on thunks, i've tried doing that in https://reviews.llvm.org/D100388 with little success, and the patch is stuck for a month now. So for now, as a stopgap measure, subj.	2021-05-13 20:33:08 +03:00
Roman Lebedev	70aa4623de	[NFC][Clang][Codegen] Add tests with wrong attributes on this/return of thunks From https://reviews.llvm.org/D100388	2021-05-13 20:32:40 +03:00
Duncan P. N. Exon Smith	23e9146fba	Modules: Rename ModuleBuildFailed => DisableGeneratingGlobalModuleIndex, NFC Rename CompilerInstance's ModuleBuildFailed field to DisableGeneratingGlobalModuleIndex, which more precisely describes its role. Otherwise, it's hard to suss out how it's different from ModuleLoader::HadFatalFailure, and what sort of code simplifications are safe. Differential Revision: https://reviews.llvm.org/D101670	2021-05-13 10:22:40 -07:00
Valeriy Savchenko	45212dec01	[analyzer][solver] Prevent use of a null state rdar://77686137 Differential Revision: https://reviews.llvm.org/D102240	2021-05-13 20:16:29 +03:00
Duncan P. N. Exon Smith	7c2afd5899	Modules: Remove ModuleLoader::OtherUncachedFailure, NFC `5cca622310` refactored CompilerInstance::loadModule, splitting out findOrCompileModuleAndReadAST, but was careful to avoid making any functional changes. It added ModuleLoader::OtherUncachedFailure to facilitate this and left behind FIXMEs asking why certain failures weren't cached. After a closer look, I think we can just remove this and simplify the code. This changes the behaviour of the following (simplified) code from CompilerInstance::loadModule, causing a failure to be cached more often: ``` if (auto MaybeModule = MM.getCachedModuleLoad(Path[0].first)) return MaybeModule; if (ModuleName == getLangOpts().CurrentModule) return MM.cacheModuleLoad(PP.lookupModule(...)); ModuleLoadResult Result = findOrCompileModuleAndReadAST(...); if (Result.isNormal()) // This will be 'true' more often. return MM.cacheModuleLoad(..., Module); return Result; ``` `MM` here is a ModuleMap owned by the Preprocessor. Here are the cases where `findOrCompileModuleAndReadAST` starts returning a "normal" failed result: - Emitted `diag::err_module_not_found`, where there's no module map found. - Emitted `diag::err_module_build_disabled`, where implicitly building modules is disabled. - Emitted `diag::err_module_cycle`, which detects module cycles in the implicit modules build system. - Emitted `diag::err_module_not_built`, which avoids building a module in this CompilerInstance if another one tried and failed already. - `compileModuleAndReadAST()` was called and failed to build. The four errors are all fatal, and last item also reports a fatal error, so it this extra caching has no functionality change... but even if it did, it seems fine to cache these failed results within a ModuleMap instance (note that each CompilerInstance has its own Preprocessor and ModuleMap). Differential Revision: https://reviews.llvm.org/D101667	2021-05-13 10:10:46 -07:00
Lei Huang	9469ff15b7	[PowerPC] Add clang option -m[no-]prefixed Add user-facing front end option to turn off power10 prefixed instructions. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D102191	2021-05-13 12:02:10 -05:00
Joe Ellis	2ed7db0d20	[InstSimplify] Remove redundant {insert,extract}_vector intrinsic chains This commit removes some redundant {insert,extract}_vector intrinsic chains by implementing the following patterns as instsimplifies: (insert_vector _, (extract_vector X, 0), 0) -> X (extract_vector (insert_vector _, X, 0), 0) -> X Reviewed By: peterwaller-arm Differential Revision: https://reviews.llvm.org/D101986	2021-05-13 16:09:50 +00:00
Aaron En Ye Shi	6a67e05a26	[HIP] Add __builtin_amdgcn_groupstaticsize Differential Revision: https://reviews.llvm.org/D102403	2021-05-13 15:50:08 +00:00
Zarko Todorovski	8fa168fc50	Parse vector bool when stdbool.h and altivec.h are included Currently when including stdbool.h and altivec.h declaration of `vector bool` leads to errors due to `bool` being expanded to '_Bool`. This patch allows the parser to recognize `_Bool`. Reviewed By: hubert.reinterpretcast, Everybody0523 Differential Revision: https://reviews.llvm.org/D102064	2021-05-13 11:48:32 -04:00
Juneyoung Lee	395607af3c	Reapply [ConstantFold] Fold more operations to poison This was reverted to mitigate mitigate miscompiles caused by the logical and/or to bitwise and/or fold. Reapply it now that the underlying issue has been fixed by D101191. ----- This patch folds more operations to poison. Alive2 proof: https://alive2.llvm.org/ce/z/mxcb9G (it does not contain tests about div/rem because they fold to poison when raising UB) Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D92270	2021-05-13 16:04:12 +02:00
Oliver Stannard	92260d7a18	Revert "[CMake][ELF] Add -fno-semantic-interposition and -Bsymbolic-functions" This reverts commit `3bf1acab5b`. This is causing the test `gcov-shared-flush.c' to fail on the 2-stage aarch64 buildbots (https://lab.llvm.org/buildbot/#/builders/7/builds/2720).	2021-05-13 14:31:17 +01:00
Vassil Vassilev	4624412367	[clang-repl] Fix ClangReplInterpreterTests unittest dependency.	2021-05-13 10:32:08 +00:00
Nemanja Ivanovic	39e4676ca7	[PowerPC] Provide doubleword vector predicate form comparisons on Power7 There are two reasons this shouldn't be restricted to Power8 and up: 1. For XL compatibility 2. Because clang will expand comparison operators to these intrinsics* *Without this patch, the following causes a selection error: int test(vector signed long a, vector signed long b) { return a < b; } This patch provides the handling for the intrinsics in the back end and removes the Power8 guards from the predicate functions (vec_{all\|any}_{eq\|ne\|gt\|ge\|lt\|le}).	2021-05-13 04:56:56 -05:00
Vassil Vassilev	b2186a69c1	[clang-repl] Add final set of missing library dependencies.	2021-05-13 08:06:58 +00:00
serge-sans-paille	6045cb89e5	Use an allow list on reserved macro identifiers The allow list is based on various official sources (see in-code comment). This fixes https://bugs.llvm.org/show_bug.cgi?id=50248 Differential Revision: https://reviews.llvm.org/D102168	2021-05-13 09:23:47 +02:00
Vassil Vassilev	3f4c518592	[clang-repl] Add exhaustive list of libInterpreter dependencies. This patch should appease the bots building with -DBUILD_SHARED_LIBS=On, resolving the regression introduced in `92f9852fc9`.	2021-05-13 07:18:01 +00:00
Vassil Vassilev	92f9852fc9	[clang-repl] Recommit "Land initial infrastructure for incremental parsing" Original commit message: In http://lists.llvm.org/pipermail/llvm-dev/2020-July/143257.html we have mentioned our plans to make some of the incremental compilation facilities available in llvm mainline. This patch proposes a minimal version of a repl, clang-repl, which enables interpreter-like interaction for C++. For instance: ./bin/clang-repl clang-repl> int i = 42; clang-repl> extern "C" int printf(const char*,...); clang-repl> auto r1 = printf("i=%d\n", i); i=42 clang-repl> quit The patch allows very limited functionality, for example, it crashes on invalid C++. The design of the proposed patch follows closely the design of cling. The idea is to gather feedback and gradually evolve both clang-repl and cling to what the community agrees upon. The IncrementalParser class is responsible for driving the clang parser and codegen and allows the compiler infrastructure to process more than one input. Every input adds to the “ever-growing” translation unit. That model is enabled by an IncrementalAction which prevents teardown when HandleTranslationUnit. The IncrementalExecutor class hides some of the underlying implementation details of the concrete JIT infrastructure. It exposes the minimal set of functionality required by our incremental compiler/interpreter. The Transaction class keeps track of the AST and the LLVM IR for each incremental input. That tracking information will be later used to implement error recovery. The Interpreter class orchestrates the IncrementalParser and the IncrementalExecutor to model interpreter-like behavior. It provides the public API which can be used (in future) when using the interpreter library. Differential revision: https://reviews.llvm.org/D96033	2021-05-13 06:30:29 +00:00
Vassil Vassilev	f6907152db	Revert "[clang-repl] Land initial infrastructure for incremental parsing" This reverts commit `44a4000181`. We are seeing build failures due to missing dependency to libSupport and CMake Error at tools/clang/tools/clang-repl/cmake_install.cmake file INSTALL cannot find	2021-05-13 04:44:19 +00:00
Vassil Vassilev	44a4000181	[clang-repl] Land initial infrastructure for incremental parsing In http://lists.llvm.org/pipermail/llvm-dev/2020-July/143257.html we have mentioned our plans to make some of the incremental compilation facilities available in llvm mainline. This patch proposes a minimal version of a repl, clang-repl, which enables interpreter-like interaction for C++. For instance: ./bin/clang-repl clang-repl> int i = 42; clang-repl> extern "C" int printf(const char*,...); clang-repl> auto r1 = printf("i=%d\n", i); i=42 clang-repl> quit The patch allows very limited functionality, for example, it crashes on invalid C++. The design of the proposed patch follows closely the design of cling. The idea is to gather feedback and gradually evolve both clang-repl and cling to what the community agrees upon. The IncrementalParser class is responsible for driving the clang parser and codegen and allows the compiler infrastructure to process more than one input. Every input adds to the “ever-growing” translation unit. That model is enabled by an IncrementalAction which prevents teardown when HandleTranslationUnit. The IncrementalExecutor class hides some of the underlying implementation details of the concrete JIT infrastructure. It exposes the minimal set of functionality required by our incremental compiler/interpreter. The Transaction class keeps track of the AST and the LLVM IR for each incremental input. That tracking information will be later used to implement error recovery. The Interpreter class orchestrates the IncrementalParser and the IncrementalExecutor to model interpreter-like behavior. It provides the public API which can be used (in future) when using the interpreter library. Differential revision: https://reviews.llvm.org/D96033	2021-05-13 04:23:24 +00:00
Chen Zheng	a0ca4c46ca	[Debug-Info] add -gstrict-dwarf support in backend Reviewed By: dblaikie, probinson Differential Revision: https://reviews.llvm.org/D100826	2021-05-12 23:00:52 -04:00
Yaxun (Sam) Liu	ce6cc87ce9	[clang] Minor fix for MarkVarDeclODRUsed Merge two if as follow up of https://reviews.llvm.org/D102270	2021-05-12 22:32:10 -04:00
Richard Smith	e1aa528d3a	Handle unexpanded packs appearing in type-constraints. For a type-constraint in a lambda signature, this makes the lambda contain an unexpanded pack; for requirements in a requires-expressions it makes the requires-expression contain an unexpanded pack; otherwise it's invalid.	2021-05-12 18:45:34 -07:00
Richard Smith	2f9d8b08ea	PR50306: When instantiating a generic lambda with a constrained 'auto', properly track that it has constraints. Previously an instantiation of a constrained generic lambda would behave as if unconstrained because we incorrectly cached a "has no constraints" value that we computed before the constraints from 'auto' parameters were attached.	2021-05-12 18:45:33 -07:00
Richard Smith	e0acfed7ed	Clean up handling of constrained parameters in lambdas. No functionality change intended.	2021-05-12 18:45:33 -07:00
Richard Smith	4c88cfb1dc	Add test for substitutability of variable templates in closure type mangling.	2021-05-12 18:45:33 -07:00
Pushpinder Singh	10c779d206	[AMDGPU][OpenMP] Emit textual IR for -emit-llvm -S Previously clang would print a binary blob into the bundled file for amdgcn. With this patch, it will instead print textual IR as expected. Reviewed By: JonChesterfield, ronlieb Differential Revision: https://reviews.llvm.org/D102065 Change-Id: I10c0127ab7357787769fdf9a2edd4b3071e790a1	2021-05-13 01:34:03 +00:00
Richard Smith	5bb7e81c64	Fix bad mangling of <data-member-prefix> for a closure in the initializer of a variable at global namespace scope. This implements the direction proposed in https://github.com/itanium-cxx-abi/cxx-abi/pull/126. Differential Revision: https://reviews.llvm.org/D101968	2021-05-12 13:13:21 -07:00
Erich Keane	08ba9ce1ef	Suppress Deferred Diagnostics in discarded statements. It doesn't really make sense to emit language specific diagnostics in a discarded statement, and suppressing these diagnostics results in a programming pattern that many users will feel is quite useful. Basically, this makes sure we only emit errors from the 'true' side of a 'constexpr if'. It does this by making the ExprEvaluatorBase type have an opt-in option as to whether it should visit discarded cases. Differential Revision: https://reviews.llvm.org/D102251	2021-05-12 12:48:47 -07:00
Pratyush Das	99d63ccff0	Add type information to integral template argument if required. Non-comprehensive list of cases: * Dumping template arguments; * Corresponding parameter contains a deduced type; * Template arguments are for a DeclRefExpr that hadMultipleCandidates() Type information is added in the form of prefixes (u8, u, U, L), suffixes (U, L, UL, LL, ULL) or explicit casts to printed integral template argument, if MSVC codeview mode is disabled. Differential revision: https://reviews.llvm.org/D77598	2021-05-12 19:00:08 +00:00
Nico Weber	d8c227ba05	Revert "Produce warning for performing pointer arithmetic on a null pointer." This reverts commit `dfc1e31d49`. See discussion on https://reviews.llvm.org/D98798	2021-05-12 14:53:50 -04:00
Shoaib Meenai	56f7e5a822	[cmake] Add support for multiple distributions LLVM's build system contains support for configuring a distribution, but it can often be useful to be able to configure multiple distributions (e.g. if you want separate distributions for the tools and the libraries). Add this support to the build system, along with documentation and usage examples. Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D89177	2021-05-12 11:13:18 -07:00
Benjamin Kramer	1470b8587f	Remove AST inclusion from Basic include That's a cyclic dependency. NFC.	2021-05-12 19:51:21 +02:00
Roman Lebedev	81f56a2eb3	[NFC][clang][Codegen] Split ThunkInfo into it's own header Otherwise we'll have issues with forward definition of GlobalDecl. Split off from https://reviews.llvm.org/D100388	2021-05-12 20:39:54 +03:00
Roman Lebedev	2d84195d60	[NFCI][clang][Codegen] CodeGenVTables::addVTableComponent(): use getGlobalDecl It does the same thing. Split off from https://reviews.llvm.org/D100388	2021-05-12 20:39:54 +03:00
Fangrui Song	3bf1acab5b	[CMake][ELF] Add -fno-semantic-interposition and -Bsymbolic-functions llvm-dev message: https://lists.llvm.org/pipermail/llvm-dev/2021-May/150465.html In an ELF shared object, a default visibility defined symbol is preemptible by default. This creates some missed optimization opportunities. -fno-semantic-interposition can optimize -fPIC: * in Clang: avoid GOT/PLT cost for variable access/function calls to external linkage definition in the same TU * in GCC: enable interprocedural optimizations (including inlining) and avoid PLT See https://gist.github.com/MaskRay/2d4dfcfc897341163f734afb59f689c6 for more information. -Bsymbolic-functions is more aggressive than -fvisibility-inlines-hidden (present since 2012) as it applies to all function definitions. It can * avoid PLT for cross-TU function calls && reduce dynamic symbol lookup * reduce dynamic symbol lookup for taking function addresses and optimize out GOT/TOC on x86-64/ppc64 With both options, the libLLVM.so and libclang-cpp.so performance should be closer to PIE binary linking against `libLLVM.a` and `libclang.a` (In a -DLLVM_TARGETS_TO_BUILD=X86 build, the number of JUMP_SLOT decreases from 12716 to 1628, and the number of GLOB_DAT decreases from 1918 to 1313 The built clang with `-DLLVM_LINK_LLVM_DYLIB=on -DCLANG_LINK_CLANG_DYLIB=on` is significantly faster. See the Linux kernel build result https://bugs.archlinux.org/task/70697 ) Some implication: Interposing a subset of functions is no longer supported. (This is fragile anyway and cannot really be supported. For Mach-O we don't use `ld -interpose`, so interposition is not supported on Mach-O at all.) Compiling a program which takes the address of any LLVM function with `{gcc,clang} -fno-pic` and expects the address to equal to the address taken from libLLVM.so or libclang-cpp.so is unsupported. I am fairly confident that llvm-project shouldn't have different behaviors depending on such pointer equality (as we've been using -fvisibility-inlines-hidden which applies to inline functions for a long time), but if we accidentally do, users should be aware that they should not make assumption on pointer equality in `-fno-pic` mode. Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D102090	2021-05-12 10:34:31 -07:00
Anastasia Stulova	58d18dde5c	[OpenCL] Remove pragma requirement from Arm dot extension. This removed the pointless need for extension pragma since it doesn't disable anything properly and it doesn't need to enable anything that is not possible to disable. The change doesn't break existing kernels since it allows to compile more cases i.e. without pragma statements but the pragma continues to be accepted. Differential Revision: https://reviews.llvm.org/D100985	2021-05-12 16:25:33 +01:00
Yaxun (Sam) Liu	98575708da	[CUDA][HIP] Fix device template variables Currently clang does not emit device template variables instantiated only in host functions, however, nvcc is able to do that: https://godbolt.org/z/fneEfferY This patch fixes this issue by refactoring and extending the existing mechanism for emitting static device var ODR-used by host only. Basically clang records device variables ODR-used by host code and force them to be emitted in device compilation. The existing mechanism makes sure these device variables ODR-used by host code are added to llvm.compiler-used, therefore they are guaranteed not to be deleted. It also fixes non-ODR-use of static device variable by host code causing static device variable to be emitted and registered, which should not. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D102237	2021-05-12 11:13:29 -04:00
Ben Shi	892c56eabe	[clang][AVR] Redefine some types to be compatible with avr-gcc Reviewed By: dylanmckay Differential Revision: https://reviews.llvm.org/D100701	2021-05-12 22:05:26 +08:00
Qiu Chaofan	cbd93cee9b	Revert "[PowerPC] [Clang] Enable float128 feature on VSX targets" This commit brought build break in some f128 related tests. But that's not the root cause. There exists some differences between Clang and GCC's definition for 128-bit float types on PPC, so macros/functions in glibc may not work with clang -mfloat128 well. We need to handle this carefully and reland it.	2021-05-12 16:51:52 +08:00
Qiu Chaofan	febbe4b5a0	[PowerPC] [Clang] Enable float128 feature on VSX targets Reviewed By: nemanjai, steven.zhang Differential Revision: https://reviews.llvm.org/D92815	2021-05-12 14:33:41 +08:00
Richard Smith	bb726383ac	Revert "Fix bad mangling of <data-member-prefix> for a closure in the initializer of a variable at global namespace scope." This reverts commit `697ac15a0f`, for which review was not complete. That change was accidentally pushed when an unrelated change was pushed.	2021-05-11 17:46:18 -07:00
Richard Smith	3978333b71	Add test for PR50039. I believe Clang's behavior is correct according to the standard here, but this is an unusual situation for which we had no test coverage, so I'm adding some.	2021-05-11 17:35:34 -07:00
Richard Smith	697ac15a0f	Fix bad mangling of <data-member-prefix> for a closure in the initializer of a variable at global namespace scope. This implements the direction proposed in https://github.com/itanium-cxx-abi/cxx-abi/pull/126. Differential Revision: https://reviews.llvm.org/D101968	2021-05-11 17:35:33 -07:00
Leonard Chan	5cb17728d1	[clang][Fuchsia] Introduce compat multilibs These are GCC-compatible multilibs that use the generic Itanium C++ ABI instead of the Fuchsia C++ ABI. Differential Revision: https://reviews.llvm.org/D102030	2021-05-11 15:45:38 -07:00
Victor Huang	46475a79f8	[AIX][TLS] Diagnose use of unimplemented TLS models Add front end diagnostics to report error for unimplemented TLS models set by - compiler option `-ftls-model` - attributes like `__thread int __attribute__((tls_model("local-exec"))) var_name;` Reviewed by: aaron.ballman, nemanjai, PowerPC Differential Revision: https://reviews.llvm.org/D102070	2021-05-11 17:21:08 -05:00
Mike Rice	f90abac6ca	[OpenMP] Use compound operators for reduction combiner if available. The OpenMP spec seems to require the compound operators be used for +, *, &, \|, and ^ reduction. So use these if a class has those operators. If not try the simple operators as we did previously to limit the impact to existing code. Fixes: https://bugs.llvm.org/show_bug.cgi?id=48584 Differential Revision: https://reviews.llvm.org/D101941	2021-05-11 11:39:12 -07:00
Fangrui Song	2075f2b296	[clang] Support -fpic -fno-semantic-interposition for RISCV -fno-semantic-interposition (only effective with -fpic) can optimize default visibility external linkage (non-ifunc-non-COMDAT) variable access and function calls to avoid GOT/PLT, by using local aliases, e.g. ``` int var; __attribute__((optnone)) int fun(int x) { return x * x; } int test() { return fun(var); } ``` -fpic (var and fun are dso_preemptable) ``` test: .LBB1_1: auipc a0, %got_pcrel_hi(var) ld a0, %pcrel_lo(.LBB1_1)(a0) lw a0, 0(a0) // fun is preemptible by default in ld -shared mode. ld will create a PLT. tail fun@plt ``` vs -fpic -fno-semantic-interposition (var and fun are dso_local) ``` test: .Ltest$local: .LBB1_1: auipc a0, %pcrel_hi(.Lvar$local) addi a0, a0, %pcrel_lo(.LBB1_1) lw a0, 0(a0) // The assembler either resolves .Lfun$local at assembly time (-mno-relax // -fno-function-sections), or produces a relocation referencing a non-preemptible // local symbol (which can avoid PLT). tail .Lfun$local ``` Note: Clang's default -fpic is more aggressive than GCC -fpic: interprocedural optimizations (including inlining) are available but local aliases are not used. -fpic -fsemantic-interposition can disable interprocedural optimizations. Depends on D101875 Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D101876	2021-05-11 11:38:32 -07:00
Pirama Arumuga Nainar	0fd0a010a1	[git-clang-format] Do not apply clang-format to symlinks This fixes PR46992. Git stores symlinks as text files and we should not format them even if they have one of the requested extensions. (Move the call to `cd_to_toplevel()` up a few lines so we can also print the skipped symlinks during verbose output.) Differential Revision: https://reviews.llvm.org/D101878	2021-05-11 10:34:40 -07:00
Jamie Schmeiser	dfc1e31d49	Produce warning for performing pointer arithmetic on a null pointer. Summary: Test and produce warning for subtracting a pointer from null or subtracting null from a pointer. Reuse existing warning that this is undefined behaviour. Also add unit test for both warnings. Reformat to satisfy clang-format. Respond to review comments: add additional test. Respond to review comments: Do not issue warning for nullptr - nullptr in C++. Fix indenting to satisfy clang-format. Respond to review comments: Add C++ tests. Author: Jamie Schmeiser <schmeise@ca.ibm.com> Reviewed By: efriedma (Eli Friedman), nickdesaulniers (Nick Desaulniers) Differential Revision: https://reviews.llvm.org/D98798	2021-05-11 11:29:50 -04:00
Pushpinder Singh	eca3d68399	Revert "[AMDGPU][OpenMP] Emit textual IR for -emit-llvm -S" This reverts commit `7f78e409d0`.	2021-05-11 10:07:13 -05:00
Anastasia Stulova	13ea238b1e	[OpenCL] Allow use of double type without extension pragma. Simply use of extensions by allowing the use of supported double types without the pragma. Since earlier standards instructed that the pragma is used explicitly a new warning is introduced in pedantic mode to indicate that use of type without extension pragma enable can be non-portable. This patch does not break backward compatibility since the extension pragma is still supported and it makes the behavior of the compiler less strict by accepting code without extra pragma statements. Differential Revision: https://reviews.llvm.org/D100980	2021-05-11 12:54:38 +01:00
Paulo Matos	d7086af214	[WebAssembly] Support for WebAssembly globals in LLVM IR This patch adds support for WebAssembly globals in LLVM IR, representing them as pointers to global values, in a non-default, non-integral address space. Instruction selection legalizes loads and stores to these pointers to new WebAssemblyISD nodes GLOBAL_GET and GLOBAL_SET. Once the lowering creates the new nodes, tablegen pattern matches those and converts them to Wasm global.get/set of the appropriate type. Based on work by Paulo Matos in https://reviews.llvm.org/D95425. Reviewed By: pmatos Differential Revision: https://reviews.llvm.org/D101608	2021-05-11 11:19:29 +02:00
Ole Strohm	7d20f709ea	[OpenCL] [NFC] Fixed underline being too short in rst	2021-05-11 09:45:28 +01:00
Craig Topper	18f3a14e13	[RISCV] Validate the SEW and LMUL operands to __builtin_rvv_vsetvli(max) These are required to be constants, this patch makes sure they are in the accepted range of values. These are usually created by wrappers in the riscv_vector.h header which should always be correct. This patch protects against a user using the builtin directly. Reviewed By: khchen Differential Revision: https://reviews.llvm.org/D102086	2021-05-10 12:11:13 -07:00
Fangrui Song	68a20c7f36	[clang] Support -fpic -fno-semantic-interposition for AArch64 -fno-semantic-interposition (only effective with -fpic) can optimize default visibility external linkage (non-ifunc-non-COMDAT) variable access and function calls to avoid GOT/PLT, by using local aliases, e.g. ``` int var; __attribute__((optnone)) int fun(int x) { return x * x; } int test() { return fun(var); } ``` -fpic (var and fun are dso_preemptable) ``` test: // @test adrp x8, :got:var ldr x8, [x8, :got_lo12:var] ldr w0, [x8] // fun is preemptible by default in ld -shared mode. ld will create a PLT. b fun ``` vs -fpic -fno-semantic-interposition (var and fun are dso_local) ``` test: // @test .Ltest$local: adrp x8, .Lvar$local ldr w0, [x8, :lo12:.Lvar$local] // The assembler either resolves .Lfun$local at assembly time, or produces a // relocation referencing a non-preemptible section symbol (which can avoid PLT). b .Lfun$local ``` Note: Clang's default -fpic is more aggressive than GCC -fpic: interprocedural optimizations (including inlining) are available but local aliases are not used. -fpic -fsemantic-interposition can disable interprocedural optimizations. Depends on D101872 Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D101873	2021-05-10 09:43:33 -07:00
serge-sans-paille	91a919e899	[NFC] Synchronize reserved identifier code between macro and variables / symbols Differential Revision: https://reviews.llvm.org/D102164	2021-05-10 17:46:51 +02:00
Momchil Velikov	5c7b43aa82	[clang][AArch32] Correctly align HA arguments when passed on the stack Analogously to https://reviews.llvm.org/D98794 this patch uses the `alignstack` attribute to fix incorrect passing of homogeneous aggregate (HA) arguments on AArch32. The EABI/AAPCS was recently updated to clarify how VFP co-processor candidates are aligned: `4488e34998` Differential Revision: https://reviews.llvm.org/D100853	2021-05-10 16:28:46 +01:00
Alexey Bataev	230953d577	[OPENMP]Fix PR48851: the locals are not globalized in SPMD mode. Follow the more general patch for now, do not try to SPMDize the kernel if the variable is used and local. Differential Revision: https://reviews.llvm.org/D101911	2021-05-10 06:34:11 -07:00
Nico Weber	08de6e3ada	clang: Fix tests after `7f78e409d0` if clang is not called clang-13 We might release a new version at some point after all. In fact, use the same pattern the other CHECK lines in this test use, for consistency.	2021-05-10 08:49:26 -04:00
Kadir Cetinkaya	761f3d1675	[clang][PreProcessor] Cutoff parsing after hitting completion point This fixes a crash caused by Lexers being invalidated at code completion points in https://github.com/llvm/llvm-project/blob/main/clang/lib/Lex/PPLexerChange.cpp#L520. Differential Revision: https://reviews.llvm.org/D102069	2021-05-10 11:24:27 +02:00
Pushpinder Singh	7f78e409d0	[AMDGPU][OpenMP] Emit textual IR for -emit-llvm -S Previously clang would print a binary blob into the bundled file for amdgcn. With this patch, it will instead print textual IR as expected. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D102065	2021-05-10 07:54:23 +00:00
Pushpinder Singh	c711aa0f6f	[amdgpu-arch] Guard hsa.h with __has_include This patch is suppose to fix the issue of hsa.h not found. Issue was reported in D99949 Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D102067	2021-05-10 07:33:30 +00:00
Yuanfang Chen	9ffd4924e8	[NFC][Coroutines] Fix two tests by removing hardcoded SSA value.	2021-05-09 19:06:16 -07:00
Xiang1 Zhang	d4bdeca576	[X86] Support AMX fast register allocation Differential Revision: https://reviews.llvm.org/D100026	2021-05-08 14:21:11 +08:00
Xiang1 Zhang	bebafe01a7	Revert "[X86] Support AMX fast register allocation" This reverts commit `77e2e5e07d`.	2021-05-08 13:43:32 +08:00
Xiang1 Zhang	77e2e5e07d	[X86] Support AMX fast register allocation	2021-05-08 13:27:21 +08:00
Arthur Eubanks	34a8a437bf	[NewPM] Hide pass manager debug logging behind -debug-pass-manager-verbose Printing pass manager invocations is fairly verbose and not super useful. This allows us to remove DebugLogging from pass managers and PassBuilder since all logging (aside from analysis managers) goes through instrumentation now. This has the downside of never being able to print the top level pass manager via instrumentation, but that seems like a minor downside. Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D101797	2021-05-07 21:51:47 -07:00
Weston Carvalho	1f65f42dd3	Make `hasTypeLoc` matcher support more node types. Differential Revision: https://reviews.llvm.org/D101572	2021-05-08 00:35:22 +01:00
Weston Carvalho	0ad494838b	NFC: Move TypeList implementation up the file This will make it possible for more code to use it.	2021-05-08 00:35:13 +01:00
Arthur Eubanks	6f7131002b	[NewPM] Move analysis invalidation/clearing logging to instrumentation We're trying to move DebugLogging into instrumentation, rather than being part of PassManagers/AnalysisManagers. Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D102093	2021-05-07 15:25:31 -07:00
Petr Hosek	167906c109	[BareMetal] Ensure that sysroot always comes after library paths This addresses an issue introduced in D91559. We would invoke the compiler with -Lpath/to/lib --sysroot=path/to/sysroot where both locations contain libraries with the same name, but we expect linker to pick up the library in path/to/lib since that version is more specialized. This was the case before D91559 where the sysroot path would be ignored, but after that change linker would now pick up the library from the sysroot which resulted in unexpected behavior. The sysroot path should always come after any user provided library paths, followed by compiler runtime paths. We want for libraries in user provided library paths to always take precedence over sysroot libraries. This matches the behavior of other toolchains used with other targets. Differential Revision: https://reviews.llvm.org/D102049	2021-05-07 14:42:02 -07:00
Petr Hosek	f97ada27aa	Revert "[BareMetal] Ensure that sysroot always comes after library paths" This reverts commit `6b00b34b8a`.	2021-05-07 13:38:04 -07:00
Olivier Goffart	c4adc49a1c	[SEH] Fix regression with SEH in noexpect functions Commit `5baea05601` set the CurCodeDecl because it was needed to pass the assert in CodeGenFunction::EmitLValueForLambdaField, But this was not right to do as CodeGenFunction::FinishFunction passes it to EmitEndEHSpec and cause corruption of the EHStack. Revert the part of the commit that changes the CurCodeDecl, and instead adjust the assert to check for a null CurCodeDecl. Differential Revision: https://reviews.llvm.org/D102027	2021-05-07 13:27:59 -07:00
Petr Hosek	6b00b34b8a	[BareMetal] Ensure that sysroot always comes after library paths This addresses an issue introduced in D91559. We would invoke the compiler with -Lpath/to/lib --sysroot=path/to/sysroot where both locations contain libraries with the same name, but we expect linker to pick up the library in path/to/lib since that version is more specialized. This was the case before D91559 where the sysroot path would be ignored, but after that change linker would now pick up the library from the sysroot which resulted in unexpected behavior. The sysroot path should always come after any user provided library paths, followed by compiler runtime paths. We want for libraries in user provided library paths to always take precedence over sysroot libraries. This matches the behavior of other toolchains used with other targets. Differential Revision: https://reviews.llvm.org/D102049	2021-05-07 13:21:07 -07:00
Hsiangkai Wang	c04c66d705	[RISCV] Consider scalar types for required extensions. We have vector operations on double vector and float scalar. For example, vfwadd.wf is such a instruction. vfloat64m1_t vfwadd_wf(vfloat64m1_t op0, float op1, size_t op2); We should specify F and D extensions for it. Differential Revision: https://reviews.llvm.org/D102051	2021-05-08 04:06:45 +08:00
Thomas Lively	1e9c39a3f9	[WebAssembly] Use functions instead of macros for const SIMD intrinsics To improve hygiene, consistency, and usability, it would be good to replace all the macro intrinsics in wasm_simd128.h with functions. The reason for using macros in the first place was to enforce the use of constants for some arguments using `_Static_assert` with `__builtin_constant_p`. This commit switches to using functions and uses the `__diagnose_if__` attribute rather than `_Static_assert` to enforce constantness. The remaining macro intrinsics cannot be made into functions until the builtin functions they are implemented with can be replaced with normal code patterns because the builtin functions themselves require that their arguments are constants. This commit also fixes a bug with the const_splat intrinsics in which the f32x4 and f64x2 variants were incorrectly producing integer vectors. Differential Revision: https://reviews.llvm.org/D102018	2021-05-07 11:50:19 -07:00
Ahsan Saghir	25bbff632d	[PowerPC] Provide MMA builtins for compatibility Vector pair intrinsics and builtins were renamed in https://reviews.llvm.org/D91974 to replace the _mma_ prefix by _vsx_. However, some projects used the _mma_ version, so this patch adds these intrinsics to provide compatibility. Fixes Bugzilla: https://bugs.llvm.org/show_bug.cgi?id=50159 Reviewed By: nemanjai, amyk Differential Revision: https://reviews.llvm.org/D100482	2021-05-07 09:10:16 -05:00
Anastasia Stulova	76f1de10f4	[OpenCL] Fix optional image types. This change allows the use of identifiers for image types from `cl_khr_gl_msaa_sharing` freely in the kernel code if the extension is not supported since they are not in the list of the reserved identifiers. This change also removed the need for pragma for the types in the extensions since the spec does not require the pragma uses. Differential Revision: https://reviews.llvm.org/D100983	2021-05-07 13:29:28 +01:00
Ole Strohm	f372ff17f7	[NFC] (test commit) Changed example invocation of C++ for OpenCL	2021-05-07 12:31:37 +01:00
Bruno Cardoso Lopes	819e0d105e	[CGAtomic] Lift strong requirement for remaining compare_exchange combinations Follow up on `431e3138a` and complete the other possible combinations. Besides enforcing the new behavior, it also mitigates TSAN false positives when combining orders that used to be stronger.	2021-05-06 21:05:20 -07:00
Stanislav Mekhanoshin	c714d03785	[AMDGPU] Expose __builtin_amdgcn_perm for v_perm_b32 Differential Revision: https://reviews.llvm.org/D102022	2021-05-06 16:17:33 -07:00
Aaron Puchert	d21e1b79ff	Thread safety analysis: Eliminate parameter from intersectAndWarn (NFC) We were modifying precisely when intersecting the lock sets of multiple predecessors without back edge. That's no coincidence: we can't modify on back edges, it doesn't make sense to modify at the end of a function, and otherwise we always want to intersect on forward edges, because we can build a new lock set for those. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D101755	2021-05-06 23:07:42 +02:00
Alex Hoppen	a3a8a1a15b	[Index] Ignore nullptr decls for indexing We can end up with a call to `indexTopLevelDecl(D)` with `D == nullptr` in non-assert builds e.g. when indexing a module in `indexModule` and - `ASTReader::GetDecl` returns `nullptr` if `Index >= DeclsLoaded.size()`, thus returning `nullptr` => `ModuleDeclIterator::operator*` returns `nullptr` => we call `IndexCtx.indexTopLevelDecl` with `nullptr` Be resilient and just ignore the `nullptr` decls during indexing. Reviewed By: akyrtzi Differential Revision: https://reviews.llvm.org/D102001	2021-05-06 13:12:26 -07:00
Petr Hosek	8cb191b724	[Fuchsia][CMake] Update OSX deployment target Use correct spelling of CMAKE_OSX_DEPLOYMENT_TARGET and bump the minimum version to 10.13 which matches what we use for host tools in Fuchsia. Differential Revision: https://reviews.llvm.org/D102013	2021-05-06 12:06:16 -07:00
Thomas Lively	b198b9b897	[WebAssembly] Fix argument types in SIMD narrowing intrinsics The builtins were updated to take signed parameters in `627a526955`, but the intrinsics that use those builtins were not updated as well. The intrinsic test did not catch this sign mismatch because it is only reported as an error under -fno-lax-vector-conversions. This commit fixes the type mismatch and adds -fno-lax-vector-conversions to the test to catch similar problems in the future. Differential Revision: https://reviews.llvm.org/D101979	2021-05-06 10:07:45 -07:00
Nemanja Ivanovic	1faf3b195e	[PowerPC] Re-commit `ed87f512bb` This was reverted in `3761b9a234` just as I was about to commit the fix. This patch inlcudes the necessary fix.	2021-05-06 09:50:12 -05:00
David Spickett	e4b790c5e3	[OpenMP] Temporarily require X86 target for parallel_for_codegen.cpp test Since https://reviews.llvm.org/D101849 this test has been failing on bots that only enable either Arm or AArch64 targets. See: https://lab.llvm.org/buildbot/#/builders/107/builds/7601 Temporarily requires X86 for this test while the difference is figured out.	2021-05-06 14:16:43 +00:00
Nico Weber	3761b9a234	Revert "[PowerPC] Provide some P8-specific altivec overloads for P7" This reverts commit `ed87f512bb`. Breaks check-clang, see e.g. https://lab.llvm.org/buildbot/#/builders/139/builds/3818	2021-05-06 10:01:16 -04:00
Nemanja Ivanovic	ed87f512bb	[PowerPC] Provide some P8-specific altivec overloads for P7 This adds additional support for XL compatibility. There are a number of functions in altivec.h that produce a single instruction (or a very short sequence) for Power8 but can be done on Power7 without scalarization. XL provides these implementations. This patch adds the following overloads for doubleword vectors: vec_add vec_cmpeq vec_cmpgt vec_cmpge vec_cmplt vec_cmple vec_sl vec_sr vec_sra	2021-05-06 08:37:36 -05:00
Paul C. Anagnostopoulos	d40a0b8af7	[TableGen] [Clang] Clean up Options.td and add asserts. Differential Revision: https://reviews.llvm.org/D101766	2021-05-06 09:32:25 -04:00
Anastasia Stulova	c28a602329	[OpenCL] Remove subgroups pragma in enqueue kernel and pipe builtins. This patch simplifies the parser and makes the language semantics consistent. There is no extension pragma requirement in the spec for the subgroup functions in enqueue kernel or pipes and all other builtin functions are available without the pragama. Differential Revision: https://reviews.llvm.org/D100984	2021-05-06 13:59:38 +01:00
Jon Chesterfield	b24e9f82b7	[amdgpu-arch] Fix rpath to run from build dir [amdgpu-arch] Fix rpath to run from build dir Prior to this, amdgpu-arch has RUNPATH set to $ORIGIN/../lib which works for some installs, but not from the build directory where clang executes the tool from when running tests. This cmake option adds the location of the rocr runtime to the RUNPATH (note, it amends RUNPATH here, despite the cmake option referring to RPATH) to create a binary that runs from build or install location. Before: RUNPATH [$ORIGIN/../lib] After: RUNPATH [$ORIGIN/../lib:$HOME/llvm-install/lib] Credit to Greg for knowing this trick and pointing to examples of it in use for the aomp build scripts. Reviewed By: pdhaliwal Differential Revision: https://reviews.llvm.org/D101926	2021-05-06 13:07:00 +01:00
Eliza Velasquez	cdf33962d9	[clang-format] Rename common types between C#/JS Reviewed By: curdeius Differential Revision: https://reviews.llvm.org/D101862	2021-05-06 12:12:58 +02:00
Eliza Velasquez	ec725b307f	[clang-format] Fix C# nullable-related errors This fixes two errors: Previously, clang-format was splitting up type identifiers from the nullable ?. This changes this behavior so that the type name sticks with the operator. Additionally, nullable operators attached to return types in interface functions were not parsed correctly. Digging deeper, it looks like interface bodies were being parsed differently than classes and structs, causing MustBeDeclaration to be incorrect for interface members. They now share the same logic. One other change is reintroducing the CSharpNullable type independent of JsTypeOptionalQuestion. Despite having a similar semantic purpose, their actual syntax differs quite a bit. Reviewed By: MyDeveloperDay, curdeius Differential Revision: https://reviews.llvm.org/D101860	2021-05-06 12:11:15 +02:00
Eliza Velasquez	a437befa8f	[clang-format] Add more support for C# 8 nullables This adds support for the null-coalescing assignment and null-forgiving operators. https://docs.microsoft.com/en-us/dotnet/csharp/language-reference/operators/null-coalescing-operator https://docs.microsoft.com/en-us/dotnet/csharp/language-reference/operators/null-forgiving Reviewed By: krasimir, curdeius Differential Revision: https://reviews.llvm.org/D101702	2021-05-06 11:58:38 +02:00
Johannes Doerfert	df729e2b82	[OpenMP] Overhaul `declare target` handling This patch fixes various issues with our prior `declare target` handling and extends it to support `omp begin declare target` as well. This started with PR49649 in mind, trying to provide a way for users to avoid the "ref" global use introduced for globals with internal linkage. From there it went down the rabbit hole, e.g., all variables, even `nohost` ones, were emitted into the device code so it was impossible to determine if "ref" was needed late in the game (based on the name only). To make it really useful, `begin declare target` was needed as it can carry the `device_type`. Not emitting variables eagerly had a ripple effect. Finally, the precedence of the (explicit) declare target list items needed to be taken into account, that meant we cannot just look for any declare target attribute to make a decision. This caused the handling of functions to require fixup as well. I tried to clean up things while I was at it, e.g., we should not "parse declarations and defintions" as part of OpenMP parsing, this will always break at some point. Instead, we keep track what region we are in and act on definitions and declarations instead, this is what we do for declare variant and other begin/end directives already. Highlights: - new diagnosis for restrictions specificed in the standard, - delayed emission of globals not mentioned in an explicit list of a declare target, - omission of `nohost` globals on the host and `host` globals on the device, - no explicit parsing of declarations in-between `omp [begin] declare variant` and the corresponding end anymore, regular parsing instead, - precedence for explicit mentions in `declare target` lists over implicit mentions in the declaration-definition-seq, and - `omp allocate` declarations will now replace an earlier emitted global, if necessary. --- Notes: The patch is larger than I hoped but it turns out that most changes do on their own lead to "inconsistent states", which seem less desirable overall. After working through this I feel the standard should remove the explicit declare target forms as the delayed emission is horrible. That said, while we delay things anyway, it seems to me we check too often for the current status even though that is often not sufficient to act upon. There seems to be a lot of duplication that can probably be trimmed down. Eagerly emitting some things seems pretty weak as an argument to keep so much logic around. --- Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D101030	2021-05-06 02:10:41 -05:00
Johannes Doerfert	3f14596700	[OpenMP] Ensure the DefaultMapperId has a location A user reported an assertion (below) but without a reproducer. I failed to create a test myself but from the assertion one can derive the problem. I set the DefaultMapperId location now to make sure this doesn't cause trouble. ``` clang-13: .../DeclTemplate.h:1940: void clang::ClassTemplateSpecializationDecl::setPointOfInstantiation(clang::SourceLocation): Assertion `Loc.isValid() && "point of instantiation must be valid!"' failed. ``` Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D100621	2021-05-06 02:10:36 -05:00
Johannes Doerfert	5d8d994dfb	[OpenMP] Make sure classes work on the device as they do on the host We do provide `operator delete(void*)` in `<new>` but it should be available by default. This is mostly boilerplate to test it and the unconditional include of `<new>` in the header we always in include on the device. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D100620	2021-05-06 02:10:30 -05:00
Giorgis Georgakoudis	207b08a913	[OpenMP][NFC] Refactor Clang OpenMP tests using update_cc_test_checks This patch refactors a subset of Clang OpenMP tests, generating checklines using the update_cc_test_checks script. This refactoring facilitates updating the Clang OpenMP code generation codebase by automating test generation. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D101849	2021-05-05 20:08:38 -07:00
Giorgis Georgakoudis	f97b843d88	[OpenMP] Fix non-determinism in clang copyin codegen Codegen for OpeMP copyin has non-deterministic IR output due to the unspecified evaluation order in a codegen conditional branch, which makes automatic test generation unreliable. This patch refactors codegen code to avoid this non-determinism. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D101952	2021-05-05 19:24:03 -07:00
Juneyoung Lee	8a156d1c27	[InstCombine] Fully disable select to and/or i1 folding This is a patch that disables the poison-unsafe select -> and/or i1 folding. It has been blocking D72396 and also has been the source of a few miscompilations described in llvm.org/pr49688 . D99674 conditionally blocked this folding and successfully fixed the latter one. The former one was still blocked, and this patch addresses it. Note that a few test functions that has `_logical` suffix are now deoptimized. These are created by @nikic to check the impact of disabling this optimization by copying existing original functions and replacing and/or with select. I can see that most of these are poison-unsafe; they can be revived by introducing freeze instruction. I left comments at fcmp + select optimizations (or-fcmp.ll, and-fcmp.ll) because I think they are good targets for freeze fix. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D101191	2021-05-06 09:29:52 +09:00
Petr Hosek	9d3dbcd24c	[Driver] Move -print-runtime-dir and -print-resource-dir tests Put these into a separate files to match other -print-* options tests. Differential Revision: https://reviews.llvm.org/D101813	2021-05-05 15:23:49 -07:00
Richard Smith	6bbfa0fd40	When performing template argument deduction to select a partial specialization while substituting a partial template parameter pack, don't try to extend the existing deduction. This caused us to select the wrong partial specialization in some rare cases. A recent change to libc++ caused this to happen in practice for code using std::conjunction.	2021-05-05 14:47:18 -07:00
Giorgis Georgakoudis	78a7d8c4dd	[Utils][NFC] Rename replace-function-regex in update_cc_test_checks This patch renames the replace-function-regex to replace-value-regex to indicate that the existing regex replacement functionality can replace any IR value besides functions. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D101934	2021-05-05 14:19:30 -07:00
Thomas Lively	81fce29d6e	[WebAssembly] Add SIMD const_splat intrinsics These intrinsics do not correspond to their own underlying instruction, but are a convenience for the common case of materializing a constant vector that has the same value in each lane. Differential Revision: https://reviews.llvm.org/D101885	2021-05-05 13:46:45 -07:00
Thomas Lively	602f318cfd	[WebAssembly] Fix constness of pointer params to load intrinsics Update the SIMD builtin load functions to take pointers to const data and update the intrinsics themselves to not cast away constness. Differential Revision: https://reviews.llvm.org/D101884	2021-05-05 13:16:56 -07:00
Thomas Lively	627a526955	[WebAssembly] Update narrowing builtin function operand types Make the inputs to all narrowing builtins signed, which is how they are interpreted by the underlying instructions (only the result changes sign between instructions). Differential Revision: https://reviews.llvm.org/D101883	2021-05-05 13:04:04 -07:00
Nick Desaulniers	aefbfbcbd7	[Clang] remove text extension from diag::err_drv_invalid_value_with_suggestion This hinders translations, as per: https://clang.llvm.org/docs/InternalsManual.html#the-format-string Reviewed By: MaskRay, xbolva00 Differential Revision: https://reviews.llvm.org/D101387	2021-05-05 11:01:43 -07:00
Philipp Krones	632ebc4ab4	[MC] Untangle MCContext and MCObjectFileInfo This untangles the MCContext and the MCObjectFileInfo. There is a circular dependency between MCContext and MCObjectFileInfo. Currently this dependency also exists during construction: You can't contruct a MOFI without a MCContext without constructing the MCContext with a dummy version of that MOFI first. This removes this dependency during construction. In a perfect world, MCObjectFileInfo wouldn't depend on MCContext at all, but only be stored in the MCContext, like other MC information. This is future work. This also shifts/adds more information to the MCContext making it more available to the different targets. Namely: - TargetTriple - ObjectFileType - SubtargetInfo Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D101462	2021-05-05 10:03:02 -07:00
Nico Weber	f16afcd9b5	[clang] remove an incremental build workaround This cleaned up an oversight over a year ago. Should no longer be needed.	2021-05-05 12:21:56 -04:00
Jinsong Ji	20d0aca430	[clang][Driver] Add -fintegrate-as to debug-pass-structure test CGProfilePass is not always on, it will be disabled when using non-intergrated assemblers. // Only enable CGProfilePass when using integrated assembler, since // non-integrated assemblers don't recognize .cgprofile section. PMBuilder.CallGraphProfile = !CodeGenOpts.DisableIntegratedAS; Add -fintegrate-as to make sure the output don't rely on the platform default. Reviewed By: evgeny777 Differential Revision: https://reviews.llvm.org/D101918	2021-05-05 16:10:57 +00:00
Sushma Unnibhavi	67ee2f870d	Added a faster method to clone llvm project [DOCS] Reviewed By: xgupta, amccarth Differential Revision: https://reviews.llvm.org/D101433	2021-05-05 21:37:53 +05:30
Pushpinder Singh	1f5cacfcb8	[AMDGPU][OpenMP] Fix clang driver crash when provided -c The offload action is used in four different ways as explained in Driver.cpp:4495. When -c is present, the final phase will be assemble (linker when -c is not present). However, this phase is skipped according to D96769 for amdgcn. So, offload action arrives into following situation, compile (device) ---> offload ---> offload without -c the chain looks like, compile (device) ---> offload ---> linker (device) ---> offload The former situation creates an unhandled case which causes problem. The solution presented in this patch delays the D96769 logic until job creation time. This keeps the offload action in the 1 of the 4 specified situations. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D101901	2021-05-05 14:26:58 +00:00
Anastasia Stulova	e994e74bca	[OpenCL] Add clang extension for non-portable kernel parameters. Added __cl_clang_non_portable_kernel_param_types extension that allows using non-portable types as kernel parameters. This allows bypassing the portability guarantees from the restrictions specified in C++ for OpenCL v1.0 s2.4. Currently this only disables the restrictions related to the data layout. The programmer should ensure the compiler generates the same layout for host and device or otherwise the argument should only be accessed on the device side. This extension could be extended to other case (e.g. permitting size_t) if desired in the future. Patch by olestrohm (Ole Strohm)! https://reviews.llvm.org/D101168	2021-05-05 14:58:23 +01:00
Hans Wennborg	4f4aa7b78d	Require asserts for clang/test/Headers/wasm.c The test doesn't pass in no-asserts builds, see comment on https://reviews.llvm.org/D101805	2021-05-05 11:42:18 +02:00
Yang Fan	cab3c6c6c4	[clang][TargetCXXABI] Fix -Wreturn-type warning (NFC) GCC warning: ``` In file included from /llvm-project/clang/include/clang/Basic/LangOptions.h:22, from /llvm-project/clang/include/clang/Frontend/CompilerInvocation.h:16, from /llvm-project/clang/lib/Frontend/CompilerInvocation.cpp:9: /llvm-project/clang/include/clang/Basic/TargetCXXABI.h: In static member function ‘static bool clang::TargetCXXABI::isSupportedCXXABI(const llvm::Triple&, clang::TargetCXXABI::Kind)’: /llvm-project/clang/include/clang/Basic/TargetCXXABI.h:114:3: warning: control reaches end of non-void function [-Wreturn-type] 114 \| }; \| ^ ```	2021-05-05 14:44:48 +08:00
Giorgis Georgakoudis	f016c06abb	Revert "[OpenMP][NFC] Refactor Clang OpenMP tests using update_cc_test_checks" This reverts commit `956cae2f09`.	2021-05-04 17:12:32 -07:00
Giorgis Georgakoudis	956cae2f09	[OpenMP][NFC] Refactor Clang OpenMP tests using update_cc_test_checks This patch refactors a subset of Clang OpenMP tests, generating checklines using the update_cc_test_checks script. This refactoring facilitates updating the Clang OpenMP code generation codebase by automating test generation. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D101849	2021-05-04 16:58:45 -07:00
Thomas Lively	f3b769e82f	[WebAssembly] Add codegen test for wasm_simd128.h We previously did not have tests demonstrating that the intrinsics in wasm_simd128.h lower to reasonable LLVM IR. This commit adds such a test. Differential Revision: https://reviews.llvm.org/D101805	2021-05-04 16:11:00 -07:00
Leonard Chan	0277a24f4b	[clang][test] Update -fc++-abi tests This attempts to move driver tests out of Frontend and to Driver, separates RUNs that should fail from RUNs that should succeed, and prevent creating output files or dumping output. Differential Revision: https://reviews.llvm.org/D101867	2021-05-04 15:53:00 -07:00
Adrian Prantl	6c3a10760d	Mark Basic/TargetCXXABI.def as textual in the module map.	2021-05-04 12:52:52 -07:00
Giorgis Georgakoudis	92f2c39f91	[Utils] Run non-filecheck runlines in-order in update_cc_test_checks The script update_cc_test_checks runs all non-filechecked runlines before the filechecked ones. This creates problems since outputs of those non-filechecked runlines may conflict and that will fail the execution of update_cc_test_checks. This patch executes non-filechecked in the order specified in the test file to avoid this issue. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D101683	2021-05-04 12:06:03 -07:00
Giorgis Georgakoudis	313ee609e1	[OpenMP] Fix non-determinism in clang task codegen (lastprivates) Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D101800	2021-05-04 11:56:31 -07:00
Leonard Chan	9c72a210b5	Fix for test failure caused by `84c4754372`. Reduces the number of targets/triples for this test since not all cmake invocations will build for those targets.	2021-05-04 11:45:32 -07:00
Dan Liew	1971823ecb	[Driver] Fix `ToolChain::getCompilerRTPath()` to return the correct path on Apple platforms. When the target triple was an Apple platform `ToolChain::getOSLibName()` (called by `getCompilerRTPath()`) would return the full OS name including the version number (e.g. `darwin20.3.0`). This is not correct because the library directory for all Apple platforms is `darwin`. This in turn caused * `-print-runtime-dir` to return a non-existant path. * `-print-file-name=<any compiler-rt library>` to return the filename instead of the full path to the library. Two regression tests are included. rdar://77417317 Differential Revision: https://reviews.llvm.org/D101682	2021-05-04 11:28:26 -07:00
Nathan James	61dc0f2b59	[Format] Don't sort includes if DisableFormat is true Fixes https://llvm.org/PR35099. I'm not sure if this decision was intentional but its definitely confusing for users. Reviewed By: MyDeveloperDay, HazardyKnusperkeks, curdeius Differential Revision: https://reviews.llvm.org/D101628	2021-05-04 19:04:12 +01:00
Leonard Chan	84c4754372	[clang] Add -fc++-abi= flag for specifying which C++ ABI to use This implements the flag proposed in RFC http://lists.llvm.org/pipermail/cfe-dev/2020-August/066437.html. The goal is to add a way to override the default target C++ ABI through a compiler flag. This makes it easier to test and transition between different C++ ABIs through compile flags rather than build flags. In this patch: - Store -fc++-abi= in a LangOpt. This isn't stored in a CodeGenOpt because there are instances outside of codegen where Clang needs to know what the ABI is (particularly through ASTContext::createCXXABI), and we should be able to override the target default if the flag is provided at that point. - Expose the existing ABIs in TargetCXXABI as values that can be passed through this flag. - Create a .def file for these ABIs to make it easier to check flag values. - Add an error for diagnosing bad ABI flag values. Differential Revision: https://reviews.llvm.org/D85802	2021-05-04 10:52:13 -07:00
Andrew Savonichev	b451ecd86e	[Clang][AArch64] Disable rounding of return values for AArch64 If a return value is explicitly rounded to 64 bits, an additional zext instruction is emitted, and in some cases it prevents tail call optimization. As discussed in D100225, this rounding is not necessary and can be disabled. Differential Revision: https://reviews.llvm.org/D100591	2021-05-04 20:29:01 +03:00

1 2 3 4 5 ...

89241 Commits