llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-29 14:20:29 +00:00

Author	SHA1	Message	Date
Matt Arsenault	f48b6ba0a8	GlobalISel: Use Register	2020-08-19 13:45:31 -04:00
Mehdi Amini	db235b2187	Revert "Revert "[NFC][llvm] Make the contructors of `ElementCount` private."" Was reverted because MLIR/Flang builds were broken, these APIs have been fixed in the meantime.	2020-08-19 17:26:36 +00:00
Mehdi Amini	4386b1823a	Revert "[NFC][llvm] Make the contructors of `ElementCount` private." This reverts commit 264afb9e6aebc98c353644dd0700bec808501cab. (and dependent 6b742cc48 and fc53bd610f) MLIR/Flang are broken.	2020-08-19 17:21:37 +00:00
Jessica Paquette	153c17604a	[GlobalISel] Add combine for (x & mask) -> x when (x & mask) == x If we have a mask, and a value x, where (x & mask) == x, we can drop the AND and just use x. This is about a 0.4% geomean code size improvement on CTMark at -O3 for AArch64. In AArch64, this is most useful post-legalization. Patterns like this often show up when legalizing s1s, which must be extended to larger types. e.g. ``` %cmp:_(s32) = G_ICMP ... %and:_(s32) = G_AND %cmp, 1 ``` Since G_ICMP only produces a single bit, there's no reason to mask it with the G_AND. Differential Revision: https://reviews.llvm.org/D85463	2020-08-19 10:20:57 -07:00
Francesco Petrogalli	d75808bc7f	[NFC][llvm] Make the contructors of `ElementCount` private. Differential Revision: https://reviews.llvm.org/D86120	2020-08-19 16:26:44 +00:00
Bjorn Pettersson	1813b6efab	[GlobalISel] Untabify InstructionSelectorImpl.h. NFC	2020-08-19 12:00:00 +02:00
sstefan1	de2379255b	[IR] Intrinsics default attributes and opt-out flag Intrinsic properties can now be set to default and applied to all intrinsics. If the attributes are not needed, the user can opt-out by setting the DisableDefaultAttributes flag to true. Differential Revision: https://reviews.llvm.org/D70365	2020-08-19 10:50:46 +02:00
Ronak Chauhan	4697f34ed6	Revert "[AMDGPU] Support disassembly for AMDGPU kernel descriptors" This reverts commit cacfb02d28a3cabd4e45d2535cb0686cef48a2c9. Reverting due to buildbot failures.	2020-08-19 13:12:29 +05:30
David Sherwood	f7a1832d69	[SVE][CodeGen] Fix scalable vector issues in DAGTypeLegalizer::GenWidenVectorLoads In DAGTypeLegalizer::GenWidenVectorLoads the algorithm assumes it only ever deals with fixed width types, hence the offsets for each individual store never take 'vscale' into account. I've changed the code in that function to use TypeSize instead of unsigned for tracking the remaining load amount. In addition, I've changed the load loop to use the new IncrementPointer helper function for updating the addresses in each iteration, since this handles scalable vector types. Also, I've added report_fatal_errors in GenWidenVectorExtLoads, TargetLowering::scalarizeVectorLoad and TargetLowering::scalarizeVectorStores, since these functions currently use a sequence of element-by-element scalar loads/stores. In a similar vein, I've also added a fatal error report in FindMemType for the case when we decide to return the element type for a scalable vector type. I've added new tests in CodeGen/AArch64/sve-split-load.ll CodeGen/AArch64/sve-ld-addressing-mode-reg-imm.ll for the changes in GenWidenVectorLoads. Differential Revision: https://reviews.llvm.org/D85909	2020-08-19 07:54:32 +01:00
Yaxun (Sam) Liu	6660be7005	[HIP] Support target id by --offload-arch This patch introduces support of target id by -offload-arch. Differential Revision: https://reviews.llvm.org/D60620	2020-08-18 23:43:53 -04:00
Ronak Chauhan	142f4dd209	[AMDGPU] Support disassembly for AMDGPU kernel descriptors Decode AMDGPU Kernel descriptors as assembler directives. Reviewed By: scott.linder Differential Revision: https://reviews.llvm.org/D80713	2020-08-19 08:49:07 +05:30
Elliott Hughes	8e3a33cacc	ld128 demangle: allow space for 'L' suffix. Summary: Caught by HWASAN on arm64 Android (which uses ld128 for long double). This was running the existing fuzzer. The specific minimized fuzz input to reproduce this is: __cxa_demangle("1\006ILeeeEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE", 0, 0, 0); Reviewers: eugenis, srhines, #libc_abi! Subscribers: kristof.beyls, danielkiss, libcxx-commits Tags: #libc_abi Differential Revision: https://reviews.llvm.org/D77924	2020-08-18 16:14:05 -07:00
Jessica Paquette	67ae683e5b	[GlobalISel][CallLowering] NFC: Unify flag-setting from CallBase + AttributeList It's annoying to have to maintain multiple, nearly identical chains of if statements which all set the same attributes. Add a helper function, `addFlagsUsingAttrFn` which performs the attribute setting. Then, use wrappers for that function in `lowerCall` and `setArgFlags`. (Note that the flag-setting code in `setArgFlags` was missing the returned attribute. There's no selection for this yet, so no test. It's an example of the kind of thing this lets us avoid, though.) Differential Revision: https://reviews.llvm.org/D86159	2020-08-18 11:07:33 -07:00
Matt Arsenault	418515b7d0	GlobalISel: Implement fewerElementsVector for G_INSERT_VECTOR_ELT Add unit tests since AMDGPU will only trigger this for gigantic vectors, and won't use the annoying odd sized breakdown case.	2020-08-18 13:51:19 -04:00
David Blaikie	01ab206194	[WIP][DebugInfo] Lazily parse debug_loclist offsets Parsing DWARFv5 debug_loclist offsets when a CU is parsed is weighing down memory usage of symbolizers that don't need to parse this data at all. There's not much benefit to caching these anyway - since they are O(1) lookup and reading once you know where the offset list starts (and can do bounds checking with the offset list size too). In general, I think it might be time to start paying down some of the technical debt of loc/loclist/range/rnglist parsing to try to unify it a bit more. eg: * Currently DWARFUnit has: RangeSection, RangeSectionBase, LocSection, LocSectionBase, LocTable, RngListTable, LoclistTableHeader (be nice if these were all wrapped up in two variables - one for loclists, one for rnglists) * rnglists and loclists are handled differently (see: LoclistTableHeader, but no RnglistTableHeader) * maybe all these types could be less stateful - lazily parse what they need to, even reparsing rather than caching because it doesn't seem too expensive, for instance. (though admittedly so long as it's constantcost/overead per compilatiton that's probably adequate) * Maybe implementing and using a DWARFDataExtractor that can be sub-ranged (so we could slice it up to just the single contribution) - though maybe that's not so useful because loc/ranges need to refer to it by absolute, not contribution-relative mechanisms Differential Revision: https://reviews.llvm.org/D86110	2020-08-18 10:49:39 -07:00
Amara Emerson	f6bce1ffcd	[GlobalISel] Add a combine for sext_inreg(load x), c --> sextload x This is restricted to single use loads, which if we fold to sextloads we can find more optimal addressing modes on AArch64. This also fixes an overload the MachineFunction::getMachineMemOperand() method which was incorrectly using the MF alignment instead of the MMO alignment. Differential Revision: https://reviews.llvm.org/D85966	2020-08-18 10:42:15 -07:00
Amara Emerson	d1d273ff1c	[GlobalISel] Add a combine for ashr(shl x, c), c --> sext_inreg x, c' By detecting this sign extend pattern early, we can uncover opportunities for more optimizations. Differential Revision: https://reviews.llvm.org/D85965	2020-08-18 10:42:15 -07:00
Jessica Paquette	7e08e6c7a3	[GlobalISel][CallLowering] Look through call parameters for flags We weren't looking through the parameters on calls at all. E.g., say you had ``` declare i32 @zext(i32 zeroext %x) ... %y = call i32 @zext(i32 %something) ... ``` At the point of the call, we wouldn't know that the %something should have the zeroext attribute. This sets flags in about the same way as TargetLoweringBase::ArgListEntry::setAttributes. Differential Revision: https://reviews.llvm.org/D86125	2020-08-18 08:48:56 -07:00
Ronak Chauhan	6e3663ae70	[ELF] Hide target specific methods as private Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D86136	2020-08-18 18:26:08 +05:30
Ronak Chauhan	0f95014e38	[llvm-objdump][AMDGPU] Detect CPU string AMDGPU ISA isn't backwards compatible and hence -mcpu must always be specified during disassembly. However, the AMDGPU target CPU is stored in e_flags in the ELF object. This patch allows targets to implement CPU string detection, and also implements it for AMDGPU by looking at e_flags. Reviewed By: scott.linder Differential Revision: https://reviews.llvm.org/D84519	2020-08-18 17:43:16 +05:30
Shinji Okumura	8816b1755f	[Attributor] Deduce noundef attribute This patch introduces a new abstract attribute `AANoUndef` which corresponds to `noundef` IR attribute and deduce them. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D85184	2020-08-18 18:05:54 +09:00
Johannes Doerfert	117fb9d08e	[Attributor][NFC] Directly return proper type to avoid casts	2020-08-17 23:36:36 -05:00
Harmen Stoppels	cca791c2df	Use find_library for ncurses Currently it is hard to avoid having LLVM link to the system install of ncurses, since it uses check_library_exists to find e.g. libtinfo and not find_library or find_package. With this change the ncurses lib is found with find_library, which also considers CMAKE_PREFIX_PATH. This solves an issue for the spack package manager, where we want to use the zlib installed by spack, and spack provides the CMAKE_PREFIX_PATH for it. This is a similar change as https://reviews.llvm.org/D79219, which just landed in master. Differential revision: https://reviews.llvm.org/D85820	2020-08-17 19:52:52 -07:00
Amy Kwan	c630ca1f5c	[PowerPC] Implement Vector Extract Mask builtins in LLVM/Clang This patch implements the vec_extractm function prototypes in altivec.h in order to utilize the vector extract with mask instructions introduced in Power10. Differential Revision: https://reviews.llvm.org/D82675	2020-08-17 21:14:17 -05:00
Hamilton Tobon Mosquera	59a4de0434	[OpenMPOpt][HideMemTransfersLatency] Split __tgt_target_data_begin_mapper into its "issue" and "wait" counterparts. WIP that tries to hide the latency of runtime calls that involve host to device memory transfers by splitting them into their "issue" and "wait" versions. The "issue" is moved upwards as much as possible. The "wait" is moved downards as much as possible. The "issue" issues the memory transfer asynchronously, returning a handle. The "wait" waits in the returned handle for the memory transfer to finish. We still lack of the movement.	2020-08-17 20:56:10 -05:00
Hongtao Yu	43bf988191	[llvm-objdump] Symbolize binary addresses for low-noisy asm diff. When diffing disassembly dump of two binaries, I see lots of noises from mismatched jump target addresses and global data references, which unnecessarily causes diffs on every function, making it impractical. I'm trying to symbolize the raw binary addresses to minimize the diff noise. In this change, a local branch target is modeled as a label and the branch target operand will simply be printed as a label. Local labels are collected by a separate pre-decoding pass beforehand. A global data memory operand will be printed as a global symbol instead of the raw data address. Unfortunately, due to the way the disassembler is set up and to be less intrusive, a global symbol is always printed as the last operand of a memory access instruction. This is less than ideal but is probably acceptable from checking code quality point of view since on most targets an instruction can have at most one memory operand. So far only the X86 disassemblers are supported. Test Plan: llvm-objdump -d --x86-asm-syntax=intel --no-show-raw-insn --no-leading-addr : ``` Disassembly of section .text: <_start>: push rax mov dword ptr [rsp + 4], 0 mov dword ptr [rsp], 0 mov eax, dword ptr [rsp] cmp eax, dword ptr [rip + 4112] # 202182 <g> jge 0x20117e <_start+0x25> call 0x201158 <foo> inc dword ptr [rsp] jmp 0x201169 <_start+0x10> xor eax, eax pop rcx ret ``` llvm-objdump -d --symbolize-operands --x86-asm-syntax=intel --no-show-raw-insn --no-leading-addr : ``` Disassembly of section .text: <_start>: push rax mov dword ptr [rsp + 4], 0 mov dword ptr [rsp], 0 <L1>: mov eax, dword ptr [rsp] cmp eax, dword ptr <g> jge <L0> call <foo> inc dword ptr [rsp] jmp <L1> <L0>: xor eax, eax pop rcx ret ``` Note that the jump instructions like `jge 0x20117e <_start+0x25>` without this work is printed as a real target address and an offset from the leading symbol. With a change in the optimizer that adds/deletes an instruction, the address and offset may shift for targets placed after the instruction. This will be a problem when diffing the disassembly from two optimizers where there are unnecessary false positives due to such branch target address changes. With `--symbolize-operand`, a label is printed for a branch target instead to reduce the false positives. Similarly, the disassemble of PC-relative global variable references is also prone to instruction insertion/deletion. Reviewed By: jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D84191	2020-08-17 16:55:12 -07:00
Matt Arsenault	4a4ddffaf1	GlobalISel: Make type for lower action more consistently optional Some of the lower implementations were relying on this, however the type was not set depending on which form .lower* helper form you were using. For instance, if you used an unconditonal lower(), the type was never set. Most of the lower actions do not benefit from a type parameter, and just expand in terms of the original operation's types. However, some lowerings could benefit from an additional type hint to combine a promotion and an expansion. An example of this is for add/sub sat. The DAG integer legalization tries to use smarter expansions directly when promoting the integer type, and doesn't always produce the same instruction with a wider type. Treat this as an optional hint argument, that only means something for specific lower actions. It may be useful to generalize this mechanism to pass a full list of type indexes and desired types, but I haven't run into a case like that yet.	2020-08-17 16:24:55 -04:00
diggerlin	78cee19819	[AIX][XCOFF][Patch1] Provide decoding trace back table information API for xcoff object file for llvm-objdump -d SUMMARY: 1. This patch provided API for decoding the traceback table info and unit test for the these API. 2. Another patchs will do the following things: 2.1 added a new option --traceback-table to decode the trace back table information for xcoff object file when using llvm-objdump to disassemble the xcoff objfile. 2.2 print out the traceback table information for llvm-objdump. Reviewers: Jason liu, Hubert Tong, James Henderson Differential Revision: https://reviews.llvm.org/D81585	2020-08-17 16:23:47 -04:00
Dávid Bolvanský	26599cbe3f	Revert "[BPI] Improve static heuristics for integer comparisons" This reverts commit 50c743fa713002fe4e0c76d23043e6c1f9e9fe6f. Patch will be split to smaller ones.	2020-08-17 20:44:33 +02:00
Valentin Clement	a11d8d3dcc	[flang][directives] Use TableGen to generate clause unparsing Use the TableGen directive back-end to generate code for the clauses unparsing. Reviewed By: sscalpone, kiranchandramohan Differential Revision: https://reviews.llvm.org/D85851	2020-08-17 14:22:25 -04:00
Matt Arsenault	116b71f4c8	GlobalISel: Fix parameter name in doxygen comment	2020-08-17 13:57:10 -04:00
Matt Arsenault	a71dba11e9	GlobalISel: Revisit users of other merge opcodes in artifact combiner The artifact combiner searches for the uses of G_MERGE_VALUES for unmerge/trunc that need further combining. This also needs to handle the vector merge opcodes the same way. This fixes leaving behind some pairs I expected to be removed, that were if the legalizer is run a second time.	2020-08-17 13:56:53 -04:00
Matt Arsenault	5bcccd9fb4	GlobalISel: Remove unnecessary check for copy type COPY isn't allowed to change the type, but can mix no type with type.	2020-08-17 09:19:25 -04:00
Alex Zinenko	544267f834	[llvm] support graceful failure of DataLayout parsing Existing implementation always aborts on syntax errors in a DataLayout description. While this is meaningful for consuming textual IR modules, it is inconvenient for users that may need fine-grained control over the layout from, e.g., command-line options. Propagate errors through the parsing functions and only abort in the top-level parsing function instead. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D85650	2020-08-17 15:10:37 +02:00
Simon Pilgrim	8366289c89	[DemandedBits] Improve accuracy of Add propagator The current demand propagator for addition will mark all input bits at and right of the alive output bit as alive. But carry won't propagate beyond a bit for which both operands are zero (or one/zero in the case of subtraction) so a more accurate answer is possible given known bits. I derived a propagator by working through truth tables and using a bit-reversed addition to make demand ripple to the right, but I'm not sure how to make a convincing argument for its correctness in the comments yet. Nevertheless, here's a minimal implementation and test to get feedback. This would help in a situation where, for example, four bytes (<128) packed into an int are added with four others SIMD-style but only one of the four results is actually read. Known A: 0_______0_______0_______0_______ Known B: 0_______0_______0_______0_______ AOut: 00000000001000000000000000000000 AB, current: 00000000001111111111111111111111 AB, patch: 00000000001111111000000000000000 Committed on behalf of: @rrika (Erika) Differential Revision: https://reviews.llvm.org/D72423	2020-08-17 12:54:09 +01:00
Vitaly Buka	6f71d99b21	[StackSafety] Skip ambiguous lifetime analysis If we can't identify alloca used in lifetime marker we need to assume to worst case scenario. Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D84630	2020-08-16 18:05:52 -07:00
Fady Ghanim	a11b92e493	[OpenMP][OMPBuilder] Adding support for `omp single` This adds support for generating `omp single`, and necessary calls for `copyprivate` clause. Differential Revision: https://reviews.llvm.org/D85617	2020-08-16 01:15:16 -04:00
Wenlei He	8c3d7a1d09	[InlineAdvisor] New inliner advisor to replay inlining from optimization remarks This change added a new inline advisor that takes optimization remarks from previous inlining as input, and provides the decision as advice so current inlining can replay inline decisions of a different compilation. Dwarf inline stack with line and discriminator is used as anchor for call sites including call context. The change can be useful for Inliner tuning as it provides a channel to allow external input for tweaking inline decisions. Existing alternatives like alwaysinline attribute is per-function, not per-callsite. Per-callsite inline intrinsic can be another solution (not yet existing), but it's intrusive to implement and also does not differentiate call context. A switch -sample-profile-inline-replay=<inline_remarks_file> is added to hook up the new inline advisor with SampleProfileLoader's inline decision for replay. Since SampleProfileLoader does top-down inlining, inline decision can be specialized for each call context, hence we should be able to replay inlining accurately. However with a bottom-up inliner like CGSCC inlining, the replay can be limited due to lack of specialization for different call context. Apart from that limitation, the new inline advisor can still be used by regular CGSCC inliner later if needed for tuning purpose. This is a resubmit of https://reviews.llvm.org/D83743	2020-08-15 20:17:21 -07:00
Aditya Kumar	d33455c31b	[NFC] Fix typo and variable names	2020-08-15 09:06:22 -07:00
Philip Reames	bc40b4f222	[Statepoint] Remove code related to inline operand bundles This code becomes dead for valid IR after 48f4312 and a96fc46. The reason for the test change is that the verifier reports the first verification error encountered, in some non-specified visit order. By removing the verification code in gc.relocates for a statepoint with inline gc operands, I change the error the verifier reports. And in one case, the checked for error is no longer possible with the bundle representation, so I simply delete the file.	2020-08-14 20:29:41 -07:00
Arthur Eubanks	47a36fca7f	[NewPM][optnone] Mark various passes as required This was done by turning on -enable-npm-optnone and fixing failures. That will be enabled in a follow-up change for ease of reverting. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D85457	2020-08-14 15:51:59 -07:00
Craig Topper	10839866a1	[X86][MC][Target] Initial backend support a tune CPU to support -mtune This patch implements initial backend support for a -mtune CPU controlled by a "tune-cpu" function attribute. If the attribute is not present X86 will use the resolved CPU from target-cpu attribute or command line. This patch adds MC layer support a tune CPU. Each CPU now has two sets of features stored in their GenSubtargetInfo.inc tables . These features lists are passed separately to the Processor and ProcessorModel classes in tablegen. The tune list defaults to an empty list to avoid changes to non-X86. This annoyingly increases the size of static tables on all target as we now store 24 more bytes per CPU. I haven't quantified the overall impact, but I can if we're concerned. One new test is added to X86 to show a few tuning features with mismatched tune-cpu and target-cpu/target-feature attributes to demonstrate independent control. Another new test is added to demonstrate that the scheduler model follows the tune CPU. I have not added a -mtune to llc/opt or MC layer command line yet. With no attributes we'll just use the -mcpu for both. MC layer tools will always follow the normal CPU for tuning. Differential Revision: https://reviews.llvm.org/D85165	2020-08-14 15:31:50 -07:00
Jordan Rupprecht	e6a595a264	Temporarily revert "[SCEVExpander] Add helper to clean up instrs inserted while expanding." This reverts commit 7829c33084a7a5097533cf862daef521380c4e63. The assertion is triggering on some internal code. A reduced test case is in progress.	2020-08-14 14:52:37 -07:00
Vitaly Buka	320ac778a0	[StackSafety] Use ValueInfo in ParamAccess::Call This avoid GUID lookup in Index.findSummaryInModule. Follow up for D81242. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D85269	2020-08-14 12:42:44 -07:00
Greg McGary	f5a6a11410	[MachO] Add skeletal support for DriverKit platform Define the platform ID = 10, and simple mappings between platform ID & name. Reviewed By: MaskRay, cishida Differential Revision: https://reviews.llvm.org/D85594	2020-08-14 12:36:43 -07:00
Matt Arsenault	2348cd927a	TableGen/GlobalISel: Partially handle immAllOnesV/immAllZerosV These should really match either G_BUILD_VECTOR or G_BUILD_VECTOR_TRUNC, but there doesn't seem to be an existing mechanism for matching alternative opcodes. There is GIM_SwitchOpcode, but it seems to assume it's oly only used for matcher optimization. I could also omit any opcode check and rely on the matcher directly checking the opcode, but the table optimizer currently assumes there has to be an opcode check. Also doesn't try to handle undef elements like the DAG version.	2020-08-14 13:55:30 -04:00
Bjorn Pettersson	2da0c11e30	[Orc] Fix werror for unused variable in noasserts build	2020-08-14 15:58:04 +02:00
Stefan Gränitz	dbb4ff9754	[ORC] Fix missing include in OrcRemoteTargetClient.h	2020-08-14 12:00:18 +02:00
Stefan Gränitz	e03618a9ec	[ORC] In LLLazyJIT provide public access to the CompileOnDemandLayer This is analog to how LLJIT provides public access to all its layers. Differential Revision: https://reviews.llvm.org/D85921	2020-08-14 11:34:44 +02:00
Stefan Gränitz	92757f17c9	[ORC] Add JITLink-compatible remote memory-manager and LLJITWithChildProcess example This adds RemoteJITLinkMemoryManager is a new subclass of OrcRemoteTargetClient. It implements jitlink::JITLinkMemoryManager and targets the OrcRemoteTargetRPCAPI. Behavior should be very similar to RemoteRTDyldMemoryManager. The essential differnce with JITLink is that allocations work in isolation from its memory manager. Thus, the RemoteJITLinkMemoryManager might be seen as "JITLink allocation factory". RPCMMAlloc is another subclass of OrcRemoteTargetClient and implements the actual functionality. It allocates working memory on the host and target memory on the remote target. Upon finalization working memory is copied over to the tagrte address space. Finalization can be asynchronous for JITLink allocations, but I don't see that it makes a difference here. Differential Revision: https://reviews.llvm.org/D85919	2020-08-14 11:34:44 +02:00
Yuanfang Chen	105eeb4e46	[NewPM][CodeGen] Add machine code verification callback D83608 need this. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D85916	2020-08-13 16:13:01 -07:00
Stefan Gränitz	4f0d2712f5	[ORC] cloneToNewContext() can work with a const-ref to ThreadSafeModule	2020-08-13 21:01:21 +02:00
Stefan Gränitz	f500d846dc	[ORC] Fix SymbolLookupSet::containsDuplicates()	2020-08-13 21:01:21 +02:00
Haowei Wu	d8abdb26ca	[elfabi] Move llvm-elfabi related code to InterfaceStub library This change moves elfabi related code to llvm/InterfaceStub library so it can be shared by multiple llvm tools without causing cyclic dependencies. Differential Revision: https://reviews.llvm.org/D85678	2020-08-13 11:51:44 -07:00
Sameer Arora	110f791fb7	[llvm-libtool-darwin] Refactor ArchiveWriter Refactoring function `writeArchive` in ArchiveWriter. Added a new function `writeArchiveBuffer` that returns the archive in a memory buffer instead of writing it out to the disk. This refactor is necessary so as to allow `llvm-libtool-darwin` to write universal files containing archives. Reviewed by jhenderson, MaskRay, smeenai Differential Revision: https://reviews.llvm.org/D84858	2020-08-13 10:56:30 -07:00
Dávid Bolvanský	7129f2d26c	[BPI] Improve static heuristics for integer comparisons Similarly as for pointers, even for integers a == b is usually false. GCC also uses this heuristic. Reviewed By: ebrevnov Differential Revision: https://reviews.llvm.org/D85781	2020-08-13 19:54:27 +02:00
Fangrui Song	311d485856	[llvm-objdump] Change symbol name/PLT decoding errors to warnings If the referenced symbol of a J[U]MP_SLOT is invalid (e.g. symbol index 0), llvm-objdump -d will bail out: ``` error: 'a': st_name (0x326600) is past the end of the string table of size 0x7 ``` where 0x326600 is the st_name field of the first entry past the end of .symtab Change it to a warning to continue dumping. `X86/plt.test` uses a prebuilt executable, so I pick `ELF/AArch64/plt.test` which has a YAML input and can be easily modified. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D85623	2020-08-13 08:13:42 -07:00
David Stenberg	ce5974df13	[InstCombine] Fix incorrect Modified status When removing instructions from unreachable blocks, and only debug info intrinsics were removed, InstCombine could incorrectly return a false Modified status. This is fixed by making removeAllNonTerminatorAndEHPadInstructions() also return how many debug info intrinsics that were removed, and take that into account. This was caught using the check introduced by D80916. Reviewed By: majnemer Differential Revision: https://reviews.llvm.org/D85839	2020-08-13 15:10:41 +02:00
Dávid Bolvanský	baa55bd4d6	Revert "[BPI] Improve static heuristics for integer comparisons" This reverts commit 44587e2f7e732604cd6340061d40ac21e7e188e5. Sanitizer tests need to be updated.	2020-08-13 14:37:40 +02:00
Dávid Bolvanský	f4c1a714d0	[BPI] Improve static heuristics for integer comparisons Similarly as for pointers, even for integers a == b is usually false. GCC also uses this heuristic. Reviewed By: ebrevnov Differential Revision: https://reviews.llvm.org/D85781	2020-08-13 14:23:58 +02:00
Dávid Bolvanský	aecc53e597	Revert "[BPI] Improve static heuristics for integer comparisons" This reverts commit 385c9d673f217e176b18e7bf6fe055154ac589c6.	2020-08-13 12:59:15 +02:00
Dávid Bolvanský	b38379d5d6	[BPI] Improve static heuristics for integer comparisons Similarly as for pointers, even for integers a == b is usually false. GCC also uses this heuristic. Reviewed By: ebrevnov Differential Revision: https://reviews.llvm.org/D85781	2020-08-13 12:45:40 +02:00
Xing GUO	b9e787ce80	[DWARFYAML] Replace InitialLength with Format and Length. NFC. This change replaces the InitialLength of pub-tables with Format and Length. All the InitialLength fields have been removed. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D85880	2020-08-13 18:39:03 +08:00
David Sherwood	ac8a8b4ad9	[SVE][CodeGen] Fix scalable vector issues in DAGTypeLegalizer::GenWidenVectorStores In DAGTypeLegalizer::GenWidenVectorStores the algorithm assumes it only ever deals with fixed width types, hence the offsets for each individual store never take 'vscale' into account. I've changed the main loop in that function to use TypeSize instead of unsigned for tracking the remaining store amount and offset increment. In addition, I've changed the loop to use the new IncrementPointer helper function for updating the addresses in each iteration, since this handles scalable vector types. Whilst fixing this function I also fixed a minor issue in IncrementPointer whereby we were not adding the no-unsigned-wrap flag for the add instruction in the same way as the fixed width case does. Also, I've added a report_fatal_error in GenWidenVectorTruncStores, since this code currently uses a sequence of element-by-element scalar stores. I've added new tests in CodeGen/AArch64/sve-intrinsics-stores.ll CodeGen/AArch64/sve-st1-addressing-mode-reg-imm.ll for the changes in GenWidenVectorStores. Differential Revision: https://reviews.llvm.org/D84937	2020-08-13 11:07:17 +01:00
David Sherwood	8fe44dff5f	[CodeGen] In narrowExtractedVectorLoad bail out for scalable vectors In narrowExtractedVectorLoad there is an optimisation that tries to combine extract_subvector with a narrowing vector load. At the moment this produces warnings due to the incorrect calls to getVectorNumElements() for scalable vector types. I've got this working for scalable vectors too when the extract subvector index is a multiple of the minimum number of elements. I have added a new variant of the function: MachineFunction::getMachineMemOperand that copies an existing MachineMemOperand, but replaces the pointer info with a null version since we cannot currently represent scaled offsets. I've added a new test for this particular case in: CodeGen/AArch64/sve-extract-subvector.ll Differential Revision: https://reviews.llvm.org/D83950	2020-08-13 10:46:18 +01:00
Amara Emerson	a56d446c69	[GlobalISel] Implement bit-test switch table optimization. This is mostly a straight port from SelectionDAG. We re-use the actual bit-test analysis part from SwitchLoweringUtils, which was factored out earlier to support jump-tables. Differential Revision: https://reviews.llvm.org/D85233	2020-08-12 11:31:39 -07:00
Christopher Tetreault	76b23db219	[SVE] Remove default-false VectorType::get Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D84212	2020-08-12 10:37:05 -07:00
David Green	51c55be3c2	[Scheduler] Fix typo in comments. NFC	2020-08-12 18:36:05 +01:00
Xing GUO	748a235862	[DWARFYAML] Make the address size of compilation units optional. This patch makes the 'AddrSize' field optional. If the address size is missing, yaml2obj will infer it from the object file. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D85805	2020-08-12 21:47:32 +08:00
Igor Kudrin	cde37ff605	[llvm-dwarfdump] Avoid crashing if an abbreviation offset is invalid. Note that DWARFUnit::getAbbreviations() returns nullptr if the abbreviations could not be read, but callers used the returned pointer without checking. Differential Revision: https://reviews.llvm.org/D85738	2020-08-12 16:01:53 +07:00
David Sherwood	96c98d5a46	[SVE][CodeGen] Fix issues with EXTRACT_SUBVECTOR when using scalable FP vectors In this patch I have fixed two issues: 1. Our SVE tuple get/set intrinsics were using the wrong constant type for the index passed to EXTRACT_SUBVECTOR. I have fixed this by using the function SelectionDAG::getVectorIdxConstant to create the value. Also, I have updated the documentation for EXTRACT_SUBVECTOR describing what type the constant index should be and we now enforce this when creating the node. 2. The AArch64 backend was missing the appropriate patterns for extracting certain subvectors (nxv4f16 and nxv2f32) from legal SVE types. I have added them as part of this patch. The only way that I could find to test the new patterns was to use the SVE tuple get intrinsics, although I realise it looks a bit unusual. Tests added here: test/CodeGen/AArch64/sve-extract-subvector.ll Differential Revision: https://reviews.llvm.org/D85516	2020-08-12 08:35:46 +01:00
Kiran Chandramohan	f33b4004b5	[MLIR,OpenMP] Lowering of parallel operation: proc_bind clause 2/n This patch adds the translation of the proc_bind clause in a parallel operation. The values that can be specified for the proc_bind clause are specified in the OMP.td tablegen file in the llvm/Frontend/OpenMP directory. From this single source of truth enumeration for proc_bind is generated in llvm and mlir (used in specification of the parallel Operation in the OpenMP dialect). A function to return the enum value from the string representation is also generated. A new header file (DirectiveEmitter.h) containing definitions of classes directive, clause, clauseval etc is created so that it can be used in mlir as well. Reviewers: clementval, jdoerfert, DavidTruby Differential Revision: https://reviews.llvm.org/D84347	2020-08-12 08:03:13 +01:00
Petr Hosek	6abbc429da	[CMake] Simplify CMake handling for zlib Rather than handling zlib handling manually, use find_package from CMake to find zlib properly. Use this to normalize the LLVM_ENABLE_ZLIB, HAVE_ZLIB, HAVE_ZLIB_H. Furthermore, require zlib if LLVM_ENABLE_ZLIB is set to YES, which requires the distributor to explicitly select whether zlib is enabled or not. This simplifies the CMake handling and usage in the rest of the tooling. This is a reland of abb0075 with all followup changes and fixes that should address issues that were reported in PR44780. Differential Revision: https://reviews.llvm.org/D79219	2020-08-11 20:22:11 -07:00
Vedant Kumar	60a6ae1b59	Revert "[Instruction] Add updateLocationAfterHoist helper" This reverts commit 4a646ca9e2caf70d6312714770f516fb83b7e3cb. This is causing some bots to fail with "!dbg attachment points at wrong subprogram for function", like: http://lab.llvm.org:8011/builders/sanitizer-windows/builds/67958/steps/stage%201%20check/logs/stdio	2020-08-11 14:54:09 -07:00
Vedant Kumar	62d3804379	[Instruction] Add updateLocationAfterHoist helper Introduce a helper on Instruction which can be used to update the debug location after hoisting. Use this in GVN and LICM, where we were mistakenly introducing new line 0 locations after hoisting (the docs recommend dropping the location in this case). For more context, see the discussion in https://reviews.llvm.org/D60913. Differential Revision: https://reviews.llvm.org/D85670	2020-08-11 14:05:20 -07:00
Nikita Popov	587bdc1d95	[InstSimplify] Respect CanUseUndef in more places Similar to what we do in IIQ, add an isUndefValue() helper that checks for undef values while respective CanUseUndef. This makes it much easier to search for places that don't respect the flag yet.	2020-08-11 21:53:33 +02:00
diggerlin	b434098ac8	[AIX][XCOFF] change the operand of branch instruction from symbol name to qualified symbol name for function declarations SUMMARY: 1. in the patch , remove setting storageclass in function .getXCOFFSection and construct function of class MCSectionXCOFF there are XCOFF::StorageMappingClass MappingClass; XCOFF::SymbolType Type; XCOFF::StorageClass StorageClass; in the MCSectionXCOFF class, these attribute only used in the XCOFFObjectWriter, (asm path do not need the StorageClass) we need get the value of StorageClass, Type,MappingClass before we invoke the getXCOFFSection every time. actually , we can get the StorageClass of the MCSectionXCOFF from it's delegated symbol. 2. we also change the oprand of branch instruction from symbol name to qualify symbol name. for example change bl .foo extern .foo to bl .foo[PR] extern .foo[PR] 3. and if there is reference indirect call a function bar. we also add extern .bar[PR] Reviewers: Jason liu, Xiangling Liao Differential Revision: https://reviews.llvm.org/D84765	2020-08-11 15:26:19 -04:00
Jessica Paquette	8e321c8c5a	[GlobalISel] Combine (logic_op (op x...), (op y...)) -> (op (logic_op x, y)) This implements ``` (logic_op (op x...), (op y...)) -> (op (logic_op x, y)) ``` when `op` is an extend, a shift, or an and. This is similar to `DAGCombiner::hoistLogicOpWithSameOpcodeHands` (with a bunch of missing cases, e.g. G_TRUNC, G_BITCAST, etc.) This is implemented so it works both pre and post-legalization. This also adds a general way to add a series of instructions in a combine. (`applyBuildInstructionSteps`). Differential Revision: https://reviews.llvm.org/D85050	2020-08-11 10:40:06 -07:00
Matt Arsenault	1c0be51aa1	GlobalISel: Fix typo	2020-08-11 13:08:56 -04:00
Lang Hames	f0d37fd69e	[ORC] Move file-descriptor based raw byte channel into a public header. This will enable re-use in other llvm tools.	2020-08-11 09:50:58 -07:00
Nikita Popov	577d874016	[InstSimplify] Forbid undef folds in expandBinOp This is the replacement for D84250 based on D84792. As we recursively fold with the same value twice, we need to disable undef folds, to prevent an undef from being folded to two different values. Reverting rG00f3579aea6e3d4a4b7464c3db47294f71cef9e4 and using the test case from https://reviews.llvm.org/D83360#2145793, it no longer performs the incorrect fold. Differential Revision: https://reviews.llvm.org/D85684	2020-08-11 18:39:24 +02:00
Jay Foad	928c1dd7ef	[GlobalISel] Add G_ABS This is equivalent to the new llvm.abs intrinsic added by D84125 with is_int_min_poison=0. Differential Revision: https://reviews.llvm.org/D85718	2020-08-11 16:34:37 +01:00
Sanjay Patel	5b7d18ac79	[InstSimplify] fold min/max with matching min/max operands I think this is the last remaining translation of an existing instcombine transform for the corresponding cmp+sel idiom. This interpretation is more general though - we can remove mismatched signed/unsigned combinations in addition to the more obvious cases. min/max(X, Y) must produce X or Y as the result, so this is just another clause in the existing transform that was already matching a min/max of min/max.	2020-08-11 11:23:15 -04:00
Valentin Clement	402fcd0af2	[flang][directives] Use TableGen information for clause classes in parse-tree This patch takes advantage of the directive information and tablegen generation to replace the clauses class parse tree and in the dump parse tree sections. Reviewed By: sscalpone Differential Revision: https://reviews.llvm.org/D85549	2020-08-11 10:44:14 -04:00
Matt Arsenault	674304dfb1	GlobalISel: Implement bitcast action for G_INSERT_VECTOR_ELT This mirrors the support for the equivalent extracts. This also creates a huge mess that would be greatly improved if we had any bit operation combines.	2020-08-11 10:39:14 -04:00
Matt Arsenault	c9ab6823ab	TableGen/GlobalISel: Hack the operand order for atomic_store ISD::ATOMIC_STORE arbitrarily has the operands in the opposite order from regular ISD::STORE, which always introduced an annoying duplication of patterns to handle both cases. Since in GlobalISel there's just the one G_STORE, we need to swap the operands to correctly emit the type check for the pointer operand. Some work started in 20aafa31569b5157e792daa8860d71dd0df8a53a to migrate SelectionDAG to use ISD::STORE for atomics, but that work seems to have stalled. Since this is the pretty much the last operation which matters which isn't supported for AMDGPU, use this compatibility hack to unblock declaring it functionally complete. Not sure what's going on with the pending_phis AArch64 test. It seems it didn't always use atomics, and I'm not sure what it was originally testing matters anymore.	2020-08-11 10:22:44 -04:00
clementval	2623773dd2	Revert "[flang][directives] Use TableGen information for clause classes in parse-tree" This reverts commit bf93edc4758b512e84b70ec6335d15b0cf386a18. Buildbot failure	2020-08-11 09:54:04 -04:00
Valentin Clement	72a67d9950	[flang][directives] Use TableGen information for clause classes in parse-tree This patch takes advantage of the directive information and tablegen generation to replace the clauses class parse tree and in the dump parse tree sections. Reviewed By: sscalpone Differential Revision: https://reviews.llvm.org/D85549	2020-08-11 09:43:11 -04:00
David Stenberg	331775f51f	[DebugInfo] Allow GNU macro extension to be read Allow the GNU .debug_macro extension to be parsed and printed by llvm-dwarfdump. In an upcoming patch support will be added for emitting that format also. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D82974	2020-08-11 13:30:52 +02:00
David Stenberg	9fb94de7cb	[DebugInfo] Introduce GNU macro extension entry encodings This is a preparatory patch for allowing the GNU .debug_macro extension, which is a precursor to the DWARF 5 format, to be emitted by LLVM for earlier DWARF versions. The entries share the same encoding and behavior as in DWARF5; there are just more entries in the DWARF 5 format. Therefore, we could have used those existing DWARF 5 entries, but I think that explicitly referring to the GNU macro variants makes the code more clear. The defines that this patch introduces can be found in GCC in the dwarf2.h header: https://gcc.gnu.org/git/?p=gcc.git;a=blob; f=include/dwarf2.h; h=0b6facfd4cf4c02320c7328114231b128ab42d5e; hb=dccbf1e2a6e544f71b4a5795f0c79015db019fc3#l425 Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D82972	2020-08-11 13:30:52 +02:00
Kerry McLaughlin	76e22108d4	[CodeGen] Refactor getMemBasePlusOffset & getObjectPtrOffset to accept a TypeSize Changes the Offset arguments to both functions from int64_t to TypeSize & updates all uses of the functions to create the offset using TypeSize::Fixed() Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D85220	2020-08-11 12:17:10 +01:00
Kai Nacke	b0aab57571	[SystemZ/ZOS] Add binary format goff and operating system zos to the triple Adds the binary format goff and the operating system zos to the triple class. goff is selected as default binary format if zos is choosen as operating system. No further functionality is added. Reviewers: efriedma, tahonermann, hubert.reinterpertcast, MaskRay Reviewed By: efriedma, tahonermann, hubert.reinterpertcast Differential Revision: https://reviews.llvm.org/D82081	2020-08-11 05:26:26 -04:00
Florian Hahn	cd4df5279d	[SCEVExpander] Add helper to clean up instrs inserted while expanding. SCEVExpander already tracks which instructions have been inserted n InsertedValues/InsertedPostIncValues. This patch adds an additional vector to collect the instructions in insertion order. This can then be used to remove exactly the instructions inserted by the expander. This replaces ExpandedValuesCleaner, which in some cases might remove values not inserted by the expander (e.g. if a value was dead before insertion and is then used during expansion). Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D84327	2020-08-11 09:30:31 +01:00
Shinji Okumura	aa6d7628ea	[Attributor][NFC] Connect AAPotentialValues with AAValueSimplify This patch enables `AAValueSimplify` to use information from `AAPotentialValues` Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D85668	2020-08-11 15:52:02 +09:00
Haowei Wu	f44281335e	Revert "Move ELFObjHandler to TextAPI library" This reverts commit e6f8ba12e6ea97cbf4374b70e78309c2d859ca1c due to build failures.	2020-08-10 21:31:29 -07:00
Haowei Wu	1fa1012fcc	Move ELFObjHandler to TextAPI library This change moves ELFObjHandler to llvm/TextAPI library so it can be used by different llvm tools.	2020-08-10 21:23:39 -07:00
Wang, Pengfei	72838e8fb2	[X86][FPEnv] Teach X86 mask compare intrinsics to respect strict FP semantics. When we use mask compare intrinsics under strict FP option, the masked elements shouldn't raise any exception. So, we cann't replace the intrinsic with a full compare + "and" operation. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D85385	2020-08-11 10:28:41 +08:00
Yuanfang Chen	08ee0eb96e	[CodeGen] Make MMI immutable NPM pass	2020-08-10 17:52:42 -07:00
Lang Hames	ce159bf33c	[llvm-jitlink] Update llvm-jitlink to use TargetProcessControl.	2020-08-10 17:19:48 -07:00
Johannes Doerfert	d5ca7458b9	[OpenMP][NFC] Reuse OMPIRBuilder `struct ident_t` handling in Clang Replace the `ident_t` handling in Clang with the methods offered by the OMPIRBuilder. This cuts down on the clang code as well as the differences between the two, making further transitions easier. Tests have changed but there should not be a real functional change. The most interesting difference is probably that we stop generating local ident_t allocations for now and just use globals. Given that this happens only with debug info, the location part of the `ident_t` is probably bigger than the test anyway. As the location part is already a global, we can avoid the allocation, memcpy, and store in favor of a constant global that is slightly bigger. This can be revisited if there are complications. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D80735	2020-08-10 17:13:26 -05:00
jasonliu	a056348a49	[XCOFF][AIX] Use TE storage mapping class when large code model is enabled Summary: Use TE SMC instead of TC SMC in large code model mode, so that large code model TOC entries could get placed after all the small code model TOC entries, which reduces the chance of TOC overflow. Reviewed By: Xiangling_L Differential Revision: https://reviews.llvm.org/D85455	2020-08-10 19:52:10 +00:00
Craig Topper	1e510916fc	[BreakFalseDeps][X86] Move operand loop out of X86's getUndefRegClearance and put in the pass. X86 is the only user of this interface in tree. Previously the X86 pass would loop over operands looking for one undef operand for the pass to fix. But there could theoretically be multiple operands to fix. So it makes more sense for the pass to do the looping and ask the target if an operand needs to be fixed.	2020-08-10 10:32:29 -07:00
Xiangling Liao	0a11ef2eea	[AIX] Static init frontend recovery and backend support On the frontend side, this patch recovers AIX static init implementation to use the linkage type and function names Clang chooses for sinit related function. On the backend side, this patch sets correct linkage and function names on aliases created for sinit/sterm functions. Differential Revision: https://reviews.llvm.org/D84534	2020-08-10 10:10:49 -04:00
James Henderson	73d127e3f5	[DebugInfo] Don't error for zero-length arange entries Although the DWARF specification states that .debug_aranges entries can't have length zero, these can occur in the wild. There's no particular reason to enforce this part of the spec, since functionally they have no impact. The patch removes the error and introduces a new warning for premature terminator entries which does not stop parsing. This is a relanding of cb3a598c87db, adding the missing obj2yaml part that was needed. Fixes https://bugs.llvm.org/show_bug.cgi?id=46805. See also https://reviews.llvm.org/D71932 which originally introduced the error. Reviewed by: ikudrin, dblaikie, Higuoxing Differential Revision: https://reviews.llvm.org/D85313	2020-08-10 14:57:52 +01:00
Matt Arsenault	b0828481d3	PeepholeOptimizer: Use Register	2020-08-10 08:49:36 -04:00
Sanjay Patel	65e73449e2	[InstCombine] auto-generate test checks; NFC	2020-08-10 08:27:38 -04:00
Nico Weber	962b0abc0f	Revert "[DebugInfo] Don't error for zero-length arange entries" This reverts commit cb3a598c87db2db997401b82dfb3f7f80707194e. Breaks build of check-llvm dep obj2yaml everywhere.	2020-08-10 08:20:35 -04:00
James Henderson	66278e7306	[DebugInfo] Don't error for zero-length arange entries Although the DWARF specification states that .debug_aranges entries can't have length zero, these can occur in the wild. There's no particular reason to enforce this part of the spec, since functionally they have no impact. The patch removes the error and introduces a new warning for premature terminator entries which does not stop parsing. Fixes https://bugs.llvm.org/show_bug.cgi?id=46805. See also https://reviews.llvm.org/D71932 which originally introduced the error. Reviewed by: ikudrin, dblaikie Differential Revision: https://reviews.llvm.org/D85313	2020-08-10 12:48:31 +01:00
Qiu Chaofan	4605d02c43	[PowerPC] Add intrinsic to read or set FPSCR register This patch introduces two intrinsics: llvm.ppc.setflm and llvm.ppc.readflm. They read from or write to FPSCR register (floating-point status & control) which contains rounding mode and exception status. To ensure correctness of program, we need to prevent FP operations from being moved across these intrinsics (mffs/mtfsf instruction), so here I set them as scheduling boundaries. We can relax such restriction if FPSCR is modeled well in the future. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D84914	2020-08-10 18:27:45 +08:00
Petar Avramovic	5d6a53d942	AMDGPU/GlobalISel: Lower G_FREM Add custom lower for G_FREM. Differential Revision: https://reviews.llvm.org/D84324	2020-08-10 10:10:46 +02:00
Vitaly Buka	5845ad5ad5	[NFC][Attributor] Add missing override	2020-08-09 23:30:42 -07:00
Shinji Okumura	6da8fa8433	[Attributor][NFC][AAPotentialValues] Change interface of PotentialValuesState Previously `PotentialValuesState` inherited `BooleanState`. We have to add `getAssumed` to the state in order to use `clampStateAndIndicateChange` (which will be used in `AAPotentialValuesArgument`). However `BooleanState::getAssumed` is not a virtual function and we cannot override it. Therefore, I changed the state not to inherit `BooleanState` and add `getAssumed` to it. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D85610	2020-08-10 09:18:10 +09:00
Florian Hahn	9f66518378	[InstSimplify] Make sure CanUseUndef is initialized in all cases. This should fix a bunch of buildbot failures.	2020-08-09 19:47:16 +01:00
Florian Hahn	564f5c4ac7	[InstSimplify/NewGVN] Add option to control the use of undef. Making use of undef is not safe if the simplification result is not used to replace all uses of the result. This leads to problems in NewGVN, which does not replace all uses in the IR directly. See PR33165 for more details. This patch adds an option to SimplifyQuery to disable the use of undef. Note that I've only guarded uses if isa<UndefValue>/m_Undef where SimplifyQuery is currently available. If we agree on the general direction, I'll update the remaining uses. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D84792	2020-08-09 19:16:56 +01:00
Florian Hahn	4fd7f95ec0	[SCEVExpander] Avoid re-using existing casts if it means updating users. Currently the SCEVExpander tries to re-use existing casts, even if they are not exactly at the insertion point it was asked to create the cast. To do so in some case, it creates a new cast at the insertion point and updates all users to use the new cast. This behavior is problematic, because it changes the IR outside of the instructions created during the expansion. Therefore we cannot completely undo all changes made during expansion. This re-use should be only an extra optimization, so only using the new cast in the expanded instructions should not be a correctness issue. There are many cases equivalent instructions are created during expansion. This patch also adjusts findInsertPointAfter to skip instructions inserted during expansion. This enables re-using existing casts without the renaming any uses, by picking a better insertion point. Reviewed By: efriedma, lebedev.ri Differential Revision: https://reviews.llvm.org/D84399	2020-08-09 13:25:17 +01:00
Petr Hosek	6c27d09879	Revert "[CMake] Simplify CMake handling for zlib" This reverts commit ccbc1485b55ff4acd21bcfafbf7aec4ed0fd818d which is still failing on the Windows MLIR bots.	2020-08-08 17:08:23 -07:00
Petr Hosek	af8170b5ad	[CMake] Simplify CMake handling for zlib Rather than handling zlib handling manually, use find_package from CMake to find zlib properly. Use this to normalize the LLVM_ENABLE_ZLIB, HAVE_ZLIB, HAVE_ZLIB_H. Furthermore, require zlib if LLVM_ENABLE_ZLIB is set to YES, which requires the distributor to explicitly select whether zlib is enabled or not. This simplifies the CMake handling and usage in the rest of the tooling. This is a reland of abb0075 with all followup changes and fixes that should address issues that were reported in PR44780. Differential Revision: https://reviews.llvm.org/D79219	2020-08-08 16:44:08 -07:00
Yuanfang Chen	526e29b5b9	Reland "Revert "[NewPM][CodeGen] Introduce machine pass and machine pass manager"" This relands commit 320eab2d558fde0b61437e9b9075bfd301c2c474. The test failed because it was looking for x86-linux target unconditionally. Now it gets the default target.	2020-08-07 16:40:49 -07:00
Arthur Eubanks	834a6fc438	[NewPM] Print 'Skipping pass' as pass instrumentation If OptNoneInstrumentation prints it instead, 'Skipping pass' will print for even required passes. Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D85493	2020-08-07 15:02:02 -07:00
Sameer Arora	df248b9fac	[llvm-libtool-darwin] Add constant CPU_SUBTYPE_ARM64_V8 Add support for constant MachO::CPU_SUBTYPE_ARM64_V8. This constant is needed so as to match `llvm-libtool-darwin`'s behavior to that of cctools' libtool when `-arch_only` flag is passed in on command line. Reviewed by jhenderson, alexshap, smeenai Differential Revision: https://reviews.llvm.org/D85041	2020-08-07 14:09:27 -07:00
Vitaly Buka	6669d78639	Revert "[StackSafety] Skip ambiguous lifetime analysis" This reverts commit 0b2616a8045cb776ea1514c3401d0a8577de1060. Crashes with safe-stack.	2020-08-07 14:02:50 -07:00
Matt Arsenault	fc0dd4b853	GlobalISel: Handle zext(sext x) in artifact combiner This eliminates the illegal intermediate s8 value in the added test.	2020-08-07 16:37:46 -04:00
Yuanfang Chen	cee8d8ef70	Revert "[NewPM][CodeGen] Introduce machine pass and machine pass manager" This reverts commit 911565d1085d9447363fe8ad041817436c4998fe. Broke some non-Linux bots.	2020-08-07 11:59:58 -07:00
Yuanfang Chen	4240996330	[NewPM][CodeGen] Introduce machine pass and machine pass manager machine pass could define four methods: - `PreservedAnalyses run(MachineFunction &, MachineFunctionAnalysisManager &)` - `Error doInitialization(Module &, MachineFunctionAnalysisManager &)` - `Error doFinalization(Module &, MachineFunctionAnalysisManager &)` - `Error run(Module &, MachineFunctionAnalysisManager &)` machine pass manger: - MachineFunctionAnalysisManager: Basically an AnalysisManager<MachineFunction> augmented with the ability to register and query IR analyses - MachineFunctionPassManager: support only two methods, `addPass` and `run` Reviewed By: arsenm, asbirlea, aeubanks Differential Revision: https://reviews.llvm.org/D67687	2020-08-07 11:00:31 -07:00
Yuanfang Chen	377ad5f083	[NewPM] Only verify loop for nonskipped user loop pass No verification for pass mangers since it is not needed. No verification for skipped loop pass since the asserted condition is not used. Add a BeforeNonSkippedPass callback for this. The callback needs more inputs than its parameters to work so the callback is added on-the-fly. Reviewed By: aeubanks, asbirlea Differential Revision: https://reviews.llvm.org/D84977	2020-08-07 11:00:31 -07:00
Bevin Hansson	7c243aea4b	[Intrinsic] Add sshl.sat/ushl.sat, saturated shift intrinsics. Summary: This patch adds two intrinsics, llvm.sshl.sat and llvm.ushl.sat, which perform signed and unsigned saturating left shift, respectively. These are useful for implementing the Embedded-C fixed point support in Clang, originally discussed in http://lists.llvm.org/pipermail/llvm-dev/2018-August/125433.html and http://lists.llvm.org/pipermail/cfe-dev/2018-May/058019.html Reviewers: leonardchan, craig.topper, bjope, jdoerfert Subscribers: hiraditya, jdoerfert, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83216	2020-08-07 15:09:24 +02:00
Igor Kudrin	eabe546cbc	[DebugInfo] Clean up DIEUnit. NFC. This removes members of the DIEUnit class which were used only in unit tests. Note also that child classes shadowed some of these methods, namely, getDwarfVersion() was overridden in DwartfUnit and getLength() was overridden in DwarfCompileUnit. Differential Revision: https://reviews.llvm.org/D85436	2020-08-07 15:55:44 +07:00
Shinji Okumura	02e369eb69	[Attributor] AAPotentialValues Interface This is a split patch of D80991. This patch introduces AAPotentialValues and its interface only. For more detail of AAPotentialValues abstract attribute, see the original patch. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D83283	2020-08-07 17:35:12 +09:00
Christian Kühnel	e33dafd15e	Revert "[CMake] Simplify CMake handling for zlib" This reverts commit 1adc494bce44f6004994deed61b30d4b71fe1d05. This patch broke the Windows compilation on buildbot and pre-merge testing: http://lab.llvm.org:8011/builders/mlir-windows/builds/5945 https://buildkite.com/llvm-project/llvm-master-build/builds/780	2020-08-07 09:36:49 +02:00
biplmish	f87266ec05	[PowerPC] Implement Vector Extract Low/High Order Builtins in LLVM/Clang This patch implements the function prototypes vec_extractl and vec_extracth in altivec.h to utilize the vector extract double element instructions introduced in Power10. Differential Revision: https://reviews.llvm.org/D84622	2020-08-07 01:02:29 -05:00
QingShan Zhang	b0aa16911a	[PowerPC] Support constrained fp operation for setcc The constrained fp operation fcmp was added by https://reviews.llvm.org/D69281. This patch is trying to add the support for PowerPC backend. Reviewed By: uweigand Differential Revision: https://reviews.llvm.org/D81727	2020-08-07 05:16:36 +00:00
Vitaly Buka	3b944733de	[StackSafety] Skip ambiguous lifetime analysis If we can't identify alloca used in lifetime marker we need to assume to worst case scenario. Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D84630	2020-08-06 19:10:33 -07:00
Vitaly Buka	567e88646c	[LTO,NFC] Skip generateParamAccessSummary when empty addGlobalValueSummary can check newly added FunctionSummary and set HasParamAccess to mark that generateParamAccessSummary is needed. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D85182	2020-08-06 19:01:19 -07:00
Arthur Eubanks	f51f94852f	[NewPM] Add callback for skipped passes Parallel to https://reviews.llvm.org/D84772. Will use this for printing when a pass is skipped. Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D85478	2020-08-06 18:58:59 -07:00
Matt Arsenault	b1038416e3	GlobalISel: Implement lower for G_INSERT_VECTOR_ELT	2020-08-06 19:29:17 -04:00
Snehasish Kumar	bfbcf062be	[NFC] Rename BBSectionsPrepare -> BasicBlockSections. Rename the BBSectionsPrepare pass as suggested by the review comment in https://reviews.llvm.org/D85368. Differential Revision: https://reviews.llvm.org/D85380	2020-08-06 13:12:06 -07:00
Matt Arsenault	fc03bd4465	GlobalISel: Implement fewerElementsVector for G_EXTRACT_VECTOR_ELT Use the same basic strategy as LegalizeVectorTypes. Try to index into smaller pieces if there's a constant index, and otherwise fall back to a stack temporary.	2020-08-06 14:33:16 -04:00
Matt Arsenault	f305dea485	AMDGPU: Define raw/struct variants of buffer atomic fadd Somehow the new FP atomic buffer intrinsics ended up using the legacy style for buffer intrinsics.	2020-08-06 13:36:19 -04:00
Simon Pilgrim	0add02e052	PDBExtras.h - remove unnecessary raw_ostream forward declaration. NFCI. We already need to include raw_ostream.h, also add missing StringRef.h implicit dependency.	2020-08-06 16:31:56 +01:00
Sanjay Patel	12ef22f0e5	[PatternMatch] allow intrinsic form of min/max with existing matchers I skimmed the existing users of these matchers and don't see any problems (eg, the caller assumes the matched value was a select instruction without checking). So I think we can generalize the matching to allow the new intrinsics or the cmp+select idioms. I did not find any unit tests for the matchers, so added some basics there. The instsimplify tests are adapted from existing tests for the cmp+select pattern and cover the folds in simplifyICmpWithMinMax(). Differential Revision: https://reviews.llvm.org/D85230	2020-08-06 10:50:24 -04:00
Raphael Isemann	bc5b29dac3	Revert "PDBExtras.h - remove unnecessary raw_ostream forward declaration. NFCI." This reverts commit 87c5437afd273e909e0fed3389de7531d5452ea5. The commit includes several headers in the middle of a function, which breaks pretty much everything.	2020-08-06 15:15:43 +02:00
Simon Pilgrim	a7159bc3c5	BitstreamRemarkParser.h - remove unnecessary includes. NFCI. Remove unused includes, moving to the lib header or cpp file as necessary.	2020-08-06 13:17:53 +01:00
Simon Pilgrim	e07ee0759f	Fix include sorting order. NFC	2020-08-06 11:46:53 +01:00
Simon Pilgrim	b4317ecea4	PDBExtras.h - remove unnecessary raw_ostream forward declaration. NFCI. We already need to include raw_ostream.h, also add missing StringRef.h and cstdint implicit dependencies. Remove unnecessary includes from PDBExtras.cpp	2020-08-06 11:28:42 +01:00
David Green	77d21dcd3f	[LoopVectorizer] Inloop vector reductions Arm MVE has multiple instructions such as VMLAVA.s8, which (in this case) can take two 128bit vectors, sign extend the inputs to i32, multiplying them together and sum the result into a 32bit general purpose register. So taking 16 i8's as inputs, they can multiply and accumulate the result into a single i32 without any rounding/truncating along the way. There are also reduction instructions for plain integer add and min/max, and operations that sum into a pair of 32bit registers together treated as a 64bit integer (even though MVE does not have a plain 64bit addition instruction). So giving the vectorizer the ability to use these instructions both enables us to vectorize at higher bitwidths, and to vectorize things we previously could not. In order to do that we need a way to represent that the reduction operation, specified with a llvm.experimental.vector.reduce when vectorizing for Arm, occurs inside the loop not after it like most reductions. This patch attempts to do that, teaching the vectorizer about in-loop reductions. It does this through a vplan recipe representing the reductions that the original chain of reduction operations is replaced by. Cost modelling is currently just done through a prefersInloopReduction TTI hook (which follows in a later patch). Differential Revision: https://reviews.llvm.org/D75069	2020-08-06 10:10:50 +01:00
Roman Lebedev	3b33b0cf64	[InstCombine] (-NSW x) s> x --> x s< 0 (PR39480) Name: (-x) s> x --> x s< 0 %neg_x = sub nsw i8 0, %x ; %x must not be INT_MIN %r = icmp sgt i8 %neg_x, %x => %r = icmp slt i8 %x, 0 https://rise4fun.com/Alive/ZslD https://bugs.llvm.org/show_bug.cgi?id=39480	2020-08-06 11:50:34 +03:00
Xing GUO	47c44c8042	[DWARFYAML][debug_info] Pull out dwarf::FormParams from DWARFYAML::Unit. Unit.Format, Unit.Version and Unit.AddrSize are replaced with dwarf::FormParams in D84496 to get rid of unnecessary functions getOffsetSize() and getRefSize(). However, that change makes it difficult to make AddrSize optional (Optional<uint8_t>). This change pulls out dwarf::FormParams from DWARFYAML::Unit and use it as a helper struct in DWARFYAML::emitDebugInfo(). Reviewed By: jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D85296	2020-08-06 16:39:00 +08:00
Craig Topper	7ca0daba97	[X86] Rename X86::getImpliedFeatures to X86::updateImpliedFeatures and pass clang's StringMap directly to it. No point in building a vector of StringRefs for clang to apply to the StringMap. Just pass the StringMap and modify it directly.	2020-08-06 00:20:46 -07:00
Petr Hosek	70737c97db	[CMake] Simplify CMake handling for zlib Rather than handling zlib handling manually, use find_package from CMake to find zlib properly. Use this to normalize the LLVM_ENABLE_ZLIB, HAVE_ZLIB, HAVE_ZLIB_H. Furthermore, require zlib if LLVM_ENABLE_ZLIB is set to YES, which requires the distributor to explicitly select whether zlib is enabled or not. This simplifies the CMake handling and usage in the rest of the tooling. This is a reland of abb0075 with all followup changes and fixes that should address issues that were reported in PR44780. Differential Revision: https://reviews.llvm.org/D79219	2020-08-05 16:07:11 -07:00
Greg Clayton	4e442757fd	Add verification for DW_AT_decl_file and DW_AT_call_file. LTO builds have been creating invalid DWARF and one of the errors was a file index that was out of bounds. "llvm-dwarfdump --verify" will check all file indexes for line tables already, but there are no checks for the validity of file indexes in attributes. The verification will verify if there is a DW_AT_decl_file/DW_AT_call_file that: - there is a line table for the compile unit - the file index is valid - the encoding is appropriate Tests are added that test all of the above conditions. Differential Revision: https://reviews.llvm.org/D84817	2020-08-05 15:30:13 -07:00

1 2 3 4 5 ...

42139 Commits