llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-04-01 07:11:45 +00:00

Author	SHA1	Message	Date
Konstantin Zhuravlyov	de5b1ccd56	[AMDGPU] Change amd_kernel_code_t's minor version to 1 - We do emit amd_kernel_code_t v1.1 Differential Revision: https://reviews.llvm.org/D30433 llvm-svn: 296489	2017-02-28 17:17:52 +00:00
Adrian Prantl	cdf4ff52ee	Strip debug info when inlining into a nodebug function. The LLVM backend cannot produce any debug info for an llvm::Function without a DISubprogram attachment. When inlining a debug-info-carrying function into a nodebug function, there is therefore no reason to keep any debug info intrinsic calls or debug locations on the instructions. This fixes a problem discovered in PR32042. rdar://problem/30679307 llvm-svn: 296488	2017-02-28 16:58:13 +00:00
Vedant Kumar	32c8d92826	[llvm-cov] Error-out when an unsupported format is used (PR32087) llvm-svn: 296487	2017-02-28 16:57:28 +00:00
Craig Topper	140427ab5e	[DAGISel] When checking if chain node is foldable, make sure the intermediate nodes have a single use across all results not just the result that was used to reach the chain node. This recovers a test case that was severely broken by r296476, my making sure we don't create ADD/ADC that loads and stores when there is also a flag dependency. llvm-svn: 296486	2017-02-28 16:52:05 +00:00
Stanislav Mekhanoshin	a402b32514	[AMDGPU] Fix read-undef flags when schedule is reverted If two subregs of the same register are defined and we need to revert schedule changing def order, we will end up with both instructions having def,read-undef flags because adjustLaneLiveness() will only set this flag but will not remove it. Fix this by removing read-undef flags before calling adjustLaneLiveness. Differential Revision: https://reviews.llvm.org/D30428 llvm-svn: 296484	2017-02-28 16:26:27 +00:00
David Bozier	faeb49676c	[Stack Protection] Add diagnostic information for why stack protection was applied to a function Stack Smash Protection is not completely free, so in hot code, the overhead it causes can cause performance issues. By adding diagnostic information for which functions have SSP and why, a user can quickly determine what they can do to stop SSP being applied to a specific hot function. This change adds a remark that is reported by the stack protection code when an instruction or attribute is encountered that causes SSP to be applied. Patch by: James Henderson Differential Revision: https://reviews.llvm.org/D29023 llvm-svn: 296483	2017-02-28 16:02:37 +00:00
Simon Dardis	90ce6f59b6	[mips] Fix 64bit slt/sltu/nor with immediates Patch By: Alexander Richardson Reviewers: atanasyan, theraven, sdardis Differential Revision: https://reviews.llvm.org/D30330 llvm-svn: 296482	2017-02-28 15:55:23 +00:00
Daniel Sanders	56276c3edc	Revert r296474 - [globalisel] Change LLT constructor string into an LLT subclass that knows how to generate it. There's a circular dependency that's only revealed when LLVM_ENABLE_MODULES=1. llvm-svn: 296478	2017-02-28 15:00:27 +00:00
Nirav Dave	e24ecaa975	In visitSTORE, always use FindBetterChain, rather than only when UseAA is enabled. Recommiting after fixup of 32-bit aliasing sign offset bug in DAGCombiner. * Simplify Consecutive Merge Store Candidate Search Now that address aliasing is much less conservative, push through simplified store merging search and chain alias analysis which only checks for parallel stores through the chain subgraph. This is cleaner as the separation of non-interfering loads/stores from the store-merging logic. When merging stores search up the chain through a single load, and finds all possible stores by looking down from through a load and a TokenFactor to all stores visited. This improves the quality of the output SelectionDAG and the output Codegen (save perhaps for some ARM cases where we correctly constructs wider loads, but then promotes them to float operations which appear but requires more expensive constant generation). Some minor peephole optimizations to deal with improved SubDAG shapes (listed below) Additional Minor Changes: 1. Finishes removing unused AliasLoad code 2. Unifies the chain aggregation in the merged stores across code paths 3. Re-add the Store node to the worklist after calling SimplifyDemandedBits. 4. Increase GatherAllAliasesMaxDepth from 6 to 18. That number is arbitrary, but seems sufficient to not cause regressions in tests. 5. Remove Chain dependencies of Memory operations on CopyfromReg nodes as these are captured by data dependence 6. Forward loads-store values through tokenfactors containing {CopyToReg,CopyFromReg} Values. 7. Peephole to convert buildvector of extract_vector_elt to extract_subvector if possible (see CodeGen/AArch64/store-merge.ll) 8. Store merging for the ARM target is restricted to 32-bit as some in some contexts invalid 64-bit operations are being generated. This can be removed once appropriate checks are added. This finishes the change Matt Arsenault started in r246307 and jyknight's original patch. Many tests required some changes as memory operations are now reorderable, improving load-store forwarding. One test in particular is worth noting: CodeGen/PowerPC/ppc64-align-long-double.ll - Improved load-store forwarding converts a load-store pair into a parallel store and a memory-realized bitcast of the same value. However, because we lose the sharing of the explicit and implicit store values we must create another local store. A similar transformation happens before SelectionDAG as well. Reviewers: arsenm, hfinkel, tstellarAMD, jyknight, nhaehnle llvm-svn: 296476	2017-02-28 14:24:15 +00:00
Daniel Sanders	bdc9c89429	[globalisel] Change LLT constructor string into an LLT subclass that knows how to generate it. Summary: This will allow future patches to inspect the details of the LLT. The implementation is now split between the Support and CodeGen libraries to allow TableGen to use this class without introducing layering concerns. Thanks to Ahmed Bougacha for finding a reasonable way to avoid the layering issue and providing the version of this patch without that problem. Reviewers: t.p.northover, qcolombet, rovka, aditya_nandakumar, ab, javed.absar Subscribers: arsenm, nhaehnle, mgorny, dberris, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D30046 llvm-svn: 296474	2017-02-28 14:21:31 +00:00
Diana Picus	c958dc921b	[ARM] GlobalISel: Lower i32 and fp call parameters on the stack Lower i32, float and double parameters that need to live on the stack. This boils down to creating some G_GEPs starting from the stack pointer and storing the values there. During the process we also keep track of the stack size and use the final value in the ADJCALLSTACKDOWN/UP instructions. We currently assert for smaller types, since they usually require extensions. They will be handled in a separate patch. llvm-svn: 296473	2017-02-28 14:17:53 +00:00
Diana Picus	9f47d8e6fa	[ARM] GlobalISel: Select 32-bit G_CONSTANT Put it into a register by means of a MOVi. llvm-svn: 296471	2017-02-28 13:05:42 +00:00
Diana Picus	74ef680dc6	[ARM] GlobalISel: Add mapping for G_CONSTANT Like G_FRAME_INDEX, G_CONSTANT has one register operand and one non-register operand. llvm-svn: 296469	2017-02-28 12:13:58 +00:00
Diana Picus	2640394619	[ARM] GlobalISel: Legalize 32-bit constants llvm-svn: 296468	2017-02-28 11:33:46 +00:00
Sanne Wouda	ef778d0fcb	[Assembler] Add test for !srcloc references in assembler diags Summary: clang adds !srcloc metadata to inline assembly in LLVM bitcode generated for inline assembly in C. The value of this !srcloc is passed to the diagnostics handler if the inline assembly generates a diagnostic. clang is able to turn this cookie back to a location in the C source file. To test this functionality without a dependency, make llc print the !srcloc metadata if it is present. The added test uses this mechanism to test that the correct !srclocs are passed to the diag handler. Reviewers: rengolin, rnk, echristo, grosbach, mehdi_amini Reviewed By: mehdi_amini Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D30167 llvm-svn: 296465	2017-02-28 10:34:48 +00:00
NAKAMURA Takumi	313d45f133	Reformat a blank line. llvm-svn: 296464	2017-02-28 10:15:25 +00:00
NAKAMURA Takumi	ccf0e89448	Revert r296442 (and r296443), "Allow externally dlopen-ed libraries to be registered as permanent libraries." It broke clang/test/Analysis/checker-plugins.c llvm-svn: 296463	2017-02-28 10:15:18 +00:00
Diana Picus	7cb8f6733d	[ARM] GlobalISel: Select G_GEP At this point, G_GEP is just an add, so we treat it exactly like a G_ADD. llvm-svn: 296462	2017-02-28 10:14:38 +00:00
Oliver Stannard	ccb96c68e1	[ARM] Diagnose PC-writing instructions in IT blocks In Thumb2, instructions which write to the PC are UNPREDICTABLE if they are in an IT block but not the last instruction in the block. Previously, we only diagnosed this for LDM instructions, this patch extends the diagnostic to cover all of the relevant instructions. Differential Revision: https://reviews.llvm.org/D30398 llvm-svn: 296459	2017-02-28 10:04:36 +00:00
Diana Picus	80a5ee84d6	[ARM] GlobalISel: Add reg bank mapping for G_GEP This should be the same as the mapping for G_ADD etc. llvm-svn: 296455	2017-02-28 09:35:10 +00:00
Diana Picus	95763abcda	[ARM] GlobalISel: Legalize G_GEP with 32-bit offsets At the moment we're only interested in GEPs for putting call parameters on the stack, so we'll stick to 32-bit offsets. llvm-svn: 296452	2017-02-28 09:02:42 +00:00
Artyom Skrobov	a31efc4b6c	Relate the CHECK: lines to the functions that they're checking [NFC] llvm-svn: 296450	2017-02-28 08:58:40 +00:00
Vadzim Dambrouski	1ba11fe675	Test commit, fix typo, NFC. llvm-svn: 296447	2017-02-28 08:27:43 +00:00
Chandler Carruth	f35bb1abdb	[IR] Add range accessors for the indices of a GEP instruction. These were noticed as missing in a code review. Add them and the boring unit test to make sure they compile and DTRT. llvm-svn: 296444	2017-02-28 08:04:20 +00:00
Vassil Vassilev	cc7b69a03f	Fix Win bots. llvm-svn: 296443	2017-02-28 07:26:21 +00:00
Vassil Vassilev	f324de55aa	Allow externally dlopen-ed libraries to be registered as permanent libraries. This is also useful in cases when llvm is in a shared library. First we dlopen the llvm shared library and then we register it as a permanent library in order to keep the JIT and other services working. Patch reviewed by Vedant Kumar (D29955)! llvm-svn: 296442	2017-02-28 07:11:59 +00:00
Sanjoy Das	41d809825d	[ImplicitNullCheck] Add alias analysis usage Summary: With this change ImplicitNullCheck optimization uses alias analysis and can use load/store memory access for implicit null check if there are other load/store before but memory accesses do not alias. Patch by Serguei Katkov! Reviewers: sanjoy Reviewed By: sanjoy Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30331 llvm-svn: 296440	2017-02-28 07:04:49 +00:00
Xin Tong	3b4bcc0f56	Empty line. NFCI llvm-svn: 296438	2017-02-28 05:30:48 +00:00
Xin Tong	807f4b1e24	[LoopUnswitch] Common pushing LIC's user to worklist. llvm-svn: 296432	2017-02-28 03:32:41 +00:00
Matthias Braun	174ca3760f	Revert "Add MIR-level outlining pass" Revert Machine Outliner for now, as it breaks the asan bot. This reverts commit r296418. llvm-svn: 296426	2017-02-28 02:24:30 +00:00
Daniel Berlin	a5b1489dbe	This script was meant to be committed with the DebugCounter changes. llvm-svn: 296425	2017-02-28 02:19:11 +00:00
Lang Hames	e8296b21f0	[docs] Fix a think-o in the Programmer's Manual. llvm-svn: 296421	2017-02-28 01:35:31 +00:00
Amaury Sechet	d39f8946d0	Add test case for usubo combine. NFC. llvm-svn: 296420	2017-02-28 01:16:39 +00:00
Matthias Braun	00b30110fb	Add MIR-level outlining pass This is a patch for the outliner described in the RFC at: http://lists.llvm.org/pipermail/llvm-dev/2016-August/104170.html The outliner is a code-size reduction pass which works by finding repeated sequences of instructions in a program, and replacing them with calls to functions. This is useful to people working in low-memory environments, where sacrificing performance for space is acceptable. This adds an interprocedural outliner directly before printing assembly. For reference on how this would work, this patch also includes X86 target hooks and an X86 test. The outliner is run like so: clang -mno-red-zone -mllvm -enable-machine-outliner file.c Patch by Jessica Paquette<jpaquette@apple.com>! rdar://29166825 Differential Revision: https://reviews.llvm.org/D26872 llvm-svn: 296418	2017-02-28 00:33:32 +00:00
Amaury Sechet	330a189992	Add test case for computing known bits of substraction operations. NFC llvm-svn: 296417	2017-02-28 00:15:13 +00:00
Michael Kuperstein	bbb5beaf34	[CGP] Split some critical edges coming out of indirect branches Splitting critical edges when one of the source edges is an indirectbr is hard in general (because it requires changing the memory the indirectbr reads). But if a block only has a single indirectbr predecessor (which is the common case), we can simulate splitting that edge by splitting the destination block, and retargeting the direct branches. This is motivated by the use of computed gotos in python 2.7: PyEval_EvalFrame() ends up using an indirect branch with ~100 successors, and passing a constant to each of those. Since MachineSink can't break indirect critical edges on demand (and doing this in MIR doesn't look feasible), this causes us to emit about ~100 defs of registers containing constants, which we in the predecessor block, where only one of those constants is used in each successor. So, at each computed goto, we needlessly spill about a 100 constants to stack. The end result is that a clang-compiled python interpreter can be about ~2.5x slower on a simple python reduction loop than a gcc-compiled interpreter. Differential Revision: https://reviews.llvm.org/D29916 llvm-svn: 296416	2017-02-28 00:11:34 +00:00
Zachary Turner	cd226b0757	[PDB] Make streams carry their own endianness. Before the endianness was specified on each call to read or write of the StreamReader / StreamWriter, but in practice it's extremely rare for streams to have data encoded in multiple different endiannesses, so we should optimize for the 99% use case. This makes the code cleaner and more general, but otherwise has NFC. llvm-svn: 296415	2017-02-28 00:04:07 +00:00
Eugene Zelenko	7c39ec2ebd	[DebugInfo] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 296413	2017-02-27 23:43:14 +00:00
Michael Kuperstein	8a9eb97aa0	[SLP] Load sorting should not try to sort things that aren't loads. We may get a VL where the first element is a load, but the others aren't. Trying to sort such VLs can only lead to sorrow. llvm-svn: 296411	2017-02-27 23:18:11 +00:00
Dan Gohman	a0a2f9eacb	[MC] Implement the COFF directives in MCNullStreamer. This fixes -filetype=null errors introduced in r296403. llvm-svn: 296410	2017-02-27 23:10:18 +00:00
Matt Arsenault	3168453e6e	AMDGPU: Basic folds for fmed3 intrinsic Constant fold, canonicalize constants to RHS, reduce to minnum/maxnum when inputs are nan/undef. llvm-svn: 296409	2017-02-27 23:08:49 +00:00
Zachary Turner	c6ab3e91eb	Remove some code accidentally left in. llvm-svn: 296407	2017-02-27 22:57:32 +00:00
Petr Hosek	cab367b07a	[AddressSanitizer] Put shadow at 0 for Fuchsia The Fuchsia ASan runtime reserves the low part of the address space. Patch by Roland McGrath Differential Revision: https://reviews.llvm.org/D30426 llvm-svn: 296405	2017-02-27 22:49:37 +00:00
Eugene Zelenko	d8db98615d	[CodeGen] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 296404	2017-02-27 22:45:06 +00:00
Dan Gohman	48b9ccfac2	[MC] Factor out non-COFF handling of COFF-specific directives. Instead of requiring every non-COFF MCObjectStreamer to implement the COFF hooks just to do an llvm_unreachable to say that they're not supported, do the llvm_unreachable in the default implementation, as suggested by rnk in https://reviews.llvm.org/D26722. llvm-svn: 296403	2017-02-27 22:44:37 +00:00
Dan Gohman	df23c64a6a	[WebAssembly] Add some comments and tidy up whitespace. llvm-svn: 296402	2017-02-27 22:41:39 +00:00
Matt Arsenault	824e186e4d	AMDGPU: Use v_med3_{f16\|i16\|u16} llvm-svn: 296401	2017-02-27 22:40:39 +00:00
Dan Gohman	f2e8848268	[WebAssembly] Split CFG-sorting into its own pass. NFC. CFG sorting was already an independent algorithm from block/loop insertion; this change makes it more convenient to debug. llvm-svn: 296399	2017-02-27 22:38:58 +00:00
Hans Wennborg	ff792be595	Revert r296366 "[InlineFunction] add nonnull assumptions based on argument attributes" It causes miscompiles e.g. during self-host of Clang (PR32082). llvm-svn: 296398	2017-02-27 22:33:02 +00:00
Zachary Turner	be1966c92e	Add missing namespace qualifier. llvm-svn: 296397	2017-02-27 22:17:50 +00:00

1 2 3 4 5 ...

145546 Commits