llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-29 14:20:29 +00:00

Author	SHA1	Message	Date
Nikolay Haustov	048a920e0e	AMDGPU/SI: Assembler: Unify parsing/printing of operands. Summary: The goal is for each operand type to have its own parse function and at the same time share common code for tracking state as different instruction types share operand types (e.g. glc/glc_flat, etc). Introduce parseAMDGPUOperand which can parse any optional operand. DPP and Clamp/OMod have custom handling for now. Sam also suggested to have class hierarchy for operand types instead of table. This can be done in separate change. Remove parseVOP3OptionalOps, parseDS*OptionalOps, parseFlatOptionalOps, parseMubufOptionalOps, parseDPPOptionalOps. Reduce number of definitions of AsmOperand's and MatchClasses' by using common base class. Rename AsmMatcher/InstPrinter methods accordingly. Print immediate type when printing parsed immediate operand. Use 'off' if offset/index register is unused instead of skipping it to make it more readable (also agreed with SP3). Update tests. Reviewers: tstellarAMD, SamWot, artem.tamazov Subscribers: qcolombet, arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D19584 llvm-svn: 268015	2016-04-29 09:02:30 +00:00
Simon Pilgrim	cacaf0a6c2	[InstCombine][SSE] Regenerated x86 pshufb tests llvm-svn: 268014	2016-04-29 08:53:35 +00:00
Zlatko Buljan	8adb396ea9	[mips][microMIPS] Fix offsets for LLE, LWE, SBE, SCE and SHE instructions Differential Revision: http://reviews.llvm.org/D18645 llvm-svn: 268012	2016-04-29 08:36:54 +00:00
David Majnemer	9514917d37	[DeadArgumentElimination] Propagate operand bundles to promoted call sites We neglected to transfer operand bundles when performing argument promotion. llvm-svn: 268008	2016-04-29 07:22:36 +00:00
Adam Nemet	01846957ca	[LoopDist] Add missing RUN line in test from r268006 llvm-svn: 268007	2016-04-29 07:16:00 +00:00
Adam Nemet	33fcad4825	[LoopDist] Also emit optimization remark on success (-Rpass=) The option -Rpass=loop-distribute now reports the loops that were distributed. llvm-svn: 268006	2016-04-29 07:10:46 +00:00
David Majnemer	2075b35f09	[SLPVectorizer] Add operand bundles to vectorized functions SLPVectorizing a call site should result in further propagation of its bundles. llvm-svn: 268004	2016-04-29 07:09:51 +00:00
David Majnemer	f7012da323	[LoopVectorize] Add operand bundles to vectorized functions Also, do not crash when calculating a cost model for loop-invariant token values. llvm-svn: 268003	2016-04-29 07:09:48 +00:00
Matt Arsenault	1b74f4bf74	AMDGPU: Stop reporting an addressing mode for unknown addrspace This was being treated the same as private, which has an immediate offset. For unknown, it probably means it's for a computation not actually being used for accessing memory, so it should not have a nontrivial addressing mode. llvm-svn: 268002	2016-04-29 06:25:10 +00:00
Matt Arsenault	6923c26f30	DivergenceAnalysis: Fix crash with unreachable blocks Unreachable blocks may not be in the dominator tree, so don't crash on them. llvm-svn: 268001	2016-04-29 06:17:47 +00:00
David Majnemer	e17dff606d	[ArgumentPromotion] Propagate operand bundles to promoted call sites We neglected to transfer operand bundles when performing argument promotion. This fixes PR27568. llvm-svn: 267986	2016-04-29 04:56:12 +00:00
Michael Zolotukhin	6be07401c1	[PR25281] Remove AAResultsWrapper from preserved analyses of loop vectorizer. We don't preserve AAResults, because, for one, we don't preserve SCEV-AA. That fixes PR25281. llvm-svn: 267980	2016-04-29 03:31:25 +00:00
Matthias Braun	5d4a43cf37	RegisterPressure: Fix default lanemask for missing regunit intervals In case of missing live intervals for a physical registers getLanesWithProperty() would report 0 which was not a safe default in all situations. Add a parameter to pass in a safe default. No testcase because in-tree targets do not skip computing register unit live intervals. Also cleanup the getXXX() functions to not perform the RequireLiveIntervals checks anymore so we do not even need to return safe defaults. llvm-svn: 267977	2016-04-29 02:44:54 +00:00
Vedant Kumar	37bef9dfdc	[llvm-cov] Don't emit 'nan%' in reports llvm-svn: 267971	2016-04-29 01:31:49 +00:00
Hal Finkel	52782fda9f	[LoopVectorize] Keep hints from original loop on the vector loop We need to keep loop hints from the original loop on the new vector loop. Failure to do this meant that, for example: void foo(int *b) { #pragma clang loop unroll(disable) for (int i = 0; i < 16; ++i) b[i] = 1; } this loop would be unrolled. Why? Because we'd vectorize it, thus dropping the hints that unrolling should be disabled, and then we'd unroll it. llvm-svn: 267970	2016-04-29 01:27:40 +00:00
David Majnemer	92772e4d05	[llvm-pdbdump] Restore error messages, handle bad block sizes We lost the ability to report errors, bring it back. Also, correctly validate the block size. llvm-svn: 267955	2016-04-28 23:47:27 +00:00
David Majnemer	b458e0b8ba	[llvm-pdbdump] Correctly read data larger than a block A bug was introduced when the code was refactored which resulted in a bad memory access. This fixes PR27565. llvm-svn: 267953	2016-04-28 23:24:23 +00:00
Adam Nemet	b7cc6fd680	[LoopDist] Emit optimization remarks (-Rpass) I closely followed the precedents set by the vectorizer: With -Rpass-missed, the loop is reported with further details pointing to -Rpass--analysis. * -Rpass-analysis reports the details why distribution has failed. * Regardless of -Rpass*, when distribution fails for a loop where distribution was forced with the pragma, a warning is produced according to -Wpass-failed. In this case the analysis info is also printed even without -Rpass-analysis. llvm-svn: 267952	2016-04-28 23:08:32 +00:00
Hal Finkel	5a6e2ec855	[Inliner] Preserve llvm.mem.parallel_loop_access metadata When inlining a call site with llvm.mem.parallel_loop_access metadata, this metadata needs to be propagated to all cloned memory-accessing instructions. Otherwise, inlining parts of the loop body will invalidate the annotation. With this functionality, we now vectorize the following as expected: void Body(int res, int c, int d, int p, int i) { res[i] = (p[i] == 0) ? res[i] : res[i] + d[i]; } void Test(int res, int c, int d, int p, int n) { int i; #pragma clang loop vectorize(assume_safety) for (i = 0; i < 1600; i++) { Body(res, c, d, p, i); } } llvm-svn: 267949	2016-04-28 23:00:04 +00:00
Dehao Chen	40132d2919	Read discriminators correctly from object file. Summary: This is the follow-up patch for http://reviews.llvm.org/D19436 * Update the discriminator reading algorithm to match the assignment algorithm. * Add test to cover the new algorithm. Reviewers: dnovillo, echristo, dblaikie Subscribers: danielcdh, dblaikie, echristo, llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D19522 llvm-svn: 267945	2016-04-28 22:09:37 +00:00
Marcin Koscielnicki	832d560a7e	[PowerPC] Fix the EH_SjLj_Setup pseudo. This instruction is just a control flow marker - it should not actually exist in the object file. Unfortunately, nothing catches it before it gets to AsmPrinter. If integrated assembler is used, it's considered to be a normal 4-byte instruction, and emitted as an all-0 word, crashing the program. With external assembler, a comment is emitted. Fixed by setting Size to 0 and handling it in MCCodeEmitter - this means the comment will still be emitted if integrated assembler is not used. This broke an ASan test, which has been disabled for a long time as a result (see the discussion on D19657). We can reenable it once this lands. llvm-svn: 267943	2016-04-28 21:24:37 +00:00
Kevin Enderby	211d6a827a	Fix a bug in llvm-objdump for -private-headers printing the LC_CODE_SIGNATURE Mach-O load command. rdar://25985653 llvm-svn: 267940	2016-04-28 21:07:20 +00:00
Krzysztof Parzyszek	89f6a784c5	[RDF] Improve handling of inline-asm - Keep implicit defs from inline-asm instructions. - Treat register references from inline-asm as fixed. llvm-svn: 267936	2016-04-28 20:33:33 +00:00
Kevin Enderby	57a8fdd741	Update llvm-objdump for disassembly of ARM Mach-O files to always include the opcode bytes. As this is the expected behavior of the old darwin otool(1) for ARM Mach-O files. rdar://25896249 llvm-svn: 267929	2016-04-28 20:14:13 +00:00
Zachary Turner	c1be2bb041	Read the rest of the DBI substreams, and parse source info. We now read out the rest of the substreams from the DBI streams. One of these substreams, the FileInfo substream, contains information about which source files contribute to each module (aka compiland). This patch additionally parses out the file information from that substream, and dumps it in llvm-pdbdump. Differential Revision: http://reviews.llvm.org/D19634 Reviewed by: ruiu llvm-svn: 267928	2016-04-28 20:05:18 +00:00
Kit Barton	d03785d120	This reverts commit r265505. Revert "[Power9] Implement add-pc, multiply-add, modulo, extend-sign-shift, random number, set bool, and dfp test significance". This patch has caused a functional regression in SPEC2k6 namd, and a performance regression in mesa-pipe. llvm-svn: 267927	2016-04-28 20:00:42 +00:00
Krzysztof Parzyszek	4d8da47e67	[Hexagon] Add instruction aliases for vector unsigned compare-equal Unsigned compare-equal instructions are mapped to signed compare-equal. llvm-svn: 267925	2016-04-28 19:49:18 +00:00
Matt Arsenault	28f0a3fe58	AMDGPU: Emit error if too much LDS is used llvm-svn: 267922	2016-04-28 19:37:35 +00:00
Krzysztof Parzyszek	f555db9453	Reset the TopRPTracker's position in ScheduleDAGMILive::initQueues ScheduleDAGMI::initQueues changes the RegionBegin to the first non-debug instruction. Since it does not track register pressure, it does not affect any RP trackers. ScheduleDAGMILive inherits initQueues from ScheduleDAGMI, and it does reset the TopTPTracker in its schedule method. Any derived, target-specific scheduler will need to do it as well, but the TopRPTracker is only exposed as a "const" object to derived classes. Without the ability to modify the tracker directly, this leaves a derived scheduler with a potential of having the TopRPTracker out-of-sync with the CurrentTop. The symptom of the problem: void llvm::ScheduleDAGMILive::scheduleMI(llvm::SUnit *, bool): Assertion `TopRPTracker.getPos() == CurrentTop && "out of sync"' failed. Differential Revision: http://reviews.llvm.org/D19438 llvm-svn: 267918	2016-04-28 19:17:44 +00:00
Matt Arsenault	f94836045a	AMDGPU: Fix mishandling array allocations when promoting alloca The canonical form for allocas is a single allocation of the array type. In case we see a non-canonical array alloca, make sure we aren't replacing this with an array N times smaller. llvm-svn: 267916	2016-04-28 18:38:48 +00:00
Krzysztof Parzyszek	ff5fb695cc	[Hexagon] Define certain aliases for vector instructions Specifically: Vd = #0 -> Vd = vxor(Vd, Vd) Vdd = #0 -> Vdd.w = vsub(Vdd.w, Vdd.w) Vdd = Vss -> Vdd = vcombine(Vss.H, Vss.L) llvm-svn: 267901	2016-04-28 16:43:16 +00:00
Simon Dardis	156870a1a4	[mips][atomics] Fix partword atomic binary operation implementation Currently Mips::emitAtomicBinaryPartword() does not properly respect the width of pointers. For MIPS64 this causes the memory address that the ll/sc sequence uses to be truncated. At runtime this causes a segmentation fault. This can be fixed by applying similar changes as r266204, so that a full 64bit pointer is loaded. Reviewers: dsanders Differential Review: http://reviews.llvm.org/D19651 llvm-svn: 267900	2016-04-28 16:26:43 +00:00
Arch D. Robison	606517b553	[SLPVectorizer] Extend SLP Vectorizer to deal with aggregates. The refactoring portion part was done as r267748. http://reviews.llvm.org/D14185 llvm-svn: 267899	2016-04-28 16:11:45 +00:00
Krzysztof Parzyszek	14103a2bbf	[Hexagon] Handle double-vector registers as new-value producers Patch by Colin LeMahieu. llvm-svn: 267897	2016-04-28 15:54:48 +00:00
Adrian Prantl	cba0c028e4	Debug Info: Restore the pre-r240853 behavior for DWARF2 bitfields. The DWARF2 specification of DW_AT_bit_offset is ambiguous for little-endian machines, but by restoring to the old behavior we match what debuggers expect and what other popular compilers generate. llvm-svn: 267896	2016-04-28 15:37:52 +00:00
Adrian Prantl	ae61e8a345	Debug info: Support DWARF4 bitfields via DW_AT_data_bit_offset. The DWARF2 specification of DW_AT_bit_offset was written from the perspective of a big-endian machine with unclear semantics for other systems. DWARF4 deprecated DW_AT_bit_offset and introduced a new attribute DW_AT_data_bit_offset that simply counts the number of bits from the beginning of the containing entity regardless of endianness. After this patch LLVM emits DW_AT_bit_offset for DWARF 2 or 3 and DW_AT_data_bit_offset when DWARF 4 or later is requested. llvm-svn: 267895	2016-04-28 15:37:48 +00:00
Krzysztof Parzyszek	49d1f997e6	[RDF] Handle undefined registers in RDF copy propagation When updating the graph, make sure that new uses without reaching defs are handled correctly. llvm-svn: 267891	2016-04-28 15:09:19 +00:00
Simon Pilgrim	04933872c3	[InstCombine][SSE] Add MOVMSK support to SimplifyDemandedUseBits The MOVMSK instructions copies a vector elements' sign bits to the low bits of a scalar register and zeros the high bits. This patch adds MOVMSK support to SimplifyDemandedUseBits so that its aware that the upper bits are known to be zero. It also removes the call to MOVMSK if none of the lower bits are actually required and just returns zero. Differential Revision: http://reviews.llvm.org/D19614 llvm-svn: 267873	2016-04-28 12:22:53 +00:00
Matthias Braun	18562ab366	CodeGen: Add DetectDeadLanes pass. The DetectDeadLanes pass performs a dataflow analysis of used/defined subregister lanes across COPY instructions and instructions that will get lowered to copies. It detects dead definitions and uses reading undefined values which are obscured by COPY and subregister usage. These dead definitions cause trouble in the register coalescer which cannot deal with definitions suddenly becoming dead after coalescing COPY instructions. For now the pass only adds dead and undef flags to machine operands. It should be possible to extend it in the future to remove the dead instructions and redo the analysis for the affected virtual registers. Differential Revision: http://reviews.llvm.org/D18427 llvm-svn: 267851	2016-04-28 03:07:16 +00:00
Sanjay Patel	9fec0afb7a	Update test to use FileCheck Also, add some metadata to show what that currently looks like. llvm-svn: 267827	2016-04-28 00:29:27 +00:00
Bryan Chan	2567ab558c	[SystemZ] Support Swift Calling Convention Summary: Port rL265480, rL264754, rL265997 and rL266252 to SystemZ, in order to enable the Swift port on the architecture. SwiftSelf and SwiftError are assigned to R10 and R9, respectively, which are normally callee-saved registers. For more information, see: RFC: Implementing the Swift calling convention in LLVM and Clang https://groups.google.com/forum/#!topic/llvm-dev/epDd2w93kZ0 Reviewers: kbarton, manmanren, rjmccall, uweigand Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D19414 llvm-svn: 267823	2016-04-28 00:17:23 +00:00
Peter Collingbourne	2442815aeb	LTO: Don't bother trying to mangle unnamed globals, as they can't be preserved with MustPreserveSymbols. Summary: Should fix sanitizer-windows bot. Reviewers: joker.eph Subscribers: llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D19635 llvm-svn: 267820	2016-04-27 23:48:11 +00:00
Kevin Enderby	b798753bdb	Fix bugs in llvm-objdump printing the last word for -section in non i386 and x86 files. Two problems, 1) for the last 4 bytes it would print them as separate bytes not a word and 2) it would print the same last byte for those bytes less than a word. rdar://25938224 llvm-svn: 267819	2016-04-27 23:43:00 +00:00
Zachary Turner	7ccba4f0bc	Parse module information from DBI stream. This gets more data out of the DBI strema of the PDB. In particular it extracts the metadata for the list of modules (compilands) that this PDB contains info about, and adds support for dumping these fields to llvm-pdbdump. Differential Revision: http://reviews.llvm.org/D19570 Reviewed By: ruiu llvm-svn: 267818	2016-04-27 23:41:42 +00:00
Rong Xu	3a8084117b	more buildbot failure fix to r267792 __llvm_prf_nm length is embedded in llvm_used. Relax llvm_used check. llvm-svn: 267816	2016-04-27 23:23:53 +00:00
Rong Xu	b6a36f2009	[PGO] Promote indirect calls to conditional direct calls with value-profile This patch implements the transformation that promotes indirect calls to conditional direct calls when the indirect-call value profile meta-data is available. Differential Revision: http://reviews.llvm.org/D17864 llvm-svn: 267815	2016-04-27 23:20:27 +00:00
Mitch Bodart	fde6615c97	[X86] Enable the post-RA-scheduler for clang's default 32-bit cpu. For compilations with no explicit cpu specified, this exhibits nice gains on Silvermont, with neutral performance on big cores. Differential Revision: http://reviews.llvm.org/D19138 llvm-svn: 267809	2016-04-27 22:52:35 +00:00
Kevin Enderby	41f6b232d1	Fix a bug in llvm-objdump printing of 32-bit addresses for -section in non i386 and x86 files. rdar://25896202 llvm-svn: 267807	2016-04-27 22:36:18 +00:00
Quentin Colombet	d6bb035737	[X86][FastISel] Make sure we use the right register class when we select stores. llvm-svn: 267806	2016-04-27 22:33:42 +00:00
Rong Xu	db7372e42c	Fix buildbot failure due to r267792 Relax the test check as some targets do not have name compression. llvm-svn: 267803	2016-04-27 22:06:35 +00:00

1 2 3 4 5 ...

36044 Commits