llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-03 09:28:58 +00:00

Author	SHA1	Message	Date
Hal Finkel	56c6162a55	Update BBVectorize to use aliasesUnknownInst. This allows BBVectorize to check the "unknown instruction" list in the alias sets. This is important to prevent instruction fusing from reordering function calls. Resolves PR11920. llvm-svn: 150250	2012-02-10 15:52:40 +00:00
Duncan Sands	931ce8ee15	Fix PR11948: the result type of an icmp may be a vector of boolean - don't assume it is a boolean. llvm-svn: 150247	2012-02-10 14:31:24 +00:00
Duncan Sands	205d9394e8	Revert commit 149912 (lattner) and add a testcase that shows the problem (which is that patterns no longer match for vectors of booleans, because you only get ConstantDataVector when the vector element type is i8, i16, etc, not when it is i1). Original commit message: Remove some dead code and tidy things up now that vectors use ConstantDataVector instead of always using ConstantVector. llvm-svn: 150246	2012-02-10 14:26:42 +00:00
Andrew Trick	c3cc8fa604	RegAlloc superpass: includes phi elimination, coalescing, and scheduling. Creates a configurable regalloc pipeline. Ensure specific llc options do what they say and nothing more: -reglloc=... has no effect other than selecting the allocator pass itself. This patch introduces a new umbrella flag, "-optimize-regalloc", to enable/disable the optimizing regalloc "superpass". This allows for example testing coalscing and scheduling under -O0 or vice-versa. When a CodeGen pass requires the MachineFunction to have a particular property, we need to explicitly define that property so it can be directly queried rather than naming a specific Pass. For example, to check for SSA, use MRI->isSSA, not addRequired<PHIElimination>. CodeGen transformation passes are never "required" as an analysis ProcessImplicitDefs does not require LiveVariables. We have a plan to massively simplify some of the early passes within the regalloc superpass. llvm-svn: 150226	2012-02-10 04:10:36 +00:00
Benjamin Kramer	1a2b069bb9	GlobalOpt: Be more aggressive about elminating side-effect free static dtors. GlobalOpt runs early in the pipeline (before inlining) and complex class hierarchies often introduce bitcasts or GEPs which weren't optimized away. Teach it to ignore side-effect free instructions instead of depending on other passes to remove them. llvm-svn: 150174	2012-02-09 14:26:06 +00:00
James Molloy	85be8f7f88	Teach the MC and disassembler about SoftFail, and hook it up to UNPREDICTABLE on ARM. Wire this to tBLX in order to provide test coverage. llvm-svn: 150169	2012-02-09 10:56:31 +00:00
NAKAMURA Takumi	81f7ad5b9b	test/CodeGen/X86/atom-lea-sp.ll: Add explicit -mtriple=i686-linux. llvm-svn: 150151	2012-02-09 05:12:58 +00:00
Evan Cheng	1be96ff50e	Commit Andy Zhang's test for the lea patch. llvm-svn: 150107	2012-02-08 22:33:17 +00:00
Kostya Serebryany	2de61e1628	[asan] unpoison the stack before every noreturn call. Fixes asan issue 37. llvm part llvm-svn: 150102	2012-02-08 21:36:17 +00:00
Elena Demikhovsky	87a6e08d3a	Fixed a bug in printing "cmp" pseudo ops. > This IR code > %res = call <8 x float> @llvm.x86.avx.cmp.ps.256(<8 x float> %a0, <8 x float> %a1, i8 14) > fails with assertion: > > llc: X86ATTInstPrinter.cpp:62: void llvm::X86ATTInstPrinter::printSSECC(const llvm::MCInst, unsigned int, llvm::raw_ostream&): Assertion `0 && "Invalid ssecc argument!"' failed. > 0 llc 0x0000000001355803 > 1 llc 0x0000000001355dc9 > 2 libpthread.so.0 0x00007f79a30575d0 > 3 libc.so.6 0x00007f79a23a1945 gsignal + 53 > 4 libc.so.6 0x00007f79a23a2f21 abort + 385 > 5 libc.so.6 0x00007f79a239a810 __assert_fail + 240 > 6 llc 0x00000000011858d5 llvm::X86ATTInstPrinter::printSSECC(llvm::MCInst const, unsigned int, llvm::raw_ostream&) + 119 I added the full testing for all possible pseudo-ops of cmp. I extended X86AsmPrinter.cpp and X86IntelInstPrinter.cpp. You'l also see lines alignments (unrelated to this fix) in X86IselLowering.cpp from my previous check-in. llvm-svn: 150068	2012-02-08 08:37:26 +00:00
Chad Rosier	b70d1dfae6	[fast-isel] Add support for SUBs with non-legal types. llvm-svn: 150047	2012-02-08 02:45:44 +00:00
Chad Rosier	66b35d7220	Add comment to test case. llvm-svn: 150046	2012-02-08 02:30:12 +00:00
Chad Rosier	1ef78d6989	[fast-isel] Add support for ORs with non-legal types. llvm-svn: 150045	2012-02-08 02:29:21 +00:00
Chad Rosier	26610906f0	[fast-isel] Add support for indirect branches. llvm-svn: 150014	2012-02-07 23:56:08 +00:00
Craig Topper	a8a69356e1	Add instruction selection for 256-bit VPSHUFD and 128-bit VPERMILPS/VPERMILPD. llvm-svn: 149968	2012-02-07 06:28:42 +00:00
Chad Rosier	945ab43c4f	[fast-isel] Add support for ADDs with non-legal types. llvm-svn: 149934	2012-02-06 23:50:07 +00:00
Kostya Serebryany	f4be131943	The patch resolves the conflict between AddressSanitizer and load widening (GVN). The problem initially reported by Mozilla folks (http://code.google.com/p/address-sanitizer/issues/detail?id=20), but it also prevents us from enabling LLVM bootstrap with AddressSanitizer. llvm-svn: 149925	2012-02-06 22:48:56 +00:00
Bill Wendling	2fbed70727	The 'unwind' instruction is deprecated and will be removed, making this test obsolete. llvm-svn: 149880	2012-02-06 18:18:47 +00:00
Nick Lewycky	bad48a142a	Teach GlobalOpt to handle atomic accesses to globals. * Most of the transforms come through intact by having each transformed load or store copy the ordering and synchronization scope of the original. * The transform that turns a global only accessed in main() into an alloca (since main is non-recursive) with a store of the initial value uses an unordered store, since it's guaranteed to be the first thing to happen in main. (Threads may have started before main (!) but they can't have the address of a function local before the point in the entry block we insert our code.) * The heap-SRoA transforms are disabled in the face of atomic operations. This can probably be improved; it seems odd to have atomic accesses to an alloca that doesn't have its address taken. AnalyzeGlobal keeps track of the strongest ordering found in any use of the global. This is more information than we need right now, but it's cheap to compute and likely to be useful. llvm-svn: 149847	2012-02-05 19:56:38 +00:00
Duncan Sands	fb60d2db35	Testcase for commit 149833 (use of an uninitialized variable noticed by GCC). llvm-svn: 149840	2012-02-05 19:27:57 +00:00
Duncan Sands	eb56d51cfb	Reduce the number of dom queries made by GVN's conditional propagation logic by half: isOnlyReachableViaThisEdge was trying to be clever and handle the case of a branch to a basic block which is contained in a loop. This costs a domtree lookup and is completely useless due to GVN's position in the pass pipeline: all loops have preheaders at this point, which means it is enough for isOnlyReachableViaThisEdge to check that Dst has only one predecessor. (I checked this theoretical argument by running over the entire nightly testsuite, and indeed it is so!). llvm-svn: 149838	2012-02-05 18:25:50 +00:00
Benjamin Kramer	8e54f21216	Testing vector code without sse doesn't make much sense. Should bring arm and ppc testers back to life (they default to -mcpu=generic) llvm-svn: 149821	2012-02-05 11:19:39 +00:00
Chris Lattner	4881c9ecb0	Add a test for the miscompilation my recent ConstantDataArray patches introduced, to make sure we don't regress on it in the future. llvm-svn: 149803	2012-02-05 02:37:36 +00:00
Craig Topper	c289726019	Remove most of the intrinsics for XOP VPCMOV instruction. They all aliased to the same instruction with different types. This would be better accomplished with casts in the not yet created xopintrin.h header file. llvm-svn: 149795	2012-02-05 00:55:56 +00:00
Hal Finkel	34ae699943	Boost the effective chain depth of loads and stores. By default, boost the chain depth contribution of loads and stores. This will allow a load/store pair to vectorize even when it would not otherwise be long enough to satisfy the chain depth requirement. llvm-svn: 149761	2012-02-04 04:14:04 +00:00
Chad Rosier	ec3053c33c	[fast-isel] HandlePHINodesInSuccessorBlocks() can promite i8 and i16 types too. llvm-svn: 149730	2012-02-04 00:39:19 +00:00
Chad Rosier	cff3c98417	[fast-isel] Add support for FPToUI. Also add test cases for FPToSI. llvm-svn: 149706	2012-02-03 20:27:51 +00:00
Chad Rosier	40b3e74387	[fast-isel] Add support for selecting UIToFP. llvm-svn: 149704	2012-02-03 19:42:52 +00:00
Nadav Rotem	5c5681cf27	The type-legalizer often scalarizes code. One of the common patterns is extract-and-truncate. In this patch we optimize this pattern and convert the sequence into extract op of a narrow type. This allows the BUILD_VECTOR dag optimizations to construct efficient shuffle operations in many cases. llvm-svn: 149692	2012-02-03 13:18:25 +00:00
Akira Hatanaka	874523adc5	Add a new MachineJumpTableInfo entry type, EK_GPRel64BlockAddress, which is needed to emit a 64-bit gp-relative relocation entry. Make changes necessary for emitting jump tables which have entries with directive .gpdword. This patch does not implement the parts needed for direct object emission or JIT. llvm-svn: 149668	2012-02-03 04:33:00 +00:00
Dan Gohman	d18622bd02	Fix SSAUpdaterImpl's RecordMatchingPHI to record exactly the PHI nodes which were matched, rather than climbing up the original PHI node's operands to rediscover PHI nodes for recording, since the PHI nodes found that are not necessarily part of the matched set. This fixes rdar://10589171. llvm-svn: 149654	2012-02-03 01:07:01 +00:00
Jim Grosbach	bc7e9b3c96	Revert "Disable InstCombine unsafe folding bitcasts of calls w/ varargs." This reverts commit d0e277d272d517ca1cda368267d199f0da7cad95. llvm-svn: 149647	2012-02-03 00:00:50 +00:00
Matt Beaumont-Gay	8b5dfe05f5	Unix line endings llvm-svn: 149615	2012-02-02 19:00:49 +00:00
NAKAMURA Takumi	a7f8fe6300	Move test/CodeGen/Generic/2012-02-01-CoalescerBug.ll to CodeGen/ARM, for now. It requires TARGETS=arm. I cannot reproduce a fixed issue with other targets. llvm-svn: 149604	2012-02-02 11:44:58 +00:00
Elena Demikhovsky	7ca11b6e3f	Optimization for SIGN_EXTEND operation on AVX. Special handling was added for v4i32 -> v4i64 and v8i16 -> v8i32 extensions. llvm-svn: 149600	2012-02-02 09:10:43 +00:00
Lang Hames	004f627ed6	Set EFLAGS correctly in EmitLoweredSelect on X86. llvm-svn: 149597	2012-02-02 07:48:37 +00:00
Lang Hames	2efb52b518	PR11868. The previous loop in LiveIntervals::join would sometimes fall over if more than two adjacent ranges needed to be merged. The new version should be able to handle an arbitrary sequence of adjancent ranges. llvm-svn: 149588	2012-02-02 05:37:34 +00:00
Andrew Trick	d09b64fc25	Instruction scheduling itinerary for Intel Atom. Adds an instruction itinerary to all x86 instructions, giving each a default latency of 1, using the InstrItinClass IIC_DEFAULT. Sets specific latencies for Atom for the instructions in files X86InstrCMovSetCC.td, X86InstrArithmetic.td, X86InstrControl.td, and X86InstrShiftRotate.td. The Atom latencies for the remainder of the x86 instructions will be set in subsequent patches. Adds a test to verify that the scheduler is working. Also changes the scheduling preference to "Hybrid" for i386 Atom, while leaving x86_64 as ILP. Patch by Preston Gurd! llvm-svn: 149558	2012-02-01 23:20:51 +00:00
Mon P Wang	7313ffe333	Avoid creating an extract element to an illegal type after LegalizeTypes has run. llvm-svn: 149548	2012-02-01 22:15:20 +00:00
Andrew Trick	b4963dd8da	VLIW specific scheduler framework that utilizes deterministic finite automaton (DFA). This new scheduler plugs into the existing selection DAG scheduling framework. It is a top-down critical path scheduler that tracks register pressure and uses a DFA for pipeline modeling. Patch by Sergei Larin! llvm-svn: 149547	2012-02-01 22:13:57 +00:00
NAKAMURA Takumi	0bb21fdfce	test/CodeGen/X86/avx-minmax.ll: Relax expressions for Win32 targets. YMM arguments are passed as indirect on Win32 x64. llvm-svn: 149505	2012-02-01 14:35:29 +00:00
Elena Demikhovsky	455db87d41	Passing AVX 256-bit structures in Win64 was wrong. Fixed Win64 calling conventions. llvm-svn: 149494	2012-02-01 10:46:14 +00:00
Elena Demikhovsky	da37eb48d8	Optimization for "truncate" operation on AVX. Truncating v4i64 -> v4i32 and v8i32 -> v8i16 may be done with set of shuffles. llvm-svn: 149485	2012-02-01 07:56:44 +00:00
Hal Finkel	8cf5de5774	Add a basic-block autovectorization pass. This is the initial checkin of the basic-block autovectorization pass along with some supporting vectorization infrastructure. Special thanks to everyone who helped review this code over the last several months (especially Tobias Grosser). llvm-svn: 149468	2012-02-01 03:51:43 +00:00
Jim Grosbach	6186319c3f	Disable InstCombine unsafe folding bitcasts of calls w/ varargs. Changing arguments from being passed as fixed to varargs is unsafe, as the ABI may require they be handled differently (stack vs. register, for example). Remove two tests which rely on the bitcast being folded into the direct call, which is exactly the transformation that's unsafe. llvm-svn: 149457	2012-02-01 00:08:17 +00:00
Kevin Enderby	cb876a7560	Fixed a crash in llvm-mc for Mach-O when a symbol difference expression uses a symbol from an assignment. In this case the symbol did not have a fragment so MCObjectWriter::IsSymbolRefDifferenceFullyResolved() should not have been calling IsSymbolRefDifferenceFullyResolvedImpl() with a NULL fragment and should just have returned false in that case. llvm-svn: 149442	2012-01-31 23:02:57 +00:00
Craig Topper	2b764de6ab	Remove pcmpgt/pcmpeq intrinsics as clang is not using them. llvm-svn: 149367	2012-01-31 06:52:44 +00:00
Bill Wendling	7761976036	Remove all references to the old EH. There was always the current EH. -- Ministry of Truth llvm-svn: 149335	2012-01-31 02:09:07 +00:00
Bill Wendling	76beba7841	Update test to new EH model. llvm-svn: 149333	2012-01-31 02:05:13 +00:00
Bill Wendling	8402993dd4	Update test to new EH model. llvm-svn: 149332	2012-01-31 02:04:20 +00:00

1 2 3 4 5 ...

15592 Commits