llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-12-27 05:55:40 +00:00

Author	SHA1	Message	Date
Chris Lattner	759c4201e9	the new scalarrepl changes are optimizing away a temporary alloca in check242, which invalidates this test. This test is an x86-32 ABI test that is trying to be run in a target-independent way, which is not going to work very well. Just remove the test. llvm-svn: 61921	2009-01-08 07:58:23 +00:00
Chris Lattner	60a03a2f36	This implements the second half of the fix for PR3290, handling loads from allocas that cover the entire aggregate. This handles some memcpy/byval cases that are produced by llvm-gcc. This triggers a few times in kc++ (with std::pair<std::_Rb_tree_const_iterator <kc::impl_abstract_phylum*>,bool>) and once in 176.gcc (with %struct..0anon). llvm-svn: 61915	2009-01-08 05:42:05 +00:00
Misha Brukman	6338af14f6	Fix off-by-one error in traversing an array; this fixes a test. The error was reported by gcc-4.3.0 during compilation. llvm-svn: 61896	2009-01-07 23:07:29 +00:00
Duncan Sands	a254acd1d3	Remove alloca tracking from nocapture analysis. Not only was it not very helpful, it was also wrong! The problem is shown in the testcase: the alloca might be passed to a nocapture callee which dereferences it and returns the original pointer. But because it was a nocapture call we think we don't need to track its uses, but we do. llvm-svn: 61876	2009-01-07 19:39:06 +00:00
Chris Lattner	8adf14ea21	Implement the first half of PR3290: if there is a store of an integer to a (transitive) bitcast the alloca and if that integer has the full size of the alloca, then it clobbers the whole thing. Handle this by extracting pieces out of the stored integer and filing them away in the SROA'd elements. This triggers fairly frequently because the CFE uses integers to pass small structs by value and the inliner exposes these. For example, in kimwitu++, I see a bunch of these with i64 stores to "%struct.std::pair<std::_Rb_tree_const_iterator<kc::impl_abstract_phylum*>,bool>" In 176.gcc I see a few i32 stores to "%struct..0anon". In the testcase, this is a difference between compiling test1 to: _test1: subl $12, %esp movl 20(%esp), %eax movl %eax, 4(%esp) movl 16(%esp), %eax movl %eax, (%esp) movl (%esp), %eax addl 4(%esp), %eax addl $12, %esp ret vs: _test1: movl 8(%esp), %eax addl 4(%esp), %eax ret The second half of this will be to handle loads of the same form. llvm-svn: 61853	2009-01-07 08:11:13 +00:00
Evan Cheng	a70ecc2f51	The coalescer does not coalesce a virtual register to a physical register if any of the physical register's sub-register live intervals overlaps with the virtual register. This is overly conservative. It prevents a extract_subreg from being coalesced away: v1024 = EDI // not killed = = EDI One possible solution is for the coalescer to examine the sub-register live intervals in the same manner as the physical register. Another possibility is to examine defs and uses (when needed) of sub-registers. Both solutions are too expensive. For now, look for "short virtual intervals" and scan instructions to look for conflict instead. This is a small win on x86-64. e.g. It shaves 403.gcc by ~80 instructions. llvm-svn: 61847	2009-01-07 02:08:57 +00:00
Chris Lattner	f6de7aa2c9	add a testcase. llvm-svn: 61845	2009-01-07 01:48:08 +00:00
Dan Gohman	ca4475dd7b	Add patterns to match conditional moves with loads folded into their left operand, rather than their right. Do this by commuting the operands and inverting the condition. llvm-svn: 61842	2009-01-07 01:00:24 +00:00
Dan Gohman	2682e8745c	X86_COND_C and X86_COND_NC are alternate mnemonics for X86_COND_B and X86_COND_AE, respectively. llvm-svn: 61835	2009-01-07 00:15:08 +00:00
Dan Gohman	4edc9d725b	Now that fold-pcmpeqd-0.ll is effectively testing that scheduling helps avoid the need for spilling, add a new testcase that tests that the pcmpeqd used for V_SETALLONES is changed to a constant-pool load as needed. llvm-svn: 61831	2009-01-06 23:48:10 +00:00
Dan Gohman	e033f7c41e	Revert r42653 and forward-port the code that lets INC64_32r be converted to LEA64_32r in x86's convertToThreeAddress. This replaces code like this: movl %esi, %edi inc %edi with this: lea 1(%rsi), %edi which appears to be beneficial. llvm-svn: 61830	2009-01-06 23:34:46 +00:00
Dan Gohman	b19f5073f9	Fix a bug in ComputeLinearIndex computation handling multi-level aggregate types. Don't increment the current index after reaching the end of a struct, as it will already be pointing at one-past-the end. This fixes PR3288. llvm-svn: 61828	2009-01-06 22:53:52 +00:00
Scott Michel	c30557841b	CellSPU: - Fix bugs 3194, 3195: i128 load/stores produce correct code (although, we need to ensure that i128 is 16-byte aligned in real life), and 128 zero- extends are supported. - New td file: SPU128InstrInfo.td: this is where all new i128 support should be put in the future. - Continue to hammer on i64 operations and test cases; ensure that the only remaining problem will be i64 mul. llvm-svn: 61784	2009-01-06 03:36:14 +00:00
Dan Gohman	cf1ac86514	Delete this test; it's a duplicate of 2006-07-03-schedulers.ll. llvm-svn: 61781	2009-01-06 01:36:23 +00:00
Dan Gohman	1cdb677fc8	Use a latency value of 0 for the artificial edges inserted by AddPseudoTwoAddrDeps. This lets the scheduling infrastructure avoid recalculating node heights. In very large testcases this was a major bottleneck. Thanks to Roman Levenstein for finding this! As a side effect, fold-pcmpeqd-0.ll is now scheduled better and it no longer requires spilling on x86-32. llvm-svn: 61778	2009-01-06 01:19:04 +00:00
Chris Lattner	e10764369d	make m_ConstantInt(int64_t) safely match ConstantInt's that are larger than i64. This fixes an instcombine crash on PR3235. llvm-svn: 61775	2009-01-05 23:45:50 +00:00
Bill Wendling	4e68cc9ef0	Strength test. llvm-svn: 61755	2009-01-05 21:27:59 +00:00
Duncan Sands	130c00e4b2	Teach the internalize pass to also internalize global aliases. llvm-svn: 61754	2009-01-05 21:24:45 +00:00
Evan Cheng	36e238a4d3	Find loop back edges only after empty blocks are eliminated. llvm-svn: 61752	2009-01-05 21:17:27 +00:00
Chris Lattner	d72ab018c0	testcase for bill's patch. llvm-svn: 61751	2009-01-05 21:07:34 +00:00
Duncan Sands	3b98802e9a	Delete unused global aliases with internal linkage. In fact this also deletes those with linkonce linkage, however this is currently dead because for the moment aliases aren't allowed to have this linkage type. llvm-svn: 61742	2009-01-05 20:37:33 +00:00
Duncan Sands	e4ef28d52c	Don't spew bitcode to standard out if this test fails, like it is right now. llvm-svn: 61690	2009-01-05 10:52:29 +00:00
Torok Edwin	e97eb75c44	This test passes again, unXFAIL. llvm-svn: 61688	2009-01-05 09:30:47 +00:00
Chris Lattner	29c7f160c4	alignment of 0 is not valid. llvm-svn: 61682	2009-01-05 08:14:35 +00:00
Scott Michel	733d5f71a0	CellSPU: - Teach SPU64InstrInfo.td about the remaining signed comparisons, update tests accordingly. llvm-svn: 61672	2009-01-05 04:05:53 +00:00
Scott Michel	06c324c6c7	CellSPU: - Add an 8-bit operation test, which doesn't do much at this point. llvm-svn: 61665	2009-01-05 01:35:22 +00:00
Scott Michel	0d9d939406	CellSPU: - Fix (brcond (setq ...)) bug, where BRNZ should have been used vice BRZ. - Kill unused/unnecessary nodes in SPUNodes.td - Beef out the i64operations.c test harness to use a lot of unaligned loads, test loops and LLVM loop/basic block optimizations; run the test harness successfully on real Cell hardware. llvm-svn: 61664	2009-01-05 01:34:35 +00:00
Nick Lewycky	6685977938	Run a post-pass that marks known function declarations by name. llvm-svn: 61632	2009-01-04 20:27:34 +00:00
Bill Wendling	dd61282551	XFAIL this test. The xform was removed. llvm-svn: 61624	2009-01-04 06:32:28 +00:00
Dan Gohman	2a079de3f5	Fix a DAGCombiner abort on an invalid shift count constant. This fixes PR3250. llvm-svn: 61613	2009-01-03 19:22:06 +00:00
Scott Michel	0309418000	CellSPU: - Remove custom lowering for BRCOND - Add remaining functionality for branches in SPUInstrInfo, such as branch condition reversal and load/store folding. Updated BrCond test to reflect branch reversal. llvm-svn: 61597	2009-01-03 00:27:53 +00:00
Nick Lewycky	2c01a8db3d	Don't try to analyze this "backward" case. This is overly conservative pending a correct solution. llvm-svn: 61589	2009-01-02 18:54:17 +00:00
Duncan Sands	c087ba24aa	When calculating 'nocapture' argument attributes, allow the argument to be stored to an alloca by tracking uses of the alloca. This occurs 4 times (out of 7121, 0.05%) in MultiSource/Applications, so may not be worth it. On the other hand, it is easy to do and fairly cheap. The functions it helps are: W_addcom and W_addlit in spiff; process_args (argv) in d (make_dparser); ercPixConcealIMB in JM/ldecod. llvm-svn: 61570	2009-01-02 11:54:37 +00:00
Chris Lattner	f28c74870f	Reimplement the old and horrible bison parser for .ll files with a nice and clean recursive descent parser. This change has a couple of ramifications: 1. The parser code is about 400 lines shorter (in what we maintain, not including what is autogenerated). 2. The code should be significantly faster than the old code because we don't have to work around bison's poor handling of datatypes with ctors/dtors. This also makes the code much more resistant to memory leaks. 3. We now get caret diagnostics from the .ll parser, woo. 4. The actual diagnostics emited from the parser are completely different so a bunch of testcases had to be updated. 5. I now disallow "%ty = type opaque %ty = type i32". There was no good reason to support this, it was just an accident of the old implementation. I have no reason to think that anyone is actually using this. 6. The syntax for sticking a global variable has changed to make it unambiguous. I don't think anyone is depending on this since only clang supports this and it is not solid yet, so I'm not worried about anything breaking. 7. This gets rid of the last use of bison, and along with it the .cvs files. I'll prune this from the makefiles as a subsequent commit. There are a few minor cleanups that can be done after this commit (suggestions welcome!) but this passes dejagnu testing and is ready for its time in the limelight. llvm-svn: 61558	2009-01-02 07:01:27 +00:00
Evan Cheng	c52f942d67	Do not isel load folding bt instructions for pentium m, core, core2, and AMD processors. These are significantly slower than a load followed by a bt of a register. llvm-svn: 61557	2009-01-02 05:35:45 +00:00
Evan Cheng	57115c1887	Use movaps / movd to extract vector element 0 even with sse4.1. It's still cheaper than pextrw especially if the value is in memory. llvm-svn: 61555	2009-01-02 05:29:08 +00:00
Nick Lewycky	0993a85522	Remove the cyclic part of this test, it was passing for the wrong reason. Two functions which mutually require each other to be nocapture are not currently supported. llvm-svn: 61553	2009-01-02 03:52:27 +00:00
Nick Lewycky	6c53fbb21d	Make adding nocapture a bit stronger. FreeInst is nocapture. Also, functions that don't write can't leak a pointer except through the return value, so a void readonly function is implicitly nocapture. Test these, and add a test that verifies that f1 calling f2 with an otherwise dead pointer gets both of them marked nocapture. llvm-svn: 61552	2009-01-02 03:46:56 +00:00
Chris Lattner	2d3e57c337	rename a file to follow naming conventions. llvm-svn: 61550	2009-01-02 01:52:35 +00:00
Duncan Sands	253f6a5dce	Add tests for two types of traps that escape analysis might one day fall into. llvm-svn: 61549	2009-01-02 00:55:51 +00:00
Misha Brukman	361188d5bc	* Quoted the executable 'runtest' to emphasize the binary needed; otherwise, some unlucky souls start looking for a 'dejagnu' binary... * Properly capitalized LLVM. llvm-svn: 61546	2009-01-01 20:26:05 +00:00
Duncan Sands	190d6bc636	Fix PR3274: when promoting the condition of a BRCOND node, promote from i1 all the way up to the canonical SetCC type. In order to discover an appropriate type to use, pass MVT::Other to getSetCCResultType. In order to be able to do this, change getSetCCResultType to take a type as an argument, not a value (this is also more logical). llvm-svn: 61542	2009-01-01 15:52:00 +00:00
Bill Wendling	efbe8b808c	Add transformation: xor (or (icmp, icmp), true) -> and(icmp, icmp) This is possible because of De Morgan's law. llvm-svn: 61537	2009-01-01 01:18:23 +00:00
Duncan Sands	e112cf52cb	Look through phi nodes and select instructions when calculating nocapture attributes. llvm-svn: 61535	2008-12-31 20:21:34 +00:00
Bill Wendling	e288a29970	This is not failing on Darwin for some reason. XFAIL for other platforms. llvm-svn: 61533	2008-12-31 19:26:09 +00:00
Misha Brukman	d7b9e41922	Removed extra spaces. llvm-svn: 61527	2008-12-31 17:38:27 +00:00
Duncan Sands	36db5853cb	Rename AddReadAttrs to FunctionAttrs, and teach it how to work out (in a very simplistic way) which function arguments (pointer arguments only) are only dereferenced and so do not escape. Mark such arguments 'nocapture'. llvm-svn: 61525	2008-12-31 16:14:43 +00:00
Bill Wendling	c73286fc81	XFAIL test caused by r61493. Apparently, this is expected? llvm-svn: 61516	2008-12-31 08:26:55 +00:00
Scott Michel	c163bf5042	XFAIL this for now until I can figure out what's going on. llvm-svn: 61512	2008-12-31 00:08:25 +00:00
Scott Michel	12a5f7cfb9	Fix test erratum (which is wierd: works locally for me?) llvm-svn: 61511	2008-12-30 23:52:05 +00:00

1 2 3 4 5 ...

6201 Commits