RPCSX/llvm - llvm - Gitea: Git with a cup of tea

RPCSX/llvm

mirror of https://github.com/RPCSX/llvm.git synced 2024-12-04 01:43:06 +00:00

Author	SHA1	Message	Date
Chris Lattner	db5f5e64fe	Turn on earlycse by default. This seems to be a small performance improvement in the generated code, and speeds up 'opt -std-compile-opts' compile time on 176.gcc from 24.84s to 23.2s (about 7%). This also resolves a specific code quality issue in rdar://7352081 which was generating poor code for: int t(int a, int b) { if (a & b & 1) return a & b; return 3; } git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122740 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-03 06:19:09 +00:00
Nick Lewycky	9959706735	Further expand what a call graph pass may do. The rationale is that after analyzing a function in the SCC, we may want to modify it in a way that requires us to update its uses (f.e. to replace the call with a constant) or its users (f.e. to call it with fewer arguments). git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122739 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-03 06:16:07 +00:00
Chris Lattner	75637154c3	earlycse can do trivial with-a-block dead store elimination as well. This deletes 60 stores in 176.gcc that largely come from bitfield code. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122736 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-03 04:17:24 +00:00
Cameron Zwarich	53eeba586d	Use a RecyclingAllocator to allocate values for MachineCSE's ScopedHashTable for a 28% speedup of MachineCSE time on 403.gcc. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122735 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-03 04:07:46 +00:00
Nick Lewycky	041ee7e6fa	Permit CallGraphSCCPasses readonly access to the direct callers of the functions in their SCC as they already have with the direct callees. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122734 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-03 04:01:44 +00:00
Chris Lattner	71230acbbe	switch the load table to use a recycling bump pointer allocator, speeding earlycse up by 6%. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122733 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-03 03:53:50 +00:00
Chris Lattner	ef87fc2e0a	now that loads are in their own table, we can implement store->load forwarding. This allows EarlyCSE to zap 600 more loads from 176.gcc. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122732 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-03 03:46:34 +00:00
Chris Lattner	85db61066a	split loads and calls into separate tables. Loads are now just indexed by their pointer instead of using MemoryValue to wrap it. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122731 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-03 03:41:27 +00:00
Chris Lattner	03d49e955e	add a testcase for readonly call CSE git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122730 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-03 03:33:47 +00:00
Chris Lattner	a60a8b0eb7	various cleanups, no functionality change. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122729 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-03 03:28:23 +00:00
Nick Lewycky	9ad1cb59de	Add spliceFunction to the CallGraph interface. This allows users to efficiently update a callGraph when performing the common operation of splicing the body to a new function and updating all callers (such as via RAUW). No users yet, though this is intended for DeadArgumentElimination as part of PR8887. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122728 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-03 03:19:35 +00:00
Chris Lattner	8e7f0d70c7	Teach EarlyCSE to do trivial CSE of loads and read-only calls. On 176.gcc, this catches 13090 loads and calls, and increases the number of simple instructions CSE'd from 29658 to 36208. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122727 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-03 03:18:43 +00:00
Chris Lattner	152096275a	add a handy typedef. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122726 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-03 03:16:20 +00:00
Chris Lattner	f19745947d	rename InstValue to SimpleValue, add some comments. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122725 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-03 02:20:48 +00:00
Michael J. Spencer	f0da7299fa	CMake: Add missing source file. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122724 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-03 02:13:05 +00:00
Chris Lattner	82dcd5edd2	Allocate nodes for the scoped hash table from a recyling bump pointer allocator. This speeds up early cse by about 20% git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122723 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-03 01:42:46 +00:00
Chris Lattner	4f20c6d354	really get this working with a custom allocator. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122722 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-03 01:38:29 +00:00
Chris Lattner	61a10a0dc9	Enhance ScopedHashTable to allow it to take an allocator argument. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122721 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-03 01:29:37 +00:00
Chris Lattner	d957c71791	reduce redundancy in the hashing code and other misc cleanups. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122720 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-03 01:10:08 +00:00
Cameron Zwarich	832f61117d	Add a new loop-instsimplify pass, with the intention of replacing the instance of instcombine that is currently in the middle of the loop pass pipeline. This commit only checks in the pass; it will hopefully be enabled by default later. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122719 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-03 00:25:16 +00:00
Chris Lattner	67678c8307	fix some pastos git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122718 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 23:29:58 +00:00
Chris Lattner	91139ccd99	add DEBUG and -stats output to earlycse. Teach it to CSE the rest of the non-side-effecting instructions. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122716 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 23:19:45 +00:00
Chris Lattner	cc9eab26b3	Enhance earlycse to do CSE of casts, instsimplify and die. Add a testcase. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122715 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 23:04:14 +00:00
Chris Lattner	9fc5cdf77c	split dom frontier handling stuff out to its own DominanceFrontier header, so that Dominators.h is just domtree. Also prune #includes a bit. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122714 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 22:09:33 +00:00
Chris Lattner	12be936cc9	sketch out a new early cse pass. No functionality yet. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122713 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 21:47:05 +00:00
Chris Lattner	63f9c3c49a	fix a miscompilation of tramp3d-v4: when forming a memcpy, we have to make sure that the loop we're promoting into a memcpy doesn't mutate the input of the memcpy. Before we were just checking that the dest of the memcpy wasn't mod/ref'd by the loop. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122712 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 21:14:18 +00:00
Chris Lattner	8e08e73f0e	If a loop iterates exactly once (has backedge count = 0) then don't mess with it. We'd rather peel/unroll it than convert all of its stores into memsets. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122711 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 20:24:21 +00:00
Benjamin Kramer	80220369b0	Try to reuse the value when lowering memset. This allows us to compile: void test(char *s, int a) { __builtin_memset(s, a, 15); } into 1 mul + 3 stores instead of 3 muls + 3 stores. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122710 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 19:57:05 +00:00
Benjamin Kramer	8c06aa1c59	Lower the i8 extension in memset to a multiply instead of a potentially long series of shifts and ors. We could implement a DAGCombine to turn x * 0x0101 back into logic operations on targets that doesn't support the multiply or it is slow (p4) if someone cares enough. Example code: void test(char *s, int a) { __builtin_memset(s, a, 4); } before: _test: ## @test movzbl 8(%esp), %eax movl %eax, %ecx shll $8, %ecx orl %eax, %ecx movl %ecx, %eax shll $16, %eax orl %ecx, %eax movl 4(%esp), %ecx movl %eax, 4(%ecx) movl %eax, (%ecx) ret after: _test: ## @test movzbl 8(%esp), %eax imull $16843009, %eax, %eax ## imm = 0x1010101 movl 4(%esp), %ecx movl %eax, 4(%ecx) movl %eax, (%ecx) ret git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122707 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 19:44:58 +00:00
Oscar Fuentes	3458534f11	A workaround for a bug in cmake 2.8.3 diagnosed on PR 8885. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122706 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 19:32:31 +00:00
Nick Lewycky	d081b04f99	Also remove functions that use complex constant expressions in terms of another function. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122705 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 19:16:44 +00:00
Chris Lattner	62c50fdf69	enhance loop idiom recognition to scan all unconditionally executed blocks in a loop, instead of just the header block. This makes it more aggressive, able to handle Duncan's Ada examples. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122704 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 19:01:03 +00:00
Chris Lattner	81a866d615	make inSubLoop much more efficient. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122703 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 18:53:08 +00:00
Chris Lattner	83fc584ea8	rip out isExitBlockDominatedByBlockInLoop, calling DomTree::dominates instead. isExitBlockDominatedByBlockInLoop is a relic of the days when domtree was just a tree and didn't have DFS numbers. Checking DFS numbers is faster and easier than "limiting the search of the tree". git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122702 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 18:45:39 +00:00
Chris Lattner	bdce5720ad	add a list of opportunities for future improvement. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122701 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 18:32:09 +00:00
Chris Lattner	527b47d189	update a bunch of entries. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122700 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 18:31:38 +00:00
Duncan Sands	67fb341f8b	Fix PR8702 by not having LoopSimplify claim to preserve LCSSA form. As described in the PR, the pass could break LCSSA form when inserting preheaders. It probably would be easy enough to fix this, but since currently we always go into LCSSA form after running this pass, doing so is not urgent. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122695 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 13:38:21 +00:00
Cameron Zwarich	86a582ddfb	Remove an unused member function. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122693 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 12:37:22 +00:00
Oscar Fuentes	449d4e12c9	Propagate to parent scope changes made to CMAKE_CXX_FLAGS. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122692 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 12:30:18 +00:00
Cameron Zwarich	94942cec11	Fix a typo in a variable name. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122691 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 12:17:10 +00:00
Cameron Zwarich	63e07d72f8	Move a load into the only branch where it is used and eliminate a temporary. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122690 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 10:50:14 +00:00
Cameron Zwarich	2a8c22aa68	Add the explanatory comment from r122680's commit message to the code itself. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122689 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 10:40:14 +00:00
Cameron Zwarich	19feb4ca8a	Tidy up indentation. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122688 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 10:10:02 +00:00
Cameron Zwarich	a4d93162cb	Fix a typo, which should also fix the failure on llvm-x86_64-linux-checks. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122687 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 10:06:44 +00:00
Chris Lattner	cf078f2b20	Allow loop-idiom to run on multiple BB loops, but still only scan the loop header for now for memset/memcpy opportunities. It turns out that loop-rotate is successfully rotating loops, but DOESN'T MERGE THE BLOCKS, turning "for loops" into 2 basic block loops that loop-idiom was ignoring. With this fix, we form many many more memcpy and memsets than before, including on the "history" loops in the viterbi benchmark, which look like this: for (j=0; j<MAX_history; ++j) { history_new[i][j+1] = history[2*i][j]; } Transforming these loops into memcpy's speeds up the viterbi benchmark from 11.98s to 3.55s on my machine. Woo. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122685 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 07:58:36 +00:00
Cameron Zwarich	c9e152b778	Remove the #ifdef'd code for balancing the eval-link data structure. It doesn't compile, and everyone's tests have shown it to be slower in practice, even for quite large graphs. I also hope to do an optimization that is only correct with the simpler data structure, which would break this even further. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122684 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 07:53:49 +00:00
Chris Lattner	6b04160dd8	remove debugging code. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122683 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 07:37:13 +00:00
Chris Lattner	4ce31fb574	add some -stats output. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122682 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 07:36:44 +00:00
Chris Lattner	d9e079706e	improve loop rotation to use CodeMetrics to analyze the size of a loop header instead of its own code size estimator. This allows it to handle bitcasts etc more precisely. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122681 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 07:35:53 +00:00
Cameron Zwarich	8252ad1351	Speed up dominator computation some more by optimizing bucket processing. When naively implemented, the Lengauer-Tarjan algorithm requires a separate bucket for each vertex. However, this is unnecessary, because each vertex is only placed into a single bucket (that of its semidominator), and each vertex's bucket is processed before it is added to any bucket itself. Instead of using a bucket per vertex, we use a single array Buckets that has two purposes. Before the vertex V with DFS number i is processed, Buckets[i] stores the index of the first element in V's bucket. After V's bucket is processed, Buckets[i] stores the index of the next element in the bucket to which V now belongs, if any. Reading from the buckets can also be optimized. Instead of processing the bucket of V's parent at the end of processing V, we process the bucket of V itself at the beginning of processing V. This means that the case of the root vertex can be simplified somewhat. It also means that we don't need to look up the DFS number of the semidominator of every node in the bucket we are processing, since we know it is the current index being processed. This is a 6.5% speedup running -domtree on test-suite + SPEC2000/2006, with larger speedups of around 12% on the larger benchmarks like GCC. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122680 91177308-0d34-0410-b5e6-96231b3b80d8	2011-01-02 07:03:00 +00:00

1 2 3 4 5 ...

68930 Commits