llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-23 04:16:42 +00:00

Author	SHA1	Message	Date
Renato Golin	cf89d692dc	Clang was not parsing target triples involving EABI and was generating wrong IR (wrong PCS) and passing the wrong information down llc via the target-triple printed in IR. I've fixed this by adding the parsing of EABI into LLVM's Triple class and using it to choose the correct PCS in Clang's Tools. A Clang patch is on its way to use this infrastructure. llvm-svn: 123990	2011-01-21 18:25:47 +00:00
Oscar Fuentes	da69ba084d	Handles libffi on the CMake build. Patch by arrowdodger! llvm-svn: 123976	2011-01-21 15:42:54 +00:00
Bruno Cardoso Lopes	2f96371a7a	Fix the encoding of QADD/SUB, QDADD/SUB. While qadd16, qadd8 use "rd, rn, rm", qadd and qdadd uses "rd, rm, rn", the same applies to the 'sub' variants. This is described in ARM manuals and matches the encoding used by the gnu assembler. llvm-svn: 123975	2011-01-21 14:07:40 +00:00
Venkatraman Govindaraju	6a083c355b	Implement support for byval arguments in Sparc backend. llvm-svn: 123974	2011-01-21 14:00:01 +00:00
Nick Lewycky	5de10fb201	SCCP doesn't actually preserve the CFG. It will delete and insert terminator instructions. llvm-svn: 123973	2011-01-21 08:38:09 +00:00
Andrew Trick	e0bccb5f87	Enable support for precise scheduling of the instruction selection DAG. Disable using "-disable-sched-cycles". For ARM, this enables a framework for modeling the cpu pipeline and counting stalls. It also activates several heuristics to drive scheduling based on the model. Scheduling is inherently imprecise at this stage, and until spilling is improved it may defeat attempts to schedule. However, this framework provides greater control over tuning codegen. Although the flag is not target-specific, it should have very little affect on the default scheduler used by x86. The only two changes that affect x86 are: - scheduling a high-latency operation bumps the current cycle so independent operations can have their latency covered. i.e. two independent 4 cycle operations can produce results in 4 cycles, not 8 cycles. - Two operations with equal register pressure impact and no latency-based stalls on their uses will be prioritized by depth before height (height is irrelevant if no stalls occur in the schedule below this point). llvm-svn: 123971	2011-01-21 06:19:05 +00:00
Andrew Trick	7155e98904	Convert -enable-sched-cycles and -enable-sched-hazard to -disable flags. They are still not enable in this revision. Added TargetInstrInfo::isZeroCost() to fix a fundamental problem with the scheduler's model of operand latency in the selection DAG. Generalized unit tests to work with sched-cycles. llvm-svn: 123969	2011-01-21 05:51:33 +00:00
Chris Lattner	f225708ef1	fix PR9013, an infinite loop in instcombine. llvm-svn: 123968	2011-01-21 05:29:50 +00:00
Chris Lattner	9a4cefc8ee	update obsolete comment. llvm-svn: 123965	2011-01-21 05:08:26 +00:00
Nick Lewycky	c4300debc2	Don't try to pull vector bitcasts that change the number of elements through a select. A vector select is pairwise on each element so we'd need a new condition with the right number of elements to select on. Fixes PR8994. llvm-svn: 123963	2011-01-21 02:30:43 +00:00
Michael J. Spencer	e4fe76fa22	Object: Fix type punned pointer issues by making DataRefImpl a union and using intptr_t. llvm-svn: 123962	2011-01-21 02:27:02 +00:00
Nick Lewycky	50f86f3414	Add a constant folding of casts from zero to zero. Fixes PR9011! While here, I'd like to complain about how vector is not an aggregate type according to llvm::Type::isAggregateType(), but they're listed under aggregate types in the LangRef and zero vectors are stored as ConstantAggregateZero. llvm-svn: 123956	2011-01-21 01:12:09 +00:00
Evan Cheng	52fe62c996	Don't be overly aggressive with CSE of "ldr constantpool". If it's a pc-relative value, the "add pc" must be CSE'ed at the same time. We could follow the same approach as T2 by adding pseudo instructions that combine the ldr + "add pc". But the better approach is to use movw + movt (which I will enable soon), so I'll leave this as a TODO. llvm-svn: 123949	2011-01-20 23:55:07 +00:00
Tobias Grosser	ea8985cc25	Implement requiredTransitive The PassManager did not implement the transitivity of requiredTransitive. This was unnoticed since 2006. llvm-svn: 123942	2011-01-20 21:03:22 +00:00
Bruno Cardoso Lopes	6aeb2e320f	Fix the encoding and parsing of clrex instruction llvm-svn: 123936	2011-01-20 19:18:32 +00:00
Bruno Cardoso Lopes	c0f87c11d6	Change instruction names for consistency llvm-svn: 123930	2011-01-20 18:36:07 +00:00
Bruno Cardoso Lopes	5f06c0aa3b	Add cdp/cdp2 instructions for thumb/thumb2 llvm-svn: 123929	2011-01-20 18:32:09 +00:00
Bruno Cardoso Lopes	3584c02d83	- Use a more appropriate name for Owen's ARM Parser isMCR hack since the same operands can be present in cdp/cdp2 instructions. Also increase the hack with cdp/cdp2 instructions. - Fix the encoding of cdp/cdp2 instructions for ARM (no thumb and thumb2 yet) and add testcases for t hem. llvm-svn: 123927	2011-01-20 18:06:58 +00:00
Jakob Stoklund Olesen	60743dd45f	SplitKit requires that all defs are in place before calling useIntv(). The value mapping gets confused about which original values have multiple new definitions so they may need phi insertions. This could probably be simplified by letting enterIntvBefore() take a live range to be added following the instruction. As long as the range stays inside the same basic block, value mapping shouldn't be a problem. llvm-svn: 123926	2011-01-20 17:45:23 +00:00
Jakob Stoklund Olesen	360b0921ac	Add LiveIntervalMap::dumpCache() to print out the cache used by the ssa update algorithm. llvm-svn: 123925	2011-01-20 17:45:20 +00:00
Bruno Cardoso Lopes	75712e8a7a	Add mcr2 and mrc2 support to thumb2 targets llvm-svn: 123919	2011-01-20 16:58:48 +00:00
Bruno Cardoso Lopes	f377d1721e	Add mcr* and mr*c support to thumb targets llvm-svn: 123917	2011-01-20 16:35:57 +00:00
Kalle Raiskila	070fb5e54d	Allow sign-extending of i8 and i16 to i128 on SPU. llvm-svn: 123912	2011-01-20 15:49:06 +00:00
Duncan Sands	1faa8712c9	At -O123 the early-cse pass is run before instcombine has run. According to my auto-simplier the transform most missed by early-cse is (zext X) != 0 -> X != 0. This patch adds this transform and some related logic to InstructionSimplify and removes some of the logic from instcombine (unfortunately not all because there are several situations in which instcombine can improve things by making new instructions, whereas instsimplify is not allowed to do this). At -O2 this often results in more than 15% more simplifications by early-cse, and results in hundreds of lines of bitcode being eliminated from the testsuite. I did see some small negative effects in the testsuite, for example a few additional instructions in three programs. One program, 483.xalancbmk, got an additional 35 instructions, which seems to be due to a function getting an additional instruction and then being inlined all over the place. llvm-svn: 123911	2011-01-20 13:21:55 +00:00
Bruno Cardoso Lopes	0312bb222e	Refactor mcr* and mr*c instructions into classes with the same encoding. No functionality change. llvm-svn: 123910	2011-01-20 13:17:59 +00:00
Eric Christopher	d17b5b7988	My editor's indent went crazy. Fix. llvm-svn: 123909	2011-01-20 08:56:34 +00:00
Eric Christopher	f7579ff174	Expand invalid return values for umulo and smulo. Handle these similarly to add/sub by doing the normal operation and then checking for overflow afterwards. This generally relies on the DAG handling the later invalid operations as well. Fixes the 64-bit part of rdar://8622122 and rdar://8774702. llvm-svn: 123908	2011-01-20 08:54:28 +00:00
Evan Cheng	d9fdc9771e	Correct itinerary entry for t2MOV_pic_ga_add_pc. llvm-svn: 123907	2011-01-20 08:43:03 +00:00
Evan Cheng	6dc21c7358	Sorry, several patches in one. TargetInstrInfo: Change produceSameValue() to take MachineRegisterInfo as an optional argument. When in SSA form, targets can use it to make more aggressive equality analysis. Machine LICM: 1. Eliminate isLoadFromConstantMemory, use MI.isInvariantLoad instead. 2. Fix a bug which prevent CSE of instructions which are not re-materializable. 3. Use improved form of produceSameValue. ARM: 1. Teach ARM produceSameValue to look pass some PIC labels. 2. Look for operands from different loads of different constant pool entries which have same values. 3. Re-implement PIC GA materialization using movw + movt. Combine the pair with a "add pc" or "ldr [pc]" to form pseudo instructions. This makes it possible to re-materialize the instruction, allow machine LICM to hoist the set of instructions out of the loop and make it possible to CSE them. It's a bit hacky, but it significantly improve code quality. 4. Some minor bug fixes as well. With the fixes, using movw + movt to materialize GAs significantly outperform the load from constantpool method. 186.crafty and 255.vortex improved > 20%, 254.gap and 176.gcc ~10%. llvm-svn: 123905	2011-01-20 08:34:58 +00:00
Michael J. Spencer	d54bad63f7	Object: Add ELF support. llvm-svn: 123896	2011-01-20 06:38:47 +00:00
Michael J. Spencer	d4d4ab7712	Object: Add COFF Support. llvm-svn: 123895	2011-01-20 06:38:34 +00:00
Andrew Trick	bf079d8831	Selection DAG scheduler register pressure heuristic fixes. Added a check for already live regs before claiming HighRegPressure. Fixed a few cases of checking the wrong number of successors. Added some tracing until these heuristics are better understood. llvm-svn: 123892	2011-01-20 06:21:59 +00:00
Jakob Stoklund Olesen	ea33059ff5	Check that a live range exists before shortening it. This fixes PR8989. The live range may have been deleted earlier because of rematerialization. llvm-svn: 123891	2011-01-20 06:20:02 +00:00
Jakob Stoklund Olesen	bb94da29b2	Add hidden -verify-coalescing to run the machine code verifier before and after register coalescing. llvm-svn: 123890	2011-01-20 06:20:00 +00:00
Venkatraman Govindaraju	5280b2876f	Sparc backend: Implements a delay slot filler that attempt to fill delay slots with useful instructions. llvm-svn: 123884	2011-01-20 05:08:26 +00:00
Cameron Zwarich	bf70725ae3	Update a comment. llvm-svn: 123879	2011-01-20 03:58:43 +00:00
Jakob Stoklund Olesen	c387993232	Fix bug found by new clang warning. llvm-svn: 123872	2011-01-20 02:43:19 +00:00
Eric Christopher	58f8058502	Use only one API at a time. llvm-svn: 123866	2011-01-20 01:29:23 +00:00
Eric Christopher	1b0e5debb4	If we can, lower the multiply part of a umulo/smulo call to a libcall with an invalid type then split the result and perform the overflow check normally. Fixes the 32-bit parts of rdar://8622122 and rdar://8774702. llvm-svn: 123864	2011-01-20 00:29:24 +00:00
Devang Patel	729c5e59af	Fix debug info for merged global. llvm-svn: 123862	2011-01-20 00:02:16 +00:00
Jakob Stoklund Olesen	69294ae8d7	Divert Hopfield network debug output. It is very noisy. llvm-svn: 123859	2011-01-19 23:14:59 +00:00
Jakob Stoklund Olesen	c47bd85657	Don't accidentally leave small gaps in the live ranges when leaving the active interval after an instruction. The leaveIntvAfter() method only adds liveness from the instruction's boundary index to the inserted copy. Ideally, SplitKit should be smarter about this, perhaps by combining useIntv() and leaveIntvAfter() into one method that guarantees continuity. llvm-svn: 123858	2011-01-19 23:14:56 +00:00
Jim Grosbach	868d2877dc	Make sure to propogate the error code when we fail to parse a modifier. llvm-svn: 123857	2011-01-19 23:06:07 +00:00
Devang Patel	574e10fa1e	Fix register address expression. Patch by Ken Dyck. llvm-svn: 123856	2011-01-19 23:04:47 +00:00
Jakob Stoklund Olesen	77738dd84e	Implement RAGreedy::splitAroundRegion and remove loop splitting. Region splitting includes loop splitting as a subset, and it is more generic. The splitting heuristics for variables that are live in more than one block are now: 1. Try to create a region that covers multiple basic blocks. 2. Try to create a new live range for each block with multiple uses. 3. Spill. Steps 2 and 3 are similar to what the standard spiller is doing. llvm-svn: 123853	2011-01-19 22:11:48 +00:00
Nick Lewycky	51c13384f5	Similarly, analyze truncate through multiply. llvm-svn: 123842	2011-01-19 18:56:00 +00:00
Nick Lewycky	9867e58096	Add a missed SCEV fold that is required to continue analyzing the IR produced by indvars through the scev expander. trunc(add x, y) --> add(trunc x, y). Currently SCEV largely folds the other way which is probably wrong, but preserved to minimize churn. Instcombine doesn't do this fold either, demonstrating a missed optz'n opportunity on code doing add+trunc+add. llvm-svn: 123838	2011-01-19 16:59:46 +00:00
Bruno Cardoso Lopes	0f7a30b1cb	Fix the encoding of mrrc and mcrr family of instructions. Also add testcases for mcr and mrc llvm-svn: 123837	2011-01-19 16:56:52 +00:00
Rafael Espindola	ce499efe1d	Add unnamed_addr when we can show that address of a global is not used. llvm-svn: 123834	2011-01-19 16:32:21 +00:00
Nick Lewycky	5a538b62ca	Add a missing SCEV simplification sext(zext x) --> zext x. llvm-svn: 123832	2011-01-19 15:56:12 +00:00

1 2 3 4 5 ...

45036 Commits