llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-12-19 01:27:40 +00:00

Author	SHA1	Message	Date
Tim Northover	29e73e0f55	Refactor isInTailCallPosition handling This change came about primarily because of two issues in the existing code. Niether of: define i64 @test1(i64 %val) { %in = trunc i64 %val to i32 tail call i32 @ret32(i32 returned %in) ret i64 %val } define i64 @test2(i64 %val) { tail call i32 @ret32(i32 returned undef) ret i32 42 } should be tail calls, and the function sameNoopInput is responsible. The main problem is that it is completely symmetric in the "tail call" and "ret" value, but in reality different things are allowed on each side. For these cases: 1. Any truncation should lead to a larger value being generated by "tail call" than needed by "ret". 2. Undef should only be allowed as a source for ret, not as a result of the call. Along the way I noticed that a mismatch between what this function treats as a valid truncation and what the backends see can lead to invalid calls as well (see x86-32 test case). This patch refactors the code so that instead of being based primarily on values which it recurses into when necessary, it starts by inspecting the type and considers each fundamental slot that the backend will see in turn. For example, given a pathological function that returned {{}, {{}, i32, {}}, i32} we would consider each "real" i32 in turn, and ask if it passes through unchanged. This is much closer to what the backend sees as a result of ComputeValueVTs. Aside from the bug fixes, this eliminates the recursion that's going on and, I believe, makes the bulk of the code significantly easier to understand. The trade-off is the nasty iterators needed to find the real types inside a returned value. llvm-svn: 187787	2013-08-06 09:12:35 +00:00
Serge Pavlov	a57ba3eab8	Unbreak Debug build on Windows llvm-svn: 187786	2013-08-06 08:44:18 +00:00
Craig Topper	4a0f26ca0c	Simplify vector lane handling math a bit. No functional change intended. llvm-svn: 187783	2013-08-06 07:23:12 +00:00
Craig Topper	70290dd386	Simplify math a little bit. llvm-svn: 187781	2013-08-06 06:54:25 +00:00
NAKAMURA Takumi	0eb9242c56	Target//CMakeLists.txt: Add the dependency to CommonTableGen explicitly for each corresponding CodeGen. Without explicit dependencies, both per-file action and in-CommonTableGen action could run in parallel. It races to emit .inc files simultaneously. llvm-svn: 187780	2013-08-06 06:38:37 +00:00
Craig Topper	7302a71ae6	Replace EVT with MVT in isHorizontalBinOp as it is only called with legal types. llvm-svn: 187779	2013-08-06 06:05:05 +00:00
NAKAMURA Takumi	c42a37f99e	AsmPrinter/CMakeLists.txt: Add explicit dependency to intrinsics_gen here. llvm-svn: 187778	2013-08-06 05:56:39 +00:00
NAKAMURA Takumi	0790962a19	Option/CMakeLists.txt: Don't use target_link_libraries. LLVMBuild knows dependencies. llvm-svn: 187777	2013-08-06 05:56:32 +00:00
Craig Topper	8ee8f7ec58	Put an llvm_unreachable at the end of getSplatIndex as its loop should never find all undef elements. llvm-svn: 187775	2013-08-06 05:41:22 +00:00
Craig Topper	e1c2f3da6c	Check against >= 0 instead of != -1 in getSplatIndex because it generally compiles to better code and is equivalent for shuffle indices. llvm-svn: 187774	2013-08-06 05:07:37 +00:00
Craig Topper	936da35b0d	Remove trailing whitespace and fix an 80-column violation. No functional change. llvm-svn: 187773	2013-08-06 05:01:21 +00:00
Craig Topper	c020365ad6	Simplify code slightly. No functional change. llvm-svn: 187771	2013-08-06 04:12:40 +00:00
Tom Stellard	e4e3be6f50	Factor FlattenCFG out from SimplifyCFG Patch by: Mei Ye llvm-svn: 187764	2013-08-06 02:43:45 +00:00
Eric Christopher	c5f9312717	Allow 4 as a valid debug info version. llvm-svn: 187763	2013-08-06 01:38:27 +00:00
Shuxin Yang	2ee8d113cd	Add some comment to LTOCodeGenerator class llvm-svn: 187761	2013-08-06 00:45:32 +00:00
Matt Arsenault	de2f38a2db	Fix missing -- C++ --s llvm-svn: 187758	2013-08-06 00:16:21 +00:00
Bill Wendling	21cb95c7bf	FileCheckize some of the testcases. llvm-svn: 187756	2013-08-05 23:43:18 +00:00
Bill Wendling	1ce6e81da8	Fix grammar. llvm-svn: 187755	2013-08-05 23:29:16 +00:00
Tom Stellard	f94818ae61	R600/SI: Add missing test for r187749 llvm-svn: 187754	2013-08-05 22:45:56 +00:00
Eric Christopher	77c487801a	Recommit previous cleanup with a fix for c++98 ambiguity. llvm-svn: 187752	2013-08-05 22:32:28 +00:00
Tom Stellard	a96032b0e7	R600: Implement TargetLowering::getVectorIdxTy() We use MVT::i32 for the vector index type, because we use 32-bit operations to caculate offsets when dynamically indexing vectors. llvm-svn: 187749	2013-08-05 22:22:07 +00:00
Tom Stellard	fdf221305c	TargetLowering: Add getVectorIdxTy() function v2 This virtual function can be implemented by targets to specify the type to use for the index operand of INSERT_VECTOR_ELT, EXTRACT_VECTOR_ELT, INSERT_SUBVECTOR, EXTRACT_SUBVECTOR. The default implementation returns the result from TargetLowering::getPointerTy() The previous code was using TargetLowering::getPointerTy() for vector indices, because this is guaranteed to be legal on all targets. However, using TargetLowering::getPointerTy() can be a problem for targets with pointer sizes that differ across address spaces. On such targets, when vectors need to be loaded or stored to an address space other than the default 'zero' address space (which is the address space assumed by TargetLowering::getPointerTy()), having an index that is a different size than the pointer can lead to inefficient pointer calculations, (e.g. 64-bit adds for a 32-bit address space). There is no intended functionality change with this patch. llvm-svn: 187748	2013-08-05 22:22:01 +00:00
Eric Christopher	1c0455141c	Revert "Use existing builtin hashing functions to make this routine more" This reverts commit r187745. llvm-svn: 187747	2013-08-05 22:07:30 +00:00
Eric Christopher	8884e7fbaa	Use existing builtin hashing functions to make this routine more simple. llvm-svn: 187745	2013-08-05 22:00:50 +00:00
Eric Christopher	57a9194e5f	Change parent hashing algorithm to be non-recursive and elaborate greatly on many comments in the code. llvm-svn: 187742	2013-08-05 21:40:57 +00:00
Michael Gottesman	9c47125a1e	[bugpoint] Allow the user to specify the path to opt on the commandline. llvm-svn: 187739	2013-08-05 21:07:07 +00:00
Peter Collingbourne	42b450c977	Introduce an optimisation for special case lists with large numbers of literal entries. Our internal regex implementation does not cope with large numbers of anchors very efficiently. Given a ~3600-entry special case list, regex compilation can take on the order of seconds. This patch solves the problem for the special case of patterns matching literal global names (i.e. patterns with no regex metacharacters). Rather than forming regexes from literal global name patterns, add them to a StringSet which is checked before matching against the regex. This reduces regex compilation time by an order of roughly thousands when reading the aforementioned special case list, according to a completely unscientific study. No test cases. I figure that any new tests for this code should check that regex metacharacters are properly recognised. However, I could not find any documentation which documents the fact that the syntax of global names in special case lists is based on regexes. The extent to which regex syntax is supported in special case lists should probably be decided on/documented before writing tests. Differential Revision: http://llvm-reviews.chandlerc.com/D1150 llvm-svn: 187732	2013-08-05 17:48:04 +00:00
Peter Collingbourne	8df077ca5e	Introduce Regex::isLiteralERE function. This will be used to implement an optimisation for literal entries in special case lists. Differential Revision: http://llvm-reviews.chandlerc.com/D1278 llvm-svn: 187731	2013-08-05 17:47:59 +00:00
Aaron Ballman	5633b24d1d	Silencing an MSVC11 type conversion warning. llvm-svn: 187727	2013-08-05 13:47:03 +00:00
Alexey Samsonov	186358278d	80-cols llvm-svn: 187725	2013-08-05 13:19:49 +00:00
Elena Demikhovsky	c018c20682	LLVM Interpreter: This patch implements vector support for cast operations (zext, sext, uitofp, sitofp, trunc, fpext, fptosi, fptrunc, bitcast) and shift operations (shl, ashr, lshr) for integer and floating point data types. Added tests. Done by Yuri Veselov (mailto:Yuri.Veselov@intel.com). llvm-svn: 187724	2013-08-05 12:17:06 +00:00
Richard Sandiford	39f379d037	[SystemZ] Use BRCT and BRCTG to eliminate add-&-compare sequences This patch just uses a peephole test for "add; compare; branch" sequences within a single block. The IR optimizers already convert loops to decrement-and-branch-on-nonzero form in some cases, so even this simplistic test triggers many times during a clang bootstrap and projects/test-suite run. It looks like there are still cases where we need to more strongly prefer branches on nonzero though. E.g. I saw a case where a loop that started out with a check for 0 ended up with a check for -1. I'll try to look at that sometime. I ended up adding the Reference class because MachineInstr::readsRegister() doesn't check for subregisters (by design, as far as I could tell). llvm-svn: 187723	2013-08-05 11:23:46 +00:00
Benjamin Kramer	8730825a6d	Don't leak passes if added outside of the area determined by Started/Stopped flags. llvm-svn: 187722	2013-08-05 11:11:11 +00:00
Richard Sandiford	92b3a35a0a	[SystemZ] Add definitions for BRCT and BRCTG llvm-svn: 187721	2013-08-05 11:07:38 +00:00
Richard Sandiford	eefa00392f	[SystemZ] Use LOAD AND TEST to eliminate comparisons against zero llvm-svn: 187720	2013-08-05 11:03:20 +00:00
Richard Sandiford	18ee3fd7df	[SystemZ] Add LOAD AND TEST instructions Just the definitions and MC support. The next patch uses them for codegen. llvm-svn: 187719	2013-08-05 11:00:53 +00:00
Richard Sandiford	b7d4e305e3	[SystemZ] Split out comparison elimination into a separate pass Perhaps predictably, doing comparison elimination on the fly during SystemZLongBranch turned out to be a bad idea. The next patches make use of LOAD AND TEST and BRANCH ON COUNT, both of which require changes to earlier instructions. No functionality change intended. llvm-svn: 187718	2013-08-05 10:58:53 +00:00
Elena Demikhovsky	cb3f9da2e3	AVX-512 set: added mask operations, lowering BUILD_VECTOR for i1 vector types. Added intrinsics and tests. llvm-svn: 187717	2013-08-05 08:52:21 +00:00
Nadav Rotem	cc36bd142d	Update the release notes about the status of the vectorizers. llvm-svn: 187714	2013-08-05 04:31:05 +00:00
Nadav Rotem	3cf6d1b3c9	Update the docs. llvm-svn: 187713	2013-08-05 04:27:34 +00:00
Reed Kotler	d5b7892552	Add the saving of S2. This is needed for some of the floating point helper functions. This can be optimized out later when the remaining parts of the helper function work is moved into the Mips16HardFloat pass. For now it forces us to use the 32 bit save/restore instructions instead of the 16 bit ones. llvm-svn: 187712	2013-08-04 23:56:53 +00:00
Bob Wilson	24b2d75ac5	Remove "lto_on_osx" xfails, now that -rdynamic works on Darwin. Note that this will require a recent version of the linker for Darwin builds with LTO to pass these tests. llvm-svn: 187711	2013-08-04 23:55:24 +00:00
Bob Wilson	014d6cf72a	Build with the $RDYNAMIC flag on Darwin as well as other platforms. Part of <rdar://problem/14620988> llvm-svn: 187710	2013-08-04 22:06:11 +00:00
Benjamin Kramer	c63386d01a	X86: Turn fp selects into mask operations. double test(double a, double b, double c, double d) { return a<b ? c : d; } before: _test: ucomisd %xmm0, %xmm1 ja LBB0_2 movaps %xmm3, %xmm2 LBB0_2: movaps %xmm2, %xmm0 after: _test: cmpltsd %xmm1, %xmm0 andpd %xmm0, %xmm2 andnpd %xmm3, %xmm0 orpd %xmm2, %xmm0 Small speedup on Benchmarks/SmallPT llvm-svn: 187706	2013-08-04 12:05:16 +00:00
Elena Demikhovsky	2f33e9fa89	AVX-512 set: added VEXTRACTPS instruction llvm-svn: 187705	2013-08-04 10:46:07 +00:00
Tim Northover	da32ed4814	X86: specify CPU on new test to fix atom buildbot Apparently Atoms use lea for stack adjustment, which we weren't looking for. llvm-svn: 187704	2013-08-04 10:00:45 +00:00
Tim Northover	d7e748d087	X86: correct tail return address calculation Due to the weird and wondeful usual arithmetic conversions, some calculations involving negative values were getting performed in uint32_t and then promoted to int64_t, which is really not a good idea. Patch by Katsuhiro Ueno. llvm-svn: 187703	2013-08-04 09:35:57 +00:00
Benjamin Kramer	1a7f3f9391	AsmParser: Store MacroLikeBodies on the side so they don't get leaked. llvm-svn: 187702	2013-08-04 09:06:29 +00:00
Reed Kotler	338c130a3e	Clean up code for Mips16 large frame handling. llvm-svn: 187701	2013-08-04 01:13:25 +00:00
Benjamin Kramer	a913e72728	PPCAsmParser: Stop leaking names. Store them in a place that gets cleaned up properly. llvm-svn: 187700	2013-08-03 22:43:29 +00:00

1 2 3 4 5 ...

94561 Commits