llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-12-11 13:37:07 +00:00

Author	SHA1	Message	Date
NAKAMURA Takumi	0dd5b347b1	[CMake] Add RuntimeDyld to libdeps corresponding to r229343. llvm-svn: 229351	2015-02-16 02:13:30 +00:00
Chandler Carruth	a34a4a834e	[x86] Teach the 128-bit vector shuffle lowering routines to take advantage of the existence of a reasonable blend instruction. The 256-bit vector shuffle lowering has leveraged the general technique of decomposed shuffles and blends for quite some time, but this never made it back into the 128-bit code, and there are a large number of patterns where this is substantially better. For example, this removes almost all domain crossing in vector shuffles that involve some blend and some permutation with SSE4.1 and later. See the massive reduction in 'shufps' for integer test cases in this commit. This isn't perfect yet for a few reasons: 1) The v8i16 shuffle lowering continues to plague me. We don't always form an unpack-based blend when that would be better. But the wins pretty drastically outstrip the losses here. 2) The v16i8 shuffle lowering is just a disaster here. I never went and implemented blend support here for some terrible reason. I'll do that next probably. I've not updated it for now. More variations on this technique are coming as well -- we don't shuffle-into-unpack or shuffle-into-palignr, both of which would also be profitable. Note that some test cases grow significantly in the number of instructions, but I expect to actually be faster. We use pshufd+pshufd+blendw instead of a single shufps, but the pshufd's are very likely to pipeline well (two ports on most modern intel chips) and the blend is a very fast instruction. The domain switch penalty will essentially always be more than a blend instruction, which is the only increase in tree height. llvm-svn: 229350	2015-02-16 01:52:02 +00:00
Chandler Carruth	572fa3dbba	[x86] Clean up a few test cases with the update script. NFC llvm-svn: 229349	2015-02-16 01:39:50 +00:00
Craig Topper	916052961d	[X86] Remove gcc builtins for AVX2 psll_dq and psrl_dq intrinsics. Clang no longer needs them. llvm-svn: 229347	2015-02-16 00:42:36 +00:00
Filipe Cabecinhas	e4564d63bb	[Bitcode reader] Fix a few assertions when reading invalid files Summary: When creating {insert,extract}value instructions from a BitcodeReader, we weren't verifying the fields were valid. Bugs found with afl-fuzz Reviewers: rafael Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7325 llvm-svn: 229345	2015-02-16 00:03:11 +00:00
Lang Hames	dbb9827d2f	[ExecutionEngine] Fix dependence issue by moving RTDyldMemoryManager into RuntimeDyld. This should fix http://llvm.org/PR22593. llvm-svn: 229343	2015-02-15 23:22:43 +00:00
Benjamin Kramer	d21608f72f	MinGW's snprintf is not exposed through std::. llvm-svn: 229342	2015-02-15 23:17:20 +00:00
Aaron Ballman	0b45511a2e	Removing LLVM_DELETED_FUNCTION, as MSVC 2012 was the last reason for requiring the macro. NFC; LLVM edition. llvm-svn: 229340	2015-02-15 22:54:22 +00:00
Benjamin Kramer	99efbfba33	Format: Modernize using variadic templates. Introduces a subset of C++14 integer sequences in STLExtras. This is just enough to support unpacking a std::tuple into the arguments of snprintf, we can add more of it when it's actually needed. Also removes an ancient macro hack that leaks a macro into the global namespace. Clean up users that made use of the convenient hack. llvm-svn: 229337	2015-02-15 22:15:41 +00:00
Aaron Ballman	0e19b5d670	Removing LLVM_EXPLICIT, as MSVC 2012 was the last reason for requiring the macro. NFC; LLVM edition. llvm-svn: 229335	2015-02-15 22:00:20 +00:00
Aaron Ballman	f48307f858	Since MSVC 1800 is our lowest common denominator, we don't need an explicit check for it in these macros any longer; NFC. llvm-svn: 229333	2015-02-15 21:21:52 +00:00
Benjamin Kramer	1256b8262c	CommandLine: Use variadic templates to simplify opt constructors. llvm-svn: 229332	2015-02-15 21:11:25 +00:00
Zachary Turner	1008f9bfa4	llvm-pdbdump: Fix warning caused by missing sentinel value. llvm-svn: 229331	2015-02-15 20:37:44 +00:00
Zachary Turner	e9cd3c4989	llvm-pdbdump: Add flags controlling the type of values to dump. llvm-svn: 229330	2015-02-15 20:27:53 +00:00
Benjamin Kramer	7ce6e93852	FoldingSet: Replace faux variadics with real variadics. NFC. llvm-svn: 229328	2015-02-15 20:12:17 +00:00
Benjamin Kramer	c1b3bb4e32	Remove LLVM_HAS_VARIADIC_TEMPLATES and all the faux variadic workarounds guarded by it. We no longer support compilers without variadic template support. llvm-svn: 229324	2015-02-15 19:34:28 +00:00
Benjamin Kramer	4579422ac9	Update the docs to require at least MSVC 2013. llvm-svn: 229323	2015-02-15 19:34:17 +00:00
Philip Reames	673440a8bc	Revert 229175 This change is a logical suspect in 22587 and 22590. Given it's of minimal importanance and I can't get clang to build on my home machine, I'm reverting so that I can deal with this next week. llvm-svn: 229322	2015-02-15 19:07:31 +00:00
Simon Pilgrim	9849f0e2cd	Added (still inefficient) shuffle test case for PR21138 llvm-svn: 229321	2015-02-15 18:21:39 +00:00
NAKAMURA Takumi	3e2afcf188	Reapply r229185(cbieneman) -- Raising minimum required Visual Studio version to 2013. This is based on the discussions on: [LLVMdev] [RFC] Raising LLVM minimum required MSVC version to 2013 for trunk llvm-svn: 229320	2015-02-15 17:53:10 +00:00
Hal Finkel	afd400b12e	[ADCE] Convert another loop for a range-based for We can use a range-based for for the operands loop too; NFC. llvm-svn: 229319	2015-02-15 15:51:25 +00:00
Hal Finkel	bc23b1a680	[ADCE] Use inst_range and range-based fors Convert a few loops to range-based fors; NFC. llvm-svn: 229318	2015-02-15 15:51:23 +00:00
Hal Finkel	20dadad9f5	[ADCE] Fix formatting of pointer types We prefer to put the * with the variable, not with the type; NFC. llvm-svn: 229317	2015-02-15 15:47:52 +00:00
Hal Finkel	f67e71db4d	[ADCE] Fix capitalization of another local variable Bring another local variable in compliance with our naming conventions, NFC. llvm-svn: 229316	2015-02-15 15:45:30 +00:00
Hal Finkel	6660cfbeba	[ADCE] Fix capitalization of some local variables Bring some local variables in compliance with our naming conventions, NFC. llvm-svn: 229315	2015-02-15 15:45:28 +00:00
Simon Pilgrim	dc017ce5f6	Added some test cases of missed opportunities to use unpckl/unpckh shuffles llvm-svn: 229313	2015-02-15 15:07:45 +00:00
Simon Pilgrim	9264c389f1	Coding style fixes to recent patches. NFC. llvm-svn: 229312	2015-02-15 14:19:29 +00:00
Simon Pilgrim	ddbf019542	[X86][AVX2] vpslldq/vpsrldq byte shifts for AVX2 This patch refactors the existing lowerVectorShuffleAsByteShift function to add support for 256-bit vectors on AVX2 targets. It also fixes a tablegen issue that prevented the lowering of vpslldq/vpsrldq vec256 instructions. Differential Revision: http://reviews.llvm.org/D7596 llvm-svn: 229311	2015-02-15 13:19:52 +00:00
Chandler Carruth	eb48186ef8	[x86] Add the test case from PR22412, we now get this right even with the new vector shuffle legality. llvm-svn: 229310	2015-02-15 12:45:05 +00:00
Chandler Carruth	2fac6b1c98	[x86] Teach the decomposed shuffle/blend lowering to use an early blend when that will allow it to lower with a single permute instead of multiple permutes. It tries to detect when it will only have to do a single permute in either case to maximize folding of loads and such. This cuts a lot of the avx2 shuffle permute counts in half. =] llvm-svn: 229309	2015-02-15 12:42:15 +00:00
Chandler Carruth	0b684c6980	[SDAG] Teach the SelectionDAG to canonicalize vector shuffles of splats directly into blends of the splats. These patterns show up even very late in the vector shuffle lowering where we don't have any chance for DAG combining to kick in, and blending is a tremendously simpler operation to model. By coercing the shuffle into a blend we can much more easily match and lower shuffles of splats. Immediately with this change there are significantly more blends being matched in the x86 vector shuffle lowering. llvm-svn: 229308	2015-02-15 12:18:12 +00:00
Chandler Carruth	fa7b9ba328	[x86] Teach the shuffle mask equivalence test to look through build vectors and detect equivalent inputs. This lets the code match unpck-style instructions when only one of the inputs are lined up but the other input is a splat and so which lanes we pull from doesn't matter. Today, this doesn't really happen, but just by accident. I have a patch that normalizes how we shuffle splats, and with that patch this will be necessary for a lot of the mask equivalence tests to work. I don't really know how to write a test case for this specific change until the other change lands though. llvm-svn: 229307	2015-02-15 12:07:55 +00:00
Chandler Carruth	941ec691e8	[x86] Tweak the ordering of unpack matching vs. element insertion, and don't try to do element insertion for non-zero-index floating point vectors. We don't have any useful patterns or lowering for element insertion into high elements of a floating point vector, and the generic shuffle lowering will end up being better -- namely it will fall back to unpck. But we should try to handle other forms of element insertion before matching unpck patterns. While this doesn't matter much right now, I'm working on a patch that makes unpck matching much more powerful, and that patch will break without this re-ordering. llvm-svn: 229306	2015-02-15 12:01:14 +00:00
Arnaud A. de Grandmaison	1c6984dc94	[PBQP] Assert conservativelly allocatable nodes are spilled by choice. llvm-svn: 229302	2015-02-15 10:35:31 +00:00
Chandler Carruth	1d8146cec4	[x86] Stop shuffling zero vectors. =] I was somewhat surprised this pattern really came up, but it does. It seems better to just directly handle it than try to special case every place where we end up forming a shuffle that devolves to a shuffle of a zero vector. llvm-svn: 229301	2015-02-15 10:34:52 +00:00
Chandler Carruth	c868c80ecb	[x86] Use a more helpful parenthesizing of these comparisons. Silences a -Wparentheses complaint from GCC. llvm-svn: 229300	2015-02-15 10:15:20 +00:00
Chandler Carruth	5c0c778648	[x86] When splitting 256-bit vectors into 128-bit vectors, don't extract subvectors from buildvectors. That doesn't really make any sense and it breaks all of the down-stream matching of buildvectors to cleverly lower shuffles. With this, we now get the shift-based lowering of 256-bit vector shuffles with AVX1 when we split them into 128-bit vectors. We also do much better on the zero-extension patterns, although there remains quite a bit of room for improvement here. llvm-svn: 229299	2015-02-15 10:12:02 +00:00
Chandler Carruth	5cf2555d91	[x86] Make computing the zeroable elements slightly more powerful, at least in theory. I don't actually have a test case that benefits from this, but theoretically, it could come up, and I don't want to try to think about whether this is the culprit or something else is, so I'd rather just make this code powerful. =/ Makes me sad that I can't really test it though. llvm-svn: 229298	2015-02-15 09:33:36 +00:00
Michael Kuperstein	c9049f9057	gold-plugin: fix test to allow default visibility on local symbols GNU ld sets default, not hidden, visibility on local symbols. Having default or hidden visibility on local symbols makes no difference in run-time behavior. Patch by: H.J. Lu <hjl.tools@gmail.com> llvm-svn: 229297	2015-02-15 09:32:30 +00:00
Chandler Carruth	65b90c2fa8	[x86] Update some tests with the latest version of my script and llc. This mostly adds some shuffle decode comments and cleans up indentation. llvm-svn: 229296	2015-02-15 09:26:15 +00:00
Chandler Carruth	8b98b98e16	[x86] Add a slight variation on some of the other generic shuffle lowerings -- one which decomposes into an initial blend followed by a permute. Particularly on newer chips, blends are handled independently of shuffles and so this is much less bottlenecked on the single port that floating point shuffles are executed with on Intel. I'll be adding this lowering to a bunch of other code paths in subsequent commits to handle still more places where we can effectively leverage blends when they're available in the ISA. llvm-svn: 229292	2015-02-15 08:26:30 +00:00
Elena Demikhovsky	79bb080d9c	Enabled cost calculation for masked memory operations. We already have implementation for cost calculation for masked memory operations. I just call it from the loop vectorizer. llvm-svn: 229290	2015-02-15 08:08:48 +00:00
Craig Topper	be42a00218	[X86] Add assembly parser support for mnemonic aliases for AVX-512 vpcmp instructions. llvm-svn: 229287	2015-02-15 07:13:48 +00:00
Chandler Carruth	85a4ad9fb4	[x86] Add a test case for PR22390 which was a dup of PR22377 and fixed by r229285. This is a nice different test case though, so I'd like to have the extra testing of these kinds of patterns. llvm-svn: 229286	2015-02-15 07:05:50 +00:00
Chandler Carruth	6153ae3921	[x86] Fix PR22377, a regression with the new vector shuffle legality test. This was just a matter of the DAG combine for vector shuffles being too aggressive. This is a bit of a grey area, but I think generally if we can re-use intermediate shuffles, we should. Certainly, given the test cases I have available, this seems like the right call. llvm-svn: 229285	2015-02-15 07:01:10 +00:00
Chandler Carruth	635ad2f50d	[x86] Switch a collection of tests explicitly to the new vector shuffle legality test (essentially, everything is legal). I'm planning to make this the default shortly, but I'd like to fix a collection of the bugs it exposes first, and this will let me easily test them. It also showcases both the improvements and a few of the regressions triggered by the change. The biggest improvements by far are the significantly reduced shuffling and domain crossing in the combining test case. The biggest regressions are missing some clever blending patterns. llvm-svn: 229284	2015-02-15 06:37:21 +00:00
Chandler Carruth	83f63dfef3	[x86] Remove the now-default-on flag for the new vector shuffle lowering strategy from a bunch of tests. llvm-svn: 229283	2015-02-15 06:20:51 +00:00
Craig Topper	e9ad59aeaf	[X86] Add assembler predicates for the rest of the AVX512 feature flags. This makes the assembly matching consistent across all AVX512 instructions. Without this we were allowing some AVX512 instructions to be parsed always, but not the foundation instructions. llvm-svn: 229280	2015-02-15 04:54:55 +00:00
Craig Topper	dc76cc8405	[X86] Add the remaining 11 possible exact ModRM formats. This makes their encodings linear which can then be used to simplify some other code. llvm-svn: 229279	2015-02-15 04:16:44 +00:00
David Blaikie	9f95d71989	FileCheck-ize a test to make it easier to migrate to typeless pointers llvm-svn: 229278	2015-02-15 04:14:00 +00:00

1 2 3 4 5 ...

113385 Commits