llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-12-26 13:36:28 +00:00

Author	SHA1	Message	Date
Tobias Grosser	acbe458a3f	Add half support to LLVM (for OpenCL) Submitted by: Anton Lokhmotov <Anton.Lokhmotov@arm.com> Approved by: o Anton Korobeynikov o Micah Villmow o David Neto llvm-svn: 157393	2012-05-24 15:59:06 +00:00
Patrik Hägglund	6f837093ba	Fixed typo in r156905. llvm-svn: 157320	2012-05-23 12:34:56 +00:00
Chris Lattner	9b30a6d7da	small refinement to r157218 to save a tiny amount of table size in the common case. llvm-svn: 157312	2012-05-23 05:19:18 +00:00
Nuno Lopes	944814b41a	revert my previous patches that introduced an additional parameter to the objectsize intrinsic. After a lot of discussion, we realized it's not the best option for run-time bounds checking llvm-svn: 157255	2012-05-22 15:25:31 +00:00
Pete Cooper	869fc085f9	Added address space qualifier to intrinsic PointerType arguments. llvm-svn: 157218	2012-05-21 23:21:28 +00:00
Stepan Dyatkovskiy	98d41ce6af	PR1255 (case ranges: work with ConstantRangesSet instead of ConstantInt) related changes for Execution and Verifier. llvm-svn: 157183	2012-05-21 10:44:40 +00:00
Benjamin Kramer	e11ffa2475	Move CallbackVHs dtor inline, it can be devirtualized in many cases. Move the other virtual methods out of line as they are only called from within Value.cpp anyway. llvm-svn: 157123	2012-05-19 19:15:25 +00:00
Chris Lattner	3d3d2be19b	enhance the intrinsic info stuff to emit encodings that don't fit in 32-bits into a separate side table, using the handy SequenceToOffsetTable class. This encodes all these weird things into another 256 bytes, allowing all intrinsics to be encoded this way. llvm-svn: 156995	2012-05-17 15:55:41 +00:00
Manuel Klimek	74eaba6d7b	Fix compile error. llvm-svn: 156986	2012-05-17 09:32:05 +00:00
Chris Lattner	cd2c5863d5	Genericize the intrinsics descriptor decoding a bit to make room for future expansion, no functionality change yet though. llvm-svn: 156979	2012-05-17 05:13:57 +00:00
Chris Lattner	288872000a	finish encoding all of the interesting details of intrinsics. Now intrinsics are only rejected because they can't be encoded into a 32-bit unit, not because they contain an unencodable feature. llvm-svn: 156978	2012-05-17 05:03:24 +00:00
Chris Lattner	8962c547fa	strengthen the intrinsic descriptor stuff to be able to handle sin, cos and other intrinsics that use passed-in arguments. llvm-svn: 156977	2012-05-17 04:30:58 +00:00
Chris Lattner	294ca0bfbc	Significantly reduce the compiled size of Functions.cpp by turning a big blob of tblgen generated code (for Intrinsic::getType) into a table. This handles common cases right now, but I plan to extend it to handle all cases and merge in type verification logic as well in follow-on patches. llvm-svn: 156905	2012-05-16 06:34:44 +00:00
Bill Wendling	aad3af5f7c	Use ArrayRef instead of an explicit vector type. llvm-svn: 156755	2012-05-14 07:53:40 +00:00
Stepan Dyatkovskiy	fa0cf8dc2e	Recommited r156374 with critical fixes in BitcodeReader/Writer: Ordinary patch for PR1255. Added new case-ranges orientated methods for adding/removing cases in SwitchInst. After this patch cases will internally representated as ConstantArray-s instead of ConstantInt, externally cases wrapped within the ConstantRangesSet object. Old methods of SwitchInst are also works well, but marked as deprecated. So on this stage we have no side effects except that I added support for case ranges in BitcodeReader/Writer, of course test for Bitcode is also added. Old "switch" format is also supported. llvm-svn: 156704	2012-05-12 10:48:17 +00:00
Jay Foad	65d25fa204	Teach Function::hasAddressTaken that BlockAddress doesn't really take the address of a function. llvm-svn: 156703	2012-05-12 08:30:16 +00:00
Joel Jones	305ecb3495	Fix a problem with incomplete equality testing of PHINodes in Instruction::IsIdenticalToWhenDefined. This manifested itself when inlining two calls to the same function. The inlined function had a switch statement that returned one of a set of global variables. Without this modification, the two phi instructions that chose values from the branches of the switch instruction inlined from the callee were considered equivalent and jump-threading replaced a load for the first switch value with a phi selecting from the second switch, thereby producing incorrect code. This patch has been tested with "make check-all", "lnt runteste nt", and llvm self-hosted, and on the original program that had this problem, wireshark. <rdar://problem/11025519> llvm-svn: 156548	2012-05-10 15:59:41 +00:00
Hans Wennborg	879332e389	Introduce llvm-c function LLVMPrintModuleToFile. This lets you save the textual representation of the LLVM IR to a file. Before this patch it could only be printed to STDERR from llvm-c. Patch by Carlo Kok! llvm-svn: 156479	2012-05-09 16:54:17 +00:00
Nuno Lopes	e8880a9916	change the objectsize intrinsic signature: add a 3rd parameter to denote the maximum runtime performance penalty that the user is willing to accept. This commit only adds the parameter. Code taking advantage of it will follow. llvm-svn: 156473	2012-05-09 15:52:43 +00:00
Stepan Dyatkovskiy	b150cd5ced	Rejected r156374: Ordinary PR1255 patch. Due to clang-x86_64-debian-fnt buildbot failure. llvm-svn: 156377	2012-05-08 08:33:21 +00:00
Craig Topper	77b1a4cee5	Remove 256-bit AVX non-temporal store intrinsics. Similar was previously done for 128-bit. llvm-svn: 156375	2012-05-08 06:58:15 +00:00
Stepan Dyatkovskiy	33fd2a5bf4	Ordinary patch for PR1255. Added new case-ranges orientated methods for adding/removing cases in SwitchInst. After this patch cases will internally representated as ConstantArray-s instead of ConstantInt, externally cases wrapped within the ConstantRangesSet object. Old methods of SwitchInst are also works well, but marked as deprecated. So on this stage we have no side effects except that I added support for case ranges in BitcodeReader/Writer, of course test for Bitcode is also added. Old "switch" format is also supported. llvm-svn: 156374	2012-05-08 06:36:08 +00:00
Dan Gohman	25a863dcf7	Reapply r155682, making constant folding more consistent, with a fix to work properly with how the code handles all-undef PHI nodes. llvm-svn: 155721	2012-04-27 17:50:22 +00:00
NAKAMURA Takumi	a28147f072	Revert r155682, "Use ConstantExpr::getExtractElement when constant-folding vectors" It broke stage2 build. stage1/clang sometimes crashed. llvm-svn: 155699	2012-04-27 07:59:20 +00:00
Dan Gohman	a72b2f97a6	Use ConstantExpr::getExtractElement when constant-folding vectors instead of getAggregateElement. This has the advantage of being more consistent and allowing higher-level constant folding to procede even if an inner extract element cannot be folded. Make ConstantFoldInstruction call ConstantFoldConstantExpression on the instruction's operands, making it more consistent with ConstantFoldConstantExpression itself. This makes sure that ConstantExprs get TargetData-aware folding before being handed off as operands for further folding. This causes more expressions to be folded, but due to a known shortcoming in constant folding, this currently has the side effect of stripping a few more nuw and inbounds flags in the non-targetdata side of constant-fold-gep.ll. This is mostly harmless. This fixes rdar://11324230. llvm-svn: 155682	2012-04-27 00:54:36 +00:00
Bill Wendling	d9d9230b83	Don't forget to reset 'first operand' flag when we're setting the MDNodeOperand value. llvm-svn: 155599	2012-04-26 00:38:42 +00:00
Nadav Rotem	3c817bb807	ConstantFoldSelectInstruction swapped the operands of the select. Fix 12592. Patch by Matt Pharr. llvm-svn: 155480	2012-04-24 20:18:49 +00:00
Bill Wendling	0f9f58c75a	Cleanup whitespace. llvm-svn: 155328	2012-04-23 00:23:33 +00:00
Bill Wendling	2b77fec649	Limit the number of times we recurse through this algorithm. All of the intructions are processed. So there's no need to look at them if they're used as operands of other instructions. llvm-svn: 155327	2012-04-23 00:22:55 +00:00
Bill Wendling	8d86028029	Add a flag to the struct type finder to collect only those types which have names. This saves collecting types we normally don't care about. llvm-svn: 155300	2012-04-21 23:59:16 +00:00
Bill Wendling	be493e63ea	Revert r155241, which is causing some breakage. llvm-svn: 155253	2012-04-20 23:11:38 +00:00
Bill Wendling	bb9c301c28	If we discover all of the named structs in a module, then don't bother to process any more Values. llvm-svn: 155241	2012-04-20 21:56:24 +00:00
Craig Topper	7c784d86eb	Remove AVX vpermil intrinsics. I removed their uses from clang headers and builtins a while back. llvm-svn: 154985	2012-04-18 05:24:00 +00:00
Eric Christopher	2ec1742f9b	Typo. llvm-svn: 154879	2012-04-16 23:54:31 +00:00
Duncan Sands	518668bd76	Remove support for the special 'fast' value for fpmath accuracy for the moment. llvm-svn: 154850	2012-04-16 19:39:33 +00:00
Duncan Sands	f61d49df40	Make it possible to indicate relaxed floating point requirements at the IR level through the use of 'fpmath' metadata. Currently this only provides a 'fpaccuracy' value, which may be a number in ULPs or the keyword 'fast', however the intent is that this will be extended with additional information about NaN's, infinities etc later. No optimizations have been hooked up to this so far. llvm-svn: 154822	2012-04-16 16:28:59 +00:00
Duncan Sands	40d080e3b7	Rename "fpaccuracy" metadata to the more generic "fpmath". That's because I'm thinking of generalizing it to be able to specify other freedoms beyond accuracy (such as that NaN's don't have to be respected). I'd like the 3.1 release (the first one with this metadata) to have the more generic name already rather than having to auto-upgrade it in 3.2. llvm-svn: 154744	2012-04-14 12:36:06 +00:00
Dan Gohman	cde3a46455	Def here is an Instruction, so !isa<Instruction>(Def) is always false, as Eli noticed. llvm-svn: 154641	2012-04-13 00:50:57 +00:00
Dan Gohman	c0a906405e	Add forms of dominates and isReachableFromEntry that accept a Use directly instead of a user Instruction. This allows them to test whether a def dominates a particular operand if the user instruction is a PHI. llvm-svn: 154631	2012-04-12 23:31:46 +00:00
Benjamin Kramer	eba5ed591b	Cache the hash value of the operands in the MDNode. FoldingSet is implemented as a chained hash table. When there is a hash collision during insertion, which is common as we fill the table until a load factor of 2.0 is hit, we walk the chained elements, comparing every operand with the new element's operands. This can be very expensive if the MDNode has many operands. We sacrifice a word of space in MDNode to cache the full hash value, reducing compares on collision to a minimum. MDNode grows from 28 to 32 bytes + operands on x86. On x86_64 the new bits fit nicely into existing padding, not growing the struct at all. The actual speedup depends a lot on the test case and is typically between 1% and 2% for C++ code with clang -c -O0 -g. llvm-svn: 154497	2012-04-11 14:06:54 +00:00
Benjamin Kramer	3a0f5a0df3	Compute hashes directly with hash_combine instead of taking a detour through FoldingSetNodeID. llvm-svn: 154495	2012-04-11 14:06:39 +00:00
Bill Wendling	16712e549c	The MDString class stored a StringRef to the string which was already in a StringMap. This was redundant and unnecessarily bloated the MDString class. Because the MDString class is a "Value" and will never have a "name", and because the Name field in the Value class is a pointer to a StringMap entry, we repurpose the Name field for an MDString. It stores the StringMap entry in the Name field, and uses the normal methods to get the string (name) back. PR12474 llvm-svn: 154429	2012-04-10 20:12:16 +00:00
Duncan Sands	f25460b85f	Express the number of ULPs in fpaccuracy metadata as a real rather than a rational number, eg as 2.5 rather than 5, 2. OK'd by Peter Collingbourne. llvm-svn: 154387	2012-04-10 08:22:43 +00:00
Bill Wendling	15f23d4018	Remove the 'Parent' pointer from the MDNodeOperand class. An MDNode has a list of MDNodeOperands allocated directly after it as part of its allocation. Therefore, the Parent of the MDNodeOperands can be found by walking back through the operands to the beginning of that list. Mark the first operand's value pointer as being the 'first' operand so that we know where the beginning of said list is. This saves a lot of space during LTO with -O0 -g flags. llvm-svn: 154280	2012-04-08 10:20:49 +00:00
Bill Wendling	d13bec8fa9	Allow subclasses of the ValueHandleBase to store information as part of the value pointer by making the value pointer into a pointer-int pair with 2 bits available for flags. llvm-svn: 154279	2012-04-08 10:16:43 +00:00
Bill Wendling	e3c2c36927	The speedup doesn't appear to have been from this, but was an anomaly of my testing machine. llvm-svn: 153951	2012-04-03 11:19:21 +00:00
Bill Wendling	3f12fbd290	Reserve space for the eventual filling of the vector. This gives a small speedup. llvm-svn: 153949	2012-04-03 10:50:09 +00:00
Duncan Sands	5165327295	I noticed in passing that the Metadata getIfExists method was creating a new node and returning it if one didn't exist. llvm-svn: 153798	2012-03-31 08:20:11 +00:00
Rafael Espindola	151b420718	Handle unreachable code in the dominates functions. This changes users when needed for correctness, but still doesn't clean up code that now unnecessary checks for reachability. llvm-svn: 153755	2012-03-30 16:46:21 +00:00
Douglas Gregor	d7dc901945	Add missing include of <new> llvm-svn: 153436	2012-03-26 14:04:17 +00:00

1 2 3 4 5 ...

4188 Commits