llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-12-16 08:08:01 +00:00

Author	SHA1	Message	Date
Eric Christopher	b4999f7af6	Fix a spurious if. llvm-svn: 255321	2015-12-11 00:51:59 +00:00
Akira Hatanaka	ef959cd337	[LazyValueInfo] Stop inserting overdefined values into ValueCache to reduce memory usage. Previously, LazyValueInfoCache inserted overdefined lattice values into both ValueCache and OverDefinedCache. This wasn't necessary and was causing LazyValueInfo to use an excessive amount of memory in some cases. This patch changes LazyValueInfoCache to insert overdefined values only into OverDefinedCache. The memory usage decreases by 70 to 75% when one of the files in llvm is compiled. rdar://problem/11388615 Differential revision: http://reviews.llvm.org/D15391 llvm-svn: 255320	2015-12-11 00:49:47 +00:00
Kyle Butt	d374fb8cd3	[PPC]: Peephole optimize small accesss to aligned globals. Access to aligned globals gives us a chance to peephole optimize nonzero offsets. If a struct is 4 byte aligned, then accesses to bytes 0-3 won't overflow the available displacement. For example: addis 3, 2, b4v@toc@ha addi 4, 3, b4v@toc@l lbz 5, b4v@toc@l(3) ; This is the result of the current peephole lbz 6, 1(4) ; optimizer lbz 7, 2(4) lbz 8, 3(4) If b4v is 4-byte aligned, we can skip using register 4 because we know that b4v@toc@l+{1,2,3} won't overflow 32K, and instead generate: addis 3, 2, b4v@toc@ha lbz 4, b4v@toc@l(3) lbz 5, b4v@toc@l+1(3) lbz 6, b4v@toc@l+2(3) lbz 7, b4v@toc@l+3(3) Saving a register and an addition. Larger alignments allow larger structures/arrays to be optimized. llvm-svn: 255319	2015-12-11 00:47:36 +00:00
Hans Wennborg	b179aaa5e1	Check in the script for building Win snapshots llvm-svn: 255318	2015-12-11 00:43:42 +00:00
Vedant Kumar	6f3e981379	[ProfileData] clang-format TextInstrProfReader::hasFormat. NFC. llvm-svn: 255317	2015-12-11 00:40:05 +00:00
Cong Hou	649a5e80cc	[X86][SSE] Update the cost table for integer-integer conversions on SSE2/SSE4.1. Previously in the conversion cost table there are no entries for integer-integer conversions on SSE2. This will result in imprecise costs for certain vectorized operations. This patch adds those entries for SSE2 and SSE4.1. The cost numbers are counted from the result of running llc on the new test case in this patch. Differential revision: http://reviews.llvm.org/D15132 llvm-svn: 255315	2015-12-11 00:31:39 +00:00
Xinliang David Li	d864384cd6	Format fix (NFC) llvm-svn: 255313	2015-12-10 23:48:05 +00:00
Eric Christopher	5f9e955382	s/need/needs llvm-svn: 255306	2015-12-10 22:29:26 +00:00
Eric Christopher	971f116a1c	Fix (bitcast (fabs x)), (bitcast (fneg x)) and (bitcast (fcopysign cst, x)) combines for ppc_fp128, since signbit computation is more complicated. Discussion thread: http://lists.llvm.org/pipermail/llvm-dev/2015-November/092863.html Patch by Tim Shen! llvm-svn: 255305	2015-12-10 22:09:06 +00:00
Eric Christopher	486717bbd5	Attempt to fix the ReST compilation to html of the C API docs. llvm-svn: 255304	2015-12-10 22:04:11 +00:00
Eric Christopher	9456f0c8a7	More non-ascii quote characters. llvm-svn: 255303	2015-12-10 21:47:38 +00:00
Eric Christopher	4d1733dc41	Clarify some of the wording on adding a new subcomponent to the C API. llvm-svn: 255302	2015-12-10 21:46:24 +00:00
Eric Christopher	cb61a69d4f	Fix non-ascii quotes. llvm-svn: 255301	2015-12-10 21:38:56 +00:00
Eric Christopher	40a2fd91c0	Add C API guidelines to the developer policy to match discussions on the llvm mailing lists. llvm-svn: 255300	2015-12-10 21:33:53 +00:00
Kyle Butt	706d39a054	PPC: Teach FMA mutate to respect register classes. This was causing bad code gen and assembly that won't assemble, as mixed altivec and vsx code would end up with a vsx high register assigned to an altivec instruction, which won't work. Constraining the classes allows the optimization to proceed. llvm-svn: 255299	2015-12-10 21:28:40 +00:00
Chris Bieneman	28da4411df	[CMake] Add LLVM_BUILD_INSTRUMENTED option to enable building with -fprofile-instr-generate This is the first step in supporting PGO data generation via CMake. I've marked the option as advanced and experimental until it is fleshed out further. llvm-svn: 255298	2015-12-10 21:19:07 +00:00
Mike Aizatsky	ea27e92765	[LibFuzzer] Introducing FUZZER_FLAG_UNSIGNED and using it for seeding. Differential Revision: http://reviews.llvm.org/D15339 done llvm-svn: 255296	2015-12-10 20:41:53 +00:00
JF Bastien	b5effc7a01	EarlyCSE: add tests Summary: As a follow-up to rL255054 I wasn't able to convince myself that the code did what I thought, so I wrote more tests. Reviewers: reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15371 llvm-svn: 255295	2015-12-10 20:24:34 +00:00
Xinliang David Li	7d948a5ebf	Add a forward declaration (NFC) llvm-svn: 255292	2015-12-10 20:13:41 +00:00
Cong Hou	2cb9520fd0	Delete a duplicate branch in IfConversion.cpp. NFC. llvm-svn: 255291	2015-12-10 19:57:22 +00:00
Simon Pilgrim	fcb5c2d030	[DAGCombiner] Fix PR25763 - vector comparison constant folding + sign-extension PR25763 demonstrated an issue with D14683 - vector comparison constant folding only works for i1 results, so we need to split off the sign-extension of the result to the required type. Luckily this can be done with the existing type legalization code. llvm-svn: 255289	2015-12-10 19:47:06 +00:00
Chad Rosier	f118529c0e	[DSE] Disable non-local DSE to see if the bots go green. I see a few bots timing out, so I'm speculatively disabling r255247. llvm-svn: 255286	2015-12-10 19:23:02 +00:00
Rafael Espindola	43562b355d	Fix another case where the linkage was not set. llvm-svn: 255272	2015-12-10 18:44:26 +00:00
Rong Xu	1cf66e30a9	[PGO] Use %t as the temporary profdata filename in the test cases. Using %t rather %T/<specific_name> as the temporary profdata filename. llvm-svn: 255271	2015-12-10 18:24:44 +00:00
Duncan P. N. Exon Smith	fe4d347815	Verifier: Avoid quadratic checking of aggregates for bad bitcasts Avoid O(N^2) behaviour when checking for bad bitcasts in `ConstantExpr`s buried inside of aggregate initializers to `GlobalVariable`s. I've: - centralized the "visited" set for recursing through `ConstantExpr`s so that expressions are only visited once per Verifier run, - removed the duplicate logic for the stack visit, and - avoided recursing into other `GlobalValue`s. This recovers roughly a 100x time difference in clang compiles of a particular input file (filled with large cross-referencing tables) that depends on whether `-disable-llvm-verifier` is on. This slowdown was caused by r187506, which introduced these checks. Now, avoiding `-disable-llvm-verifier` only causes a 2x slowdown for this case. (Interestingly, dumping the textual IR for this file starts at least 50GB of global variable initializers (I don't know the total, since I killed the dump)...) llvm-svn: 255269	2015-12-10 17:56:06 +00:00
Chad Rosier	b8c929dbff	[DeadStoreElimination] Use range-based loops. NFC. llvm-svn: 255265	2015-12-10 17:27:18 +00:00
Nathan Slingerland	5f9f1eadf1	[ProfileData] Add unit test infrastructure for sample profile reader/writer Summary: Adds support for in-memory round-trip of sample profile data along with basic round trip unit tests. This will also make it easier to include unit tests for future changes to sample profiling. Reviewers: davidxl, dnovillo, silvas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15211 llvm-svn: 255264	2015-12-10 17:21:42 +00:00
Pirama Arumuga Nainar	45a5052ae3	Fix fptosi, fptoui from f16 vectors to i8, i16 vectors Summary: Convert f16 vectors to corresponding f32 vectors before doing the conversion to int. Add tests for v4f16, v8f16. Reviewers: ab, jmolloy Subscribers: llvm-commits, srhines Differential Revision: http://reviews.llvm.org/D14936 llvm-svn: 255263	2015-12-10 17:16:49 +00:00
Sanjay Patel	25a4b4195f	[InstCombine] fold bitcasts around an extractelement (3rd try) This is a redo of r255137 (reverted at r255227) which was a redo of r255124 (reverted at r255126) with a fixed check for a scalar source type and an added test for the failure that caused the revert. Original commit message: Example: bitcast (extractelement (bitcast <2 x float> %X to <2 x i32>), 1) to float ---> extractelement <2 x float> %X, i32 1 This is part of fixing PR25543: https://llvm.org/bugs/show_bug.cgi?id=25543 The next step will be to generalize this fold: trunc ( lshr ( bitcast X) ) -> extractelement (X) Ie, I'm hoping to replace the existing transform of: bitcast ( trunc ( lshr ( bitcast X))) added by: http://reviews.llvm.org/rL112232 with 2 less specific transforms to catch the case in the bug report. Differential Revision: http://reviews.llvm.org/D14879 llvm-svn: 255261	2015-12-10 17:09:28 +00:00
Teresa Johnson	087bc3b677	[ThinLTO] Debug message cleanup (NFC) Added some missing spaces between the module identifier and the start of the debug message. Also added a ":" after the module identifier to make this look a little nicer. llvm-svn: 255259	2015-12-10 16:39:07 +00:00
Rafael Espindola	9a2fc3b415	Avoid undefined behavior when vector is empty. Found by ubsan. llvm-svn: 255258	2015-12-10 16:35:06 +00:00
Sanjay Patel	c59d0a1e2c	remove duplicated comments and don't repeat function names in comments; NFC llvm-svn: 255257	2015-12-10 16:34:21 +00:00
Teresa Johnson	99c029d45a	[ThinLTO] Release files in gold plugin during combined index (take 2) Ensure we release the files even when they don't hold a function index summary section, by restructuring the control flow a little bit. llvm-svn: 255256	2015-12-10 16:11:23 +00:00
Dan Gohman	efa19c5d71	[WebAssembly] Tighten up several CHECK tests. llvm-svn: 255255	2015-12-10 14:52:34 +00:00
Rafael Espindola	c944447e56	Slit lib/Linker in two. A linker normally has two stages: symbol resolution and "moving stuff". In lib/Linker there is the complication of lazy linking some globals, but it was still far more mixed than it needed to. This splits the linker into a lower level IRMover and the linker proper. The IRMover just takes a list of globals to move and a callback that lets the user control what is lazy linked. The main motivation is that now tools/gold (and soon lld) can use their own symbol resolution to instruct IRMover what to do. llvm-svn: 255254	2015-12-10 14:19:35 +00:00
Dan Gohman	8e9d77609c	[WebAssembly] Make WebAssemblyStoreResults only return true when it has a change. llvm-svn: 255253	2015-12-10 14:17:36 +00:00
Dan Gohman	cea85bef3f	[WebAssembly] Fix WebAssemblyPeephole to set Changed to true when making changes. llvm-svn: 255252	2015-12-10 14:16:34 +00:00
Dan Gohman	8299a6675e	[WebAssembly] Declare that WebAssemblyPeephole does not modify the CFG. llvm-svn: 255251	2015-12-10 14:12:04 +00:00
Dan Gohman	92d2c287a0	[WebAssembly] Remove an unneeded getAnalysisUsage override. llvm-svn: 255250	2015-12-10 14:10:04 +00:00
Chad Rosier	4a619fca22	[DeadStoreElimination] Add support for non-local DSE. We extend the search for redundant stores to predecessor blocks that unconditionally lead to the block BB with the current store instruction. That also includes single-block loops that unconditionally lead to BB, and if-then-else blocks where then- and else-blocks unconditionally lead to BB. http://reviews.llvm.org/D13363 Patch by Ivan Baev <ibaev@codeaurora.org>! llvm-svn: 255247	2015-12-10 13:51:43 +00:00
Nemanja Ivanovic	1c3c6f9380	Bitcasts between FP and INT values using direct moves This patch corresponds to review: http://reviews.llvm.org/D15286 LLVM IR frequently contains bitcast operations between floating point and integer values of the same width. Doing this through memory operations is quite expensive on PPC. This patch allows the use of direct register moves between FPRs and GPRs for lowering bitcasts. llvm-svn: 255246	2015-12-10 13:35:28 +00:00
Amjad Aboud	85f2758759	Macro debug info support in LLVM IR Introduced DIMacro and DIMacroFile debug info metadata in the LLVM IR to support macros. Differential Revision: http://reviews.llvm.org/D14687 llvm-svn: 255245	2015-12-10 12:56:35 +00:00
Silviu Baranga	c26660fa6c	[LLE] Use the PredicatedScalarEvolution interface to query SCEVs for dependences Summary: LAA uses the PredicatedScalarEvolution interface, so it can produce forward/backward dependences having SCEVs that are AddRecExprs only after being transformed by PredicatedScalarEvolution. Use PredicatedScalarEvolution to get the expected expressions. Reviewers: anemet Subscribers: llvm-commits, sanjoy Differential Revision: http://reviews.llvm.org/D15382 llvm-svn: 255241	2015-12-10 11:07:18 +00:00
Jonas Paulsson	34ea6437fb	[PostRA scheduling] Allow a target to do scheduling when it wants post RA. SystemZ needs to do its scheduling after branch relaxation, which can only happen after block placement, and therefore the standard PostRAScheduler point in the pass sequence is too early. TargetMachine::targetSchedulesPostRAScheduling() is a new method that signals on returning true that target will insert the final scheduling pass on its own. Reviewed by Hal Finkel llvm-svn: 255234	2015-12-10 09:10:07 +00:00
Akira Hatanaka	a1488717da	Revert r255137. This commit broke apple's internal bot. llvm-svn: 255227	2015-12-10 08:00:52 +00:00
Sanjoy Das	d85ded90d0	Add arg_begin() and arg_end() to CallInst and InvokeInst; NFCI - This simplifies the CallSite class, arg_begin / arg_end are now simple wrapper getters. - In several places, we were creating CallSite instances solely to call arg_begin and arg_end. With this change, that's no longer required. llvm-svn: 255226	2015-12-10 06:39:02 +00:00
Craig Topper	ef92c59e21	[X86] Fix a couple cases were bitwise and logical operations were being mixed. NFC llvm-svn: 255224	2015-12-10 06:09:41 +00:00
Alexey Bataev	f26d973dc0	[OPENMP] Make -fopenmp to turn on OpenMP support by default. Patch turns on OpenMP support in clang by default after fixing OpenMP buildbots. Differential Revision: http://reviews.llvm.org/D13802 llvm-svn: 255222	2015-12-10 05:45:58 +00:00
Dan Gohman	aa6cf3803d	[WebAssembly] Implement mixed-type ISD::FCOPYSIGN. ISD::FCOPYSIGN permits its operands to have differing types, and DAGCombiner uses this. Add some def : Pat rules to expand this out into an explicit conversion and a normal copysign operation. llvm-svn: 255220	2015-12-10 04:55:31 +00:00
Dan Gohman	0ff43d51a6	[WebAssembly] Implement fma. It is lowered to a libcall for now, but this is expected to change in the future. llvm-svn: 255219	2015-12-10 04:52:33 +00:00

1 2 3 4 5 ...

124875 Commits