llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-12-11 21:45:16 +00:00

Author	SHA1	Message	Date
Ehsan Amiri	b2ddc31073	[PPC] Adding the removed testcase again This testcase was originally part of r284995, but I put it in a wrong directory. So I removed it. Before adding it back I did some small enhancements. Also I changed the assertions a little bit, to take into account the impact of some changes performed since code review is done. This is similar to changes done for another testcase in the original commit. See: https://reviews.llvm.org/D23614#577749 Basically for instead of vxor we now generate xxlxor in some cases, which is better. llvm-svn: 285333	2016-10-27 19:10:09 +00:00
Haicheng Wu	78fd509204	[LoopUnroll] Check partial unrolling is enabled before initialization. NFC. Differential Revision: https://reviews.llvm.org/D23891 llvm-svn: 285330	2016-10-27 18:40:02 +00:00
Simon Pilgrim	68d206a9d2	[X86][AVX512] Fix MUL v8i64 costs on non-AVX512DQ targets llvm-svn: 285329	2016-10-27 18:32:06 +00:00
Sanjay Patel	6b6d270c74	[InstCombine] handle simple vector integer constants in IsFreeToInvert llvm-svn: 285318	2016-10-27 17:30:50 +00:00
Simon Pilgrim	02a2816235	[X86][AVX512DQ] Move v2i64 and v4i64 MUL lowering to tablegen As suggested by @igorb on D26011 llvm-svn: 285313	2016-10-27 17:07:40 +00:00
Saleem Abdulrasool	5d2e860995	ARM: ensure that the Windows DBZ check is in range The Windows ARM target expects the compiler to emit a division-by-zero check. The check would use the form of: cmp r?, #0 cbz .Ltrap b .Lbody .Lbody: ... .Ltrap: udf #249 @ __brkdiv0 This works great most of the time. However, if the body of the function is greater than 127 bytes, the branch target limitation of cbz becomes an issue. This occurs in the unoptimized code generation cases sometimes (like in compiler-rt). Since this is a matter of correctness, possibly pay a small penalty instead. We now form this slightly differently: cbnz .Lbody udf #249 @ __brkdiv0 .Lbody: ... The positive case is through the branch instead of being the next instruction. However, because of the basic block layout, the negated branch is going to be a short distance always (2 bytes away, after the inserted __brkdiv0). The new t__brkdiv0 instruction is required to explicitly mark the instruction as a terminator as the generic UDF instruction is not a terminator. Addresses PR30532! llvm-svn: 285312	2016-10-27 16:59:22 +00:00
Greg Clayton	c59e887dd6	Switch all DWARF variables for tags, attributes and forms over to use the llvm::dwarf enumerations instead of using raw uint16_t values. This allows easier debugging as users can see the values of the enumerations in the variables view that will show the enumeration string instead of just a number. https://reviews.llvm.org/D26013 llvm-svn: 285309	2016-10-27 16:32:04 +00:00
Dehao Chen	ecb41605f5	Add Loop Sink pass to reverse the LICM based of basic block frequency. Summary: LICM may hoist instructions to preheader speculatively. Before code generation, we need to sink down the hoisted instructions inside to loop if it's beneficial. This pass is a reverse of LICM: looking at instructions in preheader and sinks the instruction to basic blocks inside the loop body if basic block frequency is smaller than the preheader frequency. Reviewers: hfinkel, davidxl, chandlerc Subscribers: anna, modocache, mgorny, beanz, reames, dberlin, chandlerc, mcrosier, junbuml, sanjoy, mzolotukhin, llvm-commits Differential Revision: https://reviews.llvm.org/D22778 llvm-svn: 285308	2016-10-27 16:30:08 +00:00
Vasileios Kalintiris	119c41482e	[mips] Do not allow -opt-bisect-limit to skip the PIC call optimization pass. r282428 added the MipsOptimizePICCall as an opt-in pass that can be skipped when using the -opt-bisect-limit option. However, this pass is needed because it generates code that conforms to the o32 ABI specification by using the $t9 register for PIC calls with JALR instructions. This bug was exposed by the fact that skipFunction() also checks for the "optnone" attribute. This caused functions with that attribute to break the requirements of the o32 ABI. llvm-svn: 285305	2016-10-27 15:50:36 +00:00
Simon Pilgrim	268d797e5a	[X86][AVX512DQ] Improve lowering of MUL v2i64 and v4i64 With DQI but without VLX, lower v2i64 and v4i64 MUL operations with v8i64 MUL (vpmullq). Updated cost table accordingly. Differential Revision: https://reviews.llvm.org/D26011 llvm-svn: 285304	2016-10-27 15:27:00 +00:00
Sanjay Patel	87ccc5a816	[ValueTracking] fix matchSelectPattern to allow vector splat folds of min/max/abs/nabs llvm-svn: 285303	2016-10-27 15:26:10 +00:00
Benjamin Kramer	49bf5a5fa4	Remove duplicated default move ctors/move assign. No functional change. llvm-svn: 285302	2016-10-27 15:23:44 +00:00
Sanjay Patel	e121eccc6f	[InstCombine] add tests for missing folds of vector abs/nabs/min/max llvm-svn: 285299	2016-10-27 15:02:45 +00:00
Bjorn Pettersson	bbb6863e88	Fix memory issue in AttrBuilder::removeAttribute uses. Summary: Found when running Valgrind. This removes two unnecessary assignments when using AttrBuilder::removeAttribute. AttrBuilder::removeAttribute returns a reference to the object. As the LHSes were the same as the callees, the assignments resulted in memcpy calls where dst = src. Commited on behalf-of: dstenb (David Stenberg) Reviewers: mkuper, rnk Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25460 llvm-svn: 285298	2016-10-27 14:48:09 +00:00
Krzysztof Parzyszek	e14110d47f	[Hexagon] Do not expand ISD::SELECT for HVX vectors llvm-svn: 285297	2016-10-27 14:30:16 +00:00
Simon Pilgrim	e1bf270f66	[DAGCombiner] Add vector demanded elements support to computeKnownBits Currently computeKnownBits returns the common known zero/one bits for all elements of vector data, when we may only be interested in one/some of the elements. This patch adds a DemandedElts argument that allows us to specify the elements we actually care about. The original computeKnownBits implementation calls with a DemandedElts demanding all elements to match current behaviour. Scalar types set this to 1. The approach was found to be easier than trying to add a per-element known bits solution, for a similar usefulness given the combines where computeKnownBits is typically used. I've only added support for a few opcodes so far (the ones that have proven straightforward to test), all others will default to demanding all elements but can be updated in due course. DemandedElts support could similarly be added to computeKnownBitsForTargetNode in a future commit. Differential Revision: https://reviews.llvm.org/D25691 llvm-svn: 285296	2016-10-27 14:29:28 +00:00
Sanjay Patel	6941aa6e91	[InstCombine] auto-generate better checks; NFC llvm-svn: 285293	2016-10-27 13:55:37 +00:00
George Rimar	df30d629a9	Revert r285285 "[Object/ELF] - Fixed behavior when SectionHeaderTable->sh_size is too large." It broke BB. llvm-svn: 285288	2016-10-27 12:18:50 +00:00
Alexey Bataev	b6c0260e86	[SLP] Fix for PR30626: Compiler crash inside SLP Vectorizer. After successfull horizontal reduction vectorization attempt for PHI node vectorizer tries to update root binary op by combining vectorized tree and the ReductionPHI node. But during vectorization this ReductionPHI can be vectorized itself and replaced by the `undef` value, while the instruction itself is marked for deletion. This 'marked for deletion' PHI node then can be used in new binary operation, causing "Use still stuck around after Def is destroyed" crash upon PHI node deletion. Also the test is fixed to make it perform actual testing. Differential Revision: https://reviews.llvm.org/D25671 llvm-svn: 285286	2016-10-27 12:02:28 +00:00
George Rimar	2f2500e562	[Object/ELF] - Fixed behavior when SectionHeaderTable->sh_size is too large. Elf.h already has code checking that section table does not go past end of file. Problem is that this check may not work on values greater than UINT64_MAX / Header->e_shentsize because of calculation overflow. Parch fixes the issue. Differential revision: https://reviews.llvm.org/D25432 llvm-svn: 285285	2016-10-27 11:50:04 +00:00
George Rimar	9ee5fd9330	[Object/ELF] - Do not allow overflow when checking section size/offset. Overflow was the reason of incorrect passing the check, patch fixes the case. Differentail revision: https://reviews.llvm.org/D25514 llvm-svn: 285284	2016-10-27 11:44:56 +00:00
George Rimar	f764591abe	[Object/ELF] - Do not crash if string table sh_size is equal to zero. Revealed using "id_000038,sig_11,src_000015,op_havoc,rep_16" from PR30540, when sh_size was 0, crash happened. Differential revision: https://reviews.llvm.org/D25091 llvm-svn: 285282	2016-10-27 11:41:57 +00:00
Sam Parker	cc86b4dc71	[ARM] Add newline char to test. Missed a newline in the previous commit. Differential Revision: https://reviews.llvm.org/D26027 llvm-svn: 285280	2016-10-27 10:43:02 +00:00
Sam Parker	5118fae0d2	[ARM] Predicate UMAAL selection on hasDSP. UMAAL is a DSP instruction and it is not available on thumbv7m (Cortex-M3) and thumbv6m (Cortex-M0+1) targets. Also fix wrong CHECK prefix in longMAC.ll test. Patch by Vadzim Dambrouski. Differential Revision: https://reviews.llvm.org/D25890 llvm-svn: 285278	2016-10-27 09:47:10 +00:00
Dylan McKay	de46f31ff7	[AVR] Generate all of the TableGen files we need This enables generation of all of the TableGen files that are used downstream. llvm-svn: 285274	2016-10-27 08:20:47 +00:00
Nicolai Haehnle	1a1eaef0b2	AMDGPU: Fix SILoadStoreOptimizer when writes cannot be merged due register dependencies Summary: When finding a match for a merge and collecting the instructions that must be moved, keep in mind that the instruction we merge might actually use one of the defs that are being moved. Fixes piglit spec/arb_enhanced_layouts/execution/component-layout/vs-tcs-load-output[-indirect]. The fact that the ds_read in the test case is not eliminated suggests that there might be another problem related to alias analysis, but that's a separate problem: this pass should still work correctly even when earlier optimization passes missed something or were disabled. Reviewers: tstellarAMD, arsenm Subscribers: kzhuravl, wdng, yaxunl, llvm-commits, tony-tye Differential Revision: https://reviews.llvm.org/D25829 llvm-svn: 285273	2016-10-27 08:15:07 +00:00
Dylan McKay	261a06f328	[AVR] Compile the disassembler This also updates references of 'TheAVRTarget' to the new 'getTheAVRTarget()' method. llvm-svn: 285272	2016-10-27 08:09:15 +00:00
Dylan McKay	d6e1be8ce1	[AVR] Add AVRISelDAGToDAG.cpp Summary: This pulls the AVR instruction selector in-tree. Reviewers: arsenm, kparzysz Subscribers: llvm-commits, wdng, beanz, japaric, mgorny Differential Revision: https://reviews.llvm.org/D25278 llvm-svn: 285270	2016-10-27 07:03:47 +00:00
Dylan McKay	a02b1d7d51	[AVR] Add the machine code emitter Reviewers: arsenm, kparzysz Subscribers: wdng, beanz, japaric, llvm-commits, mgorny Differential Revision: https://reviews.llvm.org/D25388 llvm-svn: 285269	2016-10-27 06:56:46 +00:00
Nemanja Ivanovic	54552bfee6	[PowerPC] - No SExt/ZExt needed for count trailing zeros This patch corresponds to review: https://reviews.llvm.org/D25896 It just eliminates the redundant ZExt after a count trailing zeros instruction. llvm-svn: 285267	2016-10-27 05:17:58 +00:00
Kostya Serebryany	531e48b45b	[libFuzzer] speculatively trying to fix the Mac build; second attempt llvm-svn: 285262	2016-10-27 00:36:38 +00:00
NAKAMURA Takumi	59b3cfe715	xray-extract.cc: Quick fix for mingw, to avoid errc::protocol_error. errc::protocol_error is winsock-oriented and mingw doesn't include it. (MS does, though) llvm-svn: 285261	2016-10-27 00:34:24 +00:00
Kostya Serebryany	616a813fef	[libFuzzer] revert 285259 -- hit commit too soon llvm-svn: 285260	2016-10-27 00:24:34 +00:00
Kostya Serebryany	f35719f98d	[libFuzzer] speculatively trying to fix the Mac build llvm-svn: 285259	2016-10-27 00:22:39 +00:00
Davide Italiano	072144e29f	[IR] Retire unused getGEPReturnType overload. NFCI. llvm-svn: 285257	2016-10-26 23:46:16 +00:00
Tim Shen	e7784c3e65	[APFloat] Fix APFloat::getExactInverse when the input is nullptr. This is a regression introduced by r285105. Reviewers: kbarton, echristo, iteratee, eugenis Subscribers: llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D26017 llvm-svn: 285256	2016-10-26 23:31:41 +00:00
Justin Bogner	4a132b1a38	llvm-objdump: Make some error messages more consistent Most of the version of report_error were quoting the filename and printing a colon between the file name and the error message, but this one wasn't doing either of those. Fix the output to be more consistent. llvm-svn: 285252	2016-10-26 22:37:52 +00:00
Vedant Kumar	b60ba08907	[utils] Add a '--unified-report' option to the code coverage prep script In --unified-report mode, a single coverage report is prepared for all specified binaries and written to report_dir. This mode is compatible with all existing script options, including the --restrict mode which is used to limit coverage reporting to certain files or directories. This should not break any existing users of the script. llvm-svn: 285249	2016-10-26 22:07:39 +00:00
Vedant Kumar	f146b009b8	[utils] Use print_function in the code coverage prep script, NFC. llvm-svn: 285248	2016-10-26 22:07:37 +00:00
Vedant Kumar	0ef1917cef	[utils] Add an '--only-merge' option to the code coverage prep script In --only-merge mode, the script terminates after the profile merging step. This makes the script less stateful: it's more natural to split the merge out into a separate step instead of relying on the first invocation of the script to do it. This should not break any existing users of the script. llvm-svn: 285247	2016-10-26 22:07:35 +00:00
Evandro Menezes	194192393f	[AArch64] Create feature set for Samsung Exynos-M2 Since Exynos-M2 improved the FP square root unit a bit over the one in Exynos-M1, it does not benefit from using the Newton series for such operations. llvm-svn: 285246	2016-10-26 22:06:20 +00:00
Victor Leschuk	8e524a13df	DebugInfo: fix incorrect alignment type (NFC) Change type of some missed DebugInfo-related alignment variables, that are still uint64_t, to uint32_t. Original change introduced in r284482. llvm-svn: 285242	2016-10-26 21:32:29 +00:00
Reid Kleckner	7dfe7f827f	[lit] Work around Windows MSys command line tokenization bug Summary: This will allow us to revert LLD r284768, which added spaces to get MSys echo to print what we want. Reviewers: ruiu, inglorion, rafael Subscribers: modocache, llvm-commits Differential Revision: https://reviews.llvm.org/D26009 llvm-svn: 285237	2016-10-26 20:29:27 +00:00
Ehsan Amiri	aacb024df4	[PPC] Remove testcase from incorrect directory During my last commit this testcase was put in an incorrect directory. Removing it. Will put it in the right directory when I can verify everything is correct. llvm-svn: 285233	2016-10-26 20:16:59 +00:00
Tim Northover	b805bcfa72	ARM: don't rely on push/pop reglists being in order when folding SP adjust. It would be a very nice invariant to rely on, but unfortunately it doesn't necessarily hold (and the causes of mis-sorted reglists appear to be quite varied) so to be robust the frame lowering code can't assume that the first register in the list is also the first one that actually gets pushed. Should fix an issue where we were turning something like: push {r8, r4, r7, lr} sub sp, #24 into nonsense like: push {r2, r3, r4, r5, r6, r7, r8, r4, r7, lr} llvm-svn: 285232	2016-10-26 20:01:00 +00:00
Nemanja Ivanovic	897a508260	Do not assume that FP vector operands are never legalized by expanding This patch ensures that if a floating point vector operand is legalized by expanding, it is legalized through the stack rather than by calling DAGTypeLegalizer::IntegerToVector which will cause a failure since the operand is a non-integer type. This fixes PR 30715. llvm-svn: 285231	2016-10-26 19:51:35 +00:00
Sanjoy Das	4802c96296	Simplify `x >=u x >> y` and `x >=u x udiv y` Summary: Extends InstSimplify to handle both `x >=u x >> y` and `x >=u x udiv y`. This is a folloup of rL258422 and https://github.com/rust-lang/rust/pull/30917 where llvm failed to optimize away the bounds checking in a binary search. Patch by Arthur Silva! Reviewers: sanjoy Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25941 llvm-svn: 285228	2016-10-26 19:18:43 +00:00
Chad Rosier	7c0f8709c5	Revert "[AliasSetTracker] Make AST smarter about intrinsics that don't actually affect memory." This reverts commit r285191. LICM appears to rely on the Alias Set Tracker hitting lifetime markers to prevent code from being moved outside of the original scope. llvm-svn: 285227	2016-10-26 19:18:19 +00:00
Nemanja Ivanovic	2991b16281	[PowerPC] Implement vec_insert_exp builtins - llvm portion This revision corresponds to review: https://reviews.llvm.org/D25957. Committing on behalf of Zaara Syeda. llvm-svn: 285225	2016-10-26 19:03:40 +00:00
Kostya Serebryany	f4f60661d6	[libFuzzer] simplify TracePC::HandleTrace even further. Also, when dealing with -exit_on_src_pos, symbolize every PC only once llvm-svn: 285223	2016-10-26 18:52:04 +00:00

... 2 3 4 5 6 ...

140093 Commits