llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-12-11 13:37:07 +00:00

Author	SHA1	Message	Date
Dehao Chen	52b358f670	Reimplement discriminator assignment algorithm. Summary: The new algorithm is more efficient (O(n), n is number of basic blocks). And it is guaranteed to cover all cases of multiple BB mapped to same line. Reviewers: dblaikie, davidxl, dnovillo Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14738 llvm-svn: 253594	2015-11-19 19:53:05 +00:00
Davide Italiano	73a83d035c	[AddressSanitizer] assert(false) -> llvm_unreachable and remove return. llvm-svn: 253591	2015-11-19 19:28:23 +00:00
Jun Bum Lim	96d6289a92	[AArch64] Refactoring aarch64-ldst-opt. NCF. Summary : * Rename isSmallTypeLdMerge() to isNarrowLoad(). * Rename NumSmallTypeMerged to NumNarrowTypePromoted. * Use Subtarget defined as a member variable. llvm-svn: 253587	2015-11-19 18:41:27 +00:00
Chad Rosier	7f187d5c67	[LIR] Fix 80-column from previous commit. llvm-svn: 253586	2015-11-19 18:25:11 +00:00
Chad Rosier	f0129c2dbd	[LIR] Sink checks into function to enable future refactoring. NFC. The purpose of this change is help delineate the memset and memcpy optimizations with the overall goal of resolving PR25520. llvm-svn: 253585	2015-11-19 18:22:21 +00:00
James Molloy	2208ca52dd	[GlobalOpt] Localize some globals that have non-instruction users We currently bail out of global localization if the global has non-instruction users. However, often these can be simple bitcasts or constant-GEPs, which we can easily turn into instructions before localizing. Be a bit more aggressive. llvm-svn: 253584	2015-11-19 18:04:33 +00:00
Sanjay Patel	770d3e85af	update comment and error message; NFC 'notail' was added in: http://reviews.llvm.org/rL252368 llvm-svn: 253580	2015-11-19 17:35:55 +00:00
Chad Rosier	03bb5a2376	[LIR] Use the more appropriate method. NFC. llvm-svn: 253578	2015-11-19 17:27:28 +00:00
Jun Bum Lim	5b665228a6	[AArch64]Extend merging narrow loads into a wider load This change extends r251438 to handle more narrow load promotions including byte type, unscaled, and signed. For example, this change will convert : ldursh w1, [x0, #-2] ldurh w2, [x0, #-4] into ldur w2, [x0, #-4] asr w1, w2, #16 and w2, w2, #0xffff llvm-svn: 253577	2015-11-19 17:21:41 +00:00
Sanjay Patel	7659e7f765	this new test file was accidentally left out of r253573 llvm-svn: 253574	2015-11-19 16:39:00 +00:00
Sanjay Patel	938dbcf1dc	[CGP] despeculate expensive cttz/ctlz intrinsics This is another step towards allowing SimplifyCFG to speculate harder, but then have CGP clean things up if the target doesn't like it. Previous patches in this series: http://reviews.llvm.org/D12882 http://reviews.llvm.org/D13297 D13297 should catch most expensive ops, but speculation of cttz/ctlz requires special handling because of weirdness in the intrinsic definition for handling a zero input (that definition can probably be blamed on x86). For example, if we have the usual speculated-by-select expensive op pattern like this: %tobool = icmp eq i64 %A, 0 %0 = tail call i64 @llvm.cttz.i64(i64 %A, i1 true) ; is_zero_undef == true %cond = select i1 %tobool, i64 64, i64 %0 ret i64 %cond There's an instcombine that will turn it into: %0 = tail call i64 @llvm.cttz.i64(i64 %A, i1 false) ; is_zero_undef == false This CGP patch is looking for that case and despeculating it back into: entry: %tobool = icmp eq i64 %A, 0 br i1 %tobool, label %cond.end, label %cond.true cond.true: %0 = tail call i64 @llvm.cttz.i64(i64 %A, i1 true) ; is_zero_undef == true br label %cond.end cond.end: %cond = phi i64 [ %0, %cond.true ], [ 64, %entry ] ret i64 %cond This unfortunately may lead to poorer codegen (see the changes in the existing x86 test), but if we increase speculation in SimplifyCFG (the next step in this patch series), then we should avoid those kinds of cases in the first place. The need for this patch was originally mentioned here: http://reviews.llvm.org/D7506 with follow-up here: http://reviews.llvm.org/D7554 Differential Revision: http://reviews.llvm.org/D14630 llvm-svn: 253573	2015-11-19 16:37:10 +00:00
Hans Wennborg	1ead7346cd	X86: More efficient legalization of wide integer compares In particular, this makes the code for 64-bit compares on 32-bit targets much more efficient. Example: define i32 @test_slt(i64 %a, i64 %b) { entry: %cmp = icmp slt i64 %a, %b br i1 %cmp, label %bb1, label %bb2 bb1: ret i32 1 bb2: ret i32 2 } Before this patch: test_slt: movl 4(%esp), %eax movl 8(%esp), %ecx cmpl 12(%esp), %eax setae %al cmpl 16(%esp), %ecx setge %cl je .LBB2_2 movb %cl, %al .LBB2_2: testb %al, %al jne .LBB2_4 movl $1, %eax retl .LBB2_4: movl $2, %eax retl After this patch: test_slt: movl 4(%esp), %eax movl 8(%esp), %ecx cmpl 12(%esp), %eax sbbl 16(%esp), %ecx jge .LBB1_2 movl $1, %eax retl .LBB1_2: movl $2, %eax retl Differential Revision: http://reviews.llvm.org/D14496 llvm-svn: 253572	2015-11-19 16:35:08 +00:00
NAKAMURA Takumi	d16e26a317	TargetParser.cpp: Fixup -- StringRef::startswith() is better here. NFC. llvm-svn: 253570	2015-11-19 15:42:52 +00:00
Mehdi Amini	4a5d6965a4	Remove unused function parameter (NFC) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 253569	2015-11-19 15:42:34 +00:00
Diego Novillo	0b8eea8df5	SamplePGO - Sort samples by source location when emitting as text. When dumping function samples or writing them out as text format, it helps if the samples are emitted sorted by source location. The sorting of the maps is a bit slow, so we only do it on demand. llvm-svn: 253568	2015-11-19 15:33:08 +00:00
Zoran Jovanovic	c71feceb04	[mips] Add tests for ROL and ROR macros expansion Author: obucina llvm-svn: 253567	2015-11-19 15:04:31 +00:00
NAKAMURA Takumi	eb19f391e2	llvm/lib/Support/TargetParser.cpp: Rework llvm::ARM::getArchExtFeature() to avoid abuse of Twine in r253470. llvm-svn: 253566	2015-11-19 15:03:11 +00:00
Chad Rosier	7d466a8b34	[LV] Add a helper function, isReductionVariable. NFC. llvm-svn: 253565	2015-11-19 14:19:06 +00:00
Zoran Jovanovic	e136ae56de	[mips] Expansion of ROL and ROR macros Author: obucina Reviewers: dsanders Subscribers: dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D10611 llvm-svn: 253564	2015-11-19 14:15:03 +00:00
Elena Demikhovsky	6aa44f30d0	AVX-512: Fixed COPY_TO_REGCLASS for mask registers Copying one mask register to another under BW should be done with kmovq instruction, otherwise we can loose some bits. Copying 8 bits under DQ may be done with kmovb. Differential Revision: http://reviews.llvm.org/D14812 llvm-svn: 253563	2015-11-19 13:13:00 +00:00
Artyom Skrobov	1dce73e2bc	Removing specific target from the generic test llvm-svn: 253562	2015-11-19 12:24:47 +00:00
Simon Pilgrim	015080a582	[X86][AVX] Fix lowering of X86ISD::VZEXT_MOVL for 128-bit -> 256-bit extension The lowering patterns for X86ISD::VZEXT_MOVL for 128-bit to 256-bit vectors were just copying the lower xmm instead of actually masking off the first scalar using a blend. Fix for PR25320. Differential Revision: http://reviews.llvm.org/D14151 llvm-svn: 253561	2015-11-19 12:18:37 +00:00
Alexey Bataev	3304453808	Alternative to long nops for X86 CPUs, by Andrey Turetsky Make X86AsmBackend generate smarter nops instead of a bunch of 0x90 for code alignment for CPUs which don't support long nop instructions. Differential Revision: http://reviews.llvm.org/D14178 llvm-svn: 253557	2015-11-19 11:44:35 +00:00
Dan Liew	db929f853a	[lit] Fix bug when using Python3 where a failing test would not show the script when running a ShTest with an external or internal shell. This bug is caused by use of the ``map`` function in Python 3 which returns an iterable (rather than a list in Python 2). After the iterable is exhausted it won't return any more output and consequently when ``_runShTest()`` tries to access the ``script`` which has already been iterated over it is empty. Converting to a list immediatley after calling ``map()`` fixes this. This fixes the ``tests/shtest-format.py`` test when running under Python3 which was previously failing. llvm-svn: 253556	2015-11-19 11:35:42 +00:00
NAKAMURA Takumi	ce69d484e2	gold-plugin.cpp: Fix diagnosticHandler corresponding to r253540. llvm-svn: 253553	2015-11-19 10:43:44 +00:00
James Molloy	b585b0aee8	[FunctionAttrs] Provide a mechanism for adding function attributes from the command line This provides a way to force a function to have certain attributes from the command line. This can be useful when debugging or doing workload exploration, where manually editing IR is tedious or not possible (due to build systems etc). The syntax is -force-attribute=function_name:attribute_name All function attributes are parsed except alignstack as it requires an argument. llvm-svn: 253550	2015-11-19 08:49:57 +00:00
Igor Breger	0a68600909	AVX512: Implemented encoding, intrinsics and DAG lowering for VMOVDDUP instructions. Differential Revision: http://reviews.llvm.org/D14702 llvm-svn: 253548	2015-11-19 08:26:56 +00:00
Igor Breger	fb07c48ef1	AVX512: Implemented encoding for the vmovss.s and vmovsd.s instructions. Differential Revision: http://reviews.llvm.org/D14771 llvm-svn: 253547	2015-11-19 07:58:33 +00:00
Igor Breger	8a3c708d1f	AVX512: Implemented encoding for the follow instructions. vmovapd.s, vmovaps.s, vmovdqa32.s, vmovdqa64.s, vmovdqu16.s, vmovdqu32.s, vmovdqu64.s, vmovdqu8.s, vmovupd.s, vmovups.s Differential Revision: http://reviews.llvm.org/D14768 llvm-svn: 253546	2015-11-19 07:43:43 +00:00
Elena Demikhovsky	fea4d52acf	Pointers in Masked Load, Store, Gather, Scatter intrinsics The masked intrinsics support all integer and floating point data types. I added the pointer type to this list. Added tests for CodeGen and for Loop Vectorizer. Updated the Language Reference. Differential Revision: http://reviews.llvm.org/D14150 llvm-svn: 253544	2015-11-19 07:17:16 +00:00
Pete Cooper	b753649d63	Revert "Change memcpy/memset/memmove to have dest and source alignments." This reverts commit r253511. This likely broke the bots in http://lab.llvm.org:8011/builders/clang-ppc64-elf-linux2/builds/20202 http://bb.pgr.jp/builders/clang-3stage-i686-linux/builds/3787 llvm-svn: 253543	2015-11-19 05:56:52 +00:00
Mehdi Amini	b5fccc4f2e	Do not require a Context to extract the FunctionIndex from Bitcode (NFC) The LLVMContext was only used for Diagnostic. Pass a DiagnosticHandler instead. Differential Revision: http://reviews.llvm.org/D14794 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 253540	2015-11-19 05:52:29 +00:00
Nathan Slingerland	99b48e24dd	[Support] Disable SaturatingMultiply() unit test while investigating Ubsan detected undefined behavior in the MathExtras SaturatingMultiply test. This change disables the test while it is being investigated. llvm-svn: 253539	2015-11-19 05:20:17 +00:00
Weiming Zhao	081f41bef8	Fix bug 25440: GVN assertion after coercing loads Optimizations like LoadPRE in GVN will insert new instructions. If the insertion point is in a already processed BB, they should get a value number explicitly. If the insertion point is after current instruction, then just leave it. However, current GVN framework has no support for it. In this patch, we just bail out if a VN can't be found. Dfferential Revision: http://reviews.llvm.org/D14670 A test/Transforms/GVN/pr25440.ll M lib/Transforms/Scalar/GVN.cpp llvm-svn: 253536	2015-11-19 02:45:18 +00:00
Reid Kleckner	c49dc4bb0d	Fix UMR in llvm-nm on IR object files in printDarwinSymbol llvm-svn: 253529	2015-11-19 00:51:50 +00:00
Quentin Colombet	2d801721a4	[X86] Enable shrink-wrapping by default. Differential Revision: http://reviews.llvm.org/D14156 rdar://problem/21118279 llvm-svn: 253528	2015-11-19 00:38:00 +00:00
Cong Hou	7900e5e76e	Fix several long lines (>80) in LoopVectorize.cpp. NFC. llvm-svn: 253527	2015-11-19 00:32:30 +00:00
Reid Kleckner	131251d95d	Don't search for third party libraries while using MSan On the average user's system, those libraries will not be compiled with MSan. Prior to this change, the LLVM test suite was full of false positives from calls from third party libraries to MSan interceptors like strlen. We can remove this check if MSan ever grows a suppression mechanism similar to TSan's. llvm-svn: 253526	2015-11-19 00:05:21 +00:00
Reid Kleckner	d3fd331575	Disable Go bindings test with MSan, it has tons of linker errors llvm-svn: 253525	2015-11-19 00:05:20 +00:00
Reid Kleckner	709b4c44da	Initialize PersistentId for HandleSDNode, as these will never be inserted into the DAG llvm-svn: 253524	2015-11-19 00:05:09 +00:00
Davide Italiano	4a84641b2a	[SimplifyLibCalls] New trick: pow(x, 0.5) -> sqrt(x) under -ffast-math. Differential Revision: http://reviews.llvm.org/D14466 llvm-svn: 253521	2015-11-18 23:21:32 +00:00
Quentin Colombet	bab97ab25b	[AArch64] Enable shrink-wrapping by default. Differential Revision: http://reviews.llvm.org/D14360 rdar://problem/20820748 llvm-svn: 253520	2015-11-18 23:12:20 +00:00
Mehdi Amini	c848f65f7c	Fix returned value for GVN: could return "false" even after modifying the IR This bug would manifest in some very specific cases where all the following conditions are fullfilled: - GVN didn't remove block - The regular GVN iteration didn't change the IR - PRE is enabled - PRE will not split critical edge - The last instruction processed by PRE didn't change the IR Because the CallGraph PassManager relies on this returned value to decide if it needs to recompute a node after the execution of Function passes, not returning the right value can lead to unexpected results. Fix for: https://llvm.org/bugs/show_bug.cgi?id=24715 Patch by Wenxiang Qiu <vincentqiuuu@gmail.com> From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 253518	2015-11-18 22:49:49 +00:00
Chris Bieneman	76a5e53fc4	[CMake] Support -fvisibility-inlines-hidden when LLVM_ENABLE_PIC=Off I'm unaware of any reasons why -fvisibility-inlines-hidden would depend on PIC, and since autoconf supports this flag without PIC, we should support it in CMake too. llvm-svn: 253517	2015-11-18 22:49:26 +00:00
Xinliang David Li	d80e9e19b2	Minor cleanups (from review feedback) 1. remove uneeded header inclusion 2. use reinterpret_cast instead of c ctyle 3. other format change llvm-svn: 253515	2015-11-18 22:42:27 +00:00
Davide Italiano	d84ba23a15	[BuildLibCalls] EmitStrNLen() is dead code. Garbage collect. llvm-svn: 253514	2015-11-18 22:29:38 +00:00
Pete Cooper	aca4c5cdc6	Change memcpy/memset/memmove to have dest and source alignments. Note, this was reviewed (and more details are in) http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20151109/312083.html These intrinsics currently have an explicit alignment argument which is required to be a constant integer. It represents the alignment of the source and dest, and so must be the minimum of those. This change allows source and dest to each have their own alignments by using the alignment attribute on their arguments. The alignment argument itself is removed. There are a few places in the code for which the code needs to be checked by an expert as to whether using only src/dest alignment is safe. For those places, they currently take the minimum of src/dest alignments which matches the current behaviour. For example, code which used to read: call void @llvm.memcpy.p0i8.p0i8.i32(i8* %dest, i8* %src, i32 500, i32 8, i1 false) will now read: call void @llvm.memcpy.p0i8.p0i8.i32(i8* align 8 %dest, i8* align 8 %src, i32 500, i1 false) For out of tree owners, I was able to strip alignment from calls using sed by replacing: (call.llvm\.memset.)i32\ [0-9]\,\ i1 false\) with: $1i1 false) and similarly for memmove and memcpy. I then added back in alignment to test cases which needed it. A similar commit will be made to clang which actually has many differences in alignment as now IRBuilder can generate different source/dest alignments on calls. In IRBuilder itself, a new argument was added. Instead of calling: CreateMemCpy(Dst, Src, getInt64(Size), DstAlign, / isVolatile / false) you now call CreateMemCpy(Dst, Src, getInt64(Size), DstAlign, SrcAlign, / isVolatile */ false) There is a temporary class (IntegerAlignment) which takes the source alignment and rejects implicit conversion from bool. This is to prevent isVolatile here from passing its default parameter to the source alignment. Note, changes in future can now be made to codegen. I didn't change anything here, but this change should enable better memcpy code sequences. Reviewed by Hal Finkel. llvm-svn: 253511	2015-11-18 22:17:24 +00:00
Jingyue Wu	acb193e196	[doc] fix a wrong link llvm-svn: 253509	2015-11-18 22:01:44 +00:00
Simon Pilgrim	8ddf1acb2a	[DAGCombiner] Vector constant folding for comparisons This patch adds support for vector constant folding of integer/float comparisons. This requires FoldConstantVectorArithmetic to support scalar constant operands (in this case ISD::CONDCASE). In future we should be able to support other scalar constant types as necessary (and possibly start calling FoldConstantVectorArithmetic for all node creations) Differential Revision: http://reviews.llvm.org/D14683 llvm-svn: 253504	2015-11-18 21:17:19 +00:00
Tim Northover	cf7a7cf40d	ARM: make sure backend is consistent about exception handling method. It turns out we decide whether to use SjLj exceptions or some alternative in two separate places in the backend, and they disagreed with each other. This led to inconsistent code and is generally a terrible idea. So make them consistent and add an assert that they do match (unfortunately MCAsmInfo isn't available in opt, so it can't be used to initialise the CodeGen version directly). llvm-svn: 253502	2015-11-18 21:10:39 +00:00

... 3 4 5 6 7 ...

124170 Commits