llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-02-14 17:28:53 +00:00

Author	SHA1	Message	Date
Peter Zotov	23a2524793	[C API] PR19859: Add functions to query and modify branches. Patch by Gabriel Radanne <drupyog@zoho.com>. llvm-svn: 220817	2014-10-28 19:46:56 +00:00
Peter Zotov	711b85ab7d	[OCaml] PR19859: Add tests for reading the values of numeric constants. Patch by Gabriel Radanne <drupyog@zoho.com>. llvm-svn: 220816	2014-10-28 19:46:52 +00:00
Peter Zotov	927677ad96	[OCaml] PR19859: Add Llvm.{fcmp_predicate,float_of_const}. Patch by Gabriel Radanne <drupyog@zoho.com>. llvm-svn: 220815	2014-10-28 19:46:48 +00:00
Peter Zotov	e9d029fa22	[C API] PR19859: Add LLVMGetFCmpPredicate and LLVMConstRealGetDouble. Patch by Gabriel Radanne <drupyog@zoho.com>. llvm-svn: 220814	2014-10-28 19:46:44 +00:00
Saleem Abdulrasool	c568d1c120	Transforms: reapply SVN r219899 This restores the commit from SVN r219899 with an additional change to ensure that the CodeGen is correct for the case that was identified as being incorrect (originally PR7272). In the case that during inlining we need to synthesize a value on the stack (i.e. for passing a value byval), then any function involving that alloca must be stripped of its tailness as the restriction that it does not access the parent's stack no longer holds. Unfortunately, a single alloca can cause a rippling effect through out the inlining as the value may be aliased or may be mutated through an escaped external call. As such, we simply track if an alloca has been introduced in the frame during inlining, and strip any tail calls. llvm-svn: 220811	2014-10-28 18:27:37 +00:00
Robert Khasanov	4bea47e6eb	[AVX512] Fix VSQRT packed instructions internal names. No functional change llvm-svn: 220808	2014-10-28 18:22:41 +00:00
Robert Khasanov	2ca56ad410	[AVX512] Extended avx512_sqrt_packed (sqrt instructions) to VL subset. Refactored through AVX512_maskable llvm-svn: 220806	2014-10-28 18:15:20 +00:00
Robert Khasanov	d134194df8	[AVX-512] Expanded rsqrt/rcp instructions to VL subset. Refactored multiclass through AVX512_maskable llvm-svn: 220783	2014-10-28 16:37:13 +00:00
Robert Khasanov	2b3eb6af20	[AVX512] Removed special case for cmp instructions in getVectorMaskingNode. Now cmp intrinsics lower as other intrinsics through VSELECT, and then VSELECT tranforms to AND in PerformSELECTCombine. No functional change. llvm-svn: 220779	2014-10-28 16:17:14 +00:00
Robert Khasanov	fa811056f3	[x86] Simplify vector selection if condition value type matches vselect value type and true value is all ones or false value is all zeros. This transformation worked if selector is produced by SETCC, however SETCC is needed only if we consider to swap operands. So I replaced SETCC check for this case. Added tests for vselect of <X x i1> values. llvm-svn: 220777	2014-10-28 15:59:40 +00:00
Aaron Ballman	7d8105e34e	Silencing an "enumeral and non-enumeral type in conditional expression" warning; NFC. llvm-svn: 220775	2014-10-28 13:12:13 +00:00
Robert Khasanov	a87de33c61	[AVX512] Bring back vector-shuffle lowering support through broadcasts Ffter commit at rev219046 512-bit broadcasts lowering become non-optimal. Most of tests on broadcasting and embedded broadcasting were changed and they doesn’t produce efficient code. Example below is from commit changes (it’s the first test from test/CodeGen/X86/avx512-vbroadcast.ll): define <16 x i32> @_inreg16xi32(i32 %a) { ; CHECK-LABEL: _inreg16xi32: ; CHECK: ## BB#0: -; CHECK-NEXT: vpbroadcastd %edi, %zmm0 +; CHECK-NEXT: vmovd %edi, %xmm0 +; CHECK-NEXT: vpbroadcastd %xmm0, %ymm0 +; CHECK-NEXT: vinserti64x4 $1, %ymm0, %zmm0, %zmm0 ; CHECK-NEXT: retq %b = insertelement <16 x i32> undef, i32 %a, i32 0 %c = shufflevector <16 x i32> %b, <16 x i32> undef, <16 x i32> zeroinitializer ret <16 x i32> %c } Here, 256-bit broadcast was generated instead of 512-bit one. In this patch 1) I added vector-shuffle lowering through broadcasts 2) Removed asserts and branches likes because this is incorrect - assert(Subtarget->hasDQI() && "We can only lower v8i64 with AVX-512-DQI"); 3) Fixed lowering tests llvm-svn: 220774	2014-10-28 12:28:51 +00:00
NAKAMURA Takumi	e24697a037	Reformat partially, where I touched for whitespace changes. llvm-svn: 220773	2014-10-28 11:54:52 +00:00
NAKAMURA Takumi	1e3f8944dc	LoopRerollPass.cpp: Use range-based loop. NFC. llvm-svn: 220772	2014-10-28 11:54:05 +00:00
NAKAMURA Takumi	f2d570a79b	Untabify and whitespace cleanups. llvm-svn: 220771	2014-10-28 11:53:30 +00:00
Peter Zotov	da5425e252	[OCaml] Enable -g for debug builds. We don't care about pre-3.12.1 anymore. llvm-svn: 220767	2014-10-28 06:15:41 +00:00
Peter Zotov	41625b6878	[OCaml] Fix whitespace. llvm-svn: 220766	2014-10-28 06:15:18 +00:00
David Blaikie	35385231e3	Minimize the scope of some variables, NFC. llvm-svn: 220759	2014-10-28 02:57:26 +00:00
Reid Kleckner	af3046bd9e	X86: Implement the vectorcall calling convention This is a Microsoft calling convention that supports both x86 and x86_64 subtargets. It passes vector and floating point arguments in XMM0-XMM5, and passes them indirectly once they are consumed. Homogenous vector aggregates of up to four elements can be passed in sequential vector registers, but this part is not implemented in LLVM and will be handled in Clang. On 32-bit x86, it is similar to fastcall in that it uses ecx:edx as integer register parameters and is callee cleanup. On x86_64, it delegates to the normal win64 calling convention. Reviewers: majnemer Differential Revision: http://reviews.llvm.org/D5943 llvm-svn: 220745	2014-10-28 01:29:26 +00:00
Tim Northover	136fb499eb	AArch64: enable Cortex-A57 FP balancing on Cortex-A53. Benchmarks have shown that it's harmless to the performance there, and having a unified set of passes between the two cores where possible helps big.LITTLE deployment. Patch by Z. Zheng. llvm-svn: 220744	2014-10-28 01:24:32 +00:00
Rafael Espindola	fc1860dc1f	Remove the PreserveSource linker mode. I noticed that it was untested, and forcing it on caused some tests to fail: LLVM :: Linker/metadata-a.ll LLVM :: Linker/prefixdata.ll LLVM :: Linker/type-unique-odr-a.ll LLVM :: Linker/type-unique-simple-a.ll LLVM :: Linker/type-unique-simple2-a.ll LLVM :: Linker/type-unique-simple2.ll LLVM :: Linker/type-unique-type-array-a.ll LLVM :: Linker/unnamed-addr1-a.ll LLVM :: Linker/visibility1.ll If it is to be resurrected, it has to be fixed and we should probably have a -preserve-source command line option in llvm-mc and run tests with and without it. llvm-svn: 220741	2014-10-28 00:24:16 +00:00
NAKAMURA Takumi	231973f544	AArch64InstrInfo.h: Fix a warning introduced in clang r220703. [-Winconsistent-missing-override] llvm-svn: 220739	2014-10-27 23:29:27 +00:00
Adam Nemet	f35a0bba1b	[AVX512] Add vpermil variable version This is implemented via a multiclass that derives from the vperm imm multiclass. Fixes <rdar://problem/18426089> llvm-svn: 220737	2014-10-27 23:08:40 +00:00
Adam Nemet	134480598b	[AVX512] Clean up avx512_perm_imm to use X86VectorVTInfo No functionality change. No change in X86.td.expanded except that we only set the CD8 attributes for the memory variants. (This shouldn't be used unless we have a memory operand.) llvm-svn: 220736	2014-10-27 23:08:37 +00:00
Adam Nemet	8cf84f2568	[AVX512] Derive vpermil* from avx512_perm_imm This used to derive from avx512_pshuf_imm which is confusing. NFC. Compared X86.td.expanded. llvm-svn: 220735	2014-10-27 23:08:34 +00:00
Adam Nemet	78db8293e5	[AVX512] Fix copy-and-paste bugs in vpermil 1) i512mem -> f512mem (this is the packed FP input being permuted) 2) element size is 64 bits in EVEX_CD8 for PD. (A good illustration why X86VectorVTInfo is useful) llvm-svn: 220734	2014-10-27 23:08:31 +00:00
Rafael Espindola	5655c1ba55	Make it easier to pass a custom diagnostic handler to the IR linker. llvm-svn: 220732	2014-10-27 23:02:10 +00:00
Pete Cooper	01b2132972	Fix a stackmap bug introduced in r220710. For a call to not return in to the stackmap shadow, the shadow must end with the call. To do this, we must insert any required nops before the call, and not after it. llvm-svn: 220728	2014-10-27 22:38:45 +00:00
Jingyue Wu	72d50ea9d3	[ScalarEvolution] Guard dump() with #if to be consistent with its definition in ScalarEvolution.cpp llvm-svn: 220721	2014-10-27 21:14:41 +00:00
Rafael Espindola	01a14a8846	Fix bug where sys::Wait could wait on wrong pid. Setting ChildPid to -1 would cause waitpid to wait for any child process. Patch by Daniel Reynaud! llvm-svn: 220717	2014-10-27 20:30:04 +00:00
Juergen Ributzka	b3117b0b86	[FastISel][AArch64] Emit immediate version of icmp (subs) for null pointer check. This is a minor change to use the immediate version when the operand is a null value. This should get rid of an unnecessary 'mov' instruction in debug builds and align the code more with the one generated by SelectionDAG. This fixes rdar://problem/18785125. llvm-svn: 220713	2014-10-27 19:58:36 +00:00
Juergen Ributzka	76c57b0570	[FastISel][AArch64] Optimize compare-and-branch for i1 to use 'tbz'. Minor enhancement to use 'tbz' for i1 compare-and-branch to get rid of an 'and' instruction. This fixes rdar://problem/18784953. llvm-svn: 220712	2014-10-27 19:46:23 +00:00
Pete Cooper	87efb91a50	Stackmap shadows should consider call returns a branch target. To avoid emitting too many nops, a stackmap shadow can include emitted instructions in the shadow, but these must not include branch targets. A return from a call should count as a branch target as patching over the instructions after the call would lead to incorrect behaviour for threads currently making that call, when they return. llvm-svn: 220710	2014-10-27 19:40:35 +00:00
Juergen Ributzka	3423c5ccda	[FastISel][AArch64] Use 'cbz' also for null values (pointers). The pattern matching for a 'ConstantInt' value was too restrictive. Checking for a 'Constant' with a bull value is sufficient for using an 'cbz/cbnz' instruction. This fixes rdar://problem/18784732. llvm-svn: 220709	2014-10-27 19:38:05 +00:00
Juergen Ributzka	64c2c99226	[FastISel][AArch64] Don't fold the 'and' instruction into the 'tbz/tbnz' instruction if it is in a different basic block. This fixes a bug where the input register was not defined for the 'tbz/tbnz' instruction. This happened, because we folded the 'and' instruction from a different basic block. This fixes rdar://problem/18784013. llvm-svn: 220704	2014-10-27 19:16:48 +00:00
Juergen Ributzka	57783726dd	[FastISel][AArch64] Fix load/store with frame indices. At higher optimization levels the LLVM IR may contain more complex patterns for loads/stores from/to frame indices. The 'computeAddress' function wasn't able to handle this and triggered an assertion. This fix extends the possible addressing modes for frame indices. This fixes rdar://problem/18783298. llvm-svn: 220700	2014-10-27 18:21:58 +00:00
Kostya Serebryany	581ef503d5	[asan] experimental tracing for indirect calls, llvm part. llvm-svn: 220699	2014-10-27 18:13:56 +00:00
Lang Hames	45a17ea71e	[PBQP] Remove a spurious 'typename' keyword. This was causing an error on MSVC. llvm-svn: 220690	2014-10-27 17:59:51 +00:00
Lang Hames	6c2c2c8e3d	[PBQP] Clarify ambiguous-looking typedef. This was causing an error on the hexagon bots. llvm-svn: 220689	2014-10-27 17:52:05 +00:00
Lang Hames	77d387a954	[PBQP] Unique allowed-sets for nodes in the PBQP graph and use pairs of these sets as keys into a cache of interference matrice values in the Interference constraint adder. Creating interference matrices was one of the large remaining time-sinks in PBQP. Caching them reduces the total compile time (when using PBQP) on the nightly test suite by ~10%. llvm-svn: 220688	2014-10-27 17:44:25 +00:00
Michael Gottesman	d4b999b2da	Add MapVector::rbegin(), MapVector::rend() to completment MapVector::begin(), MapVector::end(). These just delegate to the underlying vector type in the MapVector. Also just add in some sanity unittests. llvm-svn: 220687	2014-10-27 17:20:53 +00:00
NAKAMURA Takumi	a1ef3346aa	Prune CRLF. llvm-svn: 220678	2014-10-27 12:37:26 +00:00
Oliver Stannard	9a595b2769	[ARM] Select VMAXNM and VMINNM regardless of operand order Currently, the ARM backend will select the VMAXNM and VMINNM for these C expressions: (a < b) ? a : b (a > b) ? a : b but not these expressions: (a > b) ? b : a (a < b) ? b : a This patch allows all of these expressions to be matched. llvm-svn: 220671	2014-10-27 09:23:02 +00:00
Yuri Gorshenin	0701551505	[asan-asm-instrumentation] Added comment describing how asm instrumentation works. Summary: [asan-asm-instrumentation] Added comment describing how asm instrumentation works. Reviewers: eugenis Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5970 llvm-svn: 220670	2014-10-27 08:38:54 +00:00
Rui Ueyama	a1707d7277	Re-commit r220667. C++ source given to check_cxx_source_compile should have define "main". llvm-svn: 220669	2014-10-27 08:16:18 +00:00
NAKAMURA Takumi	f7d87a387c	Fix unicode chars into ascii in comment lines. llvm-svn: 220668	2014-10-27 08:08:18 +00:00
Rui Ueyama	f8270e4d8e	Revert "Include stddef.h before including cxxabi.h" to un-break buildbot This reverts commit r220665. llvm-svn: 220667	2014-10-27 08:03:21 +00:00
Rui Ueyama	82ca2f4718	Include stddef.h before including cxxabi.h On FreeBSD 10.0, size_t needs to be defined before including cxxabi.h. Currenty HAVE_CXXABI_H is not defined on FreeBSD because of that reason. This patch teaches cmake and configure how to include it. http://reviews.llvm.org/D5940 llvm-svn: 220665	2014-10-27 07:37:57 +00:00
David Majnemer	61455bd9bc	InstCombine: Fix a combine assuming that icmp operands were integers An icmp may have pointer arguments, it isn't limited to integers or vectors of integers. This fixes PR21388. llvm-svn: 220664	2014-10-27 05:47:49 +00:00
Rafael Espindola	1b11b02e09	LinkModules.cpp: don't repeat names in comments. llvm-svn: 220662	2014-10-27 02:35:46 +00:00

1 2 3 4 5 ...

109134 Commits