llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-12-16 16:16:45 +00:00

Author	SHA1	Message	Date
Adam Nemet	134480598b	[AVX512] Clean up avx512_perm_imm to use X86VectorVTInfo No functionality change. No change in X86.td.expanded except that we only set the CD8 attributes for the memory variants. (This shouldn't be used unless we have a memory operand.) llvm-svn: 220736	2014-10-27 23:08:37 +00:00
Adam Nemet	8cf84f2568	[AVX512] Derive vpermil* from avx512_perm_imm This used to derive from avx512_pshuf_imm which is confusing. NFC. Compared X86.td.expanded. llvm-svn: 220735	2014-10-27 23:08:34 +00:00
Adam Nemet	78db8293e5	[AVX512] Fix copy-and-paste bugs in vpermil 1) i512mem -> f512mem (this is the packed FP input being permuted) 2) element size is 64 bits in EVEX_CD8 for PD. (A good illustration why X86VectorVTInfo is useful) llvm-svn: 220734	2014-10-27 23:08:31 +00:00
Rafael Espindola	5655c1ba55	Make it easier to pass a custom diagnostic handler to the IR linker. llvm-svn: 220732	2014-10-27 23:02:10 +00:00
Pete Cooper	01b2132972	Fix a stackmap bug introduced in r220710. For a call to not return in to the stackmap shadow, the shadow must end with the call. To do this, we must insert any required nops before the call, and not after it. llvm-svn: 220728	2014-10-27 22:38:45 +00:00
Rafael Espindola	01a14a8846	Fix bug where sys::Wait could wait on wrong pid. Setting ChildPid to -1 would cause waitpid to wait for any child process. Patch by Daniel Reynaud! llvm-svn: 220717	2014-10-27 20:30:04 +00:00
Juergen Ributzka	b3117b0b86	[FastISel][AArch64] Emit immediate version of icmp (subs) for null pointer check. This is a minor change to use the immediate version when the operand is a null value. This should get rid of an unnecessary 'mov' instruction in debug builds and align the code more with the one generated by SelectionDAG. This fixes rdar://problem/18785125. llvm-svn: 220713	2014-10-27 19:58:36 +00:00
Juergen Ributzka	76c57b0570	[FastISel][AArch64] Optimize compare-and-branch for i1 to use 'tbz'. Minor enhancement to use 'tbz' for i1 compare-and-branch to get rid of an 'and' instruction. This fixes rdar://problem/18784953. llvm-svn: 220712	2014-10-27 19:46:23 +00:00
Pete Cooper	87efb91a50	Stackmap shadows should consider call returns a branch target. To avoid emitting too many nops, a stackmap shadow can include emitted instructions in the shadow, but these must not include branch targets. A return from a call should count as a branch target as patching over the instructions after the call would lead to incorrect behaviour for threads currently making that call, when they return. llvm-svn: 220710	2014-10-27 19:40:35 +00:00
Juergen Ributzka	3423c5ccda	[FastISel][AArch64] Use 'cbz' also for null values (pointers). The pattern matching for a 'ConstantInt' value was too restrictive. Checking for a 'Constant' with a bull value is sufficient for using an 'cbz/cbnz' instruction. This fixes rdar://problem/18784732. llvm-svn: 220709	2014-10-27 19:38:05 +00:00
Juergen Ributzka	64c2c99226	[FastISel][AArch64] Don't fold the 'and' instruction into the 'tbz/tbnz' instruction if it is in a different basic block. This fixes a bug where the input register was not defined for the 'tbz/tbnz' instruction. This happened, because we folded the 'and' instruction from a different basic block. This fixes rdar://problem/18784013. llvm-svn: 220704	2014-10-27 19:16:48 +00:00
Juergen Ributzka	57783726dd	[FastISel][AArch64] Fix load/store with frame indices. At higher optimization levels the LLVM IR may contain more complex patterns for loads/stores from/to frame indices. The 'computeAddress' function wasn't able to handle this and triggered an assertion. This fix extends the possible addressing modes for frame indices. This fixes rdar://problem/18783298. llvm-svn: 220700	2014-10-27 18:21:58 +00:00
Kostya Serebryany	581ef503d5	[asan] experimental tracing for indirect calls, llvm part. llvm-svn: 220699	2014-10-27 18:13:56 +00:00
Lang Hames	77d387a954	[PBQP] Unique allowed-sets for nodes in the PBQP graph and use pairs of these sets as keys into a cache of interference matrice values in the Interference constraint adder. Creating interference matrices was one of the large remaining time-sinks in PBQP. Caching them reduces the total compile time (when using PBQP) on the nightly test suite by ~10%. llvm-svn: 220688	2014-10-27 17:44:25 +00:00
NAKAMURA Takumi	a1ef3346aa	Prune CRLF. llvm-svn: 220678	2014-10-27 12:37:26 +00:00
Oliver Stannard	9a595b2769	[ARM] Select VMAXNM and VMINNM regardless of operand order Currently, the ARM backend will select the VMAXNM and VMINNM for these C expressions: (a < b) ? a : b (a > b) ? a : b but not these expressions: (a > b) ? b : a (a < b) ? b : a This patch allows all of these expressions to be matched. llvm-svn: 220671	2014-10-27 09:23:02 +00:00
Yuri Gorshenin	0701551505	[asan-asm-instrumentation] Added comment describing how asm instrumentation works. Summary: [asan-asm-instrumentation] Added comment describing how asm instrumentation works. Reviewers: eugenis Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5970 llvm-svn: 220670	2014-10-27 08:38:54 +00:00
NAKAMURA Takumi	f7d87a387c	Fix unicode chars into ascii in comment lines. llvm-svn: 220668	2014-10-27 08:08:18 +00:00
David Majnemer	61455bd9bc	InstCombine: Fix a combine assuming that icmp operands were integers An icmp may have pointer arguments, it isn't limited to integers or vectors of integers. This fixes PR21388. llvm-svn: 220664	2014-10-27 05:47:49 +00:00
Rafael Espindola	1b11b02e09	LinkModules.cpp: don't repeat names in comments. llvm-svn: 220662	2014-10-27 02:35:46 +00:00
David Blaikie	5a24603108	Remove some unnecessary casts. llvm-svn: 220658	2014-10-26 23:37:04 +00:00
Arnold Schwaighofer	574ccf8f46	Add an option to the LTO code generator to disable vectorization during LTO We used to always vectorize (slp and loop vectorize) in the LTO pass pipeline. r220345 changed it so that we used the PassManager's fields 'LoopVectorize' and 'SLPVectorize' out of the desire to be able to disable vectorization using the cl::opt flags 'vectorize-loops'/'slp-vectorize' which the before mentioned fields default to. Unfortunately, this turns off vectorization because those fields default to false. This commit adds flags to the LTO library to disable lto vectorization which reconciles the desire to optionally disable vectorization during LTO and the desired behavior of defaulting to enabled vectorization. We really want tools to set PassManager flags directly to enable/disable vectorization and not go the route via cl::opt flags in PassManagerBuilder.cpp. llvm-svn: 220652	2014-10-26 21:50:58 +00:00
Elena Demikhovsky	467839f2f5	AVX-512: Fixed encoding of VPBROADCASTM and added SKX forms of this instruction llvm-svn: 220638	2014-10-26 09:52:24 +00:00
Andrew Trick	6aca626b5c	LSR: Minor cleanup after Daniel's patch. Combine the Inserted an Done sets into a Visited set. llvm-svn: 220623	2014-10-25 19:59:30 +00:00
Andrew Trick	b8b40e0564	Fix LSR compile time. This is a simple fix that brings the compilation time from 5min to 5s on a specific real-world example. It's a large chain of computation in a crypto routine (always a problem for SCEV). A unit test is not feasible and there would be no way to check it. The fix is just basic good practice for dealing with SCEVs, there's no risk of regression. Patch by Daniel Reynaud! llvm-svn: 220622	2014-10-25 19:42:07 +00:00
Jingyue Wu	23067c1fec	[SeparateConstOffsetFromGEP] Fixed a bug related to unsigned modulo The dividend in "signed % unsigned" is treated as unsigned instead of signed, causing unexpected behavior such as -64 % (uint64_t)24 == 0. Added a regression test in split-gep.ll Patched by Hao Liu. llvm-svn: 220618	2014-10-25 18:34:03 +00:00
Benjamin Kramer	d9a51baf8c	Clean up assume intrinsic pattern matching, no need to check that the argument is a value. Also make it const safe and remove superfluous casting. NFC. llvm-svn: 220616	2014-10-25 18:09:01 +00:00
Jingyue Wu	9c957b2e8e	[SeparateConstOffsetFromGEP] Fixed a bug in rebuilding OR expressions The two operands of the new OR expression should be NextInChain and TheOther instead of the two original operands. Added a regression test in split-gep.ll. Hao Liu reported this bug, and provded the test case and an initial patch. Thanks! llvm-svn: 220615	2014-10-25 17:36:21 +00:00
Simon Pilgrim	c20b3d1fdd	[X86][SSE] Vector integer/float conversion memory folding Tidied up some entries in the folding tables so that they are under the correct comment section (they were categorised as AVX2 instructions when they're AVX1). Minor patch agreed with qcolombet. llvm-svn: 220613	2014-10-25 08:11:20 +00:00
David Majnemer	c9904e0a8e	InstCombine: Remove overzealous asserts These asserts can trigger if the worklist iteration order is sufficiently unlucky. Instead of adding special case logic to handle these edge conditions, just bail out on trying to transform them: InstSimplify will get them when it reaches them on the worklist. This fixes PR21378. N.B. No test case is included because any test would rely on the fragile worklist iteration order. llvm-svn: 220612	2014-10-25 07:13:13 +00:00
Rafael Espindola	0d61f7182e	Allow the C API users to keep relying on the OutMessages parameter. Should fix the Ocaml tests. llvm-svn: 220611	2014-10-25 04:31:08 +00:00
Rafael Espindola	67b4b608c2	Update the error handling of lib/Linker. Instead of passing a std::string&, use the new diagnostic infrastructure. llvm-svn: 220608	2014-10-25 04:06:10 +00:00
Jingyue Wu	376aaf44c4	[NVPTX] aligned byte-buffers for vector return types Summary: Fixes PR21100 which is caused by inconsistency between the declared return type and the expected return type at the call site. The new behavior is consistent with nvcc and the NVPTXTargetLowering::getPrototype function. Test Plan: test/Codegen/NVPTX/vector-return.ll Reviewers: jholewinski Reviewed By: jholewinski Subscribers: llvm-commits, meheff, eliben, jholewinski Differential Revision: http://reviews.llvm.org/D5612 llvm-svn: 220607	2014-10-25 03:46:16 +00:00
Evgeniy Stepanov	4f152d1382	[msan] Make -msan-check-constant-shadow a bit stronger. Allow (under the experimental flag) non-Instructions to participate in MSan checks. llvm-svn: 220601	2014-10-24 23:34:15 +00:00
Rafael Espindola	ee06e286d8	Modernize the error handling of the Materialize function. llvm-svn: 220600	2014-10-24 22:50:48 +00:00
Kevin Enderby	9e33867d25	Fix a Mach-O assembler segfault for a subtraction expression with an undefined symbol. In a Mach-O object file a relocatable expression of the form SymbolA - SymbolB + constant is allowed when both symbols are defined in a section. But when either symbol is undefined it is an error. The code was crashing when it had an undefined symbol in this case. And should have printed a error message using the location information in the relocation entry. rdar://18678402 llvm-svn: 220599	2014-10-24 22:39:40 +00:00
Frederic Riss	1a2ce34071	Sink DwarfUnit::constructImportedEntityDIE into DwarfCompileUnit. So that it has access to getOrCreateGlobalVariableDIE. If we ever support decsribing using directive in C++ classes (thus requiring support in type units), it will certainly use another mechanism anyway. Differential Revision: http://reviews.llvm.org/D5975 llvm-svn: 220594	2014-10-24 21:31:09 +00:00
Simon Pilgrim	29b3d266a4	[X86][SSE] Bitcast assertion in XFormVExtractWithShuffleIntoLoad Minor patch to fix an issue in XFormVExtractWithShuffleIntoLoad where a load is unary shuffled, then bitcast (to a type with the same number of elements) before extracting an element. An undef was created for the second shuffle operand using the original (post-bitcasted) vector type instead of the pre-bitcasted type like the rest of the shuffle node - this was then causing an assertion on the different types later on inside SelectionDAG::getVectorShuffle. Differential Revision: http://reviews.llvm.org/D5917 llvm-svn: 220592	2014-10-24 21:04:41 +00:00
Colin LeMahieu	a5b4b06680	[Hexagon] Resubmission of 220427 Modified library structure to deal with circular dependency between HexagonInstPrinter and HexagonMCInst. Adding encoding bits for add opcode. Adding llvm-mc tests. Removing unit tests. http://reviews.llvm.org/D5624 llvm-svn: 220584	2014-10-24 19:00:32 +00:00
Matt Arsenault	cef46eb164	Fix copy paste comment llvm-svn: 220581	2014-10-24 18:13:10 +00:00
Rafael Espindola	e4018b9baa	Don't ever call materializeAllPermanently during LTO. To do this, change the representation of lazy loaded functions. The previous representation cannot differentiate between a function whose body has been removed and one whose body hasn't been read from the .bc file. That means that in order to drop a function, the entire body had to be read. llvm-svn: 220580	2014-10-24 18:13:04 +00:00
Sanjay Patel	4e42d925c1	Allow AVX vrsqrtps generation. This is a follow-on to r220570 that allows a 256-bit (v8f32) version of vrsqrtps to be generated. llvm-svn: 220579	2014-10-24 17:59:18 +00:00
David Blaikie	21ab861fa1	DebugInfo: Sink DwarfDebug::ScopeVariables down into DwarfFile (part of refactoring to allow subprogram emission in both the skeleton and main units to enable -gmlt-like data to be included in the skeleton for live inlined backtracing purposes) llvm-svn: 220578	2014-10-24 17:57:34 +00:00
David Blaikie	1054961712	Remove DwarfDebug::FirstCU as it has no use It was only being used as a flag to identify the lack of debug info from within endModule - use the section labels for that instead. llvm-svn: 220575	2014-10-24 17:53:38 +00:00
Sanjay Patel	d9b7837012	Use rsqrt (X86) to speed up reciprocal square root calcs This is a first step for generating SSE rsqrt instructions for reciprocal square root calcs when fast-math is allowed. For now, be conservative and only enable this for AMD btver2 where performance improves significantly - for example, 29% on llvm/projects/test-suite/SingleSource/Benchmarks/BenchmarkGame/n-body.c (if we convert the data type to single-precision float). This patch adds a two constant version of the Newton-Raphson refinement algorithm to DAGCombiner that can be selected by any target via a parameter returned by getRsqrtEstimate().. See PR20900 for more details: http://llvm.org/bugs/show_bug.cgi?id=20900 Differential Revision: http://reviews.llvm.org/D5658 llvm-svn: 220570	2014-10-24 17:02:16 +00:00
Daniel Sanders	3747982b4e	[mips] Replace MipsABIEnum with a MipsABIInfo class. Summary: No functional change yet, it's just an object replacement for an enum. It will allow us to gather ABI information in a single place so that we can start testing for properties of the ABI's instead of the ABI itself. For example we will eventually be able to use: ABI.MinStackAlignmentInBytes() instead of: (isABI_N32() \|\| isABI_N64()) ? 16 : 8 which is clearer and more maintainable. Reviewers: matheusalmeida Reviewed By: matheusalmeida Differential Revision: http://reviews.llvm.org/D3341 llvm-svn: 220568	2014-10-24 16:15:27 +00:00
Benjamin Kramer	f5c5d61947	[Object] Fix MachO's getUuid to return a pointer into the object instead of a dangling ArrayRef. This works because uuid's are always little endian so it's not swapped. Fixes use-after-return reported by asan. llvm-svn: 220567	2014-10-24 15:52:05 +00:00
Daniel Sanders	656a7ed742	[mips] Fix >80-column line llvm-svn: 220564	2014-10-24 14:46:00 +00:00
Daniel Sanders	5f05e5e050	[mips] Remove redundant code in RetCC_MipsN. NFC. Summary: i32 is always promoted to i64 so it no longer makes sense to assign i32 to registers. Reviewers: vmedic Reviewed By: vmedic Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5964 llvm-svn: 220561	2014-10-24 13:49:54 +00:00
Daniel Sanders	d767694dc4	[mips] For N32/N64, structs must be passed in the upper bits of a register. Summary: Most structs were fixed by r218451 but those of between >32-bits and <64-bits remained broken since they were not marked with [ASZ]ExtUpper. This patch fixes the remaining cases by using CCPromoteToUpperBitsInType<i64> on i64's in addition to i32 and smaller. Reviewers: vmedic Reviewed By: vmedic Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5963 llvm-svn: 220556	2014-10-24 13:09:19 +00:00

1 2 3 4 5 ...

73772 Commits