llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-12-11 21:45:16 +00:00

Author	SHA1	Message	Date
Craig Topper	96f30be3bf	[X86] Encode the EVEX2VEX exception list information in .td files instead of the emitter source. Rather than having an exclusion list in tablegen sources, add a flag to the X86 instruction records that can be used to suppress checking for convertibility. llvm-svn: 334971	2018-06-18 18:47:07 +00:00
Michael Berg	a174a3ecfa	[NFC] make MIFlag accessor functions consistant with usage model llvm-svn: 334970	2018-06-18 18:37:48 +00:00
Florian Hahn	749dd0855f	[VPlan] Add VPInstruction to VPRecipe transformation. This patch introduces a VPInstructionToVPRecipe transformation, which allows us to generate code for a VPInstruction based VPlan re-using the existing infrastructure. Reviewers: dcaballe, hsaito, mssimpso, hfinkel, rengolin, mkuper, javed.absar, sguggill Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D46827 llvm-svn: 334969	2018-06-18 18:28:49 +00:00
Lang Hames	d48b44008c	[ORC] Add an initial implementation of a replacement CompileOnDemandLayer. CompileOnDemandLayer2 is a replacement for CompileOnDemandLayer built on the ORC Core APIs. Functions in added modules are extracted and compiled lazily. CompileOnDemandLayer2 supports multithreaded JIT'd code, and compilation on multiple threads. llvm-svn: 334967	2018-06-18 18:01:43 +00:00
Lang Hames	4be38d1349	[ORC] Keep weak flag on VSO symbol tables during materialization, but treat materializing weak symbols as strong. This removes some elaborate flag tweaking and plays nicer with RuntimeDyld, which relies of weak/common flags to determine whether it should emit a given weak definition. (Switching to strong up-front makes it appear as if there is already an overriding definition, which would require an extra back-channel to override). llvm-svn: 334966	2018-06-18 18:01:41 +00:00
Krzysztof Parzyszek	deaab0b105	Shrink interval after moving copy in removePartialRedundancy llvm-svn: 334963	2018-06-18 17:16:39 +00:00
Andrea Di Biagio	f6e62b21be	[llvm-mca] Use an ordered map to collect hardware statistics. NFC. Histogram entries are now ordered by key. This should improves their readability when statistics are printed. llvm-svn: 334961	2018-06-18 17:04:56 +00:00
Nirav Dave	168f8d1e75	Fix typoed cast to avoid assertion in MCFragment::dump. llvm-svn: 334959	2018-06-18 16:26:11 +00:00
Simon Pilgrim	b5b2604517	[SLPVectorizer] Tidyup isShuffle helper Ensure we keep track of the input vectors in all cases instead of just for SK_Select. Ideally we'd reuse the shuffle mask pattern matching in TargetTransformInfo::getInstructionThroughput here to easily add support for all TargetTransformInfo::ShuffleKind without mass code duplication, I've added a TODO for now but D48236 should help us here. Differential Revision: https://reviews.llvm.org/D48023 llvm-svn: 334958	2018-06-18 16:25:01 +00:00
Craig Topper	19bb50c6a9	[TableGen] Make TiedAsmOperandTable in the AsmMatcher 'static' since its at file scope. llvm-svn: 334957	2018-06-18 16:17:46 +00:00
Craig Topper	987002011e	[TableGen] Remove unused member variable. I think this became unused after r324196. llvm-svn: 334956	2018-06-18 16:17:45 +00:00
Florian Hahn	7611816b75	[VPlanRecipeBase] Add eraseFromParent(). Reviewers: dcaballe, hsaito, mkuper, hfinkel Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D48081 llvm-svn: 334951	2018-06-18 15:18:48 +00:00
Sander de Smalen	b9e05bec1f	[AArch64][SVE] Asm: Support for saturating INC/DEC (64bit scalar) instructions. Summary: The variants added by this patch are: - SQINC (signed increment) - UQINC (unsigned increment) - SQDEC (signed decrement) - UQDEC (unsigned decrement) For example: uqincw x0, all, mul #4 Reviewers: rengolin, fhahn, SjoerdMeijer, samparker, javed.absar Differential Revision: https://reviews.llvm.org/D47715 llvm-svn: 334948	2018-06-18 14:47:52 +00:00
Simon Pilgrim	6c92f29bd5	[X86][BtVer2] Flag AVX2+ scheduler classes as unsupported Jaguar only supports up to AVX1 Differential Revision: https://reviews.llvm.org/D48274 llvm-svn: 334947	2018-06-18 14:31:14 +00:00
Andrea Di Biagio	e46200d8e6	[llvm-mca] Add tests for XOP and AVX512 instructions that implicitly clear the upper portion of a super-register. When the destination register of a XOP instruction is an XMM register, bits [255:128] of the corresponding YMM register are cleared. When the destination register of a EVEX encoded instruction is an XMM/YMM register, the upper bits of the corresponding ZMM are cleared. On processors that feature AVX512, a write to an XMM registers always clears the upper portion of the corresponding ZMM register if the instruction is VEX or EVEX encoded. These new tests show some interesting cases which aren't correctly analyzed by llvm-mca. The lack of knowledge related to the implicit update on the super-registers is addressed by D48225. llvm-svn: 334945	2018-06-18 14:00:30 +00:00
Florian Hahn	4b3cbfeb6f	[VPlan] Fix sanitizer problem with insertBefore. llvm-svn: 334943	2018-06-18 13:51:28 +00:00
Sander de Smalen	df252ecda0	[TableGen][AsmMatcherEmitter] Allow tied operands of different classes in aliases. Allow a tied operand of a different operand class in InstAliases, so that the operand can be printed (and added to the MC instruction) as the appropriate register. For example, 'GPR64as32', which would be printed/parsed as a 32bit register and should match a tied 64bit register operand, where the former is a sub-register of the latter. This patch also generalizes the constraint checking to an overrideable method in MCTargetAsmParser, so that target asmparsers can specify whether a given operand satisfies the tied register constraint. Reviewers: olista01, rengolin, fhahn, SjoerdMeijer, samparker, dsanders, craig.topper Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D47714 llvm-svn: 334942	2018-06-18 13:39:29 +00:00
Paul Robinson	600459b595	Update copyright year to 2018. llvm-svn: 334936	2018-06-18 12:22:17 +00:00
Simon Pilgrim	38ebb0dfc5	[SLPVectorizer] Avoid calling const VL.size() repeatedly in for-loop. NFCI. llvm-svn: 334934	2018-06-18 11:35:36 +00:00
Florian Hahn	f09a59c9cb	[VPlanRecipeBase] Add insertBefore helper. Reviewers: dcaballe, mkuper, hfinkel, hsaito, mssimpso Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D48080 llvm-svn: 334933	2018-06-18 11:34:17 +00:00
Clement Courbet	bdb322c5a1	[llvm-exegesis] Optionally ignore instructions without a sched class. Summary: See PR37602. Reviewers: RKSimon Subscribers: llvm-commits, tschuett Differential Revision: https://reviews.llvm.org/D48267 llvm-svn: 334932	2018-06-18 11:27:47 +00:00
Sander de Smalen	6864576067	[AArch64][SVE] Asm: Support for vector element compares. This patch adds instructions for comparing elements from two vectors, e.g. cmpgt p0.s, p0/z, z0.s, z1.s and also adds support for comparing to a 64-bit wide element vector, e.g. cmpgt p0.s, p0/z, z0.s, z1.d The patch also contains aliases for certain comparisons, e.g.: cmple p0.s, p0/z, z0.s, z1.s => cmpge p0.s, p0/z, z1.s, z0.s cmplo p0.s, p0/z, z0.s, z1.s => cmphi p0.s, p0/z, z1.s, z0.s cmpls p0.s, p0/z, z0.s, z1.s => cmphs p0.s, p0/z, z1.s, z0.s cmplt p0.s, p0/z, z0.s, z1.s => cmpgt p0.s, p0/z, z1.s, z0.s llvm-svn: 334931	2018-06-18 10:59:19 +00:00
Clement Courbet	cafa0cab01	[X86] Fix NOOP sched overrides on BDW/HSW/SKL. Summary: Noop certainly does not use resources. Reviewers: RKSimon, craig.topper, andreadb Subscribers: gbedwell, llvm-commits, gchatelet Differential Revision: https://reviews.llvm.org/D48028 llvm-svn: 334927	2018-06-18 06:48:22 +00:00
Craig Topper	78ffc4f614	[X86] Create X86InstrFMA3Group objects fully in a static table instead of on the heap. NFCI Previously we heap allocated the X86InstrFMA3Group objects which were created by passing them small register/memory opcode arrays that existed as individual static tables. Rather than a bunch of small static arrays we now have one large static table of X86InstrFMA3Group objects. Rather than storing a pointer to the opcode arrays in the X86InstrFMA3Group object, we now store have a register and memory array as part of the object. If a group doesn't have memory or register opcodes, the array entries will be 0. This greatly simplifies the destruction of the X86InstrFMA3Info object. We no longer need to delete the X86InstrFMA3Group objects as we destruct the DenseMap. And we don't need to keep track of which ones we already deleted. This reduces the llc binary size on my local machine by ~50k. I can only assume that's really due to the fact that we had something like 512 small static arrays that we passed to the init functions either one at a time or in pairs. So there were between 256 and 512 distinct calls to the init functions in the initOnceImpl method. llvm-svn: 334925	2018-06-18 06:32:22 +00:00
Craig Topper	5679b5a5ef	[X86] Add '.s' aliases to the assembler for the various redundant move encodings to match gas and our EVEX instructions. We already have these aliases for EVEX enocded instructions, but not for the GPR, MMX, SSE, and VEX versions. Also remove the vpextrw.s EVEX alias. That's not something gas implements. llvm-svn: 334922	2018-06-18 05:00:50 +00:00
Craig Topper	58ed1c810a	[X86] Move the 'vmovq.s' and similar assembly strings for EVEX vector moves with reversed operands to InstAliases. The .s assembly strings allow the reversed forms to be targeted from assembly which matches gas behavior. But when printing the instructions we should print them without the .s to match other tooling like objdump. By using InstAliases we can use the normal string in the instruction and just hide it from the assembly parser. Ideally we'd add the .s versions to the legacy SSE and VEX versions as well for full compatibility with gas. Not sure how we got to state where only EVEX was supported. llvm-svn: 334920	2018-06-18 01:28:05 +00:00
Craig Topper	94258135d2	[TableGen] Prevent double flattening of InstAlias asm strings in the asm matcher emitter. Unlike CodeGenInstruction, CodeGenInstAlias was flatting asm strings in its constructor. For instructions it was the users responsibility to flatten the string. AsmMatcherEmitter didn't know this and treated them the same. This caused double flattening of InstAliases. This is mostly harmless unless the desired assembly string contains curly braces. The second flattening wouldn't know to ignore these and would remove the curly braces. And for variant 1 it would remove the contents of them as well. To mitigate this, this patch makes removes the flattening from the CodeGenIntAlias constructor and modifies AsmWriterEmitter to account for the flattening not having been done. llvm-svn: 334919	2018-06-18 01:28:01 +00:00
Lang Hames	cfaaedd3cb	[ORC] Remove redundant condition llvm-svn: 334918	2018-06-17 23:54:58 +00:00
Lang Hames	287c3db240	[ORC] Only notify queries that they are resolved/ready when the query state changes. This guards against redundant notifications. llvm-svn: 334916	2018-06-17 18:59:01 +00:00
Craig Topper	d9f71cdd4e	[X86] Add all the FMA instructions direclty to the load folding table instead of proxying through X86InstrFMA3Info. These increases the size of the static tables, but is closer to what we would get if used the autogenerated table directly. This reduces the remaining large deltas between what's in the manual table and what's in the autogenerated table. llvm-svn: 334915	2018-06-17 18:00:16 +00:00
Lang Hames	5f1c62332e	[ORC] Suppress an unused variable warning for a debug-mode only use. llvm-svn: 334911	2018-06-17 17:18:12 +00:00
Lang Hames	d0ab0c2460	[ORC] Erase empty dependence sets when adding new symbol dependencies. llvm-svn: 334910	2018-06-17 16:59:53 +00:00
Lang Hames	9d5f10e891	[ORC] In MaterializationResponsibility, only maintain the Materializing flag on symbols in debug mode. The MaterializationResponsibility class hijacks the Materializing flag to track symbols that have not yet been resolved in order to guard against redundant resolution. Since this is an API contract check and only enforced in debug mode there is no reason to maintain the flag state in release mode. llvm-svn: 334909	2018-06-17 16:59:52 +00:00
Craig Topper	77b20be268	[X86] Pass the parent SDNode to X86DAGToDAGISel::selectScalarSSELoad to simplify the hasSingleUseFromRoot handling. Some of the calls to hasSingleUseFromRoot were passing the load itself. If the load's chain result has a user this would count against that. By getting the true parent of the match and ensuring any intermediate between the match and the load have a single use we can avoid this case. isLegalToFold will take care of checking users of the load's data output. This fixed at least fma-scalar-memfold.ll to succed without the peephole pass. llvm-svn: 334908	2018-06-17 16:29:46 +00:00
Simon Pilgrim	cbf937f1bc	[llvm-mca][X86] Add some avx512f/avx512vl resource test placeholders There are a lot of instructions to add under these ISAs (and the other AVX512 variants) but this should demonstrate how to test for the EVEX instructions with different maskings llvm-svn: 334907	2018-06-17 16:25:48 +00:00
Sander de Smalen	013bed785d	[AArch64][SVE] Asm: Support for bitwise operations on predicate vectors. This patch adds support for instructions performing bitwise operations on predicate vectors, including AND, BIC, EOR, NAND, NOR, ORN, ORR, and their status flag setting variants ANDS, BICS, EORS, NANDS, ORNS, ORRS. This patch also adds several aliases: orr p0.b, p1/z, p1.b, p1.b => mov p0.b, p1.b orrs p0.b, p1/z, p1.b, p1.b => movs p0.b, p1.b and p0.b, p1/z, p2.b, p2.b => mov p0.b, p1/z, p2.b ands p0.b, p1/z, p2.b, p2.b => movs p0.b, p1/z, p2.b eor p0.b, p1/z, p2.b, p1.b => not p0.b, p1/z, p2.b eors p0.b, p1/z, p2.b, p1.b => nots p0.b, p1/z, p2.b llvm-svn: 334906	2018-06-17 10:48:21 +00:00
Sander de Smalen	1afe6f4625	[AArch64][SVE] Asm: Support for SEL (vector/predicate) instructions. Support for SVE's predicated select instructions to select elements from either vector, both in a data-vector and a predicate-vector variant. llvm-svn: 334905	2018-06-17 10:11:04 +00:00
Jonas Hahnfeld	4e517a57ce	[NVPTX] Ignore target-cpu and -features for inlining We don't want to prevent inlining because of target-cpu and -features attributes that were added to newer versions of LLVM/Clang: There are no incompatible functions in PTX, ptxas will throw errors in such cases. Differential Revision: https://reviews.llvm.org/D47691 llvm-svn: 334904	2018-06-17 09:55:20 +00:00
Heejin Ahn	fa383144a4	[WebAssembly] Simple comment fix. NFC. llvm-svn: 334899	2018-06-17 00:37:56 +00:00
Craig Topper	29015a63e6	[X86] More additions to the load folding tables based on the autogenerated tables. Including more additions for NotMemoryFoldable to remove some entries from the autogenerated table. llvm-svn: 334898	2018-06-16 23:25:50 +00:00
Craig Topper	9470965797	[X86] Hide POP16/32/64rmr and PUSH16/32/64rmr instructions from the assembly parser. These all have a short form encoding that the assembler already prefers. Though that preference seems to only be based on order in the .td fie. Hiding the long form saves space in the table and prevents us from breaking the implicit order based priority. llvm-svn: 334897	2018-06-16 23:25:48 +00:00
Craig Topper	3e8074d360	[X86] Fix an inconsistency between AVX512 and AVX/SSE version on a couple instructions. VMOVPQIto64Zmr is not a 64-bit mode only instruction. But I don't know how to test this because VMOVPQIto64mr should always have priority over it in 32-bit mode since its only advantage is XMM16-XMM31 which aren't usable in 32-bit mode. VMOVPQIto64Zrr is a 64-bit mode only instruction, but we don't need to explicitly mark it as such because it uses a GR64 register which won't parse in 32-bit mode. llvm-svn: 334896	2018-06-16 23:25:47 +00:00
Michael Zolotukhin	eff2ee4997	CorrelatedValuePropagation: Preserve DT. Summary: We only modify CFG in a couple of places, and we can preserve DT there with a little effort. Reviewers: davide, vsk Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D48059 llvm-svn: 334895	2018-06-16 18:57:31 +00:00
Florian Hahn	2183664a75	[Dominators] Change getNode parameter type to const NodeT * (NFC). DominatorTreeBase::getNode does not modify its parameter and this change allows callers that only have access to const pointers to use it without casting. Reviewers: kuhar, dblaikie, chandlerc Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D48231 llvm-svn: 334892	2018-06-16 14:47:05 +00:00
Benjamin Kramer	697f605eee	Fix namespaces. No functionality change. llvm-svn: 334890	2018-06-16 13:37:52 +00:00
Florian Hahn	d812d14da5	Revert r334887, as GCC 4.8 does not have is_trivially_copy_constructible & co llvm-svn: 334889	2018-06-16 13:00:33 +00:00
Florian Hahn	c9aad1c77a	[SmallSet] Avoid using is_trivially_XXX<>::value which is C++17 llvm-svn: 334888	2018-06-16 12:50:32 +00:00
Florian Hahn	ea374da437	[SmallSet] Add SmallSetIterator. This patch adds a simple const_iterator implementation for SmallSet by delegating to either a SmallVector::const_iterator or std::set::const_iterator, depending on which storage is used by the SmallSet. Reviewers: dblaikie, craig.topper Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D47942 llvm-svn: 334887	2018-06-16 12:36:19 +00:00
Stanislav Mekhanoshin	1479defe70	[AMDGPU] setcc (select cc, CT, CF), CF, eq \| ne -> xor cc, -1 \| cc This is the common case in the BE when we serialize condition and then rematerialize it. Use either original or inverted condition. Differential Revision: https://reviews.llvm.org/D48246 llvm-svn: 334882	2018-06-16 03:46:59 +00:00
Nirav Dave	bbd897ed7a	Avoid needing to walk out legalization tables. NFCI. Relanding after fixing expensive check from modifying tables. To avoid redundant work, during DAG legalization we keep tables mapping pre-legalized SDValues to post-legalized SDValues and a SDValue-to-SDValue map to enable fast node replacements. However, as the keys are nodes which may be reused it is possible that an entry in a table refers to a now deleted node N (that should have been renamed by the value replacement map) while a new node N' exists. If N' is then replaced that entry would be wrong. Previously we avoided this by when potentially violating this property, walking every table and updating all node pointers. This is very expensive but hopefully rare occurance. This patch assigns each instance of a SDValue used in legalization a unique id and uses these ids in the legalization tables. This avoids any such aliasing issue, avoiding the full table search and allowing more aggressive incremental table pruning. In some cases this is a 1000x speedup to compilation. Reviewers: jyknight, echristo, bogner, tra Reviewed By: bogner Subscribers: dberris, grandinj, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D47959 llvm-svn: 334880	2018-06-16 02:51:29 +00:00

... 2 3 4 5 6 ...

165601 Commits