llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-02-24 22:40:36 +00:00

Author	SHA1	Message	Date
Clement Courbet	9c2e0e0fa4	[llvm-exegesis] Add loop mode for repeating the snippet. Summary: Before this change the Executable function was made by duplicating the snippet. This change adds a --repetion-mode={loop\|duplicate} flag that allows choosing between this behaviour and wrapping the snippet instructions in a loop. The new mode can help measurements when the snippet fits in the DSB by short-cirtcuiting decoding. The loop adds a dec + jmp to the measurements, but since these are not part of the critical path, they execute in parallel with the measured code and do not impact measurements in practice. Overview of the change: - New SnippetRepetitor abstraction that handles repeating the snippet. The assembler delegates repeating the instructions to this class. - ExegesisTarget learns how to decrement loop counter and jump. - Some refactoring of the assembler into FunctionFiller/BasicBlockFiller. Reviewers: gchatelet Subscribers: mgorny, tschuett, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68125 llvm-svn: 373083	2019-09-27 12:56:24 +00:00
Guillaume Chatelet	114e854bc6	[Alignment][NFC] Remove unneeded llvm:: scoping on Align types llvm-svn: 373081	2019-09-27 12:54:21 +00:00
Nico Weber	6a8ef0b7b3	Reland "gn build: (manually) merge r373028" This relands r373029, reverted in 373033, because r373028 relanded in r373066. llvm-svn: 373070	2019-09-27 11:37:59 +00:00
Sam Parker	e8556a8898	[NFC][ARM] Add some tail-predication tests Use different data types for some simple loops. llvm-svn: 373064	2019-09-27 10:33:53 +00:00
Simon Pilgrim	85979f36a0	[SLPVectorizer][X86] Regenerate arith-fp tests llvm-svn: 373063	2019-09-27 10:04:25 +00:00
Simon Pilgrim	9314ff93e0	[UpdateTestChecks] Fix wildcard support on DOS prompts D64572 / rL365818 changed the way that the file paths were collected, which meant we lost the file pattern expansion necessary when working with DOS command prompt llvm-svn: 373062	2019-09-27 10:04:16 +00:00
Hans Wennborg	e727b1924f	Revert r372893 "[CodeGen] Replace -max-jump-table-size with -max-jump-table-targets" This caused severe compile-time regressions, see PR43455. > Modern processors predict the targets of an indirect branch regardless of > the size of any jump table used to glean its target address. Moreover, > branch predictors typically use resources limited by the number of actual > targets that occur at run time. > > This patch changes the semantics of the option `-max-jump-table-size` to limit > the number of different targets instead of the number of entries in a jump > table. Thus, it is now renamed to `-max-jump-table-targets`. > > Before, when `-max-jump-table-size` was specified, it could happen that > cluster jump tables could have targets used repeatedly, but each one was > counted and typically resulted in tables with the same number of entries. > With this patch, when specifying `-max-jump-table-targets`, tables may have > different lengths, since the number of unique targets is counted towards the > limit, but the number of unique targets in tables is the same, but for the > last one containing the balance of targets. > > Differential revision: https://reviews.llvm.org/D60295 llvm-svn: 373060	2019-09-27 09:54:26 +00:00
Thomas Preud'homme	1349d8564d	[sancov][NFC] Make filename Regexes "const" Summary: The const-correctness of match() was fixed in rL372764, which allows such static Regex objects to be marked const. Reviewers: thopre Reviewed By: thopre Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68091 llvm-svn: 373058	2019-09-27 09:39:13 +00:00
Roman Lebedev	c09b37c168	[NFC][InstCombine] Revisit shift-by-signext tests llvm-svn: 373055	2019-09-27 09:09:15 +00:00
Guillaume Chatelet	7c8bc22ca6	[Alignment][NFC] MaybeAlign in GVNExpression Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67922 llvm-svn: 373054	2019-09-27 08:56:43 +00:00
Alexandros Lamprineas	67cf4fb1cb	[MC][ARM] vscclrm disassembles as vldmia Happens only when the mve.fp subtarget feature is enabled: $ llvm-mc -triple thumbv8.1m.main -mattr=+mve.fp,+8msecext -disassemble <<< "0x9f,0xec,0x08,0x0b" .text vldmia pc, {d0, d1, d2, d3} $ llvm-mc -triple thumbv8.1m.main -mattr=+8msecext -disassemble <<< "0x9f,0xec,0x08,0x0b" .text vscclrm {d0, d1, d2, d3, vpr} Assembling returns the correct encoding with or without mve.fp: $ llvm-mc -triple thumbv8.1m.main -mattr=+mve.fp,+8msecext -show-encoding <<< "vscclrm {d0-d3, vpr}" .text vscclrm {d0, d1, d2, d3, vpr} @ encoding: [0x9f,0xec,0x08,0x0b] $ llvm-mc -triple thumbv8.1m.main -mattr=+8msecext -show-encoding <<< "vscclrm {d0-d3, vpr}" .text vscclrm {d0, d1, d2, d3, vpr} @ encoding: [0x9f,0xec,0x08,0x0b] The problem seems to be in the TableGen description of VSCCLRMD. The least significant bit should be set to zero. Differential Revision: https://reviews.llvm.org/D68025 llvm-svn: 373052	2019-09-27 08:22:24 +00:00
Clement Courbet	e063a909a2	[llvm-exegesis] Refactor how forbidden registers are computed. Summary: Right now latency generation can incorrectly select the scratch register as a dependency-carrying register. - Move the logic for preventing register selection from Uops implementation to common SnippetGenerator class. - Aliasing detection now takes a set of forbidden registers just like random register assignment does. Reviewers: gchatelet Subscribers: tschuett, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68084 llvm-svn: 373048	2019-09-27 08:04:10 +00:00
Wei Mi	e03a35d303	[LoopInfo] Remove duplicates in ExitBlocks to reduce the compile time of hasDedicatedExits. For the compile time problem described in https://reviews.llvm.org/D67359, turns out the root cause is there are many duplicates in ExitBlocks so the algorithm complexity of hasDedicatedExits gets very high. If we remove the duplicates, the compile time issue is gone. Thanks to Philip Reames for raising a good question and it leads me to find the root cause. Differential Revision: https://reviews.llvm.org/D68107 llvm-svn: 373045	2019-09-27 05:43:31 +00:00
Wei Mi	92134e706c	Revert "[LoopInfo] Limit the iterations to check whether a loop has dedicated exits" Get a better approach in https://reviews.llvm.org/D68107 to solve the problem. Revert the initial patch and will commit the new one soon. This reverts commit rL372990. llvm-svn: 373044	2019-09-27 05:43:30 +00:00
Thomas Lively	6acb2d6fe1	[WebAssembly] v128.andnot Summary: As specified at https://github.com/WebAssembly/simd/blob/master/proposals/simd/SIMD.md#bitwise-and-not Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68113 llvm-svn: 373041	2019-09-27 02:11:40 +00:00
Thomas Lively	01ad7e0891	[WebAssembly] SIMD Load and extend operations Summary: As specified at https://github.com/webassembly/simd/blob/master/proposals/simd/SIMD.md#load-and-extend. These instructions are behind the unimplemented-simd128 target feature for now because they have not been implemented in V8 yet. Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68058 llvm-svn: 373040	2019-09-27 02:06:50 +00:00
Nicholas Allegra	52b783a790	Revert r373034 It breaks the build on MSVC. llvm-svn: 373039	2019-09-27 01:58:31 +00:00
Peter Collingbourne	cf3bd23479	Speculative fix for gcc build. llvm-svn: 373038	2019-09-27 01:35:04 +00:00
Peter Collingbourne	855343fa29	hwasan: Compatibility fixes for short granules. We can't use short granules with stack instrumentation when targeting older API levels because the rest of the system won't understand the short granule tags stored in shadow memory. Moreover, we need to be able to let old binaries (which won't understand short granule tags) run on a new system that supports short granule tags. Such binaries will call the __hwasan_tag_mismatch function when their outlined checks fail. We can compensate for the binary's lack of support for short granules by implementing the short granule part of the check in the __hwasan_tag_mismatch function. Unfortunately we can't do anything about inline checks, but I don't believe that we can generate these by default on aarch64, nor did we do so when the ABI was fixed. A new function, __hwasan_tag_mismatch_v2, is introduced that lets code targeting the new runtime avoid redoing the short granule check. Because tag mismatches are rare this isn't important from a performance perspective; the main benefit is that it introduces a symbol dependency that prevents binaries targeting the new runtime from running on older (i.e. incompatible) runtimes. Differential Revision: https://reviews.llvm.org/D68059 llvm-svn: 373035	2019-09-27 01:02:10 +00:00
Nicholas Allegra	cfc930fd01	[Consumed][NFC] Refactor handleCall to take function argument list. Differential Revision: https://reviews.llvm.org/D67569 llvm-svn: 373034	2019-09-26 23:47:18 +00:00
Dmitri Gribenko	31df45cd54	Revert "gn build: (manually) merge r373028" This reverts commit r373029, which depends on r373028, which I reverted. llvm-svn: 373033	2019-09-26 23:30:47 +00:00
Craig Topper	dcc2e53be1	[X86] Remove CodeGenOnly instructions added in r373021, but keep the isel patterns and add COPY_TO_REGCLASS to them. llvm-svn: 373031	2019-09-26 23:22:15 +00:00
Nico Weber	46ae7724f1	gn build: (manually) merge r373028 llvm-svn: 373029	2019-09-26 23:17:25 +00:00
Craig Topper	43c1486a3b	[X86] Remove unused arguments from a tablegen multiclass. NFC llvm-svn: 373026	2019-09-26 22:56:19 +00:00
Craig Topper	2bf95fafce	[X86] Add VMOVSSZrrk/VMOVSDZrrk/VMOVSSZrrkz/VMOVSDZrrkz to getUndefRegClearance. We have isel patterns that can put an IMPLICIT_DEF on one of the sources for these instructions. So we should make sure we break any dependencies there. This should be done by just using one of the other sources. llvm-svn: 373025	2019-09-26 22:56:06 +00:00
Changpeng Fang	446072f9b7	Remove the AliasAnalysis argument in function areMemAccessesTriviallyDisjoint Reviewers: arsenm Differential Revision: https://reviews.llvm.org/D58360 llvm-svn: 373024	2019-09-26 22:53:44 +00:00
Craig Topper	e756487aa3	[X86] Add CodeGenOnly instructions for (f32 (X86selects $mask, (loadf32 addr), fp32imm0) to use masked MOVSS from memory. Similar for f64 and having a non-zero passthru value. We were previously not trying to fold the load at all. Using a CodeGenOnly instruction allows us to use FR32X/FR64X as the register class to avoid a bunch of COPY_TO_REGCLASS. llvm-svn: 373021	2019-09-26 22:23:09 +00:00
Jordan Rupprecht	6f4086ac07	Revert [SLP] Fix for PR31847: Assertion failed: (isLoopInvariant(Operands[i], L) && "SCEVAddRecExpr operand is not loop-invariant!") This reverts r372626 (git commit 6a278d9073bdc158d31d4f4b15bbe34238f22c18) llvm-svn: 373019	2019-09-26 22:09:17 +00:00
Kit Barton	cc445c5dbe	[LoopFusion] Add ability to fuse guarded loops Summary: This patch extends the current capabilities in loop fusion to fuse guarded loops (as defined in https://reviews.llvm.org/D63885). The patch adds the necessary safety checks to ensure that it safe to fuse the guarded loops (control flow equivalent, no intervening code, and same guard conditions). It also provides an alternative method to perform the actual fusion of guarded loops. The mechanics to fuse guarded loops are slightly different then fusing non-guarded loops, so I opted to keep them separate methods. I will be cleaning this up in later patches, and hope to converge on a single method to fuse both guarded and non-guarded loops, but for now I think the review will be easier to keep them separate. Reviewers: jdoerfert, Meinersbur, dmgreen, etiotto, Whitney Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65464 llvm-svn: 373018	2019-09-26 21:42:45 +00:00
Zhaoshi Zheng	c18f26ae8c	[Unroll] Do NOT unroll a loop with small runtime upperbound For a runtime loop if we can compute its trip count upperbound: Don't unroll if: 1. loop is not guaranteed to run either zero or upperbound iterations; and 2. trip count upperbound is less than UnrollMaxUpperBound Unless user or TTI asked to do so. If unrolling, limit unroll factor to loop's trip count upperbound. Differential Revision: https://reviews.llvm.org/D62989 Change-Id: I6083c46a9d98b2e22cd855e60523fdc5a4929c73 llvm-svn: 373017	2019-09-26 21:40:27 +00:00
Stanislav Mekhanoshin	7b6da0fe8f	[AMDGPU] copy OtherPredicates from pseudo to VOP3_Real Differential Revision: https://reviews.llvm.org/D68102 llvm-svn: 373015	2019-09-26 21:06:17 +00:00
Roman Lebedev	6139cdd29c	[DAGCombine][X86][AArch64][NFC] Add tests for shift-by-signext llvm-svn: 373014	2019-09-26 20:49:49 +00:00
Roman Lebedev	4a15c8df9d	[InstCombine][NFC] Add tests for shift-by-signext llvm-svn: 373013	2019-09-26 20:49:30 +00:00
Roman Lebedev	8df84495ee	[InstCombine][NFC] Regenerate load-cmp.ll test llvm-svn: 373012	2019-09-26 20:49:21 +00:00
Whitney Tsang	c92726d91d	[LOOPGUARD] Disable loop with multiple loop exiting blocks. Summary: As discussed in the loop group meeting. With the current definition of loop guard, we should not allow multiple loop exiting blocks. For loops that has multiple loop exiting blocks, we can simply unable to find the loop guard. When getUniqueExitBlock() obtains a vector size not equals to one, that means there is either no exit blocks or there exists more than one unique block the loop exit to. If we don't disallow loop with multiple loop exit blocks, then with our current implementation, there can exist exit blocks don't post dominated by the non pre-header successor of the guard block. Reviewer: reames, Meinersbur, kbarton, etiotto, bmahjour Reviewed By: Meinersbur, kbarton Subscribers: fhahn, hiraditya, llvm-commits Tag: LLVM Differential Revision: https://reviews.llvm.org/D66529 llvm-svn: 373011	2019-09-26 20:20:42 +00:00
Xiangling Liao	be30366603	[AIX]Emit function descriptor csect in assembly This patch emits the function descriptor csect for functions with definitions under both 32-bit/64-bit mode on AIX. Differential Revision: https://reviews.llvm.org/D66724 llvm-svn: 373009	2019-09-26 19:38:32 +00:00
Bob Haarman	1e425c53e9	[NFC][emacs] remove out-of-date comment from tablegen-mode.el Summary: The syntax table was originally based on and attributed to jasmin.el, but was rewritten in r45192, so the comment that says the code comes from jasmin.el is no longer accurate. This change removes the comment, shortening the code a bit. Reviewers: MaskRay, lattner Reviewed By: MaskRay Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68042 llvm-svn: 373008	2019-09-26 19:36:01 +00:00
David Bolvansky	c5809d1ef6	[NFC] Precommit tests for D68089 llvm-svn: 373006	2019-09-26 19:01:18 +00:00
Kevin P. Neal	a4dc431d14	Document requirement of function attributes with constrained floating point. Reviewed by: andrew.w.kaylor, uweigand, efriedma Approved by: andrew.w.kaylor Differential Revision: https://reviews.llvm.org/D67839 llvm-svn: 373002	2019-09-26 17:50:25 +00:00
Simon Pilgrim	2dfd238c4b	DAGISelMatcherOpt - TGParser::ParseOperation - silence static analyzer cast_or_null<CheckTypeMatcher> null dereference warning. NFCI. The static analyzer is warning about a potential null dereference, replace with an null/isa assertion and cast<CheckTypeMatcher>. llvm-svn: 373001	2019-09-26 17:38:47 +00:00
Craig Topper	25d257199b	[InstCombine] Use m_Zero instead of isNullValue() when checking if a GEP index is all zeroes to prevent an infinite loop. The test case here previously infinite looped. Only one element from the GEP is used so SimplifyDemandedVectorElts would replace the other lanes in each index with undef leading to the first index being <0, undef, undef, undef>. But there's a GEP transform that tries to replace an index into a 0 sized type with a zero index. But the zero index check only works on ConstantInt 0 or ConstantAggregateZero so it would turn the index back to zeroinitializer. Resulting in a loop. The fix is to use m_Zero() to allow a vector of zeroes and undefs. Differential Revision: https://reviews.llvm.org/D67977 llvm-svn: 373000	2019-09-26 17:20:50 +00:00
Simon Pilgrim	af806870e9	TGParser::ParseOperation - silence static analyzer dyn_cast<TypedInit> null dereference warning. NFCI. The static analyzer is warning about a potential null dereference, but we should be able to use cast<TypedInit> directly and if not assert will fire for us. I've also pulled out the repeated getType() call which was the only user of the pointer. llvm-svn: 372997	2019-09-26 17:11:02 +00:00
Simon Pilgrim	acfd7dffd8	ConstantFold - silence static analyzer dyn_cast<ExtractValueInst> null dereference warning. NFCI. The static analyzer is warning about a potential null dereference, but we should be able to use cast<ExtractValueInst> directly and if not assert will fire for us. llvm-svn: 372993	2019-09-26 16:30:36 +00:00
Simon Pilgrim	ec57a7d95f	ARMBaseInstrInfo getOperandLatency - silence static analyzer dyn_cast<> null dereference warnings. NFCI. The static analyzer is warning about potential null dereferences, but we should be able to use cast<> directly and if not assert will fire for us. llvm-svn: 372992	2019-09-26 16:05:55 +00:00
Jinsong Ji	a533429dfb	[PowerPC] Fix typo in rL372985 llvm-svn: 372991	2019-09-26 15:49:11 +00:00
Wei Mi	72db08839a	[LoopInfo] Limit the iterations to check whether a loop has dedicated exits for extreme large case. We had a case that a single loop which has 4000 exits and the average number of predecessors of each exit is > 1000, and we found compiling the case spent a significant amount of time on checking whether a loop has dedicated exits. This patch adds a limit for the iterations to the check. With the patch, the time to compile our testcase reduced from 1000s to 200s (clang release build). Differential Revision: https://reviews.llvm.org/D67359 llvm-svn: 372990	2019-09-26 15:36:25 +00:00
Jakub Kuderski	cc245c26cb	Handle successor's PHI node correctly when flattening CFG merges two if-regions Summary: FlattenCFG merges two 'if' basicblocks by inserting one basicblock to another basicblock. The inserted basicblock can have a successor that contains a PHI node whoes incoming basicblock is the inserted basicblock. Since the existing code does not handle it, it becomes a badref. if (cond1) statement if (cond2) statement successor - contains PHI node whose predecessor is cond2 --> if (cond1 \|\| cond2) statement (BB for cond2 was deleted) successor - contains PHI node whose predecessor is cond2 --> bad ref! Author: Jaebaek Seo Reviewers: asbirlea, kuhar, tstellar, chandlerc, davide, dexonsmith Reviewed By: kuhar Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68032 llvm-svn: 372989	2019-09-26 15:20:17 +00:00
Matt Arsenault	0129f9ebba	Fix typo llvm-svn: 372988	2019-09-26 15:20:16 +00:00
Guillaume Chatelet	2c58ce940f	[NFC] Rearrange Value::getPointerAlignment Reviewers: courbet Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67918 llvm-svn: 372987	2019-09-26 15:13:05 +00:00
Sean Fertile	a227f477f9	Updated comments in LWZtoc pseudo expansion. Refined a couple of the comments in the LWZtoc expansion code based on a post commit review comment. llvm-svn: 372986	2019-09-26 15:12:30 +00:00

1 2 3 4 5 ...

185491 Commits