llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-06 03:08:43 +00:00

Author	SHA1	Message	Date
Robert Lougher	46e12bc5ef	Resubmit r356511 "[TailCallElim] Add tailcall elimination pass to LTO pipelines" Failing LLD tests have been fixed in r356593. llvm-svn: 356594	2019-03-20 19:08:18 +00:00
Tim Renouf	0fe4e29071	[AMDGPU] Added MsgPack format PAL metadata Summary: PAL metadata now supports both the old linear reg=val pairs format and the new MsgPack format. The MsgPack format uses YAML as its textual representation. On output to YAML, a mnemonic name is provided for some hardware registers. Differential Revision: https://reviews.llvm.org/D57028 Change-Id: I2bbaabaaca4b3574f7e03b80fbef7c7a69d06a94 llvm-svn: 356591	2019-03-20 18:47:21 +00:00
Philip Reames	0f5ababc89	Simplify operands of masked stores and scatters based on demanded elements If we know we're not storing a lane, we don't need to compute the lane. This could be improved by using the undef element result to further prune the mask, but I want to separate that into its own change since it's relatively likely to expose other problems. Differential Revision: https://reviews.llvm.org/D57247 llvm-svn: 356590	2019-03-20 18:44:58 +00:00
Alina Sbirlea	875454e0eb	[LICM & MemorySSA] Don't sink/hoist stores in the presence of ordered loads. Summary: Before this patch, if any Use existed in the loop, with a defining access in the loop, we conservatively decide to not move the store. What this approach was missing, is that ordered loads are not Uses, they're Defs in MemorySSA. So, even when the clobbering walker does not find that volatile load to interfere, we still cannot hoist a store past a volatile load. Resolves PR41140. Reviewers: george.burgess.iv Subscribers: sanjoy, jlebar, Prazek, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D59564 llvm-svn: 356588	2019-03-20 18:33:37 +00:00
Nikita Popov	8f1e59217b	[ValueTracking] Compute range for abs without nsw This is a small followup to D59511. The code that was moved into computeConstantRange() there is a bit overly conversative: If the abs is not nsw, it does not compute any range. However, abs without nsw still has a well-defined contiguous unsigned range from 0 to SIGNED_MIN. This is a lot less useful than the usual 0 to SIGNED_MAX range, but if we're already here we might as well specify it... Differential Revision: https://reviews.llvm.org/D59563 llvm-svn: 356586	2019-03-20 18:16:02 +00:00
Nikita Popov	b4e86424c1	[InstCombine] Fold add nuw + uadd.with.overflow Fold add nuw and uadd.with.overflow with constants if the addition does not overflow. Part of https://bugs.llvm.org/show_bug.cgi?id=38146. Patch by Dan Robertson. Differential Revision: https://reviews.llvm.org/D59471 llvm-svn: 356584	2019-03-20 18:00:27 +00:00
Jordan Rupprecht	1994dd4be1	[Remarks] Fix mismatched delete due to missing virtual destructor This fixes an asan failure introduced in r356519. llvm-svn: 356583	2019-03-20 17:44:24 +00:00
Tim Renouf	dfe137c4db	[AMDGPU] Factored PAL metadata handling out into its own class Summary: This commit introduces a new AMDGPUPALMetadata class that: * is inside the AMDGPU target; * keeps an in-memory representation of PAL metadata; * provides a method to read the frontend-supplied metadata from LLVM IR; * provides methods for the asm printer to set metadata items; * provides methods to write the metadata as a binary blob to put in a .note record or as an asm directive; * provides a method to read the metadata as a binary blob from a .note record. Because llvm-readobj cannot call directly into a target, I had to remove llvm-readobj's ability to dump PAL metadata, pending a resolution to https://reviews.llvm.org/D52821 Differential Revision: https://reviews.llvm.org/D57027 Change-Id: I756dc830894fcb6850324cdcfa87c0120eb2cf64 llvm-svn: 356582	2019-03-20 17:42:00 +00:00
Nico Weber	217c395e5b	Remove HAVE_REALPATH from config.h Its last use was removed in r352916. No behavior change. Differential Revision: https://reviews.llvm.org/D59601 llvm-svn: 356579	2019-03-20 17:26:11 +00:00
Dmitry Preobrazhensky	30e86bc12d	[AMDGPU][MC] Corrected checks for DS offset0 range See bug 40889: https://bugs.llvm.org/show_bug.cgi?id=40889 Reviewers: artem.tamazov, arsenm Differential Revision: https://reviews.llvm.org/D59313 llvm-svn: 356576	2019-03-20 17:13:58 +00:00
Sanjay Patel	39c0da6484	[CGP] fix formatting; NFC llvm-svn: 356572	2019-03-20 16:47:53 +00:00
Clement Courbet	9500985700	Fix sanitizer failures for 356550. Mark bcmp as having optimized codegen, so that asan can detect it and mark users as nobuiltin. llvm-svn: 356568	2019-03-20 16:14:59 +00:00
Nico Weber	3370100855	gn build: Add build files for some clang-tools-extra Adds clang-change-namespace, clang-move, clang-query, clang-reorder-fields. Differential Revision: https://reviews.llvm.org/D59554 llvm-svn: 356567	2019-03-20 16:14:16 +00:00
Sanjay Patel	6a34adbd3f	[CGP] convert chain of 'if' to 'switch'; NFC This should be extended, but CGP does some strange things, so I'm intentionally not changing the potential order of any transforms yet. llvm-svn: 356566	2019-03-20 15:53:06 +00:00
Nico Weber	e66d4cf93b	gn build: Merge r356508 llvm-svn: 356563	2019-03-20 15:41:25 +00:00
Dmitry Preobrazhensky	0da2711d8a	[AMDGPU][MC][GFX9] Added support of operands shared_base, shared_limit, private_base, private_limit, pops_exiting_wave_id See bug 39297: https://bugs.llvm.org/show_bug.cgi?id=39297 Reviewers: artem.tamazov, arsenm, rampitec Differential Revision: https://reviews.llvm.org/D59290 llvm-svn: 356561	2019-03-20 15:40:52 +00:00
Nico Weber	69fb053d1b	gn build: Merge r356519 llvm-svn: 356560	2019-03-20 15:36:11 +00:00
Sanjay Patel	f0650ecc18	[CGP][x86] add tests for usubo regression (PR41129); NFC llvm-svn: 356559	2019-03-20 15:02:35 +00:00
Sjoerd Meijer	24bd340cde	Follow up of rL356555 Pacify buildbot that complained about a member function not marked with override. llvm-svn: 356557	2019-03-20 14:33:39 +00:00
Sjoerd Meijer	612776ec7f	[TTI] getMemcpyCost This adds new function getMemcpyCost to TTI so that the cost of a memcpy can be modeled and queried. The default implementation returns Expensive, but targets can override this function to model the cost more accurately. Differential Revision: https://reviews.llvm.org/D59252 llvm-svn: 356555	2019-03-20 14:15:46 +00:00
George Rimar	5602fd9683	[llvm-objcopy] - Use replaceSectionReferences to update the sections for symbols in symbol table. If the compression was used and we had a symbol not involved in relocation, we never updated its section and it was silently removed from the output. Differential revision: https://reviews.llvm.org/D59542 llvm-svn: 356554	2019-03-20 13:57:47 +00:00
Simon Pilgrim	b64c65b9e9	Remove out of date comment. NFCI. DAGCombiner::convertBuildVecZextToZext just requires the extractions to be sequential, they don't have to start from 0'th index. llvm-svn: 356552	2019-03-20 12:24:15 +00:00
Clement Courbet	39ecd2c585	[ExpandMemCmp] Trigger on bcmp too. Summary: Fixes 41150. Reviewers: gchatelet Subscribers: hiraditya, llvm-commits, ckennelly, sbenza, jyknight Tags: #llvm Differential Revision: https://reviews.llvm.org/D59593 llvm-svn: 356550	2019-03-20 11:51:11 +00:00
Simon Pilgrim	1e0c591375	[X86] Use getConstantOperandAPInt to detect out-of-range shifts. llvm-svn: 356549	2019-03-20 11:41:52 +00:00
Andrea Di Biagio	9b556c1aa8	[X86] Remove X86 specific dag nodes for RDTSC/RDTSCP/RDPMC. NFCI This patch removes the following dag node opcodes from namespace X86ISD: RDTSC_DAG, RDTSCP_DAG, RDPMC_DAG The logic that expands RDTSC/RDPMC/XGETBV intrinsics is basically the same. The only differences are: RDTSC/RDTSCP don't implicitly read ECX. RDTSCP also implicitly writes ECX. I moved the common expansion logic into a helper function with the goal to get rid of code repetition. That helper is now used for the expansion of RDTSC/RDTSCP/RDPMC/XGETBV intrinsics. No functional change intended. Differential Revision: https://reviews.llvm.org/D59547 llvm-svn: 356546	2019-03-20 11:21:15 +00:00
Sylvestre Ledru	f72a86b296	[perf][DebugInfo] follow up for "add SectionedAddress to DebugInfo interfaces" Summary: Fix the build failure when perf jit is enabled Reviewers: avl, dblaikie Reviewed By: avl Subscribers: modocache, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D59189 llvm-svn: 356542	2019-03-20 10:02:18 +00:00
David Stuttard	25d4c77ffe	[AMDGPU] Allow MIMG with no uses in adjustWritemask in isel Summary: If an MIMG instruction has managed to get through to adjustWritemask in isel but has no uses (and doesn't enable TFC) then prevent an assertion by not attempting to adjust the writemask. The instruction will be removed anyway. Change-Id: I9a5dba6bafe1f35ac99c1b73df390936e2ac27a7 Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, tpr, t-tye, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D58964 llvm-svn: 356540	2019-03-20 09:29:55 +00:00
Philip Reames	216e43ed29	[instcombine] Add todos describing missing transforms for masked.* intrinsics llvm-svn: 356536	2019-03-20 03:36:05 +00:00
Craig Topper	8e62d2c40a	[X86] Remove X32 check lines from a test that doesn't have an X32 FileCheck prefix. Regenerate the test using update_llc_test_checks. NFC llvm-svn: 356535	2019-03-20 03:13:28 +00:00
Douglas Yung	b9932c7b68	Retry to add workaround to build scoped enums with VS2015. NFCI. We need this as we still have internal build bots on VS2015. llvm-svn: 356534	2019-03-20 01:52:40 +00:00
Douglas Yung	034040844e	Revert "Add workaround to build scoped enums with VS2015. NFCI." This reverts commit 6080a6fb1949a2bdf053245d6062c7bf58dae7a6 (r356532). Clang does not accept this syntax, so reverting this until I can find something that works across all compilers. llvm-svn: 356533	2019-03-20 00:41:12 +00:00
Douglas Yung	aef16dc3c1	Add workaround to build scoped enums with VS2015. NFCI. We need this as we still have internal build bots on VS2015. llvm-svn: 356532	2019-03-20 00:26:56 +00:00
Craig Topper	6fda1bfa03	[X86] Re-disable cmpxchg16b for 32-bit mode assembly parsing. This was broken recently when I factored the 64 bit mode check into hasCmpxchg16 without thinking about the AssemblerPredicate. llvm-svn: 356531	2019-03-19 23:57:16 +00:00
Eli Friedman	3f1eea6936	[ARM] Make sure to save/restore LR when we use tBfar. This change does two things. One, it ensures compilation will abort instead of miscompiling if ARMFrameLowering::determineCalleeSaves chooses not to save LR in a case where it's necessary. Two, it changes the way we estimate the size of a function to be more conservative in the presence of constant pool entries and jump tables. EstimateFunctionSizeInBytes probably still isn't really conservative enough, but I'm not sure how we can come up with a reliable estimate before constant islands runs. Differential Revision: https://reviews.llvm.org/D59439 llvm-svn: 356527	2019-03-19 21:48:08 +00:00
Amara Emerson	36fb50659e	[AArch64][GlobalISel] Add an optimization to select vector DUP instructions. This adds pattern matching for the insert+shufflevector sequence so we can generate dup instructions instead of the current TBL sequence. Differential Revision: https://reviews.llvm.org/D59558 llvm-svn: 356526	2019-03-19 21:43:05 +00:00
Amara Emerson	3e478dfc75	[AArch64][GlobalISel] Make v4s32 G_IMPLICIT_DEF legal. llvm-svn: 356525	2019-03-19 21:43:02 +00:00
Reid Kleckner	1d2281fae7	Remove MSVC compat hack since the inline keyword was added in 2015 Our minimum MSVC toolchain requirement is greater than 2015, so we don't need this conditional macro anymore. New versions of MSVC apparently have a header, xkeycheck.h, to check that keywords haven't been redefined. Fixes PR41144 llvm-svn: 356524	2019-03-19 21:40:59 +00:00
Francis Visoiu Mistrih	2c08b15b1a	[Remarks] Fix gcc build for r356519 Fails here: http://lab.llvm.org:8011/builders/clang-cmake-x86_64-sde-avx512-linux/builds/20046/steps/build%20stage%201/logs/stdio llvm-svn: 356522	2019-03-19 21:32:03 +00:00
Florian Hahn	a22a579f5c	[DwarfDebug] Add triple to test. llvm-svn: 356521	2019-03-19 21:18:59 +00:00
Nikita Popov	954c893921	[InstSimplify] Add additional cmp of abs without nsw tests; NFC llvm-svn: 356520	2019-03-19 21:12:21 +00:00
Francis Visoiu Mistrih	aad252295b	Reland "[Remarks] Add a new Remark / RemarkParser abstraction" This adds a Remark class that allows us to share code when working with remarks. The C API has been updated to reflect this. Instead of the parser generating C structs, it's now using a C++ object that is used through opaque pointers in C. This gives us much more flexibility on what changes we can make to the internal state of the object and interacts much better with scenarios where the library is used through dlopen. * C API updates: * move from C structs to opaque pointers and functions * the remark type is now an enum instead of a string * unit tests updates: * use mostly the C++ API * keep one test for the C API * rename to YAMLRemarksParsingTest * a typo was fixed: AnalysisFPCompute -> AnalysisFPCommute. * a new error message was added: "expected a remark tag." * llvm-opt-report has been updated to use the C++ parser instead of the C API Differential Revision: https://reviews.llvm.org/D59049 Original llvm-svn: 356491 llvm-svn: 356519	2019-03-19 21:11:07 +00:00
Robert Lougher	a0c4fb1375	Revert r356511 "[TailCallElim] Add tailcall elimination pass to LTO pipelines" Due to buildbot failures (LLD tests). llvm-svn: 356516	2019-03-19 20:54:20 +00:00
Florian Hahn	21369d219c	[DwarfDebug] Skip entries to big for 16 bit size field in Dwarf < 5. Nothing prevents entries from being bigger than the 16 bit size field in Dwarf < 5. For entries that are too big, just emit an empty entry instead of crashing. This fixes PR41038. Reviewers: probinson, aprantl, davide Reviewed By: probinson Differential Revision: https://reviews.llvm.org/D59518 llvm-svn: 356514	2019-03-19 20:37:06 +00:00
Robert Lougher	0f1b915264	[TailCallElim] Add tailcall elimination pass to LTO pipelines LTO provides additional opportunities for tailcall elimination due to link-time inlining and visibility of nocapture attribute. Testing showed negligible impact on compilation times. Differential Revision: https://reviews.llvm.org/D58391 llvm-svn: 356511	2019-03-19 20:24:28 +00:00
Philip Reames	26c2a9475e	Demanded elements support for masked.load and masked.gather Teach instcombine to propagate demanded elements through a masked load or masked gather instruction. This is in the broader context of improving vector pointer instcombine under https://reviews.llvm.org/D57140. Differential Revision: https://reviews.llvm.org/D57372 llvm-svn: 356510	2019-03-19 20:10:00 +00:00
Matt Arsenault	929b067e02	CodeGen: Refactor regallocator command line and target selection This will allow targets more flexibility to replace the register allocator core passes. In a future commit, AMDGPU will run the core register assignment passes twice, and will also want to disallow using the standard -regalloc option. llvm-svn: 356506	2019-03-19 19:33:12 +00:00
Matt Arsenault	abfe49c4b1	RegAllocFast: Do not allocate registers for undef uses Do not actually allocate a register for an undef use. Previously we we would create unnecessary reload instruction for undef uses where the register wasn't live. Patch by Matthias Braun llvm-svn: 356501	2019-03-19 19:16:04 +00:00
Matt Arsenault	be591983cd	RegAllocFast: Remove early selection loop, the spill calculation will report cost 0 anyway for free regs The 2nd loop calculates spill costs but reports free registers as cost 0 anyway, so there is little benefit from having a separate early loop. Surprisingly this is not NFC, as many register are marked regDisabled so the first loop often picks up later registers unnecessarily instead of the first one available in the allocation order... Patch by Matthias Braun llvm-svn: 356499	2019-03-19 19:01:34 +00:00
Simon Pilgrim	2fece1c29e	Fix for ABS legalization on PPC buildbot. llvm-svn: 356498	2019-03-19 18:55:46 +00:00
Philip Reames	74f713dd57	Allow unordered loads to be considered invariant in CodeGen The actual code change is fairly straight forward, but exercising it isn't. First, it turned out we weren't adding the appropriate flags in SelectionDAG. Second, it turned out that we've got some optimization gaps, so obvious test cases don't work. My first attempt (in atomic-unordered.ll) points out a deficiency in our peephole-opt folding logic which I plan to fix separately. Instead, I'm exercising this through MachineLICM. Differential Revision: https://reviews.llvm.org/D59375 llvm-svn: 356494	2019-03-19 18:27:18 +00:00

1 2 3 4 5 ...

176482 Commits