llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-12-02 00:16:25 +00:00

Author	SHA1	Message	Date
Sanjay Patel	c445b48e19	[x86] add test for setcc sub->shift transform; NFC llvm-svn: 353618	2019-02-09 16:41:20 +00:00
Simon Pilgrim	8f0a16a31c	[X86] Regenerate test. llvm-svn: 353616	2019-02-09 16:27:19 +00:00
Sanjay Patel	e05a91cc53	[TargetLowering] avoid miscompile in setcc transform (PR40657) llvm-svn: 353615	2019-02-09 15:59:02 +00:00
George Rimar	9b6f6de2f1	[yaml2elf.cpp] - Fix compilation under linux. Fixes errors like: /home/ssglocal/clang-cmake-x86_64-sde-avx512-linux/clang-cmake-x86_64-sde-avx512-linux/llvm/tools/yaml2obj/yaml2elf.cpp:597:5: error: need ‘typename’ before ‘ELFT:: Xword’ because ‘ELFT’ is a dependent scope ELFT::Xword Tag = (ELFT::Xword)DE.Tag; llvm-svn: 353614	2019-02-09 15:18:52 +00:00
George Rimar	24002d5f1d	[yaml2elf] - An attemp to fix s390x BB after r353607. s390x is big-endian and seems r353607 had an issue with endianess, Bot was unhappy: http://lab.llvm.org:8011/builders/clang-s390x-linux-lnt/builds/11168/steps/ninja%20check%201/logs/stdio This should fix it. llvm-svn: 353613	2019-02-09 15:03:19 +00:00
Nikita Popov	9ae731593f	Revert "[SelectionDAG] Extract [US]MULO expansion into TL method; NFC" This reverts commit r353611. Triggers an assertion during the libcall expansion on ARM. llvm-svn: 353612	2019-02-09 13:54:02 +00:00
Nikita Popov	ad643aaa07	[SelectionDAG] Extract [US]MULO expansion into TL method; NFC In preparation for supporting vector expansion. Also drop a variant of ExpandLibCall, of which the MULO expansions were the only user. llvm-svn: 353611	2019-02-09 13:29:22 +00:00
Simon Pilgrim	f09ddc4760	[X86][SSE] Generalize X86ISD::BLENDI support to more value types D42042 introduced the ability for the ExecutionDomainFixPass to more easily change between BLENDPD/BLENDPS/PBLENDW as the domains required. With this ability, we can avoid most bitcasts/scaling in the DAG that was occurring with X86ISD::BLENDI lowering/combining, blend with the vXi32/vXi64 vectors directly and use isel patterns to lower to the float vector equivalent vectors. This helps the shuffle combining and SimplifyDemandedVectorElts be more aggressive as we lose track of fewer UNDEF elements than when we go up/down through bitcasts. I've introduced a basic blend(bitcast(x),bitcast(y)) -> bitcast(blend(x,y)) fold, there are more generalizations I can do there (e.g. widening/scaling and handling the tricky v16i16 repeated mask case). The vector-reduce-smin/smax regressions will be fixed in a future improvement to SimplifyDemandedBits to peek through bitcasts and support X86ISD::BLENDV. Differential Revision: https://reviews.llvm.org/D57888 llvm-svn: 353610	2019-02-09 13:13:59 +00:00
George Rimar	a30611d91e	[lib/ObjectYAML] - Fix BB after r353607 [2]. NFC. The second and the last place it seems. Error was: [ 4%] Building CXX object lib/Support/CMakeFiles/LLVMSupport.dir/Error.cpp.o /Users/buildslave/as-bldslv9_new/lld-x86_64-darwin13/llvm.src/lib/ObjectYAML/ELFYAML.cpp:993:15: error: unused variable 'Object' [-Werror,-Wunused-variable] const auto Object = static_cast<ELFYAML::Object >(IO.getContext()); llvm-svn: 353609	2019-02-09 12:14:20 +00:00
George Rimar	eaded9e364	[lib/ObjectYAML] - Fix BB after r353607. NFC. Error was: [ 4%] Building CXX object lib/Support/CMakeFiles/LLVMSupport.dir/DAGDeltaAlgorithm.cpp.o /Users/buildslave/as-bldslv9_new/lld-x86_64-darwin13/llvm.src/lib/ObjectYAML/ELFYAML.cpp:666:15: error: unused variable 'Object' [-Werror,-Wunused-variable] const auto Object = static_cast<ELFYAML::Object >(IO.getContext()); (http://lab.llvm.org:8011/builders/lld-x86_64-darwin13/builds/29920) llvm-svn: 353608	2019-02-09 12:04:39 +00:00
George Rimar	2a4dd5ebf5	[yaml2obj][obj2yaml] - Add support for dumping/parsing .dynamic sections. This teaches the tools to parse and dump the .dynamic section and its dynamic tags. Differential revision: https://reviews.llvm.org/D57691 llvm-svn: 353606	2019-02-09 11:34:28 +00:00
Fangrui Song	cf69aa8d04	[GlobalOpt] Simplify __cxa_atexit elimination cxxDtorIsEmpty checks callers recursively to determine if the __cxa_atexit-registered function is empty, and eliminates the __cxa_atexit call accordingly. This recursive check is unnecessary as redundant instructions and function calls can be removed by early-cse and inliner. In addition, cxxDtorIsEmpty does not mark visited function and it may visit a function exponential times (multiplication principle). llvm-svn: 353603	2019-02-09 09:18:37 +00:00
Petr Hosek	9cc4992edf	[CMake] Don't set <PROJECT>_STANDALONE_BUILD We shouldn't be treating runtimes builds as standalone builds since we have enough of the context loaded into the runtimes environment. Differential Revision: https://reviews.llvm.org/D57992 llvm-svn: 353601	2019-02-09 03:06:56 +00:00
Hubert Tong	9928505155	[MC] Clean up unused inline function and non-anchor defaulted destructors; NFCI Summary: Take care of some missing clean-ups that belong with r249548 and some other copy/paste that had happened. In particular, the destructors are no longer vtable anchors after r249548; and `setSectionName` in `MCSectionWasm` is private and unused since r313058 culled its only caller. The destructors are now implicitly defined, and the unused function is removed. Reviewers: nemanjai, jasonliu, grosbach Reviewed By: nemanjai Subscribers: sbc100, aheejin, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57182 llvm-svn: 353597	2019-02-09 02:11:51 +00:00
Gabor Buella	9dff2aa9d7	Extra processing for BitCast + PHI in InstCombine For some specific cases with bitcast A->B->A with intervening PHI nodes InstCombiner::optimizeBitCastFromPhi transformation creates extra PHI nodes, which are actually a copy of already created PHI or in another words, they are redundant. These extra PHI nodes could lead to extra move instructions generated after DeSSA transformation. This happens when several conditions are met - SROA kicks in and creates new alloca; - there is a simple assignment L = R, which falls under 'canonicalize loads' done by combineLoadToOperationType (this transformation is by default). Exactly this transformation is the reason of bitcasts generated; - the alloca is then used in A->B->A + PHI chain; - there is a loop unrolling. As a result optimizeBitCastFromPhi creates as many of PHI nodes for each new SROA alloca as loop unrolling factor is. These new extra PHI nodes are redundant actually except of one and should not be created. Moreover the idea of optimizeBitCastFromPhi is to get rid of the cast (when possible) but that doesn't happen in these conditions. The proposed fix is to do the cast replacement for the whole calculated/accumulated PHI closure not for one cast only, which is an argument to the optimizeBitCastFromPhi. These will help to accomplish several things: 1) avoid extra PHI nodes generated as all casts which may trigger optimizeBitCastFromPhi transformation will be replaced, 3) bitcasts will be replaced, and 3) create more opportunities to remove dead code, which appears after the replacement. A new test case shows that it's possible to get rid of all bitcasts completely and get quite good code reduction. Author: Igor Tsimbalist <igor.v.tsimbalist@intel.com> Reviewed By: Carrot Differential Revision: https://reviews.llvm.org/D57053 llvm-svn: 353595	2019-02-09 01:44:28 +00:00
Stanislav Mekhanoshin	a488a69a64	[AMDGPU] Split idot4/8 signed and unsigned tests. NFC. llvm-svn: 353593	2019-02-09 01:02:28 +00:00
Mikhail R. Gadelha	7db7f7f32f	This reverts commit 1440a848a635849b97f7a5cfa0ecc40d37451f5b. and commit a1853e834c65751f92521f7481b15cf0365e796b. They broke arm and aarch64 llvm-svn: 353590	2019-02-09 00:46:12 +00:00
Jessica Paquette	cb5da3f61a	Recommit "[GlobalISel] Introduce a generic floating point floor opcode, G_FFLOOR"" After r353586, we won't fail on the AMDGPU floor pattern that was killing the importer before. llvm-svn: 353589	2019-02-09 00:37:31 +00:00
Stanislav Mekhanoshin	4063d7d54a	[AMDGPU] Split dot-insts feature Differential Revision: https://reviews.llvm.org/D57971 llvm-svn: 353587	2019-02-09 00:34:21 +00:00
Jessica Paquette	4ef25cfd76	[GlobalISel] Skip patterns that define complex suboperands twice instead of dying If we run into a pattern that looks like this: add (complex $x, $y) (complex $x, $z) We should skip the pattern instead of asserting/doing something unpredictable. This makes us return an Error in that case, and adds a testcase for skipped patterns. Differential Revision: https://reviews.llvm.org/D57980 llvm-svn: 353586	2019-02-09 00:29:13 +00:00
Nico Weber	075a821712	gn build: Merge r353566 llvm-svn: 353585	2019-02-09 00:21:06 +00:00
Sergey Dmitriev	963f14674f	[NFC] Avoid passing blocks vector to the OutlineRegionInfo constructor by value. Reviewers: vsk, fhahn, davidxl Reviewed By: vsk Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57957 llvm-svn: 353582	2019-02-08 23:52:15 +00:00
Sanjay Patel	e314e9709a	[x86] add test for miscompiling setcc transform (PR40657); NFC llvm-svn: 353580	2019-02-08 23:34:57 +00:00
Francis Visoiu Mistrih	aa89ecf9c8	Re-apply r353553 "[GISel][NFC]: Add missing call to record CSE hits in the CSEMIRBuilder" With a fix after r353563 that adds some more opcodes. llvm-svn: 353579	2019-02-08 23:34:11 +00:00
Francis Visoiu Mistrih	4b7c8f212e	Revert r353553 "[GISel][NFC]: Add missing call to record CSE hits in the CSEMIRBuilder" This reverts commit r353553. This breaks CodeGen/AArch64/GlobalISel/legalize-ext-csedebug-output.mir: http://green.lab.llvm.org/green/job/clang-stage1-cmake-RA-incremental/57963/console llvm-svn: 353575	2019-02-08 22:49:43 +00:00
Craig Topper	27aa226f8e	[Docs] Use code-block:: text for part of the callbr documentation to attempt to make the bot happy. llvm-svn: 353567	2019-02-08 21:09:33 +00:00
Craig Topper	7cd7459241	[X86] Add FPCW as an implicit use on floating point load instructions. These instructions can generate a stack overflow exception so technically they read the stack overflow exception mask bit. llvm-svn: 353564	2019-02-08 20:50:09 +00:00
Craig Topper	ea7e6b3857	Implementation of asm-goto support in LLVM This patch accompanies the RFC posted here: http://lists.llvm.org/pipermail/llvm-dev/2018-October/127239.html This patch adds a new CallBr IR instruction to support asm-goto inline assembly like gcc as used by the linux kernel. This instruction is both a call instruction and a terminator instruction with multiple successors. Only inline assembly usage is supported today. This also adds a new INLINEASM_BR opcode to SelectionDAG and MachineIR to represent an INLINEASM block that is also considered a terminator instruction. There will likely be more bug fixes and optimizations to follow this, but we felt it had reached a point where we would like to switch to an incremental development model. Patch by Craig Topper, Alexander Ivchenko, Mikhail Dvoretckii Differential Revision: https://reviews.llvm.org/D53765 llvm-svn: 353563	2019-02-08 20:48:56 +00:00
Vedant Kumar	1ffce06c6d	[CodeExtractor] Restore outputs after creating exit stubs When CodeExtractor saves the result of InvokeInst at the first insertion point of the 'normal destination' basic block, this block can be omitted in the outlined region, so store is placed outside of the function. The suggested solution is to process saving outputs after creating exit stubs for new function, and stores will be placed in that blocks before return in this case. Patch by Sergei Kachkov! Fixes llvm.org/PR40455. Differential Revision: https://reviews.llvm.org/D57919 llvm-svn: 353562	2019-02-08 20:48:04 +00:00
Matt Arsenault	c611b8c0f6	AMDGPU/GlobalISel: Fix broken tests llvm-svn: 353559	2019-02-08 19:59:39 +00:00
Matt Arsenault	002a487960	AMDGPU: Eliminate GPU specific SubtargetFeatures Inline compatability is determined from the individual feature bits. These are just sets of the separate features, but will always be treated as incompatible unless they are specifically ignored. Defining the ISA version number here in tablegen would be nice, but it turns out this wasn't actually used. llvm-svn: 353558	2019-02-08 19:59:32 +00:00
Nemanja Ivanovic	fd45134124	[DAGCombine] Optimize pow(X, 0.75) to sqrt(X) * sqrt(sqrt(X)) The sqrt case is faster and we already do this for the case where the exponent is 0.25. This adds the 0.75 case which is also not sensitive to signed zeros. Patch by Whitney Tsang (Whitney) Differential revision: https://reviews.llvm.org/D57434 llvm-svn: 353557	2019-02-08 19:50:58 +00:00
Aditya Nandakumar	1cbeacb1ea	[GISel][NFC]: Add missing call to record CSE hits in the CSEMIRBuilder https://reviews.llvm.org/D57932 Add some logging + tests to make sure CSEInfo prints debug output. reviewed by: arsenm llvm-svn: 353553	2019-02-08 19:41:13 +00:00
Jonathan Metzman	49e5279126	Document libFuzzer on Windows. Summary: Document that libFuzzer supports Windows, how to get it, and its limitations. Reviewers: kcc, morehouse, rnk, metzman Reviewed By: kcc, rnk, metzman Subscribers: hans, rnk Differential Revision: https://reviews.llvm.org/D57597 llvm-svn: 353551	2019-02-08 19:35:04 +00:00
Rong Xu	10486b5c91	[Cmake] Add an option to build LLVM using the experimental new pass manager Add LLVM_USE_NEWPM to build LLVM using the experimental new pass manager. Differential Revision: http://reviews.llvm.org/D57068 llvm-svn: 353550	2019-02-08 19:31:03 +00:00
Matt Arsenault	52c196b513	AMDGPU: Remove GCN features and predicates These are no longer necessary since the R600 tablegen files are split out now. llvm-svn: 353548	2019-02-08 19:18:01 +00:00
Reid Kleckner	9cb616246a	[InstrProf] Implement static profdata registration Summary: The motivating use case is eliminating duplicate profile data registered for the same inline function in two object files. Before this change, users would observe multiple symbol definition errors with VC link, but links with LLD would succeed. Users (Mozilla) have reported that PGO works well with clang-cl and LLD, but when using LLD without this static registration, we would get into a "relocation against a discarded section" situation. I'm not sure what happens in that situation, but I suspect that duplicate, unused profile information was retained. If so, this change will reduce the size of such binaries with LLD. Now, Windows uses static registration and is in line with all the other platforms. Reviewers: davidxl, wmi, inglorion, void, calixte Subscribers: mgorny, krytarowski, eraman, fedor.sergeev, hiraditya, #sanitizers, dmajor, llvm-commits Tags: #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D57929 llvm-svn: 353547	2019-02-08 19:03:50 +00:00
Simon Pilgrim	9dc95b1f60	[TargetLowering] Use ISD::FSHR in expandFixedPointMul Replace OR(SHL,SRL) pattern with ISD::FSHR (legalization expands this later if necessary) - this helps with the scale == 0 'undefined' drop-through case that was discussed on D55720. llvm-svn: 353546	2019-02-08 18:57:38 +00:00
Jonas Devlieghere	0ede6b8de7	[test] Run the verifier for dsymutil module tests Dsymutil has an option "verify" that runs the dwarf verifier on the generated dSYM. This patch enables this for the module tests. llvm-svn: 353544	2019-02-08 18:43:11 +00:00
Simon Pilgrim	57a256b699	[TargetLowering] Add SimplifyDemandedBits funnel shift support llvm-svn: 353539	2019-02-08 17:19:01 +00:00
Teresa Johnson	8b33f920de	ArgumentPromotion should copy all metadata to new Function Summary: ArgumentPromotion had code to specifically move the dbg metadata over to the new function, but other metadata such as the function_entry_count !prof metadata was not. Replace code that moved dbg metadata with a call to copyMetadata. The old metadata is automatically removed when the old Function is removed. Reviewers: davidxl Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57846 llvm-svn: 353537	2019-02-08 17:08:27 +00:00
Craig Topper	fef2e4caad	[X86] Remove isReMaterializable from X87 floating point constant loads and constant pool loads. Summary: These instructions update FPSW so they aren't generically safe to rematerialize into any location if FPSW is live for a comparison result. They also use FPCW for exception masking control. Though the only exception they can generate is stack overflow and we manage the stack ourselves so that's not really going to occur. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57934 llvm-svn: 353536	2019-02-08 17:07:54 +00:00
Simon Pilgrim	eb1c360b05	[X86] Add basic funnel shift demanded bits tests llvm-svn: 353534	2019-02-08 16:51:16 +00:00
Sanjay Patel	a9c1b4b1bb	[x86] fix formatting; NFC (test commit #2 migrating to git) llvm-svn: 353533	2019-02-08 16:48:40 +00:00
Carl Ritson	5948082154	[AMDGPU] Fix CS scratch setup on pre-GCN3 ASICs Summary: Prior to GCN3 s_load_dword offsets are in dwords rather than bytes. Thus the scratch buffer descriptor offset must be adjusted for pre-GCN3 ASICs. Reviewers: nhaehnle, tpr Reviewed By: nhaehnle Subscribers: sheredom, arsenm, kzhuravl, jvesely, wdng, yaxunl, dstuttard, t-tye, jfb, llvm-commits Differential Revision: https://reviews.llvm.org/D56496 llvm-svn: 353530	2019-02-08 15:41:11 +00:00
Nirav Dave	99f1b60bb1	Revert r353416 "[DAG] Cleanup unused nodes on failed store-to-load forward combine." This cleanup causes out-of-tree crashes. llvm-svn: 353527	2019-02-08 15:21:13 +00:00
Matt Arsenault	75b6905f36	AMDGPU/GlobalISel: Fix shift legalization for non-power-of-2 clampScalar doesn't do anything for non-power-of-2 in range. There should probably be a combination rule to reduce the number of matching rules. llvm-svn: 353526	2019-02-08 15:06:24 +00:00
Dmitry Preobrazhensky	bdac3185da	[AMDGPU][MC] Added support of lds_direct operand See bug 39293: https://bugs.llvm.org/show_bug.cgi?id=39293 Reviewers: artem.tamazov, rampitec Differential Revision: https://reviews.llvm.org/D57889 llvm-svn: 353524	2019-02-08 14:57:37 +00:00
Matt Arsenault	e16bc10b18	AMDGPU/GlobalISel: Fix non-power-of-2 implicit_def llvm-svn: 353522	2019-02-08 14:46:27 +00:00
Eugene Leviant	34b036da71	[llvm-objcopy] Add few file processing directives Differential revision: https://reviews.llvm.org/D57877 llvm-svn: 353521	2019-02-08 14:37:54 +00:00

... 3 4 5 6 7 ...

175176 Commits