llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-12-11 13:37:07 +00:00

Author	SHA1	Message	Date
Xiang1 Zhang	61465474e9	Bugix for buildbot failure at commit 43f031d31264d20cfb8f1ebd606c66e57c231d4d Author: Xiang1 Zhang <xiang1.zhang@intel.com> Date: Fri Apr 3 11:25:38 2020 +0800 Enable IBT(Indirect Branch Tracking) in JIT with CET(Control-flow Enforcement Technology)	2020-04-03 13:25:35 +08:00
Scott Constable	f8acc0c70c	[X86] Refactor X86IndirectThunks.cpp to Accommodate Mitigations other than Retpoline Introduce a ThunkInserter CRTP base class from which new thunk types can inherit, e.g., thunks to mitigate https://software.intel.com/security-software-guidance/software-guidance/load-value-injection. Differential Revision: https://reviews.llvm.org/D76811	2020-04-02 22:09:54 -07:00
LLVM GN Syncbot	6c5517bebe	[gn build] Port 71e8021d82c	2020-04-03 04:56:02 +00:00
Scott Constable	04c04cabf9	[X86][NFC] Generalize the naming of "Retpoline Thunks" and related code to "Indirect Thunks" There are applications for indirect call/branch thunks other than retpoline for Spectre v2, e.g., https://software.intel.com/security-software-guidance/software-guidance/load-value-injection Therefore it makes sense to refactor X86RetpolineThunks as a more general capability. Differential Revision: https://reviews.llvm.org/D76810	2020-04-02 21:55:13 -07:00
laith sakka	86ebc533d4	Handle exp2 with proper vectorization and lowering to SVML calls Summary: Add mapping from exp2 math functions to corresponding SVML calls. This is a follow up and extension for llvm diff https://reviews.llvm.org/D19544 Test Plan: - update test case and run ninja check. - run tests locally Reviewers: wenlei, hoyFB, mmasten, mzolotukhin, spatel Reviewed By: spatel Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77114	2020-04-02 21:11:13 -07:00
Hongtao Yu	c2770242a6	Fix a bug in the inliner that causes subsequent double inlining Summary: A recent change in the instruction simplifier enables a call to a function that just returns one of its parameter to be simplified as simply loading the parameter. This exposes a bug in the inliner where double inlining may be involved which in turn may cause compiler ICE when an already-inlined callsite is reused for further inlining. To put it simply, in the following-like C program, when the function call second(t) is inlined, its code t = third(t) will be reduced to just loading the return value of the callsite first(). This causes the inliner internal data structure to register the first() callsite for the call edge representing the third() call, therefore incurs a double inlining when both call edges are considered an inline candidate. I'm making a fix to break the inliner from reusing a callsite for new call edges. ``` void top() { int t = first(); second(t); } void second(int t) { t = third(t); fourth(t); } void third(int t) { return t; } ``` The actual failing case is much trickier than the example here and is only reproducible with the legacy inliner. The way the legacy inliner works is to process each SCC in a bottom-up order. That means in reality function first may be already inlined into top, or function third is either inlined to second or is folded into nothing. To repro the failure seen from building a large application, we need to figure out a way to confuse the inliner so that the bottom-up inlining is not fulfilled. I'm doing this by making the second call indirect so that the alias analyzer fails to figure out the right call graph edge from top to second and top can be processed before second during the bottom-up. We also need to tweak the test code so that when the inlining of top happens, the function body of second is not that optimized, by delaying the pass of function attribute deducer (i.e, which tells function third has no side effect and just returns its parameter). Since the CGSCC pass is iterative, additional calls are added to top to postpone the inlining of second to the second round right after the first function attribute deducing pass is done. I haven't been able to repro the failure with the new pass manager since the processing order of ininlined callsites is a bit different, but in theory the issue could happen there too. Note that this fix could introduce a side effect that blocks the simplification of inlined code, specifically for a call site that can be folded to another call site. I hope this can probably be complemented by subsequent inlining or folding, as shown in the attached unit test. The ideal fix should be to separate the use of VMap. However, in reality this failing pattern shouldn't happen often. And even if it happens, there should be a good chance that the non-folded call site will be refolded by iterative inlining or subsequent simplification. Reviewers: wenlei, davidxl, tejohnson Reviewed By: wenlei, davidxl Subscribers: eraman, nikic, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76248	2020-04-02 21:08:05 -07:00
Xiang1 Zhang	4a90154b10	Enable IBT(Indirect Branch Tracking) in JIT with CET(Control-flow Enforcement Technology) Summary: This patch comes from H.J.'s `2bd54ce7fa` This patch fix the failed llvm unit tests which running on CET machine. (e.g. ExecutionEngine/MCJIT/MCJITTests) The reason we enable IBT at "JIT compiled with CET" is mainly that: the JIT don't know the its caller program is CET enable or not. If JIT's caller program is non-CET, it is no problem JIT generate CET code or not. But if JIT's caller program is CET enabled, JIT must generate CET code or it will cause Control protection exceptions. I have test the patch at llvm-unit-test and llvm-test-suite at CET machine. It passed. and H.J. also test it at building and running VNCserver(Virtual Network Console), it works too. (if not apply this patch, VNCserver will crash at CET machine.) Reviewers: hjl.tools, craig.topper, LuoYuanke, annita.zhang, pengfei Subscribers: tstellar, efriedma, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76900	2020-04-03 11:44:07 +08:00
Jessica Paquette	c863afa226	[AArch64][GlobalISel] Constrain reg operands in selectBrJT This was causing a machine verifier failure on the test suite. Make sure that we don't end up with a weird register class here. Failure for reference: * Bad machine code: Illegal virtual register for instruction * - function: check_constrain - basic block: %bb.1 (0x7f8b70839f80) - instruction: early-clobber %6:gpr64, early-clobber %7:gpr64sp = JumpTableDest32 %5:gpr64, %1:gpr64sp, %jump-table.0 - operand 3: %1:gpr64sp Expected a GPR64 register, but got a GPR64sp register Differential Revision: https://reviews.llvm.org/D77349	2020-04-02 20:34:11 -07:00
Wenju He	bd8a5b6d45	[x86] Fix Intel OpenCL builtin CalleeSavedRegs on skx Summary: Align with AVX512 builtins implementations, some of which don't preserve rdi. Reviewers: yubing, tianqing, craig.topper Reviewed By: craig.topper Subscribers: yaxunl, Anastasia, hiraditya Differential Revision: https://reviews.llvm.org/D77032	2020-04-03 11:27:40 +08:00
Qiu Chaofan	22d4391a3c	[PowerPC] Remove unnecessary XSRSP instruction MI peephole will remove unnecessary FRSP instructions. This patch removes such unnecessary XSRSP. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D77208	2020-04-03 11:05:14 +08:00
Nico Weber	4b82ef3815	This might actually fix the Windows bots after a16ba6fea.	2020-04-02 22:22:54 -04:00
Nico Weber	4cfb8c14dd	another wild guess at the win bot-only failures	2020-04-02 21:23:05 -04:00
Jun Ma	c5c5d9a37e	[Coroutines] Simplify implementation using removePredecessor Differential Revision: https://reviews.llvm.org/D77035	2020-04-03 09:20:07 +08:00
Austin Kerbow	7db103f3ff	[AMDGPU] Handle SMRD signed offset immediate Summary: This fixes a few issues related to SMRD offsets. On gfx9 and gfx10 we have a signed byte offset immediate, however we can overflow into a negative since we treat it as unsigned. Also, the SMRD SOFFSET sgpr is an unsigned offset on all subtargets. We sometimes tried to use negative values here. Third, S_BUFFER instructions should never use a signed offset immediate. Differential Revision: https://reviews.llvm.org/D77082	2020-04-02 17:41:52 -07:00
Adrian Prantl	419e268c4e	Teach the stripNonLineTableDebugInfo pass about the llvm.dbg.label intrinsic. Debug info for labels is not generated at -gline-tables-only, so this pass should remove them. Differential Revision: https://reviews.llvm.org/D77345	2020-04-02 17:39:33 -07:00
Adrian Prantl	65c9f7617e	Teach the stripNonLineTableDebugInfo pass about the llvm.dbg.addr intrinsic. This patch also strips llvm.dbg.addr intrinsics when downgrading debug info to linetables-only. Differential Revision: https://reviews.llvm.org/D77343	2020-04-02 17:39:33 -07:00
Nico Weber	88cb3a4bb1	try more to appease win bots	2020-04-02 20:27:31 -04:00
Nico Weber	1d99d4330d	Try again to get tests passing again on Windows. Things pass locally, but some tests on some bots are still unhappy. I'm not sure why. See if using forward slashes as before helps.	2020-04-02 20:00:38 -04:00
Lang Hames	da5bdc7302	Re-apply 0071eaaf089, "[ORC] Export __cxa_atexit ...", with fixes. Forgot to include part of the testcase. Thank to Nico for spotting that and reverting!	2020-04-02 16:03:35 -07:00
Matt Arsenault	ba546232a7	AMDGPU: Fix broken check lines	2020-04-02 18:52:49 -04:00
Hubert Tong	7251d92ce1	[llvm-objdump][COFF][NFC] Split format-specific interfaces; add namespace Summary: This patch addresses, for the interfaces implemented by `COFFDump.cpp`, multiple issues identified with the current structure of `llvm-objdump.h` in the review of D72973. This patch moves implementation details of the tool into an `llvm::objdump` namespace for external linkage names, splits the implementation details into separate headers for each implementation file, and uses qualified names when declaring members of the `llvm::objdump` namespace in place of leaving the namespace definition open. Reviewers: jhenderson, DiggerLin, jasonliu, daltenty, MaskRay Reviewed By: jhenderson, MaskRay Subscribers: MaskRay, rupprecht, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77285	2020-04-02 18:42:13 -04:00
Julian Lettner	6d17f38bd6	[lit] Refine filter error handling Picking a default filter `.*` that matches everything lets us streamline some error handling code.	2020-04-02 14:45:54 -07:00
Julian Lettner	0ac61a75cf	[lit] Remove unnecessary indirection in progress_callback On shutdown, the result complete handler is not racing with the main thread anymore because we are now always waiting for process pool termination via ``` finally: pool.join() ```	2020-04-02 14:45:54 -07:00
Matt Arsenault	ef4124b0b5	AMDGPU: Use 128-bit DS operations by default	2020-04-02 17:17:47 -04:00
Matt Arsenault	0d45a4284f	AMDGPU: Add some tests for exotic denormal mode combinations	2020-04-02 17:17:12 -04:00
Matt Arsenault	40d3e7765c	AMDGPU: Remove denormal subtarget features Switch to using the denormal-fp-math/denormal-fp-math-f32 attributes.	2020-04-02 17:17:12 -04:00
Matt Arsenault	8bdcb1c2a9	AMDGPU: Assume f32 denormals are enabled by default This will likely introduce catastrophic performance regressions on older subtargets, but should be correct. A follow up change will remove the old fp32-denormals subtarget features, and switch to using the new denormal-fp-math/denormal-fp-math-f32 attributes. Frontends should be making sure to add the denormal-fp-math-f32 attribute when appropriate to avoid performance regressions.	2020-04-02 17:17:12 -04:00
Duncan P. N. Exon Smith	3a5bd3107b	utils: Tweak clang-parse-diagnostics-file for modules includes Diagnostics from modules do not have a `main-file` listed. Tweak `clang-parse-diagnostics-file` to patch this up. Previously, the call to `os.path.basename` would crash. Radar-Id: rdar://problem/59000292	2020-04-02 14:16:26 -07:00
Nico Weber	b63fb1d467	Reland "Make it possible for lit.site.cfg to contain relative paths, and use it for llvm and clang" The problem on Windows was that the \b in "..\bin" was interpreted as an escape sequence. Use r"" strings to prevent that. This reverts commit ab11b9eefa16661017c2c7b3b34c46b069f43fb7, with raw strings in the lit.site.cfg.py.in files. Differential Revision: https://reviews.llvm.org/D77184	2020-04-02 16:12:03 -04:00
Cyndy Ishida	619546feb9	[llvm][TextAPI] adding inlining reexported libraries support Summary: [llvm][TextAPI] adding inlining reexported libraries support * this patch adds reader/writer support for MachO tbd files. The usecase is to represent reexported libraries in top level library that won't need to exist for linker indirection because all of the needed content will be inlined in the same document. Reviewers: ributzka, steven_wu, jhenderson Reviewed By: ributzka Subscribers: JDevlieghere, hiraditya, mgrang, dexonsmith, rupprecht, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67646	2020-04-02 13:05:08 -07:00
Craig Topper	a316b89267	[X86] Enable combineExtSetcc for vectors larger than 256 bits when we've disabled 512 bit vectors. The compares are going to be type legalized to 256 bits so we might as well fold the extend.	2020-04-02 12:44:27 -07:00
Fangrui Song	59c870af32	Reland D75382 "[lld] Initial commit for new Mach-O backend" With a fix for http://lab.llvm.org:8011/builders/clang-cmake-armv8-lld/builds/3636 Also trims some unneeded dependencies.	2020-04-02 12:03:43 -07:00
Nico Weber	529e4baab6	Revert "Make it possible for lit.site.cfg to contain relative paths, and use it for llvm and clang" This reverts commit fb80b6b2d58c476747a3206bd4371b787108591b and follow-up 631ee8b24adf36359b61ecb47484e8e82de35be8. Seems to not work on Windows: http://lab.llvm.org:8011/builders/llvm-clang-lld-x86_64-scei-ps4-windows10pro-fast/builds/31684 http://lab.llvm.org:8011/builders/llvm-clang-win-x-aarch64/builds/6512 Let's revert while I investigate.	2020-04-02 15:00:09 -04:00
Nico Weber	52b31b6f35	Make fb80b6b2d58c4 actually work. I broke it with last-minute changes right before committing. Differential Revision: https://reviews.llvm.org/D77184	2020-04-02 14:28:34 -04:00
Anna Thomas	ac950491d8	[InlineFunction] Update valid return attributes at callsite within callee body Consider a callee function that has a call (C) within it which feeds into the return. When we inline that callee into a callsite that has return attributes, we can backward propagate valid attributes to the call (C) within that inlined callee body. This is safe to do so only if we can guarantee transfer of execution to successor in the window of instructions between return value (i.e. the call C) and the return instruction. Also, this is valid only for attributes which are a property of a callsite and not those that are not dependent on the ABI, or a property of the call itself. Reviewed-By: reames, jdoerfert Differential Revision: https://reviews.llvm.org/D76140	2020-04-02 14:13:12 -04:00
Matt Arsenault	5836a23b95	AMDGPU: Hack out noinline on functions using LDS globals This is a workaround for clang adding noinline to all functions at -O0. Previously, we would just add alwaysinline, and the verifier would complain about having both noinline and alwaysinline. We currently can't truly codegen this case as a freestanding function, so override the user forcing noinline.	2020-04-02 14:12:07 -04:00
Nico Weber	16fa9028fd	Make it possible for lit.site.cfg to contain relative paths, and use it for llvm and clang Currently, all generated lit.site.cfg files contain absolute paths. This makes it impossible to build on one machine, and then transfer the build output to another machine for test execution. Being able to do this is useful for several use cases: 1. When running tests on an ARM machine, it would be possible to build on a fast x86 machine and then copy build artifacts over after building. 2. It allows running several test suites (clang, llvm, lld) on 3 different machines, reducing test time from sum(each test suite time) to max(each test suite time). This patch makes it possible to pass a list of variables that should be relative in the generated lit.site.cfg.py file to configure_lit_site_cfg(). The lit.site.cfg.py.in file needs to call `path()` on these variables, so that the paths are converted to absolute form at lit start time. The testers would have to have an LLVM checkout at the same revision, and the build dir would have to be at the same relative path as on the builder. This does not yet cover how to figure out which files to copy from the builder machine to the tester machines. (One idea is to look at the `--graphviz=test.dot` output and copy all inputs of the `check-llvm` target.) Differential Revision: https://reviews.llvm.org/D77184	2020-04-02 13:53:16 -04:00
Sanjay Patel	54629ce96c	[InstCombine] try to reduce shuffle with bitcasted operand shuf (bitcast X), undef, Mask --> bitcast X' The 'inverse shuffles' test (shuf_bitcast_operand) is a pattern in the motivating examples from PR35454: https://bugs.llvm.org/show_bug.cgi?id=35454 (see also D76727) We can deal with this class of patterns in generic instcombine because we are not creating any new shuffles, just a bitcast. Alive2 proof: http://volta.cs.utah.edu:8080/z/mwDUZf Differential Revision: https://reviews.llvm.org/D76844	2020-04-02 13:44:50 -04:00
Sanjay Patel	1e7408d565	[VectorCombine] transform bitcasted shuffle to narrower elements bitcast (shuf V, MaskC) --> shuf (bitcast V), MaskC' We do not attempt this in InstCombine because we do not want to change types and create new shuffle ops that are potentially not lowered as well as the original code. Here, we can check the cost model to see if it is worthwhile. I've aggressively enabled this transform even if the types are the same size and/or equal cost because moving the bitcast allows InstCombine to make further simplifications. In the motivating cases from PR35454: https://bugs.llvm.org/show_bug.cgi?id=35454 ...this is enough to let instcombine and the backend eliminate the redundant shuffles, but we probably want to extend VectorCombine to handle the inverse pattern (shuffle-of-bitcast) to get that simplification directly in IR. Differential Revision: https://reviews.llvm.org/D76727	2020-04-02 13:30:22 -04:00
Stanislav Mekhanoshin	0097125e1d	[AMDGPU] Fix crash in SILoadStoreOptimizer SILoadStoreOptimizer::checkAndPrepareMerge() expects base and paired instruction to come in order and scans MBB from base to the paired instruction. An original order can be changed if there were a dependent instruction in between and base instruction was moved. Fixed by bailing the optimization. In theory it might be possible still to perform a merge by swapping instructions, but on practice it bails anyway because it finds dependency on that same instruction which has resulted in the base move. Differential Revision: https://reviews.llvm.org/D77245	2020-04-02 10:26:47 -07:00
Sanjay Patel	c0d3491f6b	[InstCombine] add tests for cmyk benchmark; NFC These are versions of a function that regressed with: rGf2fbdf76d8d0 That particular problem occurs with an instcombine-simplifycfg-instcombine sequence, but we can show that it exists within instcombine only with other variations of the pattern.	2020-04-02 13:00:46 -04:00
LLVM GN Syncbot	48b726dc65	[gn build] Port c00cb76274f	2020-04-02 16:36:36 +00:00
LLVM GN Syncbot	09b61bacb4	[gn build] Port 24bb2d1e776	2020-04-02 16:36:35 +00:00
Nico Weber	0d00935cb5	Revert "[gn build] Port 03f43b3aca36" This reverts commit 45b6364e8d74f6038e94b760f017e03740acf725, 03f43b3aca36 was reverted in af39151f3c54.	2020-04-02 12:36:06 -04:00
Benjamin Kramer	d8fb1c1c6d	[LoopDataPrefetch] Remove unused include that's a layering violation	2020-04-02 17:46:10 +02:00
Jonas Paulsson	41c67725a7	NFC: Comment in TargetTransformInfo.h reformatted (by Michael Kruse).	2020-04-02 17:40:53 +02:00
Benjamin Kramer	11841a12af	Revert "[SimplifyLibCalls] Erase replaced instructions" This reverts commit 2a77544ad5911a38f81c0300385033fced1cc66d. This introduces a use-after-free in Transforms/InstCombine/sincospi.ll. Found by asan.	2020-04-02 17:30:47 +02:00
Tyker	cf3e3f7a6f	[NFC] remove delcartion that shouldn't be there	2020-04-02 17:09:16 +02:00
Alexander Lanin	d41f20efe6	[docs] use git diff instead of git format-patch Uploading output from `git format-patch` fails when version has more than 2 dots, e.g. git version 2.24.1.windows.2 which is currently recommended by e.g. GitExtensions or 2.24.1.rc on Linux. Differential Revision: https://reviews.llvm.org/D72374	2020-04-02 07:20:27 -07:00
Jonas Paulsson	df92dbf944	[SystemZ] Add isCommutable flag on vector instructions. This does not change much in code generation, but in rare cases MachineCSE can figure out that an instruction is redundant after commuting it. Review: Ulrich Weigand	2020-04-02 16:06:15 +02:00

1 2 3 4 5 ...

194322 Commits