llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-12-12 14:06:36 +00:00

Author	SHA1	Message	Date
Sanjay Patel	e7d64b577c	[X86] add an exedepfix entry for movq == movlps == movlpd This is a 1-line patch (with a TODO for AVX because that will affect even more regression tests) that lets us substitute the appropriate 64-bit store for the float/double/int domains. It's not clear to me exactly what the difference is between the 0xD6 (MOVPQI2QImr) and 0x7E (MOVSDto64mr) opcodes, but this is apparently the right choice. Differential Revision: http://reviews.llvm.org/D8691 llvm-svn: 235014	2015-04-15 15:47:51 +00:00
Sanjay Patel	f5d182f516	[x86] Implement combineRepeatedFPDivisors Set the transform bar at 2 divisions because the fastest current x86 FP divider circuit is in SandyBridge / Haswell at 10 cycle latency (best case) relative to a 5 cycle multiplier. So that's the worst case for this transform (no latency win), but multiplies are obviously pipelined while divisions are not, so there's still a big throughput win which we would expect to show up in typical FP code. These are the sequences I'm comparing: divss %xmm2, %xmm0 mulss %xmm1, %xmm0 divss %xmm2, %xmm0 Becomes: movss LCPI0_0(%rip), %xmm3 ## xmm3 = mem[0],zero,zero,zero divss %xmm2, %xmm3 mulss %xmm3, %xmm0 mulss %xmm1, %xmm0 mulss %xmm3, %xmm0 [Ignore for the moment that we don't optimize the chain of 3 multiplies into 2 independent fmuls followed by 1 dependent fmul...this is the DAG version of: https://llvm.org/bugs/show_bug.cgi?id=21768 ...if we fix that, then the transform becomes even more profitable on all targets.] Differential Revision: http://reviews.llvm.org/D8941 llvm-svn: 235012	2015-04-15 15:22:55 +00:00
Chandler Carruth	2645427642	[MBP] Spell the conditions the same way through out this if statement. NFC. llvm-svn: 235009	2015-04-15 13:39:42 +00:00
Chandler Carruth	c6290b7bc5	[MBP] Sink a comment into the if block to which it pertains. This makes the content of the comment make much more sense. llvm-svn: 235007	2015-04-15 13:26:41 +00:00
Chandler Carruth	656797ac96	[MBP] Fix a really misleading typo in a comment. llvm-svn: 235006	2015-04-15 13:19:54 +00:00
Rafael Espindola	55bc28240e	Write section and section table entries in the same order. We had two different orders, which has no value. llvm-svn: 235004	2015-04-15 13:07:47 +00:00
Daniel Sanders	7cc2751f01	[msp430] Only support the 'm' inline assembly memory constraint. NFC. Summary: MSP430 doesn't seem to have any additional constraints. Therefore remove the target hook. No functional change intended. Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8208 llvm-svn: 235003	2015-04-15 12:51:28 +00:00
Yaron Keren	5ae04648e7	Clarify that Twine::toVector appends the Twine to the given SmallString. That's the way it works now, since toVector does not clear the given SmallString before printing to it. llvm-svn: 235000	2015-04-15 11:27:32 +00:00
Filipe Cabecinhas	2b78cf49b7	Revert "Verify sizes when trying to read a VBR" This reverts r234984 since it seems to break some bots (most of them seemed arm*-selfhost). llvm-svn: 234998	2015-04-15 11:10:17 +00:00
Toma Tabacu	d9c6117843	[mips] [IAS] Refactor the function which checks for the availability of AT. NFC. Summary: Refactor MipsAsmParser::getATReg to return an internal register number instead of a register index. Also change all the int's to unsigned, seeing as the current AT register index is stored as an unsigned in MipsAssemblerOptions. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8478 llvm-svn: 234996	2015-04-15 10:48:56 +00:00
Filipe Cabecinhas	612f7a52fa	Verify sizes when trying to read a VBR Also added an assert to ReadVBR64. llvm-svn: 234984	2015-04-15 08:48:08 +00:00
Yaron Keren	066194e0a6	Fix lib\support\Windows/TimeValue.inc(48): warning C4189: 'Error' : local variable is initialized but not referenced. llvm-svn: 234982	2015-04-15 07:45:52 +00:00
Daniel Jasper	266ce3788c	Re-apply r234898 and fix tests. This commit makes LLVM not estimate branch probabilities when doing a single bit bitmask tests. The code that originally made me discover this is: if ((a & 0x1) == 0x1) { .. } In this case we don't actually have any branch probability information and should not assume to have any. LLVM transforms this into: %and = and i32 %a, 1 %tobool = icmp eq i32 %and, 0 So, in this case, the result of a bitwise and is compared against 0, but nevertheless, we should not assume to have probability information. CodeGen/ARM/2013-10-11-select-stalls.ll started failing because the changed probabilities changed the results of ARMBaseInstrInfo::isProfitableToIfCvt() and led to an Ifcvt of the diamond in the test. AFAICT, the test was never meant to test this and thus changing the test input slightly to not change the probabilities seems like the best way to preserve the meaning of the test. llvm-svn: 234979	2015-04-15 06:24:07 +00:00
Lang Hames	922165b836	[RuntimeDyld] Add casts to make delta computation 64-bit. Hopefully this will fix the i686/msvc build failure described at: http://bb.pgr.jp/builders/ninja-clang-i686-msc18-R/builds/803 llvm-svn: 234977	2015-04-15 04:46:01 +00:00
Lang Hames	78fb616ba9	[RuntimeDyld] Make sure we emit MachO __eh_frame and __gcc_except_tab sections, even if there are no references to them in the code. This allows exceptions thrown from JIT'd code to be caught by the JIT itself. llvm-svn: 234975	2015-04-15 03:39:22 +00:00
Richard Trieu	e64e0c21b9	Change range-based for-loop to use const auto&. No functionality change. llvm-svn: 234974	2015-04-15 03:17:49 +00:00
Duncan P. N. Exon Smith	8654474680	uselistorder: Remove the global bits Remove all the global bits to do with preserving use-list order by moving the `cl::opt`s to the individual tools that want them. There's a minor functionality change to `libLTO`, in that you can't send in `-preserve-bc-uselistorder=false`, but making that bit settable (if it's worth doing) should be through explicit LTO API. As a drive-by fix, I removed some includes of `UseListOrder.h` that were made unnecessary by recent commits. llvm-svn: 234973	2015-04-15 03:14:06 +00:00
Alexei Starovoitov	d7dbc8fd97	[bpf] fix build fix build due to refactoring in DIL/MDL and raw_pwrite_stream llvm-svn: 234971	2015-04-15 02:48:57 +00:00
Duncan P. N. Exon Smith	8141c202b6	uselistorder: Pull the bit through PrintModulePass Now the callers of `PrintModulePass()` (etc.) that care about use-list order in assembly pass in the flag. llvm-svn: 234969	2015-04-15 02:38:06 +00:00
Duncan P. N. Exon Smith	16fd8a7766	uselistorder: Pull the assembly bit up out of the printer Pull the `-preserve-ll-uselistorder` bit up through all the callers of `Module::print()`. I converted callers of `operator<<` to `Module::print()` where necessary to pull the bit through. llvm-svn: 234968	2015-04-15 02:12:41 +00:00
Duncan P. N. Exon Smith	58a1cb0418	uselistorder: Start pulling out -preserve-ll-uselistorder For consistency, start pulling out `-preserve-ll-uselistorder`. I'll drop the global state for both eventually. This pulls it up to `Module::print()` (but not past there). llvm-svn: 234966	2015-04-15 01:36:30 +00:00
Richard Trieu	5eedf6b231	Change range-based for-loops to be -Wrange-loop-analysis clean. No functionality change. llvm-svn: 234963	2015-04-15 01:21:15 +00:00
Duncan P. N. Exon Smith	393c4f570f	uselistorder: Pull bit through BitcodeWriterPass Now the callers of `BitcodeWriterPass` decide whether or not to preserve bitcode use-list order. llvm-svn: 234959	2015-04-15 00:34:24 +00:00
Duncan P. N. Exon Smith	9a09dfde5f	Fix build error from r234957 llvm-svn: 234958	2015-04-15 00:13:51 +00:00
Duncan P. N. Exon Smith	b222408637	uselistorder: Pull the bit through WriteToBitcodFile() Change the callers of `WriteToBitcodeFile()` to pass `true` or `shouldPreserveBitcodeUseListOrder()` explicitly. I left the callers that want to send `false` alone. I'll keep pushing the bit higher until hopefully I can delete the global `cl::opt` entirely. llvm-svn: 234957	2015-04-15 00:10:50 +00:00
Duncan P. N. Exon Smith	94762eaaad	uselistorder: Thread bit through ValueEnumerator Canonicalize access to whether to preserve use-list order in bitcode on a `bool` stored in `ValueEnumerator`. Next step, expose this as a `bool` through `WriteBitcodeToFile()`. llvm-svn: 234956	2015-04-14 23:45:11 +00:00
Rafael Espindola	cf2ab6b126	Use the ability to pwrite to simplify the ELF writer. Now we don't have to do 2 synchronized passes to compute offsets and then write the file. This also includes a fix for the corner case of seeking in /dev/null. It is not an error, but on some systems (Linux) the returned offset is always 0. An error is signaled by returning -1. This is checked by the existing tests now that "clang -o /dev/null ..." seeks. llvm-svn: 234952	2015-04-14 22:54:16 +00:00
Rafael Espindola	aeb03deb16	Use raw_pwrite_stream in the object writer/streamer. The ELF object writer will take advantage of that in the next commit. llvm-svn: 234950	2015-04-14 22:14:34 +00:00
Reid Kleckner	9e6bccc300	[WinEH] Avoid emitting xdata tables twice for cleanups Since adding invokes of llvm.donothing to cleanups, we come here now, and trivial EH cleanup usage from clang fails to compile. llvm-svn: 234948	2015-04-14 21:42:36 +00:00
Ed Maste	b8c5d92511	Correct 'teh' and other typos / repeated words. Patch by Eitan Adler. Differential Revision: http://reviews.llvm.org/D8514 llvm-svn: 234939	2015-04-14 20:52:58 +00:00
Reid Kleckner	5f4bd4ef8d	[Inliner] Don't inline functions with frameescape calls Inlining such intrinsics is very difficult, since you need to simultaneously transform many calls to llvm.framerecover and potentially duplicate the functions containing them. Normally this intrinsic isn't added until EH preparation, which is part of the backend pass pipeline after inlining. However, if it were to get fed through the inliner, this change will ensure that it doesn't break the code. llvm-svn: 234937	2015-04-14 20:38:14 +00:00
Daniel Berlin	101acbc47e	Only recalculate DFS Numbers if invalid. Invalidate DFS numbers on reset. Add unit test to verify recalculation llvm-svn: 234933	2015-04-14 19:49:26 +00:00
Daniel Berlin	b83bd6bed3	Make updateDFSNumbers API public Summary: There are a number of passes that could be sped up by using dominator tree DFS numbers to order or compare things across multiple bbs (MemorySSA, MergedLoadStoreMotion, EarlyCSE, Sinking, GVN, NewGVN, for starters :P). For example, GVN/CSE elimination can be done with a simple stack/etc (instead of full-on scoped hash table or repeated leader set walks) if the DFS pair is stored next to leaders. The dominator tree keeps them, and the DOM tree nodes expose them as public, but you have no guarantee they are up to date (and in fact, if you split blocks or whatever during your pass, they definitely won't be) This means passes either have to compute their own versions[1], or make 32 queries, or .... Rather than try to hide this, i just made the API public, and make it do nothing if the numbers are already valid. [1] Which we want as a non-recursive walk, which is not pretty, sadly, because it cannot use the depth first iterators since you don't get called on the way back up. So you either have to do one walk with po_iterator and one with df_iterator, or write your own non-recursive walk that looks identical to the one in updateDFSNumbers. Reviewers: chandlerc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8946 llvm-svn: 234930	2015-04-14 19:09:16 +00:00
Duncan P. N. Exon Smith	c26c755f93	verify-uselistorder: More outs() and errs(), less dbgs() Change all the normally relevant output in `verify-uselistorder` from using `dbgs()` to using `outs()` and `errs()`. Now you don't need `-debug=uselistorder` to figure out what's going on (or at what stage verification failed, or to get the paths of the left-behind temporary files). This is a debugging tool, so I put the logging messages on `outs()` and the error messages on `errs()`. I also adjusted the output to be less *loud. Not sure why I was so ``-happy when I first wrote this. llvm-svn: 234929	2015-04-14 19:04:03 +00:00
Duncan P. N. Exon Smith	a8fd793406	IR: Set -preserve-bc-uselistorder=false by default But keep it on by default in `llvm-as`, `opt`, `bugpoint`, `llvm-link`, `llvm-extract`, and `LTOCodeGenerator`. Part of PR5680. llvm-svn: 234921	2015-04-14 18:33:00 +00:00
Duncan P. N. Exon Smith	ae92d0d958	IR: Rename 'use-list-order' options to 'uselistorder' Rename options to be consistent with the name of `verify-uselistorder`, and update `DEBUG_TYPE` (etc.) to be consistent. llvm-svn: 234919	2015-04-14 18:19:27 +00:00
Daniel Berlin	811864ca3d	Add ladder graph utility llvm-svn: 234917	2015-04-14 18:14:38 +00:00
David Blaikie	fca1b58443	DebugInfo: Pubnames: Do not include variable declarations in pubnames This causes badness for GDB which expects to find a definition in any compile_unit that has an entry for the variable in its pubnames. llvm-svn: 234915	2015-04-14 18:08:25 +00:00
David Blaikie	37a6c0bd2a	Update test case to include the original source code & account for some changes in clang's order of emission I'd added some stuff to this test case without adding the original source, which makes updating/adding further stuff rather difficult. So update it first (& it seems in the interim Clang's changed its output order a bit, so adjust the CHECK lines to account for that - rather than hand hacking the IR order which just makes it harder to maintain/change next time) llvm-svn: 234911	2015-04-14 17:17:04 +00:00
Lang Hames	2051723280	[RuntimeDyld] Make SectionEntry's Name field a std::string. StringRef is unsafe here, since SectionEntry instances can outlive the ObjectFile instances they are created from. llvm-svn: 234910	2015-04-14 17:13:10 +00:00
Lang Hames	1f8eae7989	[Orc] Reapply r234815, outputting via stdout instead. llvm-svn: 234908	2015-04-14 16:58:05 +00:00
Zachary Turner	8e996791d5	[msvc] Strip /W[0-4] before appending /W4 Certain versions of CMake specify /W3 as part of CMAKE_CXX_FLAGS by default, before you do anything. Appending /W4 to the end of this and using the Ninja generator results in cl : Command line warning D9025 : overriding '/W3' with '/W4'. It is not possible to suppress this since it is a command line warning and not a compiler warning, so we must fix the command line to contain only one value for /Wn. llvm-svn: 234907	2015-04-14 16:57:54 +00:00
Duncan P. N. Exon Smith	74f370958b	DebugInfo: Add implicit conversion from DISubprogram to DIScope As a follow-up to r234850, add an implicit conversion from `DISubprogram` to `DIScope` to support Kaleidoscope Ch. 8. This also reverts that band-aid from r234890. (/me learns again to build Kaleidoscope before commit...) llvm-svn: 234904	2015-04-14 16:19:44 +00:00
Rafael Espindola	19e464908f	Revert "The code that originally made me discover this is:" This reverts commit r234898. CodeGen/ARM/2013-10-11-select-stalls.ll was faling. llvm-svn: 234903	2015-04-14 15:56:33 +00:00
Chad Rosier	cd13282c4d	Improve RefreshCallGraph to remove invalid call graph edge. With commit r219944, InstCombine can now turn a sqrtl into a llvm.fabs.f64. The call graph edge originally representing the call to sqrtl becomes invalid. This patch modifies CGPassManager::RefreshCallGraph() to remove the invalid call graph edge, which can triggers an assert in CallGraphNode::addCalledFunction(). Phabricator Review: http://reviews.llvm.org/D7705 Patch by Lawrence Hu <lawrence@codeaurora.org>. llvm-svn: 234902	2015-04-14 15:52:57 +00:00
Alexander Kornienko	03b47019a2	Refactor: Simplify boolean expressions in ARM target Simplify boolean expressions using `true` and `false` with `clang-tidy` http://reviews.llvm.org/D8524 Patch by Richard Thomson! llvm-svn: 234901	2015-04-14 15:32:58 +00:00
Krzysztof Parzyszek	2f1ef7b970	Change the testcase mtriple to x86_64-unknown-unknown llvm-svn: 234900	2015-04-14 15:28:42 +00:00
Rafael Espindola	273d012050	Fix MSVC build. llvm-svn: 234899	2015-04-14 15:25:14 +00:00
Daniel Jasper	a598c99e31	The code that originally made me discover this is: if ((a & 0x1) == 0x1) { .. } In this case we don't actually have any branch probability information and should not assume to have any. LLVM transforms this into: %and = and i32 %a, 1 %tobool = icmp eq i32 %and, 0 So, in this case, the result of a bitwise and is compared against 0, but nevertheless, we should not assume to have probability information. llvm-svn: 234898	2015-04-14 15:20:37 +00:00
Bradley Smith	2d2de3c295	[AArch64] Allow non-standard INS/DUP encodings The ARMv8 ARMARM states that for these instructions in A64 state: "Unspecified bits in "imm5" are ignored but should be set to zero by an assembler.", (imm4 for INS). Make the disassembler accept any encoding with these ignored bits set to 1. llvm-svn: 234896	2015-04-14 15:07:26 +00:00

... 2 3 4 5 6 ...

116116 Commits