llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 20:39:47 +00:00

Author	SHA1	Message	Date
Kristof Beyls	7bee5ceb02	Mark that SpeculativeExecution preserves Globals Alias Analysis. A few benchmarks with lots of accesses to global variables in the hot loops regressed a lot since r266399, which added the SpeculativeExecution pass to the default pipeline. The problem is that this pass doesn't mark Globals Alias Analysis as preserved. Globals Alias Analysis is computed in a module pass, whereas SpeculativeExecution is a function pass, and a lot of passes dependent on the Globals Alias Analysis to optimize these benchmarks are also function passes. As such, the Globals Alias Analysis information cannot be recomputed between SpeculativeExecution and the following function passes needing that information. SpeculativeExecution doesn't invalidate Globals Alias Analysis, so mark it as such to fix those performance regressions. Differential Revision: http://reviews.llvm.org/D19806 llvm-svn: 268370	2016-05-03 08:33:26 +00:00
Igor Breger	ab90b9e166	[AVX512] Fix lowerV4X128VectorShuffle to select correctly input operands . Differential Revision: http://reviews.llvm.org/D19803 llvm-svn: 268368	2016-05-03 08:08:44 +00:00
Davide Italiano	040f3d40e0	[lib/Object] Make this assertion more useful. llvm-svn: 268367	2016-05-03 07:30:56 +00:00
Craig Topper	eb01b2394d	[CodeGen] Add some space optimized forms of EmitNode and MorphNodeTo that implicitly indicate the number of result VTs. This shaves about 16K off the X86 matching table taking it down to about 470K. Overall this reduces the llc binary size with all in-tree targets by about 40K. llvm-svn: 268365	2016-05-03 05:54:13 +00:00
Matthias Braun	cd14842dc0	Fix uppercase typo llvm-svn: 268362	2016-05-03 05:21:53 +00:00
Vedant Kumar	842b69e863	Remove unused header, NFC llvm-svn: 268361	2016-05-03 05:05:38 +00:00
Matthias Braun	4869e8d140	AArch64/optimizeCondBranch: Remove earlier kill flag when forming TBZ This fixes -verify-machineinstrs complaints when compiling test-suite/SingleSource/Benchmarks/Shootout-C++/wordfreq.cpp llvm-svn: 268360	2016-05-03 04:54:16 +00:00
Jack Liu	03ba5bb137	test commit llvm-svn: 268358	2016-05-03 04:06:24 +00:00
David Majnemer	b545ac7cd4	[LoopUnroll] Unroll loops which have exit blocks to EH pads We were overly cautious in our analysis of loops which have invokes which unwind to EH pads. The loop unroll transform is safe because it only clones blocks in the loop body, it does not try to split critical edges involving EH pads. Instead, move the necessary safety check to LoopUnswitch. N.B. The safety check for loop unswitch is covered by an existing test which fails without it. llvm-svn: 268357	2016-05-03 03:57:40 +00:00
Zachary Turner	04c77dbb33	Change operation_not_supported to not_supported. Apparently operation_not_supported is... not supported everywhere. llvm-svn: 268348	2016-05-03 00:53:16 +00:00
Reid Kleckner	a6cfc92d4d	[codeview] Maintain the type enum-to-classname mapping in the .def file This way it will be easy to stamp out something like a type visitor. llvm-svn: 268347	2016-05-03 00:45:14 +00:00
Zachary Turner	60262929bc	Parse the TPI (type information) stream of PDB files. This parses the TPI stream (stream 2) from the PDB file. This stream contains some header information followed by a series of codeview records. There is some additional complexity here in that alongside this stream of codeview records is a serialized hash table in order to efficiently query the types. We parse the necessary bookkeeping information to allow us to reconstruct the hash table, but we do not actually construct it yet as there are still a few things that need to be understood first. Differential Revision: http://reviews.llvm.org/D19840 Reviewed By: ruiu, rnk llvm-svn: 268343	2016-05-03 00:28:21 +00:00
Zachary Turner	6491f0eb80	Move llvm-readobj/StreamWriter to Support. We wish to re-use this from llvm-pdbdump, and it provides a nice way to print structured data in scoped format that could prove useful for many other dumping tools as well. Moving to support and changing name to ScopedPrinter to better reflect its purpose. llvm-svn: 268342	2016-05-03 00:28:04 +00:00
Mehdi Amini	1c6de55063	ThinLTO: do not import function whose linkage prevents inlining. There is not point in importing a "weak" or a "linkonce" function since we won't be able to inline it anyway. We already had a targeted check for WeakAny, this is using the same check on GlobalValue as the inline, i.e. isMayBeOverriddenLinkage() From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 268341	2016-05-03 00:27:28 +00:00
Matthias Braun	bac4271200	livePhysRegs: Pass MBB by reference in addLive{Ins\|Outs}(); NFC The block must no be nullptr for the addLiveIns()/addLiveOuts() function. llvm-svn: 268340	2016-05-03 00:24:32 +00:00
Wolfgang Pieb	a34c1258af	Moved test case for r268323 to DebugInfo/X86 to unbreak aarch64. llvm-svn: 268339	2016-05-03 00:22:09 +00:00
Douglas Yung	f08336ee69	Adding period to the end of a comment to test out commit access. llvm-svn: 268337	2016-05-03 00:12:59 +00:00
Matthias Braun	d86d0ddbc5	LivePhysRegs: Automatically determine presence of pristine regs. Remove the AddPristinesAndCSRs parameters from addLiveIns()/addLiveOuts(). We need to respect pristine registers after prologue epilogue insertion, Seeing that we got this wrong in at least two commits already, we should rather pay the small price to query MachineFrameInfo for it. There are three cases that did not set AddPristineAndCSRs to true even after register allocation: - ExecutionDepsFix: live-out registers are used as a hint that the register is used soon. This is not true for pristine registers so use the new addLiveOutsNoPristines() to maintain this behaviour. - SystemZShortenInst: Not setting AddPristineAndCSRs to true looks like a bug, should do the right thing automatically now. - StackMapLivenessAnalysis: Not adding pristine registers looks like a bug to me. Added a FIXME comment but maintain the current behaviour as a change may need to get coordinated with GC runtimes. llvm-svn: 268336	2016-05-03 00:08:46 +00:00
NAKAMURA Takumi	2697fd426e	MITests: Update libdeps. llvm-svn: 268335	2016-05-03 00:04:07 +00:00
Adrian McCarthy	9b867bb7ee	NFC: An iterator for stepping through CodeView type stream in llvm-readobj This is a small refactoring step toward moving CodeView type stream logic from llvm-readobj to a library. It abstracts the logic of stepping through the stream into an iterator class and updates llvm-readobj to use that iterator. This has no functional change; llvm-readobj produces identical output. The next step is to abstract the parsing of the different leaf types and then move that and the iterator into a library. Since this is my first contrib outside LLDB, please let me know if I'm messing up on any of the LLVM style guidelines, idioms, or patterns. Differential Revision: http://reviews.llvm.org/D19746 llvm-svn: 268334	2016-05-02 23:45:03 +00:00
Reid Kleckner	75b594b1e8	[MC] Create unique .pdata sections for every .text section Summary: This adds a unique ID to the COFF section uniquing map, similar to the one we have for ELF. The unique id is not currently exposed via the assembler because we don't have a use case for it yet. Users generally create .pdata with the .seh_* family of directives, and the assembler internally needs to produce .pdata and .xdata sections corresponding to the code section. The association between .text sections and the assembler-created .xdata and .pdata sections is maintained as an ID field of MCSectionCOFF. The CFI-related sections are created with the given unique ID, so if more code is added to the same text section, we can find and reuse the CFI sections that were already created. Reviewers: majnemer, rafael Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D19376 llvm-svn: 268331	2016-05-02 23:22:18 +00:00
Matthias Braun	b3f499f033	Add unittest for LiveIntervalAnalysis::handleMove() This re-applies r260905. It requires LiveIntervals to not require LiveVariables which was reverted and re-applied in r267954. llvm-svn: 268329	2016-05-02 23:05:48 +00:00
Quentin Colombet	cf0d20f78c	[MachineBlockPlacement] Let the target optimize the branches at the end. After the layout of the basic blocks is set, the target may be able to get rid of unconditional branches to fallthrough blocks that the generic code does not catch. This happens any time TargetInstrInfo::AnalyzeBranch is not able to analyze all the branches involved in the terminators sequence, while still understanding a few of them. In such situation, AnalyzeBranch can directly modify the branches if it has been instructed to do so. This patch takes advantage of that. llvm-svn: 268328	2016-05-02 22:58:59 +00:00
Quentin Colombet	6b53c89899	[X86] Model FAULTING_LOAD_OP as a terminator and branch. This operation may branch to the handler block and we do not want it to happen anywhere within the basic block. Moreover, by marking it "terminator and branch" the machine verifier does not wrongly assume (because of AnalyzeBranch not knowing better) the branch is analyzable. Indeed, the target was seeing only the unconditional branch and not the faulting load op and thought it was a simple unconditional block. The machine verifier was complaining because of that and moreover, other optimizations could have done wrong transformation! In the process, simplify the representation of the handler block in the faulting load op. Now, we directly reference the handler block instead of using a label. This has the benefits of: 1. MC knows how to issue a label for a BB, so leave that to it. 2. Accessing the target BB from its label is painful, whereas it is direct from a MBB operand. Note: The 2 bytes offset in implicit-null-check.ll comes from the fact the unconditional jumps are not removed anymore, as the whole terminator sequence is not analyzable anymore. Will fix it in a subsequence commit. llvm-svn: 268327	2016-05-02 22:58:54 +00:00
Rafael Espindola	541d4bda86	Simplify. NFC. llvm-svn: 268326	2016-05-02 22:53:32 +00:00
Wolfgang Pieb	4f51f08e11	DebugInfo: Avoid propagating incorrect debug locations in SelectionDAG via CSE. Summary: When SelectionDAG performs CSE it is possible that the context's source location is different from that of the selected node. This can lead to incorrect line number records. We update the debug location to the one that occurs earlier in the instruction sequence. This fixes PR21006. Reviewers: echristo, sdmitrouk Subscribers: jevinskie, asl, llvm-commits Differential Revision: http://reviews.llvm.org/D12094 llvm-svn: 268323	2016-05-02 22:50:51 +00:00
Mehdi Amini	d877456f6c	Revert "ThinLTO: do not import function whose linkage prevents inlining." This reverts commit r268315, the tests are not passing. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 268317	2016-05-02 22:26:04 +00:00
Zachary Turner	997823b380	[llvm-pdbdump] Fix read past EOF when file is too small. llvm-svn: 268316	2016-05-02 22:16:57 +00:00
Mehdi Amini	4326fb70ef	ThinLTO: do not import function whose linkage prevents inlining. There is not point in importing a "weak" or a "linkonce" function since we won't be able to inline it anyway. We already had a targeted check for WeakAny, this is using the same check on GlobalValue as the inline, i.e. isMayBeOverriddenLinkage() From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 268315	2016-05-02 22:11:27 +00:00
Kevin Enderby	961a187a7a	Fix llvm-size to exit with non zero when it can’t open a file. rdar://26027819 llvm-svn: 268313	2016-05-02 21:41:03 +00:00
Vedant Kumar	14ce118a05	Remove redundant return, NFC llvm-svn: 268312	2016-05-02 21:33:36 +00:00
Vedant Kumar	e07e00e4e7	Fix a typo, NFC llvm-svn: 268311	2016-05-02 21:32:07 +00:00
Simon Pilgrim	116d17f711	[X86][SSE] Added placeholder for 128/256-bit wide shuffle combines Begun adding placeholder for future support for vperm2f128/vshuff64x2 style 128/256-bit wide shuffles llvm-svn: 268306	2016-05-02 21:12:48 +00:00
Rafael Espindola	e51691ab66	Don't try to create thin bsd archives. Not such variant has been specified yet. llvm-svn: 268305	2016-05-02 21:06:57 +00:00
Frederic Riss	bf4e97e3be	[dsymutil] Create the temporary files in the system temp directory. llvm-dsymutil used to create the temporary files in the output directory. This works fine except when the output directory contains a '%' char, which is then replaced by llvm::sys::fs::createUniqueFile() generating an invalid path. Just use the default temp dir for those files. llvm-svn: 268304	2016-05-02 21:06:14 +00:00
Xinliang David Li	b56ddbf6f6	Code refactoring -- preparation for new PM porting /NFC llvm-svn: 268301	2016-05-02 20:33:59 +00:00
Reid Kleckner	b62457c5eb	[codeview] Isolate type dumping from object file state This isolates the state we use for type dumping from the knowledge of object files. We can use CVTypeDumper to dump types from anywhere in memory now. NFC Reviewers: zturner Differential Revision: http://reviews.llvm.org/D19824 llvm-svn: 268300	2016-05-02 20:30:47 +00:00
Kevin Enderby	f7223e444c	Thread Expected<...> up from libObject’s getType() for symbols to allow llvm-objdump to produce a good error message. Produce another specific error message for a malformed Mach-O file when a symbol’s section index is more than the number of sections. The existing test case in test/Object/macho-invalid.test for macho-invalid-section-index-getSectionRawName now reports the error with the message indicating that a symbol at a specific index has a bad section index and that bad section index value. Again converting interfaces to Expected<> from ErrorOr<> does involve touching a number of places. Where the existing code reported the error with a string message or an error code it was converted to do the same. Also there some were bugs in the existing code that did not deal with the old ErrorOr<> return values. So now with Expected<> since they must be checked and the error handled, I added a TODO and a comment: "// TODO: Actually report errors helpfully" and a call something like consumeError(NameOrErr.takeError()) so the buggy code will not crash since needed to deal with the Error. llvm-svn: 268298	2016-05-02 20:28:12 +00:00
Matt Arsenault	dfb613a88d	AMDGPU: Custom lower v2i32 loads and stores This will allow us to split up 64-bit private accesses when necessary. llvm-svn: 268296	2016-05-02 20:13:51 +00:00
Tom Stellard	d541008932	AMDGPU/SI: Use v_readfirstlane_b32 when restoring SGPRs spilled to scratch We were using v_readlane_b32 with the lane set to zero, but this won't work if thread 0 is not active. Differential Revision: http://reviews.llvm.org/D19745 llvm-svn: 268295	2016-05-02 20:11:44 +00:00
Matt Arsenault	7932e530a0	AMDGPU: Make i64 loads/stores promote to v2i32 Now that unaligned access expansion should not attempt to produce i64 accesses, we can remove the hack in PreprocessISelDAG where this is done. This allows splitting i64 private accesses while allowing the new add nodes indexing the vector components can be folded with the base pointer arithmetic. llvm-svn: 268293	2016-05-02 20:07:26 +00:00
John Regehr	b357c876ed	[LVI] Add an API to LazyValueInfo so that it can export ConstantRanges that it computes. Currently this is used for testing and precision tuning, but it might be used by optimizations later. Differential Revision: http://reviews.llvm.org/D19179 llvm-svn: 268291	2016-05-02 19:58:00 +00:00
Simon Pilgrim	6c3bbc1c10	[X86][AVX2] Added 128-bit wide shuffle test Demonstrate missing 128-bit wide shuffle combine support llvm-svn: 268290	2016-05-02 19:46:58 +00:00
Reid Kleckner	34524af63a	Fix instance of -Winconsistent-missing-override in AMDGPU code llvm-svn: 268289	2016-05-02 19:45:10 +00:00
Reid Kleckner	d481f126d3	Revert "[SimplifyCFG] Extend TryToSimplifyUncondBranchFromEmptyBlock for empty block including lifetime intrinsics" This reverts commit r268254. This change causes assertion failures while building Chromium. Reduced test case coming soon. llvm-svn: 268288	2016-05-02 19:43:22 +00:00
Tom Stellard	72fc788f2b	AMDGPU/SI: Set the kill flag on temp VGPRs used to restore SGPRs from scratch Summary: When we restore an SGPR value from scratch, we first load it into a temporary VGPR and then use v_readlane_b32 to copy the value from the VGPR back into an SGPR. We weren't setting the kill flag on the VGPR in the v_readlane_b32 instruction, so the register scavenger wasn't able to re-use this temp value later. I wasn't able to create a lit test for this. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D19744 llvm-svn: 268287	2016-05-02 19:37:56 +00:00
Piotr Padlewski	31fe000000	Fix typos llvm-svn: 268281	2016-05-02 19:06:51 +00:00
Chad Rosier	972c570f4b	Typo. NFC. llvm-svn: 268280	2016-05-02 19:06:04 +00:00
Chad Rosier	583671304d	Use false rather than 0 for a boolean value. NFC. llvm-svn: 268279	2016-05-02 19:06:02 +00:00
Zachary Turner	9733cc18ce	Fix build breakage due to implicit conversion. llvm-svn: 268277	2016-05-02 18:36:58 +00:00

1 2 3 4 5 ...

131072 Commits