llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-12-15 07:39:31 +00:00

Author	SHA1	Message	Date
Chandler Carruth	dee2e63512	[PM] Simplify how the SFINAE for AnalysisResultModel is applied by factoring it out into the default template argument so clients don't have to even think about it. llvm-svn: 195402	2013-11-22 00:48:49 +00:00
Lang Hames	433095d3fe	Fix a typo where we were creating <def,kill> operands instead of <def,dead> ones. Add an assertion to make sure we catch this in the future. Fixes <rdar://problem/15464559>. llvm-svn: 195401	2013-11-22 00:46:32 +00:00
Chandler Carruth	28195a6d83	[PM] Switch analysis managers to be threaded through the run methods rather than the constructors of passes. This simplifies the APIs of passes significantly and removes an error prone pattern where the same manager had to be given to every different layer. With the new API the analysis managers themselves will have to be cross connected with proxy analyses that allow a pass at one layer to query for the analysis manager of another layer. The proxy will both expose a handle to the other layer's manager and it will provide the invalidation hooks to ensure things remain consistent across layers. Finally, the outer-most analysis manager has to be passed to the run method of the outer-most pass manager. The rest of the propagation is automatic. I've used SFINAE again to allow passes to completely disregard the analysis manager if they don't need or want to care. This helps keep simple things simple for users of the new pass manager. Also, the system specifically supports passing a null pointer into the outer-most run method if your pass pipeline neither needs nor wants to deal with analyses. I find this of dubious utility as while some passes don't care about analysis, I'm not sure there are any real-world users of the pass manager itself that need to avoid even creating an analysis manager. But it is easy to support, so there we go. Finally I renamed the module proxy for the function analysis manager to the more verbose but less confusing name of FunctionAnalysisManagerModuleProxy. I hate this name, but I have no idea what else to name these things. I'm expecting in the fullness of time to potentially have the complete cross product of types at the proxy layer: {Module,SCC,Function,Loop,Region}AnalysisManager{Module,SCC,Function,Loop,Region}Proxy (except for XAnalysisManagerXProxy which doesn't make any sense) This should make it somewhat easier to do the next phases which is to build the upward proxy and get its invalidation correct, as well as to make the invalidation within the Module -> Function mapping pass be more fine grained so as to invalidate fewer fuction analyses. After all of the proxy analyses are done and the invalidation working, I'll finally be able to start working on the next two fun fronts: how to adapt an existing pass to work in both the legacy pass world and the new one, and building the SCC, Loop, and Region counterparts. Fun times! llvm-svn: 195400	2013-11-22 00:43:29 +00:00
Tom Stellard	f02139b6c9	R600: Implement TargetInstrInfo::isLegalToSplitMBBAt() Splitting a basic block will create a new ALU clause, so we need to make sure we aren't moving uses of registers that are local to their current clause into a new one. I had a test case for this, but unfortunately unrelated schedule changes invalidated it, and I wasn't been able to come up with another one. NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195399	2013-11-22 00:41:08 +00:00
Tom Stellard	c2f05239d7	SelectionDAG: Optimize expansion of vec_type = BITCAST scalar_type The legalizer can now do this type of expansion for more type combinations without loading and storing to and from the stack. NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195398	2013-11-22 00:41:05 +00:00
Tom Stellard	439debedd3	Split SETCC if VSELECT requires splitting too. This patch is a rewrite of the original patch commited in r194542. Instead of relying on the type legalizer to do the splitting for us, we now peform the splitting ourselves in the DAG combiner. This is necessary for the case where the vector mask is a legal type after promotion and still wouldn't require splitting. Patch by: Juergen Ributzka NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195397	2013-11-22 00:39:23 +00:00
Eric Christopher	8b3b8ae8cc	In Dwarf 3 (and Dwarf 2) attributes whose value are offsets into a section use the form DW_FORM_data4 whilst in Dwarf 4 and later they use the form DW_FORM_sec_offset. This patch updates the places where such attributes are generated to use the appropriate form depending on the Dwarf version. The DIE entries affected have the following tags: DW_AT_stmt_list, DW_AT_ranges, DW_AT_location, DW_AT_GNU_pubnames, DW_AT_GNU_pubtypes, DW_AT_GNU_addr_base, DW_AT_GNU_ranges_base It also adds a hidden command line option "--dwarf-version=<uint>" to llc which allows the version of Dwarf to be generated to override what is specified in the metadata; this makes it possible to update existing tests to check the debugging information generated for both Dwarf 4 (the default) and Dwarf 3 using the same metadata. Patch (slightly modified) by Keith Walker! llvm-svn: 195391	2013-11-21 23:46:41 +00:00
Ekaterina Romanova	eda4e2e4a7	SHLD/SHRD are VectorPath (microcode) instructions known to have poor latency on certain architectures. While generating SHLD/SHRD instructions is acceptable when optimizing for size, optimizing for speed on these platforms should be implemented using alternative sequences of instructions composed of add, adc, shr, shl, or and lea which are directPath instructions. These alternative instructions not only have a lower latency but they also increase the decode bandwidth by allowing simultaneous decoding of a third directPath instruction. AMD's processors family K7, K8, K10, K12, K15 and K16 are known to have SHLD/SHRD instructions with very poor latency. Optimization guides for these processors recommend using an alternative sequence of instructions. For these AMD's processors, I disabled folding (or (x << c) \| (y >> (64 - c))) when we are not optimizing for size. It might be beneficial to disable this folding for some of the Intel's processors. However, since I couldn't find specific recommendations regarding using SHLD/SHRD instructions on Intel's processors, I haven't disabled this peephole for Intel. llvm-svn: 195383	2013-11-21 23:21:26 +00:00
Peter Collingbourne	89b5505b6b	Introduce two command-line flags for the instrumentation pass to control whether the labels of pointers should be ignored in load and store instructions The new command line flags are -dfsan-ignore-pointer-label-on-store and -dfsan-ignore-pointer-label-on-load. Their default value matches the current labelling scheme. Additionally, the function __dfsan_union_load is marked as readonly. Patch by Lorenzo Martignoni! Differential Revision: http://llvm-reviews.chandlerc.com/D2187 llvm-svn: 195382	2013-11-21 23:20:54 +00:00
Eric Christopher	b1461615d9	Move member variable up to where the rest of non-DWARF5 variables reside. llvm-svn: 195380	2013-11-21 22:56:11 +00:00
Hans Wennborg	de158d8dc9	CMake: Some changes to package version names: - Allow overriding PACKAGE_VERSION from the command-line - Use PACKAGE_VERSION to set CPACK_PACKAGE_VERSION (used by the Win installer) - Don't include the version number in the CPack install dir or registry key. Differential revision: http://llvm-reviews.chandlerc.com/D2245 llvm-svn: 195379	2013-11-21 22:47:21 +00:00
Benjamin Kramer	1fbfb1fbd6	Remove dangling documentation. llvm-prof was deleted a while ago. llvm-svn: 195372	2013-11-21 19:32:37 +00:00
Artyom Skrobov	b6c8a8a69f	[ARM] add the overlooked tests for Cortex-A7 build attributes llvm-svn: 195365	2013-11-21 16:22:39 +00:00
Daniel Sanders	0e60951a47	[mips][msa] Fix a corner case in performORCombine() when combining nodes into VSELECT. Mask == ~InvMask asserts if the width of Mask and InvMask differ. The combine isn't valid (with two exceptions, see below) if the widths differ so test for this before testing Mask == ~InvMask. In the specific cases of Mask=~0 and InvMask=0, as well as Mask=0 and InvMask=~0, the combine is still valid. However, there are more appropriate combines that could be used in these cases such as folding x & 0 to 0, or x & ~0 to x. llvm-svn: 195364	2013-11-21 16:11:31 +00:00
Artyom Skrobov	3d8780e502	[ARM] add basic Cortex-A7 support to LLVM backend llvm-svn: 195358	2013-11-21 14:03:21 +00:00
Daniel Sanders	a556d0abd7	Add support for legalizing SETNE/SETEQ by inverting the condition code and the result of the comparison. Summary: LegalizeSetCCCondCode can now legalize SETEQ and SETNE by returning the inverse condition and requesting that the caller invert the result of the condition. The caller of LegalizeSetCCCondCode must handle the inverted CC, and they do so as follows: SETCC, BR_CC: Invert the result of the SETCC with SelectionDAG::getNOT() SELECT_CC: Swap the true/false operands. This is necessary for MSA which lacks an integer SETNE instruction. Reviewers: resistor CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2229 llvm-svn: 195355	2013-11-21 13:24:49 +00:00
Evgeniy Stepanov	405470ce24	[msan] Propagate condition origin in select instruction. llvm-svn: 195349	2013-11-21 12:00:24 +00:00
Daniel Sanders	5e17920764	[mips][msa/dsp] Only do DSP combines if DSP is enabled. Fixes a crash (null pointer dereferenced) when MSA is enabled. llvm-svn: 195343	2013-11-21 11:40:14 +00:00
Evgeniy Stepanov	0ee8c16502	Use multiple filecheck prefixes in msan instrumentation tests. llvm-svn: 195342	2013-11-21 11:37:16 +00:00
NAKAMURA Takumi	656eba2e1e	Whitespace. llvm-svn: 195341	2013-11-21 11:08:31 +00:00
Chandler Carruth	4733811090	[PM] Fix typo and trailing space. llvm-svn: 195340	2013-11-21 11:04:53 +00:00
NAKAMURA Takumi	add94e0548	Revert r195317 (and r195333), "Teach ISel not to optimize 'optnone' functions." It broke, at least, i686 target. It is reproducible with "llc -mtriple=i686-unknown". FYI, it didn't appear to add either "-O0" or "-fast-isel". llvm-svn: 195339	2013-11-21 10:55:15 +00:00
Chandler Carruth	a087921555	[PM] Widen the interface for invalidate on an analysis result now that it is completely optional, and sink the logic for handling the preserved analysis set into it. This allows us to implement the delegation logic desired in the proxy module analysis for the function analysis manager where if the proxy itself is preserved we assume the set of functions hasn't changed and we do a fine grained invalidation by walking the functions in the module and running the invalidate for them all at the manager level and letting it try to invalidate any passes. This in turn makes it blindingly obvious why we should hoist the invalidate trait and have two collections of results. That allows handling invalidation for almost all analyses without indirect calls and it allows short circuiting when the preserved set is all. llvm-svn: 195338	2013-11-21 10:53:05 +00:00
Peter Zotov	770a3bbabb	[OCaml] Unbreak make install by providing ocamldoc target llvm-svn: 195336	2013-11-21 10:10:16 +00:00
Kostya Serebryany	4b3f2e0afc	add 'REQUIRES: asserts' to a test that uses 'llc -debug'; this fixes the no-asserts build llvm-svn: 195333	2013-11-21 09:28:16 +00:00
Chandler Carruth	41afa5d52d	[PM] Add support for using SFINAE to reflect on an analysis's result type and detect whether or not it provides an 'invalidate' member the analysis manager should use. This lets the overwhelming common case of not caring about custom behavior when an analysis is invalidated be the the obvious default behavior with no code written by the author of an analysis. Only when they write code specifically to handle invalidation does it get used. Both cases are actually covered by tests here. The test analysis uses the default behavior, and the proxy module analysis actually has custom behavior on invalidation that is firing correctly. (In fact, this is the analysis which was the primary motivation for having custom invalidation behavior in the first place.) llvm-svn: 195332	2013-11-21 09:10:21 +00:00
Ana Pazos	86d72bbede	Implemented Neon scalar vdup_lane intrinsics. Fixed scalar dup alias and added test case. llvm-svn: 195330	2013-11-21 08:16:15 +00:00
Ana Pazos	5ddc31e426	Implemented Neon scalar by element intrinsics. Intrinsics implemented: vqdmull_lane, vqdmulh_lane, vqrdmulh_lane, vqdmlal_lane, vqdmlsl_lane scalar Neon intrinsics. llvm-svn: 195327	2013-11-21 07:37:04 +00:00
Kostya Serebryany	1513e9969b	Don't speculate loads under ThreadSanitizer Summary: Don't speculate loads under ThreadSanitizer. This fixes https://code.google.com/p/thread-sanitizer/issues/detail?id=40 Also discussed here: http://lists.cs.uiuc.edu/pipermail/llvmdev/2013-November/067929.html Reviewers: chandlerc Reviewed By: chandlerc CC: llvm-commits, dvyukov Differential Revision: http://llvm-reviews.chandlerc.com/D2227 llvm-svn: 195324	2013-11-21 07:29:28 +00:00
Bill Wendling	07a5510fa2	The basic problem is that some mainstream programs cannot deal with the way clang optimizes tail calls, as in this example: int foo(void); int bar(void) { return foo(); } where the call is transformed to: calll .L0$pb .L0$pb: popl %eax .Ltmp0: addl $_GLOBAL_OFFSET_TABLE_+(.Ltmp0-.L0$pb), %eax movl foo@GOT(%eax), %eax popl %ebp jmpl *%eax # TAILCALL However, the GOT references must all be resolved at dlopen() time, and so this approach cannot be used with lazy dynamic linking (e.g. using RTLD_LAZY), which usually populates the PLT with stubs that perform the actual resolving. This patch changes X86TargetLowering::LowerCall() to skip tail call optimization, if the called function is a global or external symbol. Patch by Dimitry Andric! PR15086 llvm-svn: 195318	2013-11-21 07:04:30 +00:00
Paul Robinson	eba6ab82dd	Teach ISel not to optimize 'optnone' functions. Based on work by Andrea Di Biagio. llvm-svn: 195317	2013-11-21 06:33:32 +00:00
Lang Hames	682d6cc95b	Dereference the node iterator when dumping the PBQP graph structure in DOT format. Thanks to Arnaud A. de Grandmaison for the patch! llvm-svn: 195316	2013-11-21 06:30:14 +00:00
Reed Kotler	caba86b795	Add, to constant islands, long jumps similar to ARM far branch. llvm-svn: 195312	2013-11-21 05:13:23 +00:00
Yuchen Wu	5480edb76f	llvm-cov: Don't assume FileChecksum was generated. For cases where emitProfileArcs() was called but emitProfileNotes() was not, set the CfgChecksum to 0. llvm-svn: 195311	2013-11-21 04:53:39 +00:00
Yuchen Wu	ef87eca111	llvm-cov: Formatting change. llvm-svn: 195310	2013-11-21 04:12:10 +00:00
Yuchen Wu	d218a85f8c	llvm-cov: Fixed some bugs related to file checksum. Added call to update CfgChecksum. Made FileChecksum a vector, separate for each source file. llvm-svn: 195309	2013-11-21 04:01:05 +00:00
Chandler Carruth	dbfa25a6b6	[PM] Add a module analysis pass proxy for the function analysis manager. This proxy will fill the role of proxying invalidation events down IR unit layers so that when a module changes we correctly invalidate function analyses. Currently this is a very coarse solution -- any change blows away the entire thing -- but the next step is to make invalidation handling more nuanced so that we can propagate specific amounts of invalidation from one layer to the next. The test is extended to place a module pass between two function pass managers each of which have preserved function analyses which get correctly invalidated by the module pass that might have changed what functions are even in the module. llvm-svn: 195304	2013-11-21 02:11:31 +00:00
Eric Christopher	14dfe16fc9	Move DebugInfoOffset member near the other data member it helps describe. llvm-svn: 195299	2013-11-21 01:29:16 +00:00
Eric Christopher	5cb8bb6bc9	Reflow some documentation and remove whitespace comments. Move DebugInfoOffset data member up with the rest of the data members. llvm-svn: 195298	2013-11-21 01:29:13 +00:00
Eric Christopher	61450632e7	Add more documenation for the lookup tables data members. llvm-svn: 195297	2013-11-21 01:16:31 +00:00
Eric Christopher	8c89e3899c	Reorder language in the CompileUnit description and add a comment. Language may only be a temporary addition. llvm-svn: 195296	2013-11-21 01:14:00 +00:00
Eric Christopher	874a7d424a	Update comment. llvm-svn: 195293	2013-11-21 01:01:30 +00:00
Eric Christopher	d2217c5bea	Constify the DIEs used for pubname and pubtype tables. Propagate through findAttribute etc. llvm-svn: 195290	2013-11-21 00:48:22 +00:00
Nick Kledzik	3e803171af	YAML I/O add support for validate() MappingTrait template specializations can now have a validate() method which performs semantic checking. For details, see <http://llvm.org/docs/YamlIO.html>. llvm-svn: 195286	2013-11-21 00:28:07 +00:00
Nick Kledzik	11ec8eba6c	revert r194655 llvm-svn: 195285	2013-11-21 00:20:10 +00:00
Hal Finkel	fb82ed6bb5	PPC popcnt[dw] do not have record forms The instruction definitions incorrectly specified that popcntd and popcntw have record forms; they do not. This mistake was causing invalid code generation. llvm-svn: 195272	2013-11-20 20:54:55 +00:00
Hal Finkel	b17e18e8f4	Add loop rerolling code owner I am the code owner of the loop reroller. llvm-svn: 195271	2013-11-20 20:54:33 +00:00
Benjamin Kramer	40f6475264	MachineBlockPlacement: Strengthen the source order bias when picking an exit block. We now only allow breaking source order if the exit block frequency is significantly higher than the other exit block. The actual bias is currently under a flag so the best cut-off can be found; the flag defaults to the old behavior. The idea is to get some benchmark coverage over different values for the flag and pick the best one. When we require the new frequency to be at least 20% higher than the old frequency I see a 5% speedup on zlib's deflate when compressing a random file on x86_64/westmere. Hal reported a small speedup on Fhourstones on a BG/Q and no regressions in the test suite. The test case is the full long_match function from zlib's deflate. I was reluctant to add it for previous tweaks to branch probabilities because it's large and potentially fragile, but changed my mind since it's an important use case and more likely to break with all the current work going into the PGO infrastructure. Differential Revision: http://llvm-reviews.chandlerc.com/D2202 llvm-svn: 195265	2013-11-20 19:08:44 +00:00
David Blaikie	bd559db852	DwarfCompileUnit: Initialize DebugInfoOffset. While not strictly necessary (the class has an invariant that "setDebugInfoOffset" is called before "getDebugInfoOffset" - anyone client that actually gets the default zero offset is buggy/broken) this is consistent with the code as originally written and the removal of the initialization was an accident in r195166. Suggested by Manman Ren. llvm-svn: 195263	2013-11-20 18:52:39 +00:00
David Blaikie	7888a33645	CR feedback for r195166: Add comments regarding type unit mapping and type units disabling cross-CU sharing. Changes suggested by Manman Ren. llvm-svn: 195262	2013-11-20 18:40:16 +00:00

1 2 3 4 5 ...

97773 Commits