llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-12-19 09:36:44 +00:00

Author	SHA1	Message	Date
Saleem Abdulrasool	f01cc77809	[ARM] check bitwidth in PerformORCombine When simplifying a (or (and B A) (and C ~A)) to a (VBSL A B C) ensure that the bitwidth of the second operands to both ands match before comparing the negation of the values. Split the check of the value of the second operands to the ands. Move the cast and variable declaration slightly higher to make it slightly easier to follow. Bug-Id: 16700 Signed-off-by: Saleem Abdulrasool <compnerd@compnerd.org> llvm-svn: 187404	2013-07-30 04:43:08 +00:00
Rafael Espindola	0c36f79092	Remove more dead documentation. llvm-svn: 187403	2013-07-30 04:06:06 +00:00
Venkatraman Govindaraju	a1da2b1b3a	[Sparc] Use call's debugloc for the unimp instruction. llvm-svn: 187402	2013-07-30 02:26:29 +00:00
Bill Schmidt	f02898d71f	[PowerPC] Skeletal FastISel support for 64-bit PowerPC ELF. This is the first of many upcoming patches for PowerPC fast instruction selection support. This patch implements the minimum necessary for a functional (but extremely limited) FastISel pass. It allows the table-generated portions of the selector to be created and used, but in most cases selection will fall back to the DAG selector. None of the block terminator instructions are implemented yet, and most interesting instructions require some special handling. Therefore there aren't any new test cases with this patch. There will be quite a few tests coming with future patches. This patch adds the make/CMake support for the new code (including tablegen -gen-fast-isel) and creates the FastISel object for PPC64 ELF only. It instantiates the necessary virtual functions (TargetSelectInstruction, TargetMaterializeConstant, TargetMaterializeAlloca, tryToFoldLoadIntoMI, and FastLowerArguments), but of these, only TargetMaterializeConstant contains any useful implementation. This is present since the table-generated code requires the ability to materialize integer constants for some instructions. This patch has been tested by building and running the projects/test-suite code with -O0. All tests passed with the exception of a couple of long-running tests that time out using -O0 code generation. llvm-svn: 187399	2013-07-30 00:50:39 +00:00
Quentin Colombet	dbdfe6759b	[R600] Replicate old DAGCombiner behavior in target specific DAG combine. build_vector is lowered to REG_SEQUENCE, which is something the register allocator does a good job at optimizing. llvm-svn: 187397	2013-07-30 00:27:16 +00:00
Quentin Colombet	94c7d4af34	[DAGCombiner] insert_vector_elt: Avoid building a vector twice. This patch prevents the following combine when the input vector is used more than once. insert_vector_elt (build_vector elt0, ..., eltN), NewEltIdx, idx => build_vector elt0, ..., NewEltIdx, ..., eltN The reasons are: - Building a vector may be expensive, so try to reuse the existing part of a vector instead of creating a new one (think big vectors). - elt0 to eltN now have two users instead of one. This may prevent some other optimizations. llvm-svn: 187396	2013-07-30 00:24:09 +00:00
Eric Christopher	dc4f22cec8	Move file to X86 and add a triple to fix darwin bots for now. The problem is due to the section name being explicitly mentioned in the IR and differing between the two platforms. llvm-svn: 187394	2013-07-30 00:20:06 +00:00
Eric Christopher	7aa2c8e944	Fix a truly egregious thinko in anonymous namespace check, update testcase to make sure we generate debug info for walrus by adding a non-trivial constructor and verify that we don't emit an ODR signature for the type. llvm-svn: 187393	2013-07-29 23:53:08 +00:00
Eric Christopher	f6e5a0524b	Make sure we don't emit an ODR hash for types with no name and make sure the comments for each testcase are a bit easier to distinguish. llvm-svn: 187392	2013-07-29 23:53:05 +00:00
Eric Christopher	0a98290735	Clarify comments for types contained in anonymous namespaces and odr hashes. llvm-svn: 187391	2013-07-29 23:53:01 +00:00
Eric Christopher	979a1dca9f	Elaborate a bit on the type unit and ODR conditional code. llvm-svn: 187385	2013-07-29 22:24:32 +00:00
Rafael Espindola	197ab4ef5c	Make file_status::getUniqueID const. llvm-svn: 187383	2013-07-29 21:55:38 +00:00
Rafael Espindola	1480c060db	Delete documentation for deleted options. llvm-svn: 187380	2013-07-29 21:35:48 +00:00
Rafael Espindola	2bf7932f99	Include st_dev to make the result of getUniqueID actually unique. This will let us use getUniqueID instead of st_dev directly on clang. llvm-svn: 187378	2013-07-29 21:26:49 +00:00
Manman Ren	46cc5a281a	Debug Info: enable verifier for testing cases. llvm-svn: 187375	2013-07-29 20:18:19 +00:00
Akira Hatanaka	591151f071	[mips] Add comment and simplify function. llvm-svn: 187371	2013-07-29 19:08:34 +00:00
Nadav Rotem	a0826e8e1b	Add the C source code to the test to make it easier to update when debug info changes. Thanks Eric. llvm-svn: 187368	2013-07-29 18:47:36 +00:00
Nadav Rotem	fc24bbce9c	SLPVectorier: update the debug location for the new instructions. llvm-svn: 187363	2013-07-29 18:18:46 +00:00
Manman Ren	7a31996783	Debug Info: update testing cases to pass verifier. llvm-svn: 187362	2013-07-29 18:12:58 +00:00
Nico Rieck	53e36aaed9	Use proper section suffix for COFF weak symbols 32-bit symbols have "_" as global prefix, but when forming the name of COMDAT sections this prefix is ignored. The current behavior assumes that this prefix is always present which is not the case for 64-bit and names are truncated. llvm-svn: 187356	2013-07-29 13:58:39 +00:00
Nico Rieck	552fc262e0	Proper va_arg/va_copy lowering on win64 Win64 uses CharPtrBuiltinVaList instead of X86_64ABIBuiltinVaList like other 64-bit targets. llvm-svn: 187355	2013-07-29 13:07:06 +00:00
Aaron Ballman	c25578b0b6	Re-application of 187310. Re-enabling warning C4275 for MSVC 11 and up, but not MSVC 10 since it is still required there. llvm-svn: 187354	2013-07-29 13:02:08 +00:00
Rafael Espindola	ca0a1505ba	Add support for the 's' operation to llvm-ar. If no other operation is specified, 's' becomes an operation instead of an modifier. The s operation just creates a symbol table. It is the same as running ranlib. We assume the archive was created by a sane ar (like llvm-ar or gnu ar) and if the symbol table is present, then it is current. We use that to optimize the most common case: a broken build system that thinks it has to run ranlib. llvm-svn: 187353	2013-07-29 12:40:31 +00:00
Nico Rieck	d907b86dee	MC: Support larger COFF string tables Single-slash encoded entries do not require a terminating null. This bumps the maximum table size from ~1MB to ~9.5MB. llvm-svn: 187352	2013-07-29 12:30:12 +00:00
NAKAMURA Takumi	f0ebe1e004	ExceptionDemo.cpp: Tweak a @param. [-Wdocumentation] llvm-svn: 187351	2013-07-29 11:03:50 +00:00
Benjamin Kramer	9fe2707840	Some Intel Penryn CPUs come with SSE4 disabled. Detect them as core 2. PR16721. llvm-svn: 187350	2013-07-29 11:02:08 +00:00
Silviu Baranga	5aac9ffdd0	Allow generation of vmla.f32 instructions when targeting Cortex-A15. The patch also adds the VFP4 feature to Cortex-A15 and fixes the DontUseFusedMAC predicate so that we can still generate vmla.f32 instructions on non-darwin targets with VFP4. llvm-svn: 187349	2013-07-29 09:25:50 +00:00
Robert Lytton	524ff70cae	test commit llvm-svn: 187348	2013-07-29 09:23:13 +00:00
Chandler Carruth	4093b4bbce	Teach the AllocaPromoter which is wrapped around the SSAUpdater infrastructure to do promotion without a domtree the same smarts about looking through GEPs, bitcasts, etc., that I just taught mem2reg about. This way, if SROA chooses to promote an alloca which still has some noisy instructions this code can cope with them. I've not used as principled of an approach here for two reasons: 1) This code doesn't really need it as we were already set up to zip through the instructions used by the alloca. 2) I view the code here as more of a hack, and hopefully a temporary one. The SSAUpdater path in SROA is a real sore point for me. It doesn't make a lot of architectural sense for many reasons: - We're likely to end up needing the domtree anyways in a subsequent pass, so why not compute it earlier and use it. - In the future we'll likely end up needing the domtree for parts of the inliner itself. - If we need to we could teach the inliner to preserve the domtree. Part of the re-work of the pass manager will allow this to be very powerful even in large SCCs with many functions. - Ultimately, computing a domtree has gotten significantly faster since the original SSAUpdater-using code went into ScalarRepl. We no longer use domfrontiers, and much of domtree is lazily done based on queries rather than eagerly. - At this point keeping the SSAUpdater-based promotion saves a total of 0.7% on a build of the 'opt' tool for me. That's not a lot of performance given the complexity! So I'm leaving this a bit ugly in the hope that eventually we just remove all of this nonsense. I can't even readily test this because this code isn't reachable except through SROA. When I re-instate the patch that fast-tracks allocas already suitable for promotion, I'll add a testcase there that failed before this change. Before that, SROA will fix any test case I give it. llvm-svn: 187347	2013-07-29 09:06:53 +00:00
Nadav Rotem	931f83b2ef	Don't vectorize when the attribute NoImplicitFloat is used. llvm-svn: 187340	2013-07-29 05:13:00 +00:00
Rafael Espindola	ca86cefe79	Fix -Wdocumentation warnings. llvm-svn: 187336	2013-07-28 23:43:28 +00:00
Chandler Carruth	379281ffcc	Update comments for SSAUpdater to use the modern doxygen comment standards for LLVM. Remove duplicated comments on the interface from the implementation file (implementation comments are left there of course). Also clean up, re-word, and fix a few typos and errors in the commenst spotted along the way. This is in preparation for changes to these files and to keep the uninteresting tidying in a separate commit. llvm-svn: 187335	2013-07-28 22:00:33 +00:00
Craig Topper	b18d7964ca	Remove use of sprintf added to X86 disassembler tablegen code. Send message with instruction name to errs() instead and use a generic message for the llvm_unreachable. Consistent with other places in this file. llvm-svn: 187333	2013-07-28 21:28:02 +00:00
Aaron Ballman	bf546aa194	Partial revert of 187310; it seems MSVC 10 still spits out this warning, but MSVC 11 does not. llvm-svn: 187331	2013-07-28 18:04:26 +00:00
Chandler Carruth	cfd2dee91f	Temporarily revert r187323 until I update SSAUpdater to match mem2reg. I forgot that we had two totally independent things here. :: sigh :: llvm-svn: 187327	2013-07-28 09:05:49 +00:00
Elena Demikhovsky	a7a8eb9496	fixed compilation issue llvm-svn: 187325	2013-07-28 08:45:12 +00:00
Elena Demikhovsky	505373db43	Added encoding prefixes for KNL instructions (EVEX). Added 512-bit operands printing. Added instruction formats for KNL instructions. llvm-svn: 187324	2013-07-28 08:28:38 +00:00
Chandler Carruth	d3834e5bef	Now that mem2reg understands how to cope with a slightly wider set of uses of an alloca, we can pre-compute promotability while analyzing an alloca for splitting in SROA. That lets us short-circuit the common case of a bunch of trivially promotable allocas. This cuts 20% to 30% off the run time of SROA for typical frontend-generated IR sequneces I'm seeing. It gets the new SROA to within 20% of ScalarRepl for such code. My current benchmark for these numbers is PR15412, but it fits the general pattern of IR emitted by Clang so it should be widely applicable. llvm-svn: 187323	2013-07-28 08:27:12 +00:00
Chandler Carruth	ca3736c604	Thread DataLayout through the callers and into mem2reg. This will be useful in a subsequent patch, but causes an unfortunate amount of noise, so I pulled it out into a separate patch. llvm-svn: 187322	2013-07-28 06:43:11 +00:00
Bill Schmidt	f400d5844e	[PowerPC] Add comment explaining preprocessor directive. llvm-svn: 187320	2013-07-28 03:23:32 +00:00
Bill Schmidt	795f27eb7f	Revert 187318 llvm-svn: 187319	2013-07-28 02:13:24 +00:00
Bill Schmidt	0c824086d1	[PowerPC] Remove unnecessary preprocessor checking. The tests !defined(__ppc__) && !defined(__powerpc__) are not needed or helpful when verifying that code is being compiled for a 64-bit target. The simpler test provided by this revision is sufficient to tell if the target is 64-bit. llvm-svn: 187318	2013-07-28 02:08:13 +00:00
Nadav Rotem	b6c365bd70	Update the comment llvm-svn: 187316	2013-07-27 23:28:47 +00:00
Michael Gottesman	f899e8827f	[APFloat] Make all arithmetic operations with NaN produce positive NaNs. IEEE-754R 1.4 Exclusions states that IEEE-754R does not specify the interpretation of the sign of NaNs. In order to remove an irrelevant variable that most floating point implementations do not use, standardize add, sub, mul, div, mod so that operating anything with NaN always yields a positive NaN. In a later commit I am going to update the APIs for creating NaNs so that one can not even create a negative NaN. llvm-svn: 187314	2013-07-27 21:49:25 +00:00
Michael Gottesman	8ccc9506c7	[APFloat] Move setting fcNormal in zeroSignificand() to calling code. Zeroing the significand of a floating point number does not necessarily cause a floating point number to become finite non zero. For instance, if one has a NaN, zeroing the significand will cause it to become +/- infinity. llvm-svn: 187313	2013-07-27 21:49:21 +00:00
Michael Gottesman	a84f57e3e2	[APFloat] Removed nextafter from missing operations since it is implemented in APFloat::next. llvm-svn: 187312	2013-07-27 21:49:19 +00:00
Aaron Ballman	d188c8d939	Re-enabling some more MSVC warnings; all of these compile cleanly with no further changes required. llvm-svn: 187310	2013-07-27 20:20:28 +00:00
Matt Arsenault	124fe61a07	Minor code simplification suggested by Duncan llvm-svn: 187309	2013-07-27 19:22:28 +00:00
Benjamin Kramer	2b93c9d1a8	DwarfDebug: MD5 is always little endian, bswap on big endian platforms. This makes LLVM emit the same signature regardless of host and target endianess. llvm-svn: 187304	2013-07-27 14:14:43 +00:00
Chandler Carruth	4602399982	Create a constant pool symbol for the GOT in the ARMCGBR the same way we do in the SDag when lowering references to the GOT: use ARMConstantPoolSymbol rather than creating a dummy global variable. The computation of the alignment still feels weird (it uses IR types and datalayout) but it preserves the exact previous behavior. This change fixes the memory leak of the global variable detected on the valgrind leak checking bot. Thanks to Benjamin Kramer for pointing me at ARMConstantPoolSymbol to handle this use case. llvm-svn: 187303	2013-07-27 11:58:26 +00:00

... 4 5 6 7 8 ...

94588 Commits