llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-12-14 15:19:33 +00:00

Author	SHA1	Message	Date
Bob Wilson	d433cf7463	Avoid illegal integer promotion in fastisel Stop folding constant adds into GEP when the type size doesn't match. Otherwise, the adds' operands are effectively being promoted, changing the conditions of an overflow. Results are different when: sext(a) + sext(b) != sext(a + b) Problem originally found on x86-64, but also fixed issues with ARM and PPC, which used similar code. <rdar://problem/15292280> Patch by Duncan Exon Smith! llvm-svn: 194840	2013-11-15 19:09:27 +00:00
Cameron McInally	cae8bdeb82	Add AVX512 unmasked FMA intrinsics and support. llvm-svn: 194824	2013-11-15 17:01:14 +00:00
Daniel Sanders	0ebbe1d56c	Fix illegal DAG produced by SelectionDAG::getConstant() for v2i64 type Summary: When getConstant() is called for an expanded vector type, it is split into multiple scalar constants which are then combined using appropriate build_vector and bitcast operations. In addition to the usual big/little endian differences, the case where the element-order of the vector does not have the same endianness as the elements themselves is also accounted for. For example, for v4i32 on big-endian MIPS, the byte-order of the vector is <3210,7654,BA98,FEDC>. For little-endian, it is <0123,4567,89AB,CDEF>. Handling this case turns out to be a nop since getConstant() returns a splatted vector (so reversing the element order doesn't change the value) This fixes a number of cases in MIPS MSA where calling getConstant() during operation legalization introduces illegal types (e.g. to legalize v2i64 UNDEF into a v2i64 BUILD_VECTOR of illegal i64 zeros). It should also handle bigger differences between illegal and legal types such as legalizing v2i64 into v8i16. lowerMSASplatImm() in the MIPS backend no longer needs to avoid calling getConstant() so this function has been updated in the same patch. For the sake of transparency, the steps I've taken since the review are: * Added 'virtual' to isVectorEltOrderLittleEndian() as requested. This revealed that the MIPS tests were falsely passing because a polymorphic function was not actually polymorphic in the reviewed patch. * Fixed the tests that were now failing. This involved deleting the code to handle the MIPS MSA element-order (which was previously doing an byte-order swap instead of an element-order swap). This left isVectorEltOrderLittleEndian() unused and it was deleted. * Fixed build failures caused by rebasing beyond r194467-r194472. These build failures involved the bset, bneg, and bclr instructions added in these commits using lowerMSASplatImm() in a way that was no longer valid after this patch. Some of these were fixed by calling SelectionDAG::getConstant() instead, others were fixed by a new function getBuildVectorSplat() that provided the removed functionality of lowerMSASplatImm() in a more sensible way. Reviewers: bkramer Reviewed By: bkramer CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1973 llvm-svn: 194811	2013-11-15 12:56:49 +00:00
Matt Arsenault	084675c776	Add target hook to prevent folding some bitcasted loads. This is to avoid this transformation in some cases: fold (conv (load x)) -> (load (conv*)x) On architectures that don't natively support some vector loads efficiently casting the load to a smaller vector of larger types and loading is more efficient. Patch by Micah Villmow. llvm-svn: 194783	2013-11-15 04:42:23 +00:00
Peter Zotov	d805fbbee2	[llvm-c] Add missing const qualifiers to LLVMCreateTargetMachine llvm-svn: 194770	2013-11-15 02:51:12 +00:00
Peter Zotov	9d1fb89548	[llvm-c] Simplify signature of LLVMGetTargetFromName LLVMGetTargetFromName was not yet present in an LLVM release, so this does not break compatibility. llvm-svn: 194769	2013-11-15 02:51:01 +00:00
Matt Arsenault	9921608896	Add addrspacecast instruction. Patch by Michele Scandale! llvm-svn: 194760	2013-11-15 01:34:59 +00:00
Rui Ueyama	0a09bce84f	Include raw_ostream.h. Including only Debug.h did not cause a compilation error, but you couldn't do anything (like writing something with <<) to raw_ostreams returned by llvm::dbgs() or llvm::errs() without including raw_ostream.h. So including it from Debug.h should make sense. Differential Revision: http://llvm-reviews.chandlerc.com/D2183 llvm-svn: 194759	2013-11-15 01:25:34 +00:00
Chandler Carruth	46b00ab145	Fix the header comment of the new pass manager stuff to not claim to be the legacy stuff. =] llvm-svn: 194689	2013-11-14 10:55:14 +00:00
Kevin Qin	7409a29609	[AArch64 neon] support poly64 and relevant intrinsic functions. llvm-svn: 194659	2013-11-14 03:27:58 +00:00
Kevin Qin	47a3b639e3	Implement aarch64 neon instruction class SIMD misc. llvm-svn: 194656	2013-11-14 02:44:13 +00:00
Nick Kledzik	5701cc8c01	Add dyn_cast<> support to YAML I/O's IO class llvm-svn: 194655	2013-11-14 02:38:07 +00:00
Michael Gottesman	37877d5d63	Added BlockFrequencyInfo::view for displaying the block frequency propagation graph via graphviz. This is useful for debugging issues in the BlockFrequency implementation since one can easily visualize where probability mass and other errors occur in the propagation. llvm-svn: 194654	2013-11-14 02:27:46 +00:00
Jiangning Liu	5a9b5605ba	Implement AArch64 NEON instruction set AdvSIMD (table). llvm-svn: 194648	2013-11-14 01:57:32 +00:00
Nick Kledzik	cbecad4abd	Add simple support for tags in YAML I/O llvm-svn: 194644	2013-11-14 00:59:59 +00:00
Yuchen Wu	68c54ebed9	llvm-cov: Slightly improved error checking. - readInt() should check all 4 bytes can be read, not just 1. - In the event of false data in the gcno file, it was possible to index into a non-existent index of SmallVector, causing assertion error. llvm-svn: 194639	2013-11-14 00:38:41 +00:00
Yuchen Wu	97ff0f44ba	llvm-cov: Removed StringMap holding GCOVLines. According to the hazy gcov documentation, it appeared to be technically possible for lines within a block to belong to different source files. However, upon further investigation, gcov does not actually support multiple source files for a single block. This change removes a level of separation between blocks and lines by replacing the StringMap of GCOVLines with a SmallVector of ints representing line numbers. This also means that the GCOVLines class is no longer needed. This paves the way for supporting the "-a" option, which will output block information. llvm-svn: 194637	2013-11-14 00:32:00 +00:00
Yuchen Wu	fe9ba771b2	llvm-cov: Replaced asserts with proper error handling. Unified the interface for read functions. They all return a boolean indicating if the read from file succeeded. Functions that previously returned the read value now store it into a variable that is passed in by reference instead. Callers will need to check the return value to detect if an error occurred. Also added a new test which ensures that no assertions occur when file contains invalid data. llvm-cov should return with error code 1 upon failure. llvm-svn: 194635	2013-11-14 00:07:15 +00:00
Chad Rosier	fae5b22550	[AArch64] Add support for legacy AArch32 NEON scalar shift by immediate instructions. This patch does not include the shift right and accumulate instructions. A number of non-overloaded intrinsics have been remove in favor of their overloaded counterparts. llvm-svn: 194598	2013-11-13 20:05:37 +00:00
Benjamin Kramer	3bed3b6cb7	Make sure LLVMLoadLibraryPermanently gets an extern "C" symbol. Otherwise it's impossible to use it. Also don't include C++ headers in a C header. llvm-svn: 194581	2013-11-13 15:35:13 +00:00
Rafael Espindola	3c0d3eb4a9	Remove AllowQuotesInName and friends from MCAsmInfo. Accepting quotes is a property of an assembler, not of an object file. For example, ELF can support any names for sections and symbols, but the gnu assembler only accepts quotes in some contexts and llvm-mc in a few more. LLVM should not produce different symbols based on a guess about which assembler will be reading the code it is printing. llvm-svn: 194575	2013-11-13 14:01:59 +00:00
Diego Novillo	7b4e2dda6b	SampleProfileLoader pass. Initial setup. This adds a new scalar pass that reads a file with samples generated by 'perf' during runtime. The samples read from the profile are incorporated and emmited as IR metadata reflecting that profile. The profile file is assumed to have been generated by an external profile source. The profile information is converted into IR metadata, which is later used by the analysis routines to estimate block frequencies, edge weights and other related data. External profile information files have no fixed format, each profiler is free to define its own. This includes both the on-disk representation of the profile and the kind of profile information stored in the file. A common kind of profile is based on sampling (e.g., perf), which essentially counts how many times each line of the program has been executed during the run. The SampleProfileLoader pass is organized as a scalar transformation. On startup, it reads the file given in -sample-profile-file to determine what kind of profile it contains. This file is assumed to contain profile information for the whole application. The profile data in the file is read and incorporated into the internal state of the corresponding profiler. To facilitate testing, I've organized the profilers to support two file formats: text and native. The native format is whatever on-disk representation the profiler wants to support, I think this will mostly be bitcode files, but it could be anything the profiler wants to support. To do this, every profiler must implement the SampleProfile::loadNative() function. The text format is mostly meant for debugging. Records are separated by newlines, but each profiler is free to interpret records as it sees fit. Profilers must implement the SampleProfile::loadText() function. Finally, the pass will call SampleProfile::emitAnnotations() for each function in the current translation unit. This function needs to translate the loaded profile into IR metadata, which the analyzer will later be able to use. This patch implements the first steps towards the above design. I've implemented a sample-based flat profiler. The format of the profile is fairly simplistic. Each sampled function contains a list of relative line locations (from the start of the function) together with a count representing how many samples were collected at that line during execution. I generate this profile using perf and a separate converter tool. Currently, I have only implemented a text format for these profiles. I am interested in initial feedback to the whole approach before I send the other parts of the implementation for review. This patch implements: - The SampleProfileLoader pass. - The base ExternalProfile class with the core interface. - A SampleProfile sub-class using the above interface. The profiler generates branch weight metadata on every branch instructions that matches the profiles. - A text loader class to assist the implementation of SampleProfile::loadText(). - Basic unit tests for the pass. Additionally, the patch uses profile information to compute branch weights based on instruction samples. This patch converts instruction samples into branch weights. It does a fairly simplistic conversion: Given a multi-way branch instruction, it calculates the weight of each branch based on the maximum sample count gathered from each target basic block. Note that this assignment of branch weights is somewhat lossy and can be misleading. If a basic block has more than one incoming branch, all the incoming branches will get the same weight. In reality, it may be that only one of them is the most heavily taken branch. I will adjust this assignment in subsequent patches. llvm-svn: 194566	2013-11-13 12:22:21 +00:00
Chandler Carruth	e238c58a05	Add another (perhaps better) video for Sean's talk. (Thanks Marshall!) llvm-svn: 194549	2013-11-13 02:49:38 +00:00
Chandler Carruth	7c64c22064	Fix a null pointer dereference when copying a null polymorphic pointer. This bug only bit the C++98 build bots because all of the actual uses really do move. ;] But not quite ready to do the whole C++11 switch yet, so clean it up. Also add a unit test that catches this immediately. llvm-svn: 194548	2013-11-13 02:48:20 +00:00
Chandler Carruth	4b8976e254	Give folks a reference to some material on the fundamental design pattern in use here. Addresses review feedback from Sean (thanks!) and others. llvm-svn: 194541	2013-11-13 01:51:36 +00:00
Chandler Carruth	4e1d27ef68	Introduce an AnalysisManager which is like a pass manager but with a lot more smarts in it. This is where most of the interesting logic that used to live in the implicit-scheduling-hackery of the old pass manager will live. Like the previous commits, note that this is a very early prototype! I expect substantial changes before this is ready to use. The core of the design is the following: - We have an AnalysisManager which can be used across a series of passes over a module. - The code setting up a pass pipeline registers the analyses available with the manager. - Individual transform passes can check than an analysis manager provides the analyses they require in order to fail-fast. - There is no implicit registration or scheduling. - Analysis passes are different from other passes: they produce an analysis result that is cached and made available via the analysis manager. - Cached results are invalidated automatically by the pass managers. - When a transform pass requests an analysis result, either the analysis is run to produce the result or a cached result is provided. There are a few aspects of this design that I know will change in subsequent commits: - Currently there is no "preservation" system, that needs to be added. - All of the analysis management should move up to the analysis library. - The analysis management needs to support at least SCC passes. Maybe loop passes. Living in the analysis library will facilitate this. - Need support for analyses which are both module and function passes. - Need support for pro-actively running module analyses to have cached results within a function pass manager. - Need a clear design for "immutable" passes. - Need support for requesting cached results when available and not re-running the pass even if that would be necessary. - Need more thorough testing of all of this infrastructure. There are other aspects that I view as open questions I'm hoping to resolve as I iterate a bit on the infrastructure, and especially as I start writing actual passes against this. - Should we have separate management layers for function, module, and SCC analyses? I think "yes", but I'm not yet ready to switch the code. Adding SCC support will likely resolve this definitively. - How should the 'require' functionality work? Should that be the only way to request results to ensure that passes always require things? - How should preservation work? - Probably some other things I'm forgetting. =] Look forward to more patches in shorter order now that this is in place. llvm-svn: 194538	2013-11-13 01:12:08 +00:00
Aaron Ballman	247357cde7	Removing llvm::huge_vald and llvm::huge_vall because they are not currently used, and HUGE_VALD does not appear to be supported everywhere anyways. llvm-svn: 194535	2013-11-13 00:20:43 +00:00
Aaron Ballman	7c6e917033	Replacing HUGE_VALF with llvm::huge_valf in order to work around a warning triggered in MSVC 12. Patch reviewed by Reid Kleckner and Jim Grosbach. llvm-svn: 194533	2013-11-13 00:15:44 +00:00
Rafael Espindola	cd6c69e3df	Remove always true flag. llvm-svn: 194530	2013-11-12 23:27:08 +00:00
Sebastian Pop	c8eb6dbd80	delinearization of arrays llvm-svn: 194527	2013-11-12 22:47:20 +00:00
Sebastian Pop	3f70012168	remove virtual methods in SCEVApplyRewriter and SCEVParameterRewriter llvm-svn: 194526	2013-11-12 22:47:05 +00:00
Justin Bogner	f42cbce2da	Protect user-supplied runtime library functions in LTO Add user-supplied C runtime and compiler-rt library functions to llvm.compiler.used to protect them from premature optimization by passes like -globalopt and -ipsccp. Calls to (seemingly unused) runtime library functions can be added by -instcombine and instruction lowering. Patch by Duncan Exon Smith, thanks! Fixes <rdar://problem/14740087> llvm-svn: 194514	2013-11-12 21:44:01 +00:00
Weiming Zhao	fcaf85bb3d	Export intrinsics:__builtin_arm_{dmb,dsb} to frontend llvm-svn: 194505	2013-11-12 19:57:43 +00:00
Andrew Trick	92750f0da5	GraphViz CFGPrinter: wrap long lines. llvm-svn: 194496	2013-11-12 18:06:09 +00:00
Andrew Trick	9e55e72253	whitespace llvm-svn: 194495	2013-11-12 18:06:06 +00:00
Rafael Espindola	0cc236292f	Revert "Remove unused variable." This reverts commit r194485. The variable is unused in some macro instantiations, but not others. We should probably fix clang to not warn on this. llvm-svn: 194486	2013-11-12 16:37:31 +00:00
Rafael Espindola	77c22a3137	Remove unused variable. llvm-svn: 194485	2013-11-12 16:31:59 +00:00
Wan Xiaofei	b1eca50fd8	Change data structure to memorize computed result in ScalarEvolution Replace std::map with SmallVector to memorize the cached result since SCEV usually belongs to little Loop/BB Linear scan on SmallVector is faster than std::map. Code reviewer : Andrew Trick. Test result : Pass Unit Test & LLVM Test Suite 401.bzip2 0.425721 0.419981 101.37% 403.gcc 24.53855 24.2667 101.12% 429.mcf 0.060847 0.059944 101.51% 433.milc 0.646009 0.636119 101.55% 444.namd 1.383928 1.370614 100.97% 445.gobmk 5.836575 5.800225 100.63% 450.soplex 1.911257 1.895963 100.81% 456.hmmer 1.039565 1.032534 100.68% 458.sjeng 0.897401 0.885567 101.34% 464.h264ref 3.645908 3.577991 101.90% 470.lbm 0.049456 0.048398 102.19% 471.omnetpp 5.638575 5.60435 100.61% bitmnp01 0.045738 0.045291 100.99% cjpegv2data 0.304359 0.302833 100.50% idctrn01 0.046433 0.045763 101.46% quake2 4.534416 4.4952 100.87% quake 2.688566 2.659208 101.10% xcsoar 12.42545 12.30385 100.99% linpack 0.038739 0.03803 101.86% matrix01 0.053564 0.0528 101.45% nbench 0.402867 0.395803 101.78% tblook01 0.021265 0.021015 101.19% ttsprk01 0.066384 0.065566 101.25% llvm-svn: 194459	2013-11-12 09:40:41 +00:00
Arnaud A. de Grandmaison	26d846f560	CalcSpillWeights: allow overidding the spill weight normalizing function This will enable the PBQP register allocator to provide its own normalizing function. No functionnal change. llvm-svn: 194417	2013-11-11 19:56:14 +00:00
Chad Rosier	8d7ebe36dd	[AArch64] The shift right/left and insert immediate builtins expect 3 source operands, a vector, an element to insert, and a shift amount. llvm-svn: 194406	2013-11-11 19:11:11 +00:00
Arnaud A. de Grandmaison	8c40e45072	CalcSpillWeights: give a better describing name to calculateSpillWeights Besides, this relates it more obviously to the VirtRegAuxInfo::calculateSpillWeightAndHint. No functionnal change. llvm-svn: 194404	2013-11-11 19:04:45 +00:00
Chad Rosier	4848250116	[AArch64] Add support for NEON scalar floating-point convert to fixed-point instructions. llvm-svn: 194394	2013-11-11 18:04:07 +00:00
Peter Zotov	91167f1db8	[llvm-c] Remove dead typedef llvm-svn: 194379	2013-11-11 14:47:01 +00:00
Pete Cooper	28d2c72258	Don't universally enable initialiser lists on GCC. Thanks for catching this Chandler llvm-svn: 194365	2013-11-11 05:14:42 +00:00
Pete Cooper	ef6cf2aa2f	Add LLVM_HAS_INITIALIZER_LISTS for upcoming C++11 support. Use it in ArrayRef llvm-svn: 194362	2013-11-11 03:58:00 +00:00
Arnaud A. de Grandmaison	6b862708a7	CalculateSpillWeights does not need to be a pass Based on discussions with Lang Hames and Jakob Stoklund Olesen at the hacker's lab, and in the light of upcoming work on the PBQP register allocator, it was though that CalcSpillWeights does not need to be a pass. This change will enable to customize / tune the spill weight computation depending on the allocator. Update the documentation style while there. No functionnal change. llvm-svn: 194356	2013-11-10 17:46:31 +00:00
Chandler Carruth	62e299ec37	[PM] Start sketching out the new module and function pass manager. This is still just a skeleton. I'm trying to pull together the experimentation I've done into committable chunks, and this is the first coherent one. Others will follow in hopefully short order that move this more toward a useful initial implementation. I still expect the design to continue evolving in small ways as I work through the different requirements and features needed here though. Keep in mind, all of this is off by default. Currently, this mostly exercises the use of a polymorphic smart pointer and templates to hide the polymorphism for the pass manager from the pass implementation. The next step will be more significant, adding the first framework of analysis support. llvm-svn: 194325	2013-11-09 13:09:08 +00:00
Chandler Carruth	f2e7a23acb	Move the old pass manager infrastructure into a legacy namespace and give the files a legacy prefix in the right directory. Use forwarding headers in the old locations to paper over the name change for most clients during the transitional period. No functionality changed here! This is just clearing some space to reduce renaming churn later on with a new system. Even when the new stuff starts to go in, it is going to be hidden behind a flag and off-by-default as it is still WIP and under development. This patch is specifically designed so that very little out-of-tree code has to change. I'm going to work as hard as I can to keep that the case. Only direct forward declarations of the PassManager class are impacted by this change. llvm-svn: 194324	2013-11-09 12:26:54 +00:00
Filip Pizlo	821ddf0c78	This exposes the new calling conventions (WebKit_JS and AnyReg) via the C API by adding them to the enumeration in Core.h. llvm-svn: 194323	2013-11-09 06:00:03 +00:00
Chandler Carruth	17436ff36f	Switch to allow implicit construction. In many cases, we're wrapping a derived type and this makes it much easier to write this code. llvm-svn: 194321	2013-11-09 05:55:03 +00:00

1 2 3 4 5 ...

19126 Commits