llvm-capstone

mirror of https://github.com/capstone-engine/llvm-capstone.git synced 2025-02-28 22:37:18 +00:00

Author	SHA1	Message	Date
Tobias Grosser	07c1c2fcc9	Make prevectorization width configurable Polly uses 'prevectorization' to enable outer loop vectorization. When vectorizing an outer loop, we strip-mine <number-of-prevec-dims> loop iterations which are than interchanged to the innermost level such that LLVM's inner loop vectorizer (or Polly's simple vectorizer) can easily vectorize this loop. The number of loop iterations to strip-mine is now configurable with the option -polly-prevect-width=<number-of-prevec-dims>. This is mostly a debugging option. We should probably add a heuristic that derives the number of prevectorization dimensions from the target data and the data types used. llvm-svn: 245424	2015-08-19 08:46:11 +00:00
Tobias Grosser	161c9081e5	Do not use negative option name Instead of -polly-no-tiling, we use -polly-tiling=false to disable tiling. llvm-svn: 245423	2015-08-19 08:22:06 +00:00
Michael Kruse	d568a3e20d	Update test case multidim_indirect_access.ll This test was written to check the workings of IndependentBlocks on arrays which doesn't do such transformations anymore. The test itself is still useful to check that the region is rejected as SCoP. llvm-svn: 245353	2015-08-18 21:08:41 +00:00
Roman Gareev	f2bd72e00d	Use isl_set_is_subset instead of isl_set_is_equal It helps to detect correct strides in case of parametric constraints of Stride in MemoryAccess::isStrideX. Reviewers: grosser llvm-svn: 245303	2015-08-18 16:12:05 +00:00
Tobias Grosser	c0f8452592	Fix test cases which fail due to changes in isl's set representation llvm-svn: 245301	2015-08-18 15:28:02 +00:00
Roman Gareev	079968e4cf	test commit revert test commit revert llvm-svn: 245299	2015-08-18 14:56:50 +00:00
Roman Gareev	6753df4bb6	test commit test commit llvm-svn: 245298	2015-08-18 14:54:27 +00:00
Michael Kruse	d2b0360197	Fix Codegen adding a second exit out of region executeScopConditionally would destroy a predecessor region if it the scop's entry was the region's exit block by forking it to polly.start and thus creating a secnd exit out of the region. This patch "shrinks" the predecessor region s.t. polly.split_new_and_old is not the region's exit anymore. llvm-svn: 245294	2015-08-18 13:14:42 +00:00
Johannes Doerfert	e69e1141d9	Introduce the ScopExpander as a SCEVExpander replacement The SCEVExpander cannot deal with all SCEVs Polly allows in all kinds of expressions. To this end we introduce a ScopExpander that handles the additional expressions separatly and falls back to the SCEVExpander for everything else. Reviewers: grosser, Meinersbur Subscribers: #polly Differential Revision: http://reviews.llvm.org/D12066 llvm-svn: 245288	2015-08-18 11:56:00 +00:00
Johannes Doerfert	e1fa6da356	[FIX] Create location if a needed value was not yet demoted This allows the code generation to continue working even if a needed value (that is reloaded anyway) was not yet demoted. Instead of failing it will now create the location for future demotion to memory and load from that location. The stores will use the same location and by construction execute before the load even if the textual order in the generated AST is otherwise. Reviewers: grosser, Meinersbur Subscribers: #polly Differential Revision: http://reviews.llvm.org/D12072 llvm-svn: 245203	2015-08-17 09:38:46 +00:00
Tobias Grosser	3278b7cd7c	Add 2nd test case for sdiv/srem instructions in a SCEV llvm-svn: 245186	2015-08-16 19:53:21 +00:00
Johannes Doerfert	eca5282dd0	[FIX] Add XFAIL to crashing test case llvm-svn: 245180	2015-08-16 14:54:16 +00:00
Johannes Doerfert	45545ff782	Build the ScopStmt domain in-place. This will build the statement domains in-place, hence using the ScopStmt::Domain member instead of some intermediate isl_set. llvm-svn: 245179	2015-08-16 14:36:01 +00:00
Johannes Doerfert	c594dc9ed0	Add a crashing test case for the scalar code generation This test case crashes the scalar code generation as we are not consistent with the usage of the assumed context. To be precise, we use the assumed context for the dependence analysis but not to restrict the domains of the statements. A step by step explanation of the problem is given in the test case. llvm-svn: 245176	2015-08-16 11:12:22 +00:00
Tobias Grosser	8a9c2353f9	Add -polly-context option to provide additional context information This option allows the user to provide additional information about parameter values as an isl_set. To specify that N has the value 1024, we can provide the context -polly-context='[N] -> {: N = 1024}'. llvm-svn: 245175	2015-08-16 10:19:29 +00:00
Tobias Grosser	bccd1b0af0	Fix test case after recent LLVM changes llvm-svn: 244954	2015-08-13 21:08:15 +00:00
Tobias Grosser	7e584168ab	Manuallt simplify test case llvm-svn: 244907	2015-08-13 16:33:32 +00:00
Michael Kruse	2da3872a99	Add test case for SCEV synthesizing CodeGenerator currently tries to generate code for a parameter using values values that are computed later. llvm-svn: 244903	2015-08-13 15:53:53 +00:00
Tobias Grosser	0164b8ff70	Enable code generation of scalar dependences from function arguments This change extends the BlockGenerator to not only allow Instructions as base elements of scalar dependences, but any llvm::Value. This allows us to code-generate scalar dependences which reference function arguments, as they arise when moddeling read-only scalar dependences. llvm-svn: 244874	2015-08-13 08:07:39 +00:00
Tobias Grosser	d46fd5ed95	Make the dimension sizes of in ScopArrayInfo available as isl_pw_affs This makes it easier to reason about the size of an array dimension with isl. llvm-svn: 244757	2015-08-12 15:27:16 +00:00
Tobias Grosser	a77cea49d1	Always model PHI nodes in scop (if not in same nonaffine subregion) Before we only modeled PHI nodes if at least one incoming basic block was itself part of the region, now we always model them except if all of their operands are part of a single non-affine subregion which we model as a black-box. This change only affects PHI nodes in the entry block, that have exactly one incoming edge. Before this change, we did not model them and as a result code generation would not know how to code generate them. With this change, code generation can code generate them like any other PHI node. This issue was exposed by r244606. Before this change simplifyRegion would have moved these PHI nodes out of the SCoP, so we would never have tried to code generate them. We could implement this behavior again, but changing the IR after the scop has been modeled and transformed always adds a risk of us invalidating earlier analysis results. It seems more save and overall also more consistent to just model and handle this one-entry-edge PHI nodes like any other PHI node in the scop. Solution proposed by: Michael Kruse <llvm@meinersbur.de> llvm-svn: 244721	2015-08-12 07:48:54 +00:00
Michael Kruse	fba24b3775	Add another test case with trival PHI in entry BB This one was extracted from the test-suite's pifft and caused a miscompilation because a scalar was not written to its alloca address. llvm-svn: 244720	2015-08-12 07:34:55 +00:00
Michael Kruse	4f9caf2b28	Add test case for entry node with trivial PHI This is a break-down from the test-suite's oggenc where Polly currently crashes. llvm-svn: 244692	2015-08-11 23:09:19 +00:00
Michael Kruse	22370884c4	Revise the simplification of regions The previous code had several problems: For newly created BasicBlocks it did not (always) call RegionInfo::setRegionFor in order to update its analysis. At the moment RegionInfo does not verify its BBMap, but will in the future. This is fixed by determining the region new BBs belong to and set it accordingly. The new executeScopConditionally() requires accurate getRegionFor information. Which block is created by SplitEdge depends on the incoming and outgoing edges of the blocks it connects, which makes handling its output more difficult than it needs to be. Especially for finding which block has been created an to assign a region to it for the setRegionFor problem above. This patch uses an implementation for splitEdge that always creates a block between the predecessor and successor. simplifyRegion has also been simplified by using SplitBlockPredecessors instead of SplitEdge. Isolating the entries and exits have been refectored into individual functions. Previously simplifyRegion did more than just ensuring that there is only one entering and one exiting edge. It ensured that the entering block had no other outgoing edge which was necessary for executeScopConditionally(). Now the latter uses the alternative splitEdge implementation which can handle this situation so simplifyRegion really only needs to simplify the region. Also, executeScopConditionally assumed that there can be no PHI nodes in blocks with one incoming edge. This is wrong and LCSSA deliberately produces such edges. However, previous passes ensured that there can be no such PHIs in exit nodes, but which will no longer hold in the future. The new code that the property that it preserves the identity of region block (the property that the memory address of the BasicBlock containing the instructions remains the same; new blocks only contain PHI nodes and a terminator), especially the entry block. As a result, there is no need to update the reference to the BasicBlock of ScopStmt that contain its instructions because they have been moved to other basic blocks. Reviewers: grosser Part of Differential Revision: http://reviews.llvm.org/D11867 llvm-svn: 244606	2015-08-11 14:39:21 +00:00
Tobias Grosser	6e3ba33b07	Update isl to isl-0.15-117-ge42acfe Besides other changes this version of isl contains a fundamental fix to memory corruption issues we have seen with imath-32 backed isl_ints. This update also contains a fix that ensures that the schedule-tree based version of isl's dependence analysis takes the domain of the schedule into account. llvm-svn: 244585	2015-08-11 11:31:18 +00:00
Michael Kruse	874b5c2197	Correct non-existing past participle of split in filename llvm-svn: 244478	2015-08-10 18:37:34 +00:00
Michael Kruse	4adb0279a8	Add test case with PHI node in exit block The PHI node with multiple incoming edges from inside the region. Thanks Tobias for coming up with the example. llvm-svn: 244456	2015-08-10 16:17:59 +00:00
Tobias Grosser	dcc3b435ab	Optionally model read-only scalars Even though read-only accesses to scalars outside of a scop do not need to be modeled to derive valid transformations or to generate valid sequential code, but information about them is useful when we considering memory footprint analysis and/or kernel offloading. llvm-svn: 243981	2015-08-04 13:54:20 +00:00
Duncan P. N. Exon Smith	20b50f2b2a	Update testcases after LLVM r243885 llvm-svn: 243887	2015-08-03 17:28:43 +00:00
Tobias Grosser	6213913244	Use the branch instruction to define the location of a PHI-node write We use the branch instruction as the location at which a PHI-node write takes place, instead of the PHI-node itself. This allows us to identify the basic-block in a region statement which is on the incoming edge of the PHI-node and for which the write access was originally introduced. As a result we can, during code generation, avoid generating PHI-node write accesses for basic blocks that do not preceed the PHI node without having to look at the IR again. This change fixes a bug which was introduced in r243420, when we started to explicitly model PHI-node reads and writes, but dropped some additional checks that where still necessary during code generation to not emit PHI-node writes for basic-blocks that are not on incoming edges of the original PHI node. Compared to the code before r243420 the new code does not need to inspect the IR any more and we also do not generate multiple redundant writes. llvm-svn: 243852	2015-08-02 16:17:41 +00:00
Tobias Grosser	d2d15a8c65	Dependences: Zero pad the schedule map The schedule map we derive from a schedule tree map may map statements into schedule spaces of different dimensionality. This change adds zero padding to ensure just a single schedule space is used and the translation from a union_map to an isl_multi_union_pw_aff does not fail. llvm-svn: 243849	2015-08-02 13:30:33 +00:00
Tobias Grosser	45e7944bcf	Only use instructions as insert locations for SCEVExpander SCEVExpander, which we are using during code generation, only allows instructions as insert locations, but breaks in case BasicBlock->end() iterators are passed to it due to it trying to obtain the basic block in which code should be generated by calling Instruction->getParent(), which is not defined for ->end() iterators. This change adds an assert to Polly that ensures we only pass valid instructions to SCEVExpander and it fixes one case, where we used IRBuilder->SetInsertBlock() to set an ->end() insert location which was later passed to SCEVExpander. In general, Polly is always trying to build up the CFG first, before we actually insert instructions into the CFG sceleton. As a result, each basic block should already have at least one branch instruction before we start adding code. Hence, always requiring the IRBuilder insert location to be set to a real instruction should always be possible. Thanks Utpal Bora <cs14mtech11017@iith.ac.in> for his help with test case reduction. llvm-svn: 243830	2015-08-01 09:07:57 +00:00
Duncan P. N. Exon Smith	c51714a0c6	Fix polly tests after LLVM IR change in r243774 llvm-svn: 243801	2015-07-31 23:58:50 +00:00
Tobias Grosser	80e237bd53	Do not detect scops that are delinearized to arrays with "undef" size Such codes are not interesting to optimize and most likely never appear in the normal compilation flow. However, they show up during test case reduction with bugpoint and trigger -- without this change -- an assert in polly::MemoryAccess::foldAccess(). It is better to detect them in ScopDetection itself and just bail out. Contributed-by: Utpal Bora <cs14mtech11017@iith.ac.in> Reviewers: grosser Subscribers: pollydev, llvm-commits Differential Revision: http://reviews.llvm.org/D11425 llvm-svn: 243515	2015-07-29 13:52:05 +00:00
Tobias Grosser	b241d928bd	Rewrite getPrevectorMap using schedule trees operations Schedule trees are a lot easier to work with, for both humans and machines. For humans the more structured schedule representation is easier to reason about. Together with the more abstract isl programming interface this can result in a lot cleaner code (see this changeset). For machines, the structured schedule and the fact that we now use explicit piecewise affine expressions instead of integer maps makes it easier to generate code from this schedule tree. As a result, we can already see a slight compile-time improvement -- for 3mm from 0m0.593s to 0m0.551s seconds (-7 %). More importantly, future optimizations such as full-partial tile separation will most likely result in more streamlined code to be generated. Contributed-by: Roman Gareev <gareevroman@gmail.com> llvm-svn: 243458	2015-07-28 18:03:36 +00:00
Tobias Grosser	922452285a	Keep track of ScopArrayInfo objects that model PHI node storage Summary: When translating PHI nodes into memory dependences during code generation we require two kinds of memory. 'Normal memory' as for all scalar dependences and 'PHI node memory' to store the incoming values of the PHI node. With this patch we now mark and track these two kinds of memories, which we previously incorrectly marked as a single memory object. Being aware of PHI node storage makes code generation easier, as we do not need to guess what kind of storage a scalar reference requires. This simplifies the code nicely. Reviewers: jdoerfert Subscribers: pollydev, llvm-commits Differential Revision: http://reviews.llvm.org/D11554 llvm-svn: 243420	2015-07-28 14:53:44 +00:00
Tobias Grosser	3b10c94062	Prevectorize the schedule of the band (or the point loop in case of tiling) Contributed-by: Roman Gareev <gareevroman@gmail.com> llvm-svn: 243214	2015-07-25 12:28:56 +00:00
Michael Kruse	be16d22f04	Normalize whitespace in makefiles Tabs for rules and space for line continuations. llvm-svn: 243179	2015-07-24 23:30:31 +00:00
Michael Kruse	1bbe346cef	Make the lit configuration Python 3 compatible by using the same techniques as LLVM's lit configuration. llvm-svn: 243154	2015-07-24 20:33:22 +00:00
Michael Kruse	5e9f249c3e	Add LICM test cases These test cases check whether Polly still gives the same results if LICM runs before. Currently, it does not and therefore these cases are expected fails. llvm-svn: 243037	2015-07-23 20:05:11 +00:00
Johannes Doerfert	338b42c329	Removed redundant alias checks generated during run time. As specified in PR23888, run-time alias check generation is expensive in terms of compile-time. This reduces the compile time by computing minimal/maximal access only once for each base pointer Contributed-by: Pratik Bhatu <cs12b1010@iith.ac.in> llvm-svn: 243024	2015-07-23 17:04:54 +00:00
Michael Kruse	6362f5aa0b	Unify FOLDER property of Polly targets Put all Polly targets into a single "Polly" category (i.e. solution folder). Previously there was no recognizable scheme and most categories contained just one or two targets or targets didn't belong to any category. Reviewers: grosser llvm-svn: 242779	2015-07-21 12:40:01 +00:00
Tobias Grosser	808cd69a92	Use schedule trees to represent execution order of statements Instead of flat schedules, we now use so-called schedule trees to represent the execution order of the statements in a SCoP. Schedule trees make it a lot easier to analyze, understand and modify properties of a schedule, as specific nodes in the tree can be choosen and possibly replaced. This patch does not yet fully move our DependenceInfo pass to schedule trees, as some additional performance analysis is needed here. (In general schedule trees should be faster in compile-time, as the more structured representation is generally easier to analyze and work with). We also can not yet perform the reduction analysis on schedule trees. For more information regarding schedule trees, please see Section 6 of https://lirias.kuleuven.be/handle/123456789/497238 llvm-svn: 242130	2015-07-14 09:33:13 +00:00
Tobias Grosser	16c4403a91	Make non-affine statement names isl compatible Named isl sets can generally have any name if they remain within Polly, but only certain strings can be parsed by isl. The new names we create ensure that we can always copy-past isl strings from Polly to other isl tools, e.g. for debugging. llvm-svn: 241787	2015-07-09 07:31:45 +00:00
Tobias Grosser	1b13ddea50	Add first support to delinearize A[t%2][i][j] This is very preliminary support, but it seems to work for the most common case. When observing more/different test cases, we can work on generalizing this. llvm-svn: 240955	2015-06-29 14:44:22 +00:00
Tobias Grosser	af4e809ca6	Remove code for scalar and PHI to array translation This removes old code that has been disabled since several weeks and was hidden behind the flags -disable-polly-intra-scop-scalar-to-array=false and -polly-model-phi-nodes=false. Earlier, Polly used to translate scalars and PHI nodes to single element arrays, as this avoided the need for their special handling in Polly. With Johannes' patches adding native support for such scalar references to Polly, this code is not needed any more. After this commit both -polly-prepare and -polly-independent are now mostly no-ops. Only a couple of simple transformations still remain, but they are scheduled for removal too. Thanks again to Johannes Doerfert for his nice work in making all this code obsolete. llvm-svn: 240766	2015-06-26 07:31:18 +00:00
Tobias Grosser	50165ffdee	Add support for srem instruction Remainder operations with constant divisor can be modeled as quasi-affine expression. This patch adds support for detecting and modeling them. We also add a test that ensures they are correctly code generated. This patch was extracted from a larger patch contributed by Johannes Doerfert in http://reviews.llvm.org/D5293 llvm-svn: 240518	2015-06-24 04:13:29 +00:00
Tobias Grosser	a608569856	Replace srem by function call in nonaffine test cases This makes the test cases nonaffine even if Polly some days gains support for the srem instruction, an instruction which is currently not modeled but which can clearly be modeled statically. A call to a function without definition will always remain non-affine, as there is just insufficient static information for it to be modeled more precisely. llvm-svn: 240458	2015-06-23 20:55:05 +00:00
Tobias Grosser	aa9f575ae1	Adjust to personality function change in 239940 llvm-svn: 239992	2015-06-18 05:02:11 +00:00
Tobias Grosser	8199c722c7	Disable output for test case that does not need output llvm-svn: 239060	2015-06-04 17:59:51 +00:00

1 2 3 4 5 ...

465 Commits