llvm/lib at 65b3f67e1c49825f0c24c3c2240a23e03ddbfa4a - llvm

RPCSX/llvm

mirror of https://github.com/RPCSX/llvm.git synced 2024-11-27 21:50:40 +00:00

History

Dorit Nuzman 65b3f67e1c [AVX2] [TTI CostModel] Add cost of interleaved loads/stores for AVX2 The cost of an interleaved access was only implemented for AVX512. For other X86 targets an overly conservative Base cost was returned, resulting in avoiding vectorization where it is actually profitable to vectorize. This patch starts to add costs for AVX2 for most prominent cases of interleaved accesses (stride 3,4 chars, for now). Note1: Improvements of up to ~4x were observed in some of EEMBC's rgb workloads; There is also a known issue of 15-30% degradations on some of these workloads, associated with an interleaved access followed by type promotion/widening; the resulting shuffle sequence is currently inefficient and will be improved by a series of patches that extend the X86InterleavedAccess pass (such as D34601 and more to follow). Note 2: The costs in this patch do not reflect port pressure penalties which can be very dominant in the case of interleaved accesses since most of the shuffle operations are restricted to a single port. Further tuning, that may incorporate these considerations, will be done on top of the upcoming improved shuffle sequences (that is, along with the abovementioned work to extend X86InterleavedAccess pass). Differential Revision: https://reviews.llvm.org/D34023 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@306238 91177308-0d34-0410-b5e6-96231b3b80d8		2017-06-25 08:26:25 +00:00
..
Analysis	[SCEV] Avoid copying ConstantRange just to get the min/max value	2017-06-24 23:34:50 +00:00
AsmParser	Add dependency from AsmParser to BinaryFormat.	2017-06-07 04:24:33 +00:00
BinaryFormat	Move Object format code to lib/BinaryFormat.	2017-06-07 03:48:56 +00:00
Bitcode	Fix build warning on 32-bit targets where sizeof(size_t) < sizeof(long long).	2017-06-16 00:32:11 +00:00
CodeGen	[SelectionDAG] set dereferenceable flag when expanding memcpy/memmove	2017-06-24 15:17:38 +00:00
DebugInfo	[pdb] Fix reading of llvm-generated PDBs by cvdump.	2017-06-25 03:51:42 +00:00
Demangle	[ItaniumDemangle] Fix a exponential string copying bug	2017-05-28 23:24:52 +00:00
ExecutionEngine	[ORC] Re-apply r306166 and r306168 with fix for regression test.	2017-06-23 23:25:28 +00:00
Fuzzer	[libFuzzer] change the default max_len from 64 to 4096. This will affect cases where libFuzzer is run w/o initial corpus or with a corpus of very small items.	2017-06-15 22:43:40 +00:00
IR	Restrict the definition of loop preheader to avoid EH blocks	2017-06-22 23:27:16 +00:00
IRReader
LineEditor
Linker	Support for taking the max of module flags when linking, use for PIE/PIC	2017-05-23 00:08:00 +00:00
LTO	[ThinLTO] Remove unnecessary include of Linker.h (NFC)	2017-06-22 16:18:48 +00:00
MC	Simplify the processFixupValue interface. NFC.	2017-06-24 06:00:03 +00:00
Object	Updated llvm-objdump symbolic disassembly with x86_64 Mach-O MH_KEXT_BUNDLE	2017-06-22 17:41:22 +00:00
ObjectYAML	[codeview] respect signedness of APSInts when printing to YAML	2017-06-21 22:31:52 +00:00
Option	[GSoC] Flag value completion for clang	2017-06-20 16:31:31 +00:00
Passes	[PGO] Implementate profile counter regiser promotion	2017-06-25 00:26:43 +00:00
ProfileData	[ProfileData, Support] Fix some Clang-tidy modernize-use-using and Include What You Use warnings; other minor fixes (NFC).	2017-06-21 23:19:47 +00:00
Support	Add support for Ananas platform	2017-06-25 08:19:37 +00:00
TableGen	[BinaryFormat, Option, TableGen] Fix some Clang-tidy modernize-use-using and Include What You Use warnings; other minor fixes (NFC).	2017-06-16 00:43:26 +00:00
Target	[AVX2] [TTI CostModel] Add cost of interleaved loads/stores for AVX2	2017-06-25 08:26:25 +00:00
Testing	Mark LLVMTestingSupport as not installed in LLVMBuild.	2017-06-19 22:01:50 +00:00
ToolDrivers	[GSoC] Flag value completion for clang	2017-06-20 16:31:31 +00:00
Transforms	[PGO] Implementate profile counter regiser promotion	2017-06-25 00:26:43 +00:00
XRay	Sort the remaining #include lines in include/... and lib/....	2017-06-06 11:49:48 +00:00
CMakeLists.txt	[gtest] Create a shared include directory for gtest utilities.	2017-06-14 16:41:50 +00:00
LLVMBuild.txt	[gtest] Create a shared include directory for gtest utilities.	2017-06-14 16:41:50 +00:00