llvm/lib at 9819bcf7f1a3d985fc1283e76280d8d730854f9d - llvm

RPCS3/llvm

mirror of https://github.com/RPCS3/llvm.git synced 2024-12-23 12:40:17 +00:00

History

Hal Finkel 9819bcf7f1 Treat the WorkSet used to find ephemeral values as double-ended We need to make sure that we visit all operands of an instruction before moving deeper in the operand graph. We had been pushing operands onto the back of the work set, and popping them off the back as well, meaning that we might visit an instruction before visiting all of its uses that sit in between it and the call to @llvm.assume. To provide an explicit example, given the following: %q0 = extractelement <4 x float> %rd, i32 0 %q1 = extractelement <4 x float> %rd, i32 1 %q2 = extractelement <4 x float> %rd, i32 2 %q3 = extractelement <4 x float> %rd, i32 3 %q4 = fadd float %q0, %q1 %q5 = fadd float %q2, %q3 %q6 = fadd float %q4, %q5 %qi = fcmp olt float %q6, %q5 call void @llvm.assume(i1 %qi) %q5 is used by both %qi and %q6. When we visit %qi, it will be marked as ephemeral, and we'll queue %q6 and %q5. %q6 will be marked as ephemeral and we'll queue %q4 and %q5. Under the old system, we'd then visit %q4, which would become ephemeral, %q1 and then %q0, which would become ephemeral as well, and now we have a problem. We'd visit %rd, but it would not be marked as ephemeral because we've not yet visited %q2 and %q3 (because we've not yet visited %q5). This will be covered by a test case in a follow-up commit that enables ephemeral-value awareness in the SLP vectorizer. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@219815 91177308-0d34-0410-b5e6-96231b3b80d8		2014-10-15 17:34:48 +00:00
..
Analysis	Treat the WorkSet used to find ephemeral values as double-ended	2014-10-15 17:34:48 +00:00
AsmParser	Make CallingConv::ID an alias of "unsigned".	2014-09-10 18:00:17 +00:00
Bitcode	Use 'auto' for easier reading; no functional change intended.	2014-10-15 16:21:37 +00:00
CodeGen	Simplify handling of --noexecstack by using getNonexecutableStackSection.	2014-10-15 16:12:52 +00:00
DebugInfo	Add couple of missing 'override' keyword. NFC.	2014-10-10 17:34:30 +00:00
ExecutionEngine	[MCJIT] Replace memcpy with readBytesUnaligned in RuntimeDyldMachOI386.	2014-10-10 23:07:09 +00:00
IR	DI: Make comments "brief"-er, NFC	2014-10-15 17:01:28 +00:00
IRReader	Pass a && to getLazyBitcodeModule.	2014-09-03 17:31:46 +00:00
LineEditor	[CMake] Use LINK_LIBS instead of target_link_libraries().	2014-02-26 06:41:29 +00:00
Linker	Merge alignment of common GlobalValue.	2014-09-09 17:48:18 +00:00
LTO	LTO: Document the Boolean argument from r218784	2014-10-02 21:11:04 +00:00
MC	[MC] Make bundle alignment mode setting idempotent and support nested bundles	2014-10-15 17:10:04 +00:00
Object	Object, COFF: Move the VirtualSize/SizeOfRawData logic to getSectionSize	2014-10-09 08:42:31 +00:00
Option	Add an overload of getLastArgNoClaim taking two OptSpecifiers.	2014-09-12 19:42:53 +00:00
ProfileData	Reduce double set lookups. NFC.	2014-10-10 15:32:50 +00:00
Support	Revert "r216914 - Revert: [APFloat] Fixed a bug in method 'fusedMultiplyAdd'"	2014-10-14 19:23:07 +00:00
TableGen	Eliminate some deep std::vector copies. NFC.	2014-10-03 18:33:16 +00:00
Target	Simplify handling of --noexecstack by using getNonexecutableStackSection.	2014-10-15 16:12:52 +00:00
Transforms	No need to cache this unused variable.	2014-10-14 23:58:51 +00:00
CMakeLists.txt	ProfileData: Introduce the InstrProfReader interface and a text reader	2014-03-21 17:24:48 +00:00
LLVMBuild.txt	ProfileData: Introduce the InstrProfReader interface and a text reader	2014-03-21 17:24:48 +00:00
Makefile	ProfileData: Introduce the InstrProfReader interface and a text reader	2014-03-21 17:24:48 +00:00