mirror of
https://github.com/RPCS3/llvm.git
synced 2024-12-30 16:34:03 +00:00
4177e6fff5
This was done through the aid of a terrible Perl creation. I will not paste any of the horrors here. Suffice to say, it require multiple staged rounds of replacements, state carried between, and a few nested-construct-parsing hacks that I'm not proud of. It happens, by luck, to be able to deal with all the TCL-quoting patterns in evidence in the LLVM test suite. If anyone is maintaining large out-of-tree test trees, feel free to poke me and I'll send you the steps I used to convert things, as well as answer any painful questions etc. IRC works best for this type of thing I find. Once converted, switch the LLVM lit config to use ShTests the same as Clang. In addition to being able to delete large amounts of Python code from 'lit', this will also simplify the entire test suite and some of lit's architecture. Finally, the test suite runs 33% faster on Linux now. ;] For my 16-hardware-thread (2x 4-core xeon e5520): 36s -> 24s git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@159525 91177308-0d34-0410-b5e6-96231b3b80d8
30 lines
1.2 KiB
LLVM
30 lines
1.2 KiB
LLVM
; RUN: opt < %s -instcombine -S | grep "align 32" | count 1
|
|
|
|
; It's tempting to have an instcombine in which the src pointer of a
|
|
; memcpy is aligned up to the alignment of the destination, however
|
|
; there are pitfalls. If the src is an alloca, aligning it beyond what
|
|
; the target's stack pointer is aligned at will require dynamic
|
|
; stack realignment, which can require functions that don't otherwise
|
|
; need a frame pointer to need one.
|
|
;
|
|
; Abstaining from this transform is not the only way to approach this
|
|
; issue. Some late phase could be smart enough to reduce alloca
|
|
; alignments when they are greater than they need to be. Or, codegen
|
|
; could do dynamic alignment for just the one alloca, and leave the
|
|
; main stack pointer at its standard alignment.
|
|
|
|
@dst = global [1024 x i8] zeroinitializer, align 32
|
|
|
|
define void @foo() nounwind {
|
|
entry:
|
|
%src = alloca [1024 x i8], align 1
|
|
%src1 = getelementptr [1024 x i8]* %src, i32 0, i32 0
|
|
call void @llvm.memcpy.p0i8.p0i8.i32(i8* getelementptr inbounds ([1024 x i8]* @dst, i32 0, i32 0), i8* %src1, i32 1024, i32 1, i1 false)
|
|
call void @frob(i8* %src1) nounwind
|
|
ret void
|
|
}
|
|
|
|
declare void @frob(i8*)
|
|
|
|
declare void @llvm.memcpy.p0i8.p0i8.i32(i8* nocapture, i8* nocapture, i32, i32, i1) nounwind
|