llvm/test/CodeGen/X86/pr26350.ll
Sanjay Patel cc9614d291 [x86] don't blindly transform SETB into SBB
I noticed unnecessary 'sbb' instructions in D30472 and while looking at 'ptest' codegen recently. 
This happens because we were transforming any 'setb' - even when we only wanted a single-bit result.

This patch moves those transforms under visitAdd/visitSub, so we we're only creating sbb/adc when it
is a win. I don't know why we need a SETCC_CARRY node type, but I'm not proposing to change that
existing behavior in this patch.

Also, I'm skeptical that sbb/adc are a win for all micro-arches, so I added comments to the test files
where this transform still fires.

The test changes here are all cases where we no longer produce sbb/adc. Avoiding partial register
stalls (generating an xor to clear a register) is not handled in some cases, but that's a separate
issue.

Differential Revision: https://reviews.llvm.org/D30611


git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@297586 91177308-0d34-0410-b5e6-96231b3b80d8
2017-03-12 18:28:48 +00:00

30 lines
877 B
LLVM

; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
; RUN: llc -disable-constant-hoisting < %s | FileCheck %s
target datalayout = "e-m:e-p:32:32-f64:32:64-f80:32-n8:16:32-S128"
target triple = "i386-unknown-linux-gnu"
@d = global i32 8, align 4
define i32 @main() {
; CHECK-LABEL: main:
; CHECK: # BB#0: # %entry
; CHECK-NEXT: movl d, %eax
; CHECK-NEXT: movl %eax, %ecx
; CHECK-NEXT: shrl $31, %ecx
; CHECK-NEXT: addl %eax, %eax
; CHECK-NEXT: andl $16, %eax
; CHECK-NEXT: cmpl $-1, %eax
; CHECK-NEXT: sbbl $0, %ecx
; CHECK-NEXT: setb %al
; CHECK-NEXT: movzbl %al, %eax
; CHECK-NEXT: retl
entry:
%load = load i32, i32* @d, align 4
%conv1 = zext i32 %load to i64
%shl = shl i64 %conv1, 1
%mul = and i64 %shl, 4294967312
%cmp = icmp ugt i64 4294967295, %mul
%zext = zext i1 %cmp to i32
ret i32 %zext
}