mirror of
https://github.com/RPCS3/llvm.git
synced 2025-01-21 01:06:46 +00:00
8f85685860
OpenCL 2.0 introduces the notion of memory scopes in atomic operations to global and local memory. These scopes restrict how synchronization is achieved, which can result in improved performance. This change extends existing notion of synchronization scopes in LLVM to support arbitrary scopes expressed as target-specific strings, in addition to the already defined scopes (single thread, system). The LLVM IR and MIR syntax for expressing synchronization scopes has changed to use *syncscope("<scope>")*, where <scope> can be "singlethread" (this replaces *singlethread* keyword), or a target-specific name. As before, if the scope is not specified, it defaults to CrossThread/System scope. Implementation details: - Mapping from synchronization scope name/string to synchronization scope id is stored in LLVM context; - CrossThread/System and SingleThread scopes are pre-defined to efficiently check for known scopes without comparing strings; - Synchronization scope names are stored in SYNC_SCOPE_NAMES_BLOCK in the bitcode. Differential Revision: https://reviews.llvm.org/D21723 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@307722 91177308-0d34-0410-b5e6-96231b3b80d8
18 lines
895 B
LLVM
18 lines
895 B
LLVM
; RUN: llvm-dis -o - %s.bc | FileCheck %s
|
|
|
|
; Backwards compatibility test: make sure we can process bitcode without
|
|
; synchronization scope names encoded in it.
|
|
|
|
; CHECK: load atomic i32, i32* %x unordered, align 4
|
|
; CHECK: load atomic volatile i32, i32* %x syncscope("singlethread") acquire, align 4
|
|
; CHECK: store atomic i32 3, i32* %x release, align 4
|
|
; CHECK: store atomic volatile i32 3, i32* %x syncscope("singlethread") monotonic, align 4
|
|
; CHECK: cmpxchg i32* %x, i32 1, i32 0 syncscope("singlethread") monotonic monotonic
|
|
; CHECK: cmpxchg volatile i32* %x, i32 0, i32 1 acq_rel acquire
|
|
; CHECK: cmpxchg i32* %x, i32 42, i32 0 acq_rel monotonic
|
|
; CHECK: cmpxchg weak i32* %x, i32 13, i32 0 seq_cst monotonic
|
|
; CHECK: atomicrmw add i32* %x, i32 10 seq_cst
|
|
; CHECK: atomicrmw volatile xchg i32* %x, i32 10 monotonic
|
|
; CHECK: fence syncscope("singlethread") release
|
|
; CHECK: fence seq_cst
|