mirror of
https://github.com/capstone-engine/llvm-capstone.git
synced 2025-01-07 16:42:34 +00:00
b420ef75fb
document llvm-svn: 170762
362 lines
13 KiB
ReStructuredText
362 lines
13 KiB
ReStructuredText
|
|
.. role:: block-term
|
|
|
|
=================================
|
|
Language Specification for Blocks
|
|
=================================
|
|
|
|
.. contents::
|
|
:local:
|
|
|
|
Revisions
|
|
=========
|
|
|
|
- 2008/2/25 --- created
|
|
- 2008/7/28 --- revised, ``__block`` syntax
|
|
- 2008/8/13 --- revised, Block globals
|
|
- 2008/8/21 --- revised, C++ elaboration
|
|
- 2008/11/1 --- revised, ``__weak`` support
|
|
- 2009/1/12 --- revised, explicit return types
|
|
- 2009/2/10 --- revised, ``__block`` objects need retain
|
|
|
|
Overview
|
|
========
|
|
|
|
A new derived type is introduced to C and, by extension, Objective-C,
|
|
C++, and Objective-C++
|
|
|
|
The Block Type
|
|
==============
|
|
|
|
Like function types, the :block-term:`Block type` is a pair consisting
|
|
of a result value type and a list of parameter types very similar to a
|
|
function type. Blocks are intended to be used much like functions with
|
|
the key distinction being that in addition to executable code they
|
|
also contain various variable bindings to automatic (stack) or managed
|
|
(heap) memory.
|
|
|
|
The abstract declarator,
|
|
|
|
.. code-block:: c
|
|
|
|
int (^)(char, float)
|
|
|
|
describes a reference to a Block that, when invoked, takes two
|
|
parameters, the first of type char and the second of type float, and
|
|
returns a value of type int. The Block referenced is of opaque data
|
|
that may reside in automatic (stack) memory, global memory, or heap
|
|
memory.
|
|
|
|
Block Variable Declarations
|
|
===========================
|
|
|
|
A :block-term:`variable with Block type` is declared using function
|
|
pointer style notation substituting ``^`` for ``*``. The following are
|
|
valid Block variable declarations:
|
|
|
|
.. code-block:: c
|
|
|
|
void (^blockReturningVoidWithVoidArgument)(void);
|
|
int (^blockReturningIntWithIntAndCharArguments)(int, char);
|
|
void (^arrayOfTenBlocksReturningVoidWithIntArgument[10])(int);
|
|
|
|
Variadic ``...`` arguments are supported. [variadic.c] A Block that
|
|
takes no arguments must specify void in the argument list [voidarg.c].
|
|
An empty parameter list does not represent, as K&R provide, an
|
|
unspecified argument list. Note: both gcc and clang support K&R style
|
|
as a convenience.
|
|
|
|
A Block reference may be cast to a pointer of arbitrary type and vice
|
|
versa. [cast.c] A Block reference may not be dereferenced via the
|
|
pointer dereference operator ``*``, and thus a Block's size may not be
|
|
computed at compile time. [sizeof.c]
|
|
|
|
Block Literal Expressions
|
|
=========================
|
|
|
|
A :block-term:`Block literal expression` produces a reference to a
|
|
Block. It is introduced by the use of the ``^`` token as a unary
|
|
operator.
|
|
|
|
.. code-block:: c
|
|
|
|
Block_literal_expression ::= ^ block_decl compound_statement_body
|
|
block_decl ::=
|
|
block_decl ::= parameter_list
|
|
block_decl ::= type_expression
|
|
|
|
where type expression is extended to allow ``^`` as a Block reference
|
|
(pointer) where ``*`` is allowed as a function reference (pointer).
|
|
|
|
The following Block literal:
|
|
|
|
.. code-block:: c
|
|
|
|
^ void (void) { printf("hello world\n"); }
|
|
|
|
produces a reference to a Block with no arguments with no return value.
|
|
|
|
The return type is optional and is inferred from the return
|
|
statements. If the return statements return a value, they all must
|
|
return a value of the same type. If there is no value returned the
|
|
inferred type of the Block is void; otherwise it is the type of the
|
|
return statement value.
|
|
|
|
If the return type is omitted and the argument list is ``( void )``,
|
|
the ``( void )`` argument list may also be omitted.
|
|
|
|
So:
|
|
|
|
.. code-block:: c
|
|
|
|
^ ( void ) { printf("hello world\n"); }
|
|
|
|
and:
|
|
|
|
.. code-block:: c
|
|
|
|
^ { printf("hello world\n"); }
|
|
|
|
are exactly equivalent constructs for the same expression.
|
|
|
|
The type_expression extends C expression parsing to accommodate Block
|
|
reference declarations as it accommodates function pointer
|
|
declarations.
|
|
|
|
Given:
|
|
|
|
.. code-block:: c
|
|
|
|
typedef int (*pointerToFunctionThatReturnsIntWithCharArg)(char);
|
|
pointerToFunctionThatReturnsIntWithCharArg functionPointer;
|
|
^ pointerToFunctionThatReturnsIntWithCharArg (float x) { return functionPointer; }
|
|
|
|
and:
|
|
|
|
.. code-block:: c
|
|
|
|
^ int ((*)(float x))(char) { return functionPointer; }
|
|
|
|
are equivalent expressions, as is:
|
|
|
|
.. code-block:: c
|
|
|
|
^(float x) { return functionPointer; }
|
|
|
|
[returnfunctionptr.c]
|
|
|
|
The compound statement body establishes a new lexical scope within
|
|
that of its parent. Variables used within the scope of the compound
|
|
statement are bound to the Block in the normal manner with the
|
|
exception of those in automatic (stack) storage. Thus one may access
|
|
functions and global variables as one would expect, as well as static
|
|
local variables. [testme]
|
|
|
|
Local automatic (stack) variables referenced within the compound
|
|
statement of a Block are imported and captured by the Block as const
|
|
copies. The capture (binding) is performed at the time of the Block
|
|
literal expression evaluation.
|
|
|
|
The compiler is not required to capture a variable if it can prove
|
|
that no references to the variable will actually be evaluated.
|
|
Programmers can force a variable to be captured by referencing it in a
|
|
statement at the beginning of the Block, like so:
|
|
|
|
.. code-block:: c
|
|
|
|
(void) foo;
|
|
|
|
This matters when capturing the variable has side-effects, as it can
|
|
in Objective-C or C++.
|
|
|
|
The lifetime of variables declared in a Block is that of a function;
|
|
each activation frame contains a new copy of variables declared within
|
|
the local scope of the Block. Such variable declarations should be
|
|
allowed anywhere [testme] rather than only when C99 parsing is
|
|
requested, including for statements. [testme]
|
|
|
|
Block literal expressions may occur within Block literal expressions
|
|
(nest) and all variables captured by any nested blocks are implicitly
|
|
also captured in the scopes of their enclosing Blocks.
|
|
|
|
A Block literal expression may be used as the initialization value for
|
|
Block variables at global or local static scope.
|
|
|
|
The Invoke Operator
|
|
===================
|
|
|
|
Blocks are :block-term:`invoked` using function call syntax with a
|
|
list of expression parameters of types corresponding to the
|
|
declaration and returning a result type also according to the
|
|
declaration. Given:
|
|
|
|
.. code-block:: c
|
|
|
|
int (^x)(char);
|
|
void (^z)(void);
|
|
int (^(*y))(char) = &x;
|
|
|
|
the following are all legal Block invocations:
|
|
|
|
.. code-block:: c
|
|
|
|
x('a');
|
|
(*y)('a');
|
|
(true ? x : *y)('a')
|
|
|
|
The Copy and Release Operations
|
|
===============================
|
|
|
|
The compiler and runtime provide :block-term:`copy` and
|
|
:block-term:`release` operations for Block references that create and,
|
|
in matched use, release allocated storage for referenced Blocks.
|
|
|
|
The copy operation ``Block_copy()`` is styled as a function that takes
|
|
an arbitrary Block reference and returns a Block reference of the same
|
|
type. The release operation, ``Block_release()``, is styled as a
|
|
function that takes an arbitrary Block reference and, if dynamically
|
|
matched to a Block copy operation, allows recovery of the referenced
|
|
allocated memory.
|
|
|
|
|
|
The ``__block`` Storage Qualifier
|
|
=================================
|
|
|
|
In addition to the new Block type we also introduce a new storage
|
|
qualifier, :block-term:`__block`, for local variables. [testme: a
|
|
__block declaration within a block literal] The ``__block`` storage
|
|
qualifier is mutually exclusive to the existing local storage
|
|
qualifiers auto, register, and static. [testme] Variables qualified by
|
|
``__block`` act as if they were in allocated storage and this storage
|
|
is automatically recovered after last use of said variable. An
|
|
implementation may choose an optimization where the storage is
|
|
initially automatic and only "moved" to allocated (heap) storage upon
|
|
a Block_copy of a referencing Block. Such variables may be mutated as
|
|
normal variables are.
|
|
|
|
In the case where a ``__block`` variable is a Block one must assume
|
|
that the ``__block`` variable resides in allocated storage and as such
|
|
is assumed to reference a Block that is also in allocated storage
|
|
(that it is the result of a ``Block_copy`` operation). Despite this
|
|
there is no provision to do a ``Block_copy`` or a ``Block_release`` if
|
|
an implementation provides initial automatic storage for Blocks. This
|
|
is due to the inherent race condition of potentially several threads
|
|
trying to update the shared variable and the need for synchronization
|
|
around disposing of older values and copying new ones. Such
|
|
synchronization is beyond the scope of this language specification.
|
|
|
|
|
|
Control Flow
|
|
============
|
|
|
|
The compound statement of a Block is treated much like a function body
|
|
with respect to control flow in that goto, break, and continue do not
|
|
escape the Block. Exceptions are treated *normally* in that when
|
|
thrown they pop stack frames until a catch clause is found.
|
|
|
|
|
|
Objective-C Extensions
|
|
======================
|
|
|
|
Objective-C extends the definition of a Block reference type to be
|
|
that also of id. A variable or expression of Block type may be
|
|
messaged or used as a parameter wherever an id may be. The converse is
|
|
also true. Block references may thus appear as properties and are
|
|
subject to the assign, retain, and copy attribute logic that is
|
|
reserved for objects.
|
|
|
|
All Blocks are constructed to be Objective-C objects regardless of
|
|
whether the Objective-C runtime is operational in the program or
|
|
not. Blocks using automatic (stack) memory are objects and may be
|
|
messaged, although they may not be assigned into ``__weak`` locations
|
|
if garbage collection is enabled.
|
|
|
|
Within a Block literal expression within a method definition
|
|
references to instance variables are also imported into the lexical
|
|
scope of the compound statement. These variables are implicitly
|
|
qualified as references from self, and so self is imported as a const
|
|
copy. The net effect is that instance variables can be mutated.
|
|
|
|
The :block-term:`Block_copy` operator retains all objects held in
|
|
variables of automatic storage referenced within the Block expression
|
|
(or form strong references if running under garbage collection).
|
|
Object variables of ``__block`` storage type are assumed to hold
|
|
normal pointers with no provision for retain and release messages.
|
|
|
|
Foundation defines (and supplies) ``-copy`` and ``-release`` methods for
|
|
Blocks.
|
|
|
|
In the Objective-C and Objective-C++ languages, we allow the
|
|
``__weak`` specifier for ``__block`` variables of object type. If
|
|
garbage collection is not enabled, this qualifier causes these
|
|
variables to be kept without retain messages being sent. This
|
|
knowingly leads to dangling pointers if the Block (or a copy) outlives
|
|
the lifetime of this object.
|
|
|
|
In garbage collected environments, the ``__weak`` variable is set to
|
|
nil when the object it references is collected, as long as the
|
|
``__block`` variable resides in the heap (either by default or via
|
|
``Block_copy()``). The initial Apple implementation does in fact
|
|
start ``__block`` variables on the stack and migrate them to the heap
|
|
only as a result of a ``Block_copy()`` operation.
|
|
|
|
It is a runtime error to attempt to assign a reference to a
|
|
stack-based Block into any storage marked ``__weak``, including
|
|
``__weak`` ``__block`` variables.
|
|
|
|
|
|
C++ Extensions
|
|
==============
|
|
|
|
Block literal expressions within functions are extended to allow const
|
|
use of C++ objects, pointers, or references held in automatic storage.
|
|
|
|
As usual, within the block, references to captured variables become
|
|
const-qualified, as if they were references to members of a const
|
|
object. Note that this does not change the type of a variable of
|
|
reference type.
|
|
|
|
For example, given a class Foo:
|
|
|
|
.. code-block:: c
|
|
|
|
Foo foo;
|
|
Foo &fooRef = foo;
|
|
Foo *fooPtr = &foo;
|
|
|
|
A Block that referenced these variables would import the variables as
|
|
const variations:
|
|
|
|
.. code-block:: c
|
|
|
|
const Foo block_foo = foo;
|
|
Foo &block_fooRef = fooRef;
|
|
Foo *const block_fooPtr = fooPtr;
|
|
|
|
Captured variables are copied into the Block at the instant of
|
|
evaluating the Block literal expression. They are also copied when
|
|
calling ``Block_copy()`` on a Block allocated on the stack. In both
|
|
cases, they are copied as if the variable were const-qualified, and
|
|
it's an error if there's no such constructor.
|
|
|
|
Captured variables in Blocks on the stack are destroyed when control
|
|
leaves the compound statement that contains the Block literal
|
|
expression. Captured variables in Blocks on the heap are destroyed
|
|
when the reference count of the Block drops to zero.
|
|
|
|
Variables declared as residing in ``__block`` storage may be initially
|
|
allocated in the heap or may first appear on the stack and be copied
|
|
to the heap as a result of a ``Block_copy()`` operation. When copied
|
|
from the stack, ``__block`` variables are copied using their normal
|
|
qualification (i.e. without adding const). In C++11, ``__block``
|
|
variables are copied as x-values if that is possible, then as l-values
|
|
if not; if both fail, it's an error. The destructor for any initial
|
|
stack-based version is called at the variable's normal end of scope.
|
|
|
|
References to ``this``, as well as references to non-static members of
|
|
any enclosing class, are evaluated by capturing ``this`` just like a
|
|
normal variable of C pointer type.
|
|
|
|
Member variables that are Blocks may not be overloaded by the types of
|
|
their arguments.
|