Rust’s main draw is its powerful static guarantees about behavior. But safety
checks are conservative by nature: there are some programs that are actually
safe, but the compiler is not able to verify this is true. To write these kinds
of programs, we need to tell the compiler to relax its restrictions a bit. For
this, Rust has a keyword, unsafe
. Code using unsafe
has fewer restrictions
than normal code does.
Let’s go over the syntax, and then we’ll talk semantics. unsafe
is used in
four contexts. The first one is to mark a function as unsafe:
unsafe fn danger_will_robinson() { // scary stuff }
All functions called from FFI must be marked as unsafe
, for example.
The second use of unsafe
is an unsafe block:
unsafe { // scary stuff }
The third is for unsafe traits:
fn main() { unsafe trait Scary { } }unsafe trait Scary { }
And the fourth is for impl
ementing one of those traits:
unsafe impl Scary for i32 {}
It’s important to be able to explicitly delineate code that may have bugs that
cause big problems. If a Rust program segfaults, you can be sure the cause is
related to something marked unsafe
.
Safe, in the context of Rust, means ‘doesn’t do anything unsafe’. It’s also important to know that there are certain behaviors that are probably not desirable in your code, but are expressly not unsafe:
Rust cannot prevent all kinds of software problems. Buggy code can and will be
written in Rust. These things aren’t great, but they don’t qualify as unsafe
specifically.
In addition, the following are all undefined behaviors in Rust, and must be
avoided, even when writing unsafe
code:
&mut T
and &T
follow LLVM’s scoped noalias model, except if
the &T
contains an UnsafeCell<U>
. Unsafe code must not violate these
aliasing guarantees.UnsafeCell<U>
std::ptr::offset
(offset
intrinsic), with
the exception of one byte past the end which is permitted.std::ptr::copy_nonoverlapping_memory
(memcpy32
/memcpy64
intrinsics) on overlapping buffersfalse
(0) or true
(1) in a bool
enum
not included in its type definitionchar
which is a surrogate or above char::MAX
str
In both unsafe functions and unsafe blocks, Rust will let you do three things that you normally can not do. Just three. Here they are:
That’s it. It’s important that unsafe
does not, for example, ‘turn off the
borrow checker’. Adding unsafe
to some random Rust code doesn’t change its
semantics, it won’t start accepting anything. But it will let you write
things that do break some of the rules.
You will also encounter the unsafe
keyword when writing bindings to foreign
(non-Rust) interfaces. You're encouraged to write a safe, native Rust interface
around the methods provided by the library.
Let’s go over the basic three abilities listed, in order.
static mut
Rust has a feature called ‘static mut
’ which allows for mutable global state.
Doing so can cause a data race, and as such is inherently not safe. For more
details, see the static section of the book.
Raw pointers let you do arbitrary pointer arithmetic, and can cause a number of different memory safety and security issues. In some senses, the ability to dereference an arbitrary pointer is one of the most dangerous things you can do. For more on raw pointers, see their section of the book.
This last ability works with both aspects of unsafe
: you can only call
functions marked unsafe
from inside an unsafe block.
This ability is powerful and varied. Rust exposes some compiler intrinsics as unsafe functions, and some unsafe functions bypass safety checks, trading safety for speed.
I’ll repeat again: even though you can do arbitrary things in unsafe blocks and functions doesn’t mean you should. The compiler will act as though you’re upholding its invariants, so be careful!