Newtype
TODO: we've already used newtype twice by now (fixed point values and volatile addresses), so we need to adjust how we start this section.
There's a great Zero Cost abstraction that we'll be using a lot that you might not already be familiar with: we're talking about the "Newtype Pattern"!
Now, I told you to read the Rust Book before you read this book, and I'm sure you're all good students who wouldn't sneak into this book without doing the required reading, so I'm sure you all remember exactly what I'm talking about, because they touch on the newtype concept in the book twice, in two very long named sections:
- Using the Newtype Pattern to Implement External Traits on External Types
- Using the Newtype Pattern for Type Safety and Abstraction
...Yeah... The Rust Book doesn't know how to make a short sub-section name to save its life. Shame.
Newtype Basics
So, we have all these pieces of data, and we want to keep them separated, and we don't wanna pay the cost for it at runtime. Well, we're in luck, we can pay the cost at compile time.
# #![allow(unused_variables)] #fn main() { pub struct PixelColor(u16); #}
TODO: we've already talked about repr(transparent) by now
Ah, except that, as I'm sure you remember from The
Rustonomicon
(and from the RFC too, of course), if we have a single field struct that's
sometimes different from having just the bare value, so we should be using
#[repr(transparent)]
with our newtypes.
# #![allow(unused_variables)] #fn main() { #[repr(transparent)] pub struct PixelColor(u16); #}
And then we'll need to do that same thing for every other newtype we want.
Except there's only two tiny parts that actually differ between newtype declarations: the new name and the base type. All the rest is just the same rote code over and over. Generating piles and piles of boilerplate code? Sounds like a job for a macro to me!
Making It A Macro
If you're going to do much with macros you should definitely read through The Little Book of Rust Macros, but we won't be doing too much so you can just follow along here a bit if you like.
The most basic version of a newtype macro starts like this:
# #![allow(unused_variables)] #fn main() { #[macro_export] macro_rules! newtype { ($new_name:ident, $old_name:ident) => { #[repr(transparent)] pub struct $new_name($old_name); }; } #}
The #[macro_export]
makes it exported by the current module (like pub
kinda), and then we have one expansion option that takes an identifier, a ,
,
and then a second identifier. The new name is the outer type we'll be using, and
the old name is the inner type that's being wrapped. You'd use our new macro
something like this:
# #![allow(unused_variables)] #fn main() { newtype! {PixelColorCurly, u16} newtype!(PixelColorParens, u16); newtype![PixelColorBrackets, u16]; #}
Note that you can invoke the macro with the outermost grouping as any of ()
,
[]
, or {}
. It makes no particular difference to the macro. Also, that space
in the first version is kinda to show off that you can put white space in
between the macro name and the grouping if you want. The difference is mostly
style, but there are some rules and considerations here:
- If you use curly braces then you must not put a
;
after the invocation. - If you use parentheses or brackets then you must put the
;
at the end. - Rustfmt cares which you use and formats accordingly:
- Curly brace macro use mostly gets treated like a code block.
- Parentheses macro use mostly gets treated like a function call.
- Bracket macro use mostly gets treated like an array declaration.
Upgrade That Macro!
We also want to be able to add derive
stuff and doc comments to our newtype.
Within the context of macro_rules!
definitions these are called "meta". Since
we can have any number of them we wrap it all up in a "zero or more" matcher.
Then our macro looks like this:
# #![allow(unused_variables)] #fn main() { #[macro_export] macro_rules! newtype { ($(#[$attr:meta])* $new_name:ident, $old_name:ident) => { $(#[$attr])* #[repr(transparent)] pub struct $new_name($old_name); }; } #}
So now we can write
# #![allow(unused_variables)] #fn main() { newtype! { /// Color on the GBA gives 5 bits for each channel, the highest bit is ignored. #[derive(Debug, Clone, Copy)] PixelColor, u16 } #}
And that's about all we'll need for the examples.
As a reminder: remember that macro_rules
macros have to appear before
they're invoked in your source, so the newtype
macro will always have to be at
the very top of your file, or if you put it in a module within your project
you'll need to declare the module before anything that uses it.
Potential Homework
If you wanted to keep going and get really fancy with it, you could potentially add a lot more:
- Make a
pub const fn new() -> Self
method that outputs the base value in a const way. Combine this with builder style "setter" methods that are also const and you can get the compiler to do quite a bit of the value building work at compile time. - Making the macro optionally emit a
From
impl to unwrap it back into the base type. - Allow for visibility modifiers to be applied to the inner field and the newly generated type.
- Allowing for generic newtypes. You already saw the need for this once in the
volatile section. Unfortunately, this particular part gets really tricky if
you're using
macro_rules!
, so you might need to move up to a fullproc_macro
. Having aproc_macro
isn't bad except that they have to be defined in a crate of their own and they're compiled before use. You can't ever use them in the crate that defines them, so we won't be using them in any of our single file examples. - Allowing for optional
Deref
andDerefMut
of the inner value. This takes away most all the safety aspect of doing the newtype, but there may be times for it. As an example, you could make a newtype with a different form of Display impl that you want to otherwise treat as the base type in all places.