You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Talking with some folks who would like to pass around an Unstructured through Cranelift to implement a "chaos mode" that does semantics-preserving random mutations like shuffle basic blocks, but don't want to thread Unstructured's lifetimes through the whole code base. Would instead like to have an owned version of Unstructured that they can attach to existing context structures.
Could do this with a wrapper struct and self borrows but would like to make sure this is UB-free and in a place where it can be reused by anyone else who has similar needs:
structOwnedUnstructured{bytes:Vec<u8>,// self-borrow of `bytes`; needs MIRI to tell us// if we need `ManuallyDrop`s in here and all thatu:Unstructured<'static>,}impl{pubfnu<'a>(&'amutself) -> &'amutUnstructured<'a>{unsafe{ mem::transmute(&mutself.u)}}}// Unfortunately, `Deref[Mut]` doesn't work because we need to// constrain the `Unstructured`'s lifetime to the `self` borrow// but we can't do that without GATs in the `Deref` trait.implDerefforOwnedUnstructured{typeTarget = Unstructured<'what_lifetime_to_put_here>;// ...}
Another approach could be to have a Cow<'a, [u8]> in Unstructured itself, although then you'd end up with an Unstructured<'static> but the way that Arbitrary is defined, this would let you create arbitrary &'static [u8]s which is not correct.
Not even sure if this is the right approach, might be better to do something like
structOwnedUnstructured{bytes:Vec<u8>
cursor:usize,}implOwnedUnstructured{// Get a non-owned `Unstructured` of these bytes.pubfnu<'a>(&'amutself) -> implDerefMut<Target = Unstructured<'a>>{structU<'a>{cursor:&'amutusize,initial_len:usize,u:Unstructured<'a>}implDropforU<'_>{fndrop(&mutself){// Advance cursor by number of bytes consumed. This is buggy// because it assumes bytes are only taken from the from of// the input, never from the back, which is not true. Don't// know how to work around this without having a double-borrow// of `self.bytes`.self.cursor += self.initial_len - self.u.len();}}implDerefforU<'_>{/* ... */}implDerefMutforU<'_>{/* ... */}let u = Unstructured::new(&self.bytes[self.cursor..]);U{cursor:&mutself.cursor,initial_len: u.len(),
u
}}}
Out of curiosity, I made a little POC in the cranelift control plane with yoke and it pretty much does exactly what we need. The diff can be found here.
The code complexity did increase in my opinion. Solving this problem within arbitrary would shift that complexity from the user to the library. I guess what the best tradeoff is depends on how common this use case is. And I think @fitzgen mentioned that it's quite niche.
Talking with some folks who would like to pass around an
Unstructured
through Cranelift to implement a "chaos mode" that does semantics-preserving random mutations like shuffle basic blocks, but don't want to threadUnstructured
's lifetimes through the whole code base. Would instead like to have an owned version ofUnstructured
that they can attach to existing context structures.Could do this with a wrapper struct and self borrows but would like to make sure this is UB-free and in a place where it can be reused by anyone else who has similar needs:
Another approach could be to have a
Cow<'a, [u8]>
inUnstructured
itself, although then you'd end up with anUnstructured<'static>
but the way thatArbitrary
is defined, this would let you create arbitrary&'static [u8]
s which is not correct.Not even sure if this is the right approach, might be better to do something like
Just brainstorming at this point.
Thoughts? Ideas?
cc @cfallin
The text was updated successfully, but these errors were encountered: