Initial Public FS and WebAssembly Work #6

appcypher · 2022-04-08T18:09:07Z

Summary

This PR implements the following features

Initial Public FS implementation that immutable core, filetree merging, etc. will be based on.
Rust unit tests
A wasm-bindgen interface implementation.

This is the first of a few PRs that will implement the WNFSv2 Public file system.

Test plan (required)

Testing the Rust core.
```
cargo test -p wnfs
```

Testing the wasm binding.

cd crates/wasm

wasm-pack test --chrome

open http://127.0.0.1:8000

Closing issues

Fixes #4

Ongoing Issues

Plan for Wasming WNFS incrementally oddsdk/ts-odd#371

matheus23 · 2022-04-21T07:35:02Z

crates/fs/common/blockstore.rs

+/// For types that implement getting a block from a CID.
+#[async_trait(?Send)]
+pub trait BlockStoreLookup {
+    async fn get_block<'a>(&'a self, cid: &Cid) -> Result<Cow<'a, Vec<u8>>>;


This may just be me learning rust:
Can we return a slice [u8] here instead of a Vec<u8>?
Why do we return something clone-on-write-able? I was thinking a user of get_block would never want to change the result, so if there's no need for writes, we won't need the Cow.

matheus23 · 2022-04-21T07:39:38Z

crates/fs/common/blockstore.rs

+#[async_trait(?Send)]
+pub trait BlockStoreCidLoad {
+    /// Loads a decodable object from the store with provided CID.
+    async fn load<T: Decode<C>, C: Codec>(&self, cid: &Cid, decoder: C) -> Result<T>;


This is cool!
I'm thinking it might be better if this were a normal function instead of a trait though.
Something akin to async fn load<T: Decode<C>, C: Codec, B: BlockStoreLookup>(blockStore: &B, cid: &Cid, decoder: c) -> Result<T>

This way we wouldn't have to re-implement load for all the different structs that implement BlockStore. They all should work the same.

You are right. We don't need to make it a trait here.

matheus23 · 2022-04-21T07:54:20Z

crates/fs/public/directory.rs

+
+/// A directory in a WNFS public file system.
+#[derive(Debug, Clone, PartialEq, Eq)]
+pub struct PublicDirectory(Shared<PublicDirectoryInner>);


I'm highly in favor of going with only Rc<...> in here for now, instead of Rc<RefCell<...>>, and remove mutation from our algorithms currently.
@appcypher and I talked about this, I think mutation is used in the upsert function. It may be slightly more complex to do immutably, but it's definitely possible.
We need the immutability for some of our upcoming algorithms, namely the base-history-on stuff to construct previous links and the file tree merge algorithm. They like to early-return when they detect that the two trees they compare have the same reference. These early returns are important, since we don't want to recurse into the file tree deeply so we don't fetch deeper nodes.

matheus23 · 2022-04-21T08:03:11Z

crates/fs/public/directory.rs

+
+    /// Looks up a node by its path name in the current directory.
+    ///
+    /// TODO(appcypher): What is a valid path segment identifier?


Yeah, that's a great question. I can only link an old internal discussion about this: https://talk.fission.codes/t/valid-paths-in-wnfs/2015

My thinking is, let's assume UTF8. Technically any byte array is valid (even ones that utf8-decoded include a slash), because of how the encoding into IPLD works out.

This is something we'd need to think about a little harder if we want to implement the pretty tree maybe.

matheus23 · 2022-04-21T08:27:20Z

crates/fs/public/directory.rs

+pub struct OpResult<T> {
+    // The root node. It is the same as the previous root node if the directory has not been diverged.
+    pub root_node: Shared<PublicNode>,
+    // Implementation dependent but it usually the last leaf node operated on.
+    pub result: T,
+    /// Whether this is a divergence or not.
+    pub diverged: bool,
+}


I think we might be able to go without the OpResult type.

get_node returns an OpResult, but that means you get back a PublicNode in root_node and result, where I'd pretty much expect root_node to be the same thing that that's used to call get_node on.

read returns an OpResult, but like get_node, it returns back the root_node you originally called read on.

write returns the parent of the written node. That result isn't used anywhere at the moment.

diverged may not be needed when stuff is immutable, but I may just not really understand it yet! 😅

matheus23

First end-to-end public filesystem implementation! 🎉

Stephen and I talked about the comments I left above: The crux of it would be a change somewhat deeper down & technically only a refactor, the surface API would stay the same.
So let's capture that refactor (figuring out how/whether we can remove the RefCell, and make all algorithms only make use of mutation locally without needing a refcell and non-recursive) in an issue and we'll tackle that as the next PR. 👍
Technically we can continue poking the wasm blob this generates 😄

appcypher added 2 commits April 6, 2022 08:56

Set up wasm env and basic fs impl

f8ce25e

Add more fs stuff

ff1d167

appcypher force-pushed the appcypher/initial-fs-wasm branch from 2b1de45 to ff1d167 Compare April 8, 2022 18:12

Implement dagcbor encoding and tests

a48f72f

appcypher force-pushed the appcypher/initial-fs-wasm branch from a3d5a20 to a48f72f Compare April 11, 2022 14:41

Implement initial wasm-bindgen api

c624021

appcypher force-pushed the appcypher/initial-fs-wasm branch 5 times, most recently from a4ec85f to 8e1b90e Compare April 13, 2022 23:57

Fix wasm build

2547c39

appcypher force-pushed the appcypher/initial-fs-wasm branch from 9acdc7d to 2547c39 Compare April 14, 2022 00:03

Implement unix fs ops

14d8eb4

appcypher marked this pull request as ready for review April 14, 2022 12:19

appcypher force-pushed the appcypher/initial-fs-wasm branch 7 times, most recently from d4d94ee to c711c58 Compare April 14, 2022 20:20

More documentation

16b07ba

appcypher force-pushed the appcypher/initial-fs-wasm branch from c711c58 to 16b07ba Compare April 14, 2022 20:35

Add more wasm api

3d0608f

appcypher requested a review from matheus23 April 19, 2022 08:58

appcypher added 3 commits April 19, 2022 10:49

Refactor PublicDirectory to use interior mut

80529b1

Remove unnecessary tests

9ee8b0e

Add example

c5d4e58

walkah assigned appcypher Apr 19, 2022

Fix path divergence issue in

53094c5

matheus23 reviewed Apr 21, 2022

View reviewed changes

Update examples and make blockstore::load a function

624c876

matheus23 approved these changes Apr 21, 2022

View reviewed changes

appcypher merged commit 7a89949 into main Apr 21, 2022

zeeshanlakhani deleted the appcypher/initial-fs-wasm branch October 19, 2022 19:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial Public FS and WebAssembly Work #6

Initial Public FS and WebAssembly Work #6

appcypher commented Apr 8, 2022 •

edited

Loading

matheus23 Apr 21, 2022

matheus23 Apr 21, 2022

appcypher Apr 21, 2022

matheus23 Apr 21, 2022

matheus23 Apr 21, 2022

matheus23 Apr 21, 2022

matheus23 left a comment

Initial Public FS and WebAssembly Work #6

Initial Public FS and WebAssembly Work #6

Conversation

appcypher commented Apr 8, 2022 • edited Loading

Summary

Test plan (required)

Closing issues

Ongoing Issues

matheus23 Apr 21, 2022

Choose a reason for hiding this comment

matheus23 Apr 21, 2022

Choose a reason for hiding this comment

appcypher Apr 21, 2022

Choose a reason for hiding this comment

matheus23 Apr 21, 2022

Choose a reason for hiding this comment

matheus23 Apr 21, 2022

Choose a reason for hiding this comment

matheus23 Apr 21, 2022

Choose a reason for hiding this comment

matheus23 left a comment

Choose a reason for hiding this comment

appcypher commented Apr 8, 2022 •

edited

Loading