feat: use fixed number of db transactions for storage proofs #14860

Rjected · 2025-03-06T05:57:53Z

Creates a task that manages a fixed number of db transactions, specifically for calculating many storage proofs

mattsse

this looks good already

mattsse · 2025-03-06T06:55:56Z

crates/trie/parallel/src/proof_task.rs

+        if self.pending_targets.front().is_none() {
+            return;
+        }
+
+        let next_input = self.pending_targets.pop_front().unwrap();
+


can be let Some else return?

mattsse

some questions about message passing

mattsse · 2025-03-10T20:34:03Z

crates/trie/parallel/src/proof_task.rs

+        // initialize proof task txs
+        let mut proof_task_txs = Vec::with_capacity(max_concurrency);
+        for item in &mut proof_task_txs {
+            let provider_ro = view.provider_ro()?;
+            let tx = provider_ro.into_tx();
+            *item = ProofTaskTx::new(tx, task_ctx.clone());


we're currently paying for this upfront before twe spawn this, right?

which doesn't seem reasonable especially because we spawn this and can just do all of this in the background or on demand.

Yeah, we could do this on the spawned run function

We now create transactions on-demand

mattsse · 2025-03-10T20:39:27Z

crates/trie/parallel/src/proof_task.rs

+                let RunningProofTask { task_receiver, sender } = running_task;
+                match task_receiver.try_recv() {
+                    Ok(ProofTaskOutput { tx, result }) => {
+                        let _ = sender.send(result);


this feels a bit weird, why are we awaiting both the result and the sender here only to send the result through that sender

I guess we could send the result directly in the spawned storage proof task, and only pass the tx back in the ProofTaskOutput

now we send the result directly to the sender in storage_proof

mattsse · 2025-03-10T20:46:43Z

crates/trie/parallel/src/proof_task.rs

+        loop {
+            // TODO: condvar for
+            // * either running or proof tasks have stuff
+            // * otherwise yield thread
+            let message = match self.proof_task_rx.try_recv() {
+                Ok(message) => match message {


hmm, this task now has do advance more than one channel, making this loop a bit complex with try_recv.

isn't the flow of messages just
<- incoming request for storage proof with a tx
if < concurrency: pick available db tx or spawn that task without one and create one on the task
handle the storageproof request on the task and return the db tx

then this only has to loop over a single receiver

or are we then running into issues with generics if we need to carry the db tx in the prooftaskmessage enum

The design currently uses 2 channels, one for incoming proof requests, and one for returning txs from finished storage proof tasks. If we want only one channel in this loop, we would need another way (other than message passing) to keep track of txs in-use / not in-use

Rjected · 2025-03-10T23:59:03Z

Note, I still have this:

let max_concurrency = 32;

any suggestions on the value / how to derive this?

shekhirin

makes sense to me, only nits

shekhirin · 2025-03-13T10:21:44Z

crates/trie/parallel/src/proof_task.rs

+    /// Creates a new [`ProofTaskManager`] with the given max concurrency, creating that number of
+    /// cursor factories.
+    ///
+    /// Returns an error if the consistent view provider fails to create a read-only transaction.


outdated comment?

shekhirin · 2025-03-13T10:22:16Z

crates/trie/parallel/src/proof_task.rs

+    /// Spawns the proof task on the executor, with the input multiproof targets.
+    ///
+    /// If a task cannot be spawned immediately, this will be queued for completion later.


this always queues and the spawn happens in the main loop

shekhirin · 2025-03-13T10:23:29Z

crates/trie/parallel/src/proof_task.rs

+        let Some(proof_task_tx) = self.get_or_create_tx()? else { return Ok(()) };
+
+        let Some((pending_proof, sender)) = self.pending_proofs.pop_front() else {
+            // if there are no targets to do anything with put the tx back
+            self.proof_task_txs.push(proof_task_tx);
+            return Ok(())
+        };


should we swap these, so that txs aren't created if they won't be used?

Rjected added the A-trie Related to Merkle Patricia Trie implementation label Mar 6, 2025

Rjected force-pushed the dan/fixed-db-tx-multiproof branch 4 times, most recently from 64136eb to ab4eb0f Compare March 6, 2025 06:09

mattsse reviewed Mar 6, 2025

View reviewed changes

Rjected force-pushed the dan/fixed-db-tx-multiproof branch 5 times, most recently from b68a291 to 565b7fc Compare March 10, 2025 17:18

Rjected marked this pull request as ready for review March 10, 2025 18:49

Rjected requested review from rkrasiuk, shekhirin, fgimenez and gakonst as code owners March 10, 2025 18:49

Rjected force-pushed the dan/fixed-db-tx-multiproof branch 7 times, most recently from 6018b0c to 2d9dc24 Compare March 10, 2025 20:42

mattsse reviewed Mar 10, 2025

View reviewed changes

Rjected force-pushed the dan/fixed-db-tx-multiproof branch 3 times, most recently from bac17cc to d7397bb Compare March 10, 2025 22:49

Rjected added 2 commits March 11, 2025 17:16

feat: use fixed number of db transactions for storage proofs

184f83a

fix pending proof / task handling

0b07932

Rjected added 6 commits March 11, 2025 17:18

chore: make the io task longer living

c4325eb

remove todo comment

9d01117

chore: add handling for TryRecvError::Disconnected

b8e5a66

chore: send result directly when calculating proof

d777dca

chore: fix incorrect running task removal

bf5bddc

fix: pass down with_branch_node_masks

e039441

Rjected force-pushed the dan/fixed-db-tx-multiproof branch from 6a3cd28 to e039441 Compare March 11, 2025 21:22

shekhirin approved these changes Mar 13, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: use fixed number of db transactions for storage proofs #14860

feat: use fixed number of db transactions for storage proofs #14860

Rjected commented Mar 6, 2025

mattsse left a comment

mattsse Mar 6, 2025

mattsse left a comment

mattsse Mar 10, 2025

Rjected Mar 10, 2025

Rjected Mar 10, 2025

mattsse Mar 10, 2025

Rjected Mar 10, 2025

Rjected Mar 10, 2025

mattsse Mar 10, 2025

Rjected Mar 10, 2025

Rjected commented Mar 10, 2025

shekhirin left a comment

shekhirin Mar 13, 2025

shekhirin Mar 13, 2025

shekhirin Mar 13, 2025

feat: use fixed number of db transactions for storage proofs #14860

Are you sure you want to change the base?

feat: use fixed number of db transactions for storage proofs #14860

Conversation

Rjected commented Mar 6, 2025

mattsse left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mattsse left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Rjected commented Mar 10, 2025

shekhirin left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment