perf: improve extend_sorted_vec & batch write hashed_state #19990

duyquang6 · 2025-11-26T13:52:23Z

As discussed #19739 (comment)
Batch write of hashed_state is safe, so I created this PR to cherry-pick old reverted commit

Changes

improve extend_sorted_vec use merge (avoid sort at the end) and batch write hashed_state

before

erc20 transfers spam: ~100ms

native transfers spam: ~50ms

after

erc20 transfers spam: ~80ms

native transfers spam: ~30ms

mattsse

pedantic doc nit

crates/trie/common/src/utils.rs

mediocregopher · 2025-11-27T13:05:19Z

crates/trie/common/src/utils.rs

-        target.sort_unstable_by(|a, b| a.0.cmp(&b.0));
-    }
+        })
+        .collect();


The previous implementation was specifically designed to avoid having to do a big collect like this; the resulting memory allocation from this collect dwarfs any ostensible speedup you get from not having to sort. I just did a bench comparing your implementation to the previous and this new one is about 2x slower for synthetic datasets:

You can see the bench here if you're curious

very useful bench @mediocregopher
when I first use old version with aggregated hashed state to bench, the result is not good. This is why I think there something wrong with extend_ref or extend_sorted_vec

when I use your bench compare with custom own merge version (not used merge_join_by), it only shine at other target size smaller than other size, but overall case, old version still win. That give me some hint to use this function better, is keep target size and other size similar or larger so might benefit old version

shine case (new better)

but overall (size similar or target size > other) old still better

I dump the raw data of HashedPostStateSorted when bench with native-transfer

here is bench result of extend_ref, new version of both is better than in this testcase

can double check the bench here - already attach hashed state raw data
Could be raw data might have properties that benchmark doesn’t fully cover 🤔 ?

duyquang6 requested review from Rjected, mediocregopher, rakita and shekhirin as code owners November 26, 2025 13:52

github-project-automation bot added this to Reth Tracker Nov 26, 2025

duyquang6 requested a review from joshieDo as a code owner November 26, 2025 13:52

github-project-automation bot moved this to Backlog in Reth Tracker Nov 26, 2025

duyquang6 changed the title ~~perf: improve extend_sorted_vec & write batch for hashed_state~~ perf: improve extend_sorted_vec & batch write hashed_state Nov 26, 2025

duyquang6 force-pushed the push-rsnqslrrpszs branch from e21cd01 to 4d2e4d7 Compare November 26, 2025 13:53

perf: improve extend_sorted_vec & write batch for HashedPostState

3b3bd96

duyquang6 force-pushed the push-rsnqslrrpszs branch from 4d2e4d7 to 3b3bd96 Compare November 26, 2025 14:03

chore: add unit-test

029724d

duyquang6 force-pushed the push-rsnqslrrpszs branch from f6cfcb2 to 029724d Compare November 27, 2025 02:20

mattsse added C-perf A change motivated by improving speed, memory usage or disk footprint A-db Related to the database labels Nov 27, 2025

mattsse requested changes Nov 27, 2025

View reviewed changes

crates/trie/common/src/utils.rs Outdated Show resolved Hide resolved

crates/trie/common/src/utils.rs Outdated Show resolved Hide resolved

github-project-automation bot moved this from Backlog to In Progress in Reth Tracker Nov 27, 2025

duyquang6 force-pushed the push-rsnqslrrpszs branch 3 times, most recently from c1ec07c to 40f7bef Compare November 27, 2025 11:36

fix: comment

ec2786b

duyquang6 force-pushed the push-rsnqslrrpszs branch from 40f7bef to ec2786b Compare November 27, 2025 11:39

mediocregopher reviewed Nov 27, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf: improve extend_sorted_vec & batch write hashed_state #19990

perf: improve extend_sorted_vec & batch write hashed_state #19990

duyquang6 commented Nov 26, 2025 •

edited

Loading

Uh oh!

mattsse left a comment

Uh oh!

Uh oh!

Uh oh!

mediocregopher Nov 27, 2025

Uh oh!

duyquang6 Nov 28, 2025 •

edited

Loading

Uh oh!

duyquang6 Nov 28, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

perf: improve extend_sorted_vec & batch write hashed_state #19990

Are you sure you want to change the base?

perf: improve extend_sorted_vec & batch write hashed_state #19990

Conversation

duyquang6 commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

before

erc20 transfers spam: ~100ms

native transfers spam: ~50ms

after

erc20 transfers spam: ~80ms

native transfers spam: ~30ms

Uh oh!

mattsse left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mediocregopher Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

duyquang6 Nov 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

duyquang6 Nov 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

duyquang6 commented Nov 26, 2025 •

edited

Loading

duyquang6 Nov 28, 2025 •

edited

Loading

duyquang6 Nov 28, 2025 •

edited

Loading