Remove Mustermann leaf matcher to improve performance by dcr8898 · Pull Request #284 · hanami/hanami-router

dcr8898 · 2025-03-27T20:27:16Z

Router Improvement

This will be a long-running, exploratory spike intended to remove the router's dependency on Mustermann and improve performance overall. Router performance is defined by two metrics: start up time and requests-per-second, both measured using the r10k benchmarking tool. I feel that this tool, r10k, is problematic for a number of reasons, but it is readily available and offers a general sense of performance changes as the spike progresses.

I invite any and all comments as this goes along. I am tagging the following contributors to request their input:

@kyleplump @cllns @timriley

Goals

Mustermann is a powerful string matching library that offers many features, most of which our router does not use. In fact, some of the work done by Mustermann is duplicated by the work the current router does in splitting route and path strings into segments in order to utilize trie data structures. Since Mustermann pattern objects are complex and resource intensive to create, eliminating this dependency should improve router startup time, performance in production, and reduced memory usage. I hope. 🙄

I believe that further improvements may be possible on these same metrics through a more general refactor of the router's operation. I would like to explore these ideas as well.

Checklists

Following are a series of checklists to guide progress on this spike. They should be considered "living lists" that will change during the process. Please feel free to edit the lists (or suggest edits) if they are not complete or detailed enough.

Current features

The Hanami Routing guide commits to supporting the following features (checked items have been re-implemented so far without Mustermann):

Match segment-length dynamic variables without constraints.
Constraints on dynamic variables (expressed as regular expressions in the route definition).
Optional route clauses, like get "/books(/:id)".
Globbing & Catch all routes - this is unchanged from earlier versions (2.0 - 2.1). It is implemented through special handling in the router (uses Mustermann, but separate from routes stored in tries).

Features NOT implemented

These are undocumented--so far as the Hanami guides and the router README are concerned--features of the 2.1 and 2.2 routers that are powered by Mustermann and technically possible, but possibly can't be duplicated with 1-1 parity using my current approach.

Constraints using POSIX-like notation.
Escaped characters in route definition, like "/test\\:variable"
Sub-segment-length dynamic variables, like "/test.:format"
Sub-segment-length dynamic variables that are also delimited by . character, like "/hey.:greed.html" (captures :greed).

Edge cases

I am aware of the following edge cases that aren't specifically covered in the guides, but merit discussion. It would be good to establish explicit rules for them.

Trailing slashes
- the 2.1 router did not handle trailing slashes correctly, leading to ambiguous results.
- the 2.2 router does handle trailing slashes correctly and treats /books/:id and books/:id/ as distinct, matchable routes (because it uses Mustermann to match the complete, original route definition).
- the 2.2c+ routers developed during this spike no longer handle trialing slashes correctly. This is because we use String#split on path strings, which ignores the final character if it is the #split parameter.

Stretch goals

I believe these goals are possible with my current approach. I will implement them if there is consensus on their value.

Conversion of URI-encoded values in path strings (not mentioned in the routes, but a basic necessity).
User defined constraint objects (must respond to #match?, for example, and receive a string argument).

Starting Point

The current 2.2 router introduced a performance regression compared to 2.1 (see #278 ). This commit (2.2a) corrects the core issue causing that regression, as identified and fixed by @kyleplump (see #279) and restores near parity to 2.1's speed, while increasing start up time (primarily due to Mustermann object creation during start up). This is our starting point, as shown in the following graphs.

Benchmark machine: AMD Ryzen 7 laptop with 24GB of RAM and no attempt to restrict background processes (in other words, YMMV).

Commit label: 2.2a
Test status: 544 examples, 0 failures, 37 pending

expand to see graphs

dcr8898 · 2025-03-27T20:42:24Z

This commit addresses a second issue identified and fixed by @kyleplump in #279: using a regular expression to split route and path strings is much slower than using a string ("/").

With this commit, operating speed is faster than 2.1, but start up time and memory usage are unchanged from the previous commit.

Commit label: 2.2b
Test status: 544 examples, 0 failures, 37 pending

expand to see graphs

dcr8898 · 2025-03-29T02:50:34Z

This commit is the proof of concept for removing Mustermann. Basic functionality is implemented (see below). Performance is improved on all metrics. Some tests are failing, as expected at this point.

But this proof of concept also comes with a serious reality check.

Discussion

The graphs below show the performance benefits of eliminating Mustermann:

Requests per second is increased by about 10% across the board, now plainly outpacing 2.1.
Start up time is now half that of 2.1, and a seventh of 2.2a.
Memory usage is virtually identical to 2.1

The price of these increases, however, is features. This commit implements "segment-sized" dynamic variables, such as :id in "/books/:id/new". It does not implement constraints. While this commit probably covers 80% (or more) of the common usage of the router, it is not yet on 1-1 parity with the 2.2a router (as documented in the Hanami guides).

However, I now see from most of the currently failing tests that the specs do document more exotic usage patterns, not documented in the guides or README, that are available through the use of Mustermann. This includes, for example, dynamic variables embedded within a segment, such as: "/hey.:greed.html" (captures :greed). While these usage patterns are not discussed in the guides, their presence in the specs is evidence that they may be in use in the wild. Removing them could be considered a breaking change.

I'm not sure if I can duplicate all of these usage patterns using my current approach, but I want to do more thinking about this.

This raises the question of what features we want to support? Are we willing to sacrifice some of the flexibility of Mustermann for better performance?

These are our current options, as I see it:

2.1 router - fast but not "correct" (unless we define correct differently--always an option).
2.2b router - correct and fast, except for startup time.
2.2c+ router - fast and correct, based on a smaller feature set (my current strategy).
Something else?

I am happy to elaborate on my current approach, if anyone would like to help me think through alternatives.

Unless the discussion here pushes me in a different directions, I plan to implement constraints next (via regular expressions), and URI decoding of all variable segments (interestingly, a core feature of Mustermann, but not exercised in the current test suite).

Commit label: 2.2c
Test status: 544 examples, 35 failures, 41 pending

expand to see graphs

dcr8898 · 2025-03-29T18:14:38Z

@kyleplump @cllns @timriley I am tagging you all for a temperature check as to the direction I am going. No rush to respond. I will be traveling for nine days starting Tuesday, so I thought this would be a good time for reflection and conversation. It may also be a good time to broadcast this exploration to the wider community for feedback.

I doubt I will do any coding while traveling, but I may be able to comment. It might be possible to add basic constraints before I go, but no promises. Constraints would have no effect on this benchmark, since the benchmark doesn't exercise them.

Way Point No. 1

Let's see where we stand so far. Here are three graphs comparing:

Hanami 2.1
Hanami 2.2c
Roda 3.90.0 (current)
Rails 8.0.2 (current)

I will say again that r10k is not a great tool for reflecting real-world usage (at all), but it may be useful for gross comparisons.

It is clear from these graphs that Hanami router in any configuration occupies a sweet spot in terms of performance and developer experience: developer experience on par with Rails, and performance approaching that of Roda. This is Luca's achievement and should be recognized and appreciated. 🙇

Rails routing performance isn't close on any measure. This is food for thought.

Questions for the community

What are the features we need for the Hanami router to be successful?
What are our performance goals for Hanami router?

expand to see graphs

dcr8898 · 2025-03-30T23:58:38Z

This commit adds basic constraints using regular expressions. There is a variation on regular expression constraints available with Mustermann that allows POSIX-like notations. I added this as a feature in the Not Implemented list. Let me know if this should be a stretch goal.

At this point, we have essentially matched the feature set described in the Hanami router guide. However, I don't think we have full parity until we have URI decoding in place (a feature of Mustermann). Even though this is not mentioned in the guide, I see it as basic functionality. I will try to implement that next.

As of now, only 16 tests are failing (down from 35 in the prior commit). Four of these are unit tests that I need to re-work, remove, or skip. The other 12 are "exotic" applications of Mustermann, like optional segments, multiple dynamic variables in a segment, and escaped characters.

Benchmark performance is indistinguishable from 2.2c.

Commit label: 2.2d
Test status: 544 examples, 16 failures, 41 pending

expand to see graphs

~~Next up is URI decoding.~~

Next is fixing unit tests and considering skipping remaining failing tests as undocumented behavior that is not representative of current requirements.

timriley · 2025-03-31T12:17:30Z

Thank you for your work on this, @dcr8898! It is exemplary as always.

I'll get to looking at the code soon, but I did want to answer this question of yours:

This raises the question of what features we want to support? Are we willing to sacrifice some of the flexibility of Mustermann for better performance?

If I look at the README overview for the Rails flavour of Mustermann, I'm interested in us supporting the features exercised in all three of the example routes:

pattern = Mustermann.new('/:example', type: :rails)
pattern === "/foo.bar"     # => true
pattern === "/foo/bar"     # => false
pattern.params("/foo.bar") # => { "example" => "foo.bar" }
pattern.params("/foo/bar") # => nil

pattern = Mustermann.new('/:example(/:optional)', type: :rails)
pattern === "/foo.bar"     # => true
pattern === "/foo/bar"     # => true
pattern.params("/foo.bar") # => { "example" => "foo.bar", "optional" => nil   }
pattern.params("/foo/bar") # => { "example" => "foo",     "optional" => "bar" }

pattern = Mustermann.new('/*example', type: :rails)
pattern === "/foo.bar"     # => true
pattern === "/foo/bar"     # => true
pattern.params("/foo.bar") # => { "example" => "foo.bar" }
pattern.params("/foo/bar") # => { "example" => "foo/bar" }

:name (capture everything except /) is a mainstay in routing and it's obvious we want that.

*name (capture everything including /) is an important escape hatch and a helpful counterpart to :name.

As for (/:optional), I personally wished for that when building the new Hanami site earlier this year, but I think(?) we lost it in the move to the Hanami-router 2.0 release, so I had to define duplicate routes to handle this case:

get "/guides/:org/:version/:slug", to: "guides.show"
get "/guides/:org/:version/:slug/*path", to: "guides.show"

A route of /guides/:org/:version/:slug(/*path) would have felt more elegant and certainly have been pleasing.

If we can support these three things, then I think we'll be able to express most routes well enough.

It also means we'll largely maintain compatibility with the routing support we've offered historically.

I'm not sure if this throws a spanner in your works, but I'm keen to hear your response to the above, @dcr8898 😄

dcr8898 · 2025-03-31T18:36:37Z

@timriley Thanks for the guidance! It's invaluable!

Quick overview of the router's function:

The Router object holds several Tries.
Each Trie is made up of some number of Nodes.
Any Node that is an endpoint holds a leaf for each route that ends there (usually one leaf, but more are possible).

To address your three feature requests out of order:

Typical usage. This is good to go now. 👍 This is implemented in the Tries and their child objects.

pattern = Mustermann.new('/:example', type: :rails)
pattern === "/foo.bar"     # => true
pattern === "/foo/bar"     # => false
pattern.params("/foo.bar") # => { "example" => "foo.bar" }
pattern.params("/foo/bar") # => nil

Globbing and catch-all routes. This is described in the guides. This is implemented in the Router object. I have not looked at or changed this in any way from 2.1, so I believe this should also work now. 👍

pattern = Mustermann.new('/*example', type: :rails)
pattern === "/foo.bar"     # => true
pattern === "/foo/bar"     # => true
pattern.params("/foo.bar") # => { "example" => "foo.bar" }
pattern.params("/foo/bar") # => { "example" => "foo/bar" }

Optional route parts. This is not mentioned in the guides, but may have been partially doable in 2.1 as long as the optional part was within a route string segment and included a captured parameter. This may have worked because each segment with a dynamic variable defined within it got it's own Mustermann matcher. However, optional parts that included a forward slash or spanned two segments may have broken the 2.1 router (I think).

In any event, I think optional parts are totally doable by automating your "duplicate route" strategy right in the Router object (that's been my plan). However, defining routes for all permutations of optional parts could get crazy if there are multiple optional parts or nested optional parts. I suppose we have to allow for that and advise users to use sparingly. 🤔

pattern = Mustermann.new('/:example(/:optional)', type: :rails)
pattern === "/foo.bar"     # => true
pattern === "/foo/bar"     # => true
pattern.params("/foo.bar") # => { "example" => "foo.bar", "optional" => nil   }
pattern.params("/foo/bar") # => { "example" => "foo",     "optional" => "bar" }

Takeaways

My takeaway, since two of your requests should be working now.

Move optional parts from stretch goals to main goals.

Further thoughts

The current test suite includes what I've been calling "exotic" usages of Mustermann. For example, multiple dynamic variables in a segment or dynamic variables that are a portion of a segment. I believe that these usage patterns could be compatible with my current approach by . . . using Mustermann for these specific cases. We should be able to detect these "exotic" usages and only use Mustermann for the segments that are affected.

I like this approach of matching power to demand. The majority of use cases would be unaffected. This was the primary performance problem of the 2.2 router: invoking the full power of Mustermann for all routes, when it is not needed for the most common use cases.

dcr8898 · 2025-03-31T21:18:33Z

@timriley

I'm not sure what you mean by this bit:

*name (capture everything including /) is an important escape hatch and a helpful counterpart to :name.

The splat doesn't capture the leading / in ("/*name"), but it will capture any character after that, including any further /`s.

dcr8898 · 2025-04-16T17:39:02Z

This commit refactors tests to gain clarity before moving forward.

At the start of the spike there were 37 skipped tests.

Prior to this commit, 16 tests were failing and 41 were skipped (I skipped a few more unit tests along the way).

This commit does the following:

Refactor unit tests for Trie, Node, and Leaf classess. Tests now match interface changes made during the spike. I started by un-skipping any tests I had skipped during the spike, then some tests were added and some updated or removed. (Wish I had TDDed this. 🫤 ) This refactor also led to two small changes in the Node and Leaf classes.
Marked the remaining failing tests (12) as skipped, with a FIXME: not supported in 2.2.1 comment on each. These tests all exercise presently unsupported behavior, such as escaped characters in route definitions, or sub-segment level dynamic variables (see my comments on "exotic" Mustermann usages above). These features are possible, but are not supported at this point in the spike.

Thankfully, CI failures should now be gone too. 🥳

Benchmark performance is indistinguishable from 2.2d.

Commit label: 2.2e
Test status: 549 examples, 0 failures, 49 pending

Next up is support for optional clauses.

expand to see graphs

…ters

…tional_routes

dcr8898 · 2025-04-18T21:02:03Z

This commit implements optional route clauses. Nested optional clauses and consecutive optional clauses are supported. This is implemented by defining routes for every possible permutation of optional clauses.

This concludes the basic features that @timriley requested, although some more polish is probably needed, like automatic URI decoding. This is a good place for another way point and a meaningful conversation about where to go from here, especially as pertains to Mustermann. However, before we do that, I am curious to pursue a few more performance tweaks.

Benchmark performance is indistinguishable from 2.2e.

Commit label: 2.2f
Test status: 555 examples, 0 failures, 46 pending

Next up is some performance experiments.

expand to see graphs

dcr8898 · 2025-04-24T03:25:14Z

I have exhausted my ideas for performance improvements. I feel that this commit concludes this spike. I will declare a way point below and invite a discussion as to whether this spike should be pursued as an actual PR, and what further changes would be needed.

Improvements

The changes I made here centered mostly around replacing calls to Regexp#match? with String#inlcude?, since the String method is demonstrably faster. Making this change to detect "variable" routes in Router, and segments in Node, gave a noticeable (but small) boost to start-up time and requests per second. I also removed some minor indirection in Node (an unnecessary local variable).

Next, I converted the "globbed route" and "optional route" detection in Router from a Regexp to a String method. However, this did not seem to have much effect. I find this a little odd, even though these types of routes are not exercised by this benchmark, because these checks are made for every route when compiling the Tries. In fact, it seems to me that the rps benchmarks may have been slightly lower after this change for the 10, 100, and 1000 route cases. Weird.

Another change I attempted was to use String#split with a block directly, instead of String#split#each. My tests show that passing a block directly to #split is far faster than tacking on a call to #each. However, the router was unequivocally slower with this implementation. I don't know why.

When using #each, we throw away the first segment because it is always empty (since path strings always start with /). When using #split directly, I operated on the substring path[1..] to achieve the same effect. Again, tests show that this should not make a difference speed-wise, but the benchmark showed differently and I abandoned this change completely.

A keen eye will show that RPS and Runtime are slightly improved from 2.2f.

Commit label: 2.2g
Test status: 555 examples, 0 failures, 46 pending

expand to see graphs

cllns

First of all, thank you so much for continuing work on this @dcr8898! This is a huge lift.

I made a couple tiny suggestions, but overall this is good with me, based on your description of the changes. I'm fine with removing support for features that were only documented via specs. If those features breaking is a problem for users, they can revert to an earlier version for now, and we can figure out how to add support for them, or provide a workaround.

I renamed this PR to be more accurate, since we're not actually removing all uses of Mustermann, just one important use of it.

Curious what @kyleplump and @timriley have to say as well!

cllns · 2025-04-24T18:34:32Z

+        [
+          +EMPTY_STRING << match_data.pre_match << match_data[1] << match_data.post_match,
+          +EMPTY_STRING << match_data.pre_match << match_data.post_match
+        ].each do |new_path|


Instead of creating this intermediary array to iterate over it, could we repeat the conditional below for each new_path we create? Less DRY but possibly more performant?

I will try this. It makes sense.

This point also highlights the difference between writing speed-sensitive library code and writing apps: apps are optimized for maintainability, while the library code is optimized for performance. That often requires abandoning a lot of code hygiene practices that we usually promote (like DRY in this case). Many of those good practices are based on indirection, like using constants to label magic values, but that indirection impacts performance. This was the basis of my question above: How much performance is enough? I'm not sure it's worth in-lining all of the constants in the Router, for example, but others might disagree.

It's also hard to know if these micro-optimizations have an effect when we are using gross measurements like r10k. With that said, I'm curious to try this one and see what happens. 🤓

Taking a fresh look at this, since it only applies to optional routes, which are uncommon and not exercised by this benchmark at all, I wonder if this change is worthwhile. Thoughts?

According to Fast Ruby, interpolation should be slightly faster than the shovel method. Do you think this change is worth making?

[ -"#{match_data.pre_match}#{match_data[1]}#{match_data.post_match}", -"#{match_data.pre_match}#{match_data.post_match}" ].each do |new_path|

This is what I was getting at with the "Question for All." I like searching for performance tweaks, and I think a router should be as performant as possible, but I agonize over the loss of readability and maintainability. In this case, I think switching to interpolation is probably okay, because the code remains understandable (I think). But I hesitate to un-DRY the code for what is a rare usage (optional clauses).

What do you all think?

kyleplump

this is honestly an insane amount of work @dcr8898, and all of the commendations are well deserved 🎉 💪

(made a few code level comments)

in terms of philosophy, I agree with a lot of what you brought up. I think about this problem from two points of view and I think you've landed at ideal solutions for both (in my opinion):

'Hanami in a bubble': if Hanami was the only framework in the world, I think the important metric to capture with this change is to make the common use case as fast as possible, while providing clear instructions on the 'exotic' cases. The 'common case' is something Tim defined, and you've achieved 'as fast as possible' so big win there ❤️ . for the 'edge / exotic' use cases I agree with what Sean said: we can provide workarounds / documentation on how to accomplish these cases, and even support them using a slower Mustermann based approach. all good on this point, 10/10
'The real world': unfortunately Hanami does not exist in a bubble and will be constantly compared to rails, whether that's something we collectively like or not. Since that comparison will be at the forefront of developers (especially newcomers) minds, we should honestly lean into the comparison. your observation here: "... offer a feature set and developer experience that rival that of Rails, while achieving performance comparable to Roda", I think is a huge selling point of this approach / the Hanami router generally. it's easy to get stuck optimizing against yourself, but it's important to remember the broader context in which the framework exists. your work widens the gap with Rails and I think makes this spike super appealing

in summary: 'LGTM 👍'

excellent as always @dcr8898 , I'm very much in favor of the change as long as @timriley and @cllns are

kyleplump · 2025-04-24T20:04:31Z

+        [
+          +EMPTY_STRING << match_data.pre_match << match_data[1] << match_data.post_match,
+          +EMPTY_STRING << match_data.pre_match << match_data.post_match
+        ].each do |new_path|


dcr8898 · 2025-04-25T02:24:42Z

I suspect this is because calling path[1..] creates a new array, so it negates the optimization of calling split without each that avoids an allocation. I'm not sure where that's used but maybe you could do path.shift immediately after creation, or even perhaps shift the path string itself before it's split, so the leading slash isn't included?

@cllns path[1..] here is operating on and returning a string. You can find the benchmark code I used for the different strategies here. The "ArraySegmenter" (what we are using) uses #split and throws away the first array member. (Do you think this effectively creates two arrays?)

The fastest strategy I found was the one you suggested. I called it "SneakyArray." To keep it faithful to our use case, I still used string[1..] before calling #split. It is decisively faster in the benchmark, but was noticeably slower when I tried it here. It's a bit confounding. 🧐

It may be related to the fact that re-implementing Trie#find in this way requires us to manually implement the functionality of #all? within the block passed to #split. I did this by calling break false if node.get returned nil on any segment. Something like this:

return unless path[1..].split("/") do |segment|
  node = node.get(segment, param_values)

  break false if node.nil?
end

This made the benchmark about 10% slower in the 10K route case. 🤔

dcr8898 · 2025-04-25T15:05:48Z

return unless path[1..].split("/") do |segment|
  node = node.get(segment, param_values)

  break false if node.nil?
end

This made the benchmark about 10% slower in the 10K route case. 🤔

Correction. If I inline the splitting exactly as above in Trie#find, there is a small performance boost. I will commit this.

However, when I do the same in Trie#add, the situation is not clear cut. It is significantly slower with 10K routes, and faster with 10, 100 and 1K routes. Still weird. I won't change this for now.

Question for all

Is this performance increase (about 2% with 10K routes, less with fewer routes) worth the increased complexity of this change?

…l_routes

…test setup

cllns · 2025-04-25T17:52:55Z

Question for all

Is this performance increase (about 2% with 10K routes, less with fewer routes) worth the increased complexity of this change?

To answer your question in general: we should optimize performance for 10-100 routes, never 10k routes. The reason I had an issue with 10k routes in a previous implementation (what started this saga 😅 ) is that it was causing the startup time to be so high (~10 seconds) that it was essentially un-usable (since it would add so much time to starting the server + for integration tests). It makes sense that performance degrades at the upper limit, in terms of execution time. We need to be careful to not let it explode, but a small increase is fine.

Sorry, I didn't realize you were already operating on a string. Since we use frozen string literals, we'll need to get copies instead of mutating the object anyway. Also, based on a quick benchmark, String#slice is ~7% faster than String#[]. Mind experimenting with that?

This code below is really hard for me to parse. Is there a way to simplify it? return unless split iterate break false. Could we just iterate over the segments and return if the node is nil?

return unless path[1..].split("/") do |segment|
  node = node.get(segment, param_values)

  break false if node.nil?
end

dcr8898 · 2025-04-25T17:59:52Z

With the above changes there is a noticeable performance increase--about 5% in the 10K route case. Memory use and start-up time are not noticeably affected.

I don't know how far we want to go in pursuing further performance gains. This implementation is already markedly better than 2.1, 2.2, or 2.2b (which was the starting baseline for this exploration). Performance isn't the current goal, per se, but more of a baseline. But it is fun to see how far we can go. 🚀

The Big Questions

It seems like this is a viable path. Some "features" that were exercised in tests are not currently implemented, but I am confident most or all of them could be implemented in the future (although some them could impact performance).

With that said:

Do we want to implement this approach?
If so, do we merge it now or are there further changes required?

Commit label: 2.2h
Test status: 555 examples, 0 failures, 46 pending

expand to see graphs

dcr8898 · 2025-04-25T18:05:51Z

Sorry, I didn't realize you were already operating on a string. Since we use frozen string literals, we'll need to get copies instead of mutating the object anyway. Also, based on a quick benchmark, String#slice is ~7% faster than String#[]. Mind experimenting with that?

Interesting! I will try it! 🤓 The docs say #slice is an alias for []. I'm surprised there's a difference.

timriley · 2025-04-26T13:48:32Z

Hi @dcr8898, I just want to thank you so much for this fantastic work. Whatever direction we take, this has been a hugely informative investigation that I'm sure we'll refer back to in the future.

This coming week I'm in the midst of talk writing (Baltic Ruby keynote coming up). This will take most of my focus. As soon as that's done I'll get onto responding to this PR :)

dcr8898 · 2025-04-26T20:10:08Z

Hi @dcr8898, I just want to thank you so much for this fantastic work. Whatever direction we take, this has been a hugely informative investigation that I'm sure we'll refer back to in the future.

Thank you! I agree, this has been an important learning journey, especially since Luca has stepped back and this was totally his creation. I look forward to whatever path we choose! It's all upwards from here for Hanami! 💮

This coming week I'm in the midst of talk writing (Baltic Ruby keynote coming up). This will take most of my focus. As soon as that's done I'll get onto responding to this PR :)

No rush! I need to get back to looking for a job anyway! 🤓

dcr8898 · 2025-04-27T01:49:54Z

This code below is really hard for me to parse. Is there a way to simplify it? return unless split iterate break false. Could we just iterate over the segments and return if the node is nil?
return unless path[1..].split("/") do |segment|
  node = node.get(segment, param_values)

  break false if node.nil?
end

Yeah, that's awful. 🙄 I'm going to chalk this up to tunnel-vision refactoring. I changed it to this:

def find(path)
  node = @root
  param_values = []

  path[1..].split(SEGMENT_SEPARATOR) do |segment|
    node = node.get(segment, param_values)

    break if node.nil?
  end

  node&.match(param_values)&.then { |found| [found.to, found.params] }
end

Is that better? Any more indirection will reverse the performance gain I think.

dcr8898 · 2025-04-29T16:32:08Z

This seems like a good time to pause and wait for @timriley's input.

I believe that the results so far show that this is a viable path. It's faster than using Mustermann because this approach shrinks the problem space. Mustermann is a powerful library for string pattern recognition, extraction, and expansion. However, our purpose is limited to path string parsing, not all string parsing.

This is a smaller problem space. In this space, we can take advantage of the fact that path strings have a defined structure: segments, separated by / characters. This simple realization powers the speed of the trie data structures and simplifies our parsing work for parameter extraction.

I want to reiterate that we remain reliant on Mustermann for both globbed routes and all named route expanders (all or most routes in most apps), neither of which are exercised with this benchmark tool. This means our start-up time concerns remain unless we use an alternative for path string expansion. I think this is doable.

The graphs below include 2.1 and 2.2b as reference points for our progress.

Commit label: 2.2i
Test status: 555 examples, 0 failures, 46 pending

expand to see graphs

dcr8898 · 2025-10-11T21:34:35Z

Comments following the 2.3 release

I was glad to see @kyleplump's performance fix get merged. In light of that and the 2.3 release, I thought this would be a good time to renew this conversation.

This PR

I started this spike to investigate ways to speed up the router, both at start up and in production. My main focus was on exploring the removal of the Mustermann dependency. I think the work here shows that Mustermann can be avoided for the most common routing use cases, and suggests that Mustermann can be avoided for all routing with just a little more work.

However, routing is only half of the issue. The other half is path "expanding": providing properly constructed paths for link helpers. This is still currently done with Mustermann (Mustermann objects perform both route matching and path expanding). I believe we can gradually replace Mustermann for this purpose as well by creating lighter objects for each use case, starting with the most common ones (the way I approached refactoring routing here).

With that said, that's a lot of stuff to put in one PR. Therefore, I think it is probably preferable to close this PR as a successful research spike and set out a roadmap to incorporate these ideas in a series of smaller PRs.

Some suggested items for roadmap

Refactoring of our earlier fix. I would like to do some basic refactoring of the fix @kyleplump and I did for the earlier bug fix. Based on my better understanding of the way Luca designed the router, I would do some things differently, mostly around testing.
Refactoring of the router logic. Again, based on my further learning, I think the basic router implementation could be simplified. I think this (and the refactoring above) will make a better starting point for the improvements investigated in this branch.
Re-introduce the changes implemented in this branch incrementally (multiple PRs). @cllns' point that we need to improve start up time for the router is well taken. The only realistic way to do that is to move from our use of Mustermann to something else. This branch explored rolling our own solution. I believe this is the best way to go, but I would love to hear other ideas.
Explore ideas for replacing Mustermann expanders. I would keep Mustermann for now, and follow the same approach as this branch: roll the simplest solution for the most common uses cases, followed by adding complexity when justified by further needed use cases.
Consider semantics. For example, what should be the correct handling of trailing slashes ("/")?
Consider further improvements. One of the things that made this exploration successful (IMO), was "shrinking the problem space." We did not attempt to recreate everything that is possible with Mustermann or Rails. Moving forward, what additional features do we want?

Thoughts?

@kyleplump @cllns @timriley and everyone else, I would love to hear other thoughts and ideas.

Thank you all!

rkh · 2026-04-18T06:29:29Z

Not sure what the status is, and I don't have a grasp of how the router works at the moment, but I felt like chiming in here, as Mustermann being so complex is my fault.

I did initially explore writing a trie-based router as part of Mustermann. The internal AST structure used for compiling patterns is actually pretty ideal for that. At the time, I discarded that idea, as the main reason for Mustermann was to replace the somewhat messy pattern compilation in Sinatra (we kept having edge cases around regex capture greediness that people struggled with – this is something you might want to keep an eye on when building your own solution, if you want to support sub-segment matching). Shipping custom code might also be tricky if you want to support features like the pipe operator, as Rails does. They've shipped broken versions of it in stable releases – I think this is part of the reason no one has touched their route compilation since Rails 5.0. You would probably end up partially reimplementing an AST-based parser (like Journey or Mustermann).

Turns out, at least back then, that the potential performance gains from trie-based routing were negligible or even negative for the typical Sinatra application, given that the number of routes is usually in the tens rather than the thousands.

I have been revisiting Mustermann and am open to giving trie-based routing another try if you're interested. It would not help at all with the startup/compilation time. To be honest, I'm not really sure what to do about that, other than adding a shortcut compilation for simple patterns. It might be an option, given that Rails patterns are much simpler than others.

This is a smaller problem space. In this space, we can take advantage of the fact that path strings have a defined structure: segments, separated by / characters. This simple realization powers the speed of the trie data structures and simplifies our parsing work for parameter extraction.

FWIW, Mustermann does realize this, too. They have a special AST node. They are also used for atomic grouping in regular expressions to speed up matching (i.e., you don't need to backtrack beyond them). Splitting by them just isn't necessary from a Mustermann perspective, as we're not building a trie. Mustermann is written with path-based matching and expansion as the main use case.

That said, if you only ever have placeholders that span exactly one segment, or multiple segments, then Mustermann is probably way too complex.

dcr8898 added 2 commits March 27, 2025 13:04

store Mustermann matchers (instead of routes) in leaves

fd355d2

split routes and paths using string instead of regular expression

b82b44d

basic proof of concept for Mustermann removal

6c87d3f

implement basic regexp constraints

891d752

dcr8898 added 7 commits April 14, 2025 22:52

refactor unit tests for leaf to match updated leaf interface

dcb35a3

remove unnecessary guard clause in leaf#match

3962c53

refactor unit tests for node to match updated node interface

ecf0f93

remove deprecated attr_reader for :to in node

f650ebd

refactor unit tests for trie to match use standard regex (not posix)

3ca7d5c

skip remaining failing tests as not presently supported

900b6f5

appease rubocop

1aea397

dcr8898 added 6 commits April 18, 2025 14:59

unskip generation_spec test for route with optional clause

a134c19

add tests for routes with optional clauses in recognition_spec

af8d51d

create InvalidRouteDefinitionError to use with broken optional clauses

3125d49

update Leaf#match to ignore constraints placed on non-existent parama…

3840939

…ters

implement optional route definition clauses

e89a025

remove namesaces when raising InvalidRouteDefinition in Router#add_op…

65dd7a1

…tional_routes

dcr8898 added 3 commits April 21, 2025 21:43

replace regex test for variable segments with string test

8110cb0

remove some indirection in Trie when parsing segments

3b44f9c

convert globbed route detection from regexp to string method

43ef3d1

cllns changed the title ~~remove Mustermann and improve performance~~ Remove Mustermann leaf matcher to improve performance Apr 24, 2025

cllns reviewed Apr 24, 2025

View reviewed changes

kyleplump reviewed Apr 24, 2025

View reviewed changes

inline path splitting in Trie#find

4eedc9c

dcr8898 force-pushed the router-saga branch from 6b9ff6e to 4eedc9c Compare April 25, 2025 15:19

dcr8898 added 5 commits April 25, 2025 11:50

remove unnecessary local variable derived_paths in Router#add_optiona…

f1c9bda

…l_routes

update Node#put unit test to expect key without ':' prefix

8f3e8e9

update Node to transform param keys on the fly, instead of in Leaf

8de377b

do not process param_keys in Leaf

183639a

update Leaf#match unit test to remove leading ':' in test param_keys …

e96b123

…test setup

dcr8898 added 4 commits April 26, 2025 21:51

simplify Trie#find method

bc81af1

improved the simplification of Trie#find method :)

4286baf

use #slice instead of #[] in Trie#find

341bb81

use interpolation instead of << in Router#add_optional_routes

a61b405

timriley added this to Hanami 2.3 Sep 28, 2025

timriley moved this to Todo in Hanami 2.3 Oct 1, 2025

timriley mentioned this pull request Oct 9, 2025

speed up new router implementation #279

Merged

timriley removed this from Hanami 2.3 Oct 9, 2025

rkh mentioned this pull request Apr 18, 2026

Proposal for 4.0: Trie-based string matching and more sinatra/mustermann#154

Closed

31 tasks

Uh oh!

Conversation

dcr8898 commented Mar 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Router Improvement

Goals

Checklists

Current features

Features NOT implemented

Edge cases

Stretch goals

Starting Point

expand to see graphs

Uh oh!

dcr8898 commented Mar 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

expand to see graphs

Uh oh!

dcr8898 commented Mar 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Discussion

expand to see graphs

Uh oh!

dcr8898 commented Mar 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Way Point No. 1

Questions for the community

expand to see graphs

Uh oh!

dcr8898 commented Mar 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

expand to see graphs

Uh oh!

timriley commented Mar 31, 2025

Uh oh!

dcr8898 commented Mar 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Takeaways

Further thoughts

Uh oh!

dcr8898 commented Mar 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dcr8898 commented Apr 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

expand to see graphs

Uh oh!

dcr8898 commented Apr 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

expand to see graphs

Uh oh!

dcr8898 commented Apr 24, 2025

Improvements

expand to see graphs

Uh oh!

cllns left a comment

Choose a reason for hiding this comment

Uh oh!

cllns Apr 24, 2025

Choose a reason for hiding this comment

Uh oh!

kyleplump Apr 24, 2025

Choose a reason for hiding this comment

Uh oh!

dcr8898 Apr 25, 2025

Choose a reason for hiding this comment

Uh oh!

dcr8898 Apr 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dcr8898 Apr 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

kyleplump left a comment

Choose a reason for hiding this comment

Uh oh!

dcr8898 commented Mar 27, 2025 •

edited

Loading

dcr8898 commented Mar 27, 2025 •

edited

Loading

dcr8898 commented Mar 29, 2025 •

edited

Loading

dcr8898 commented Mar 29, 2025 •

edited

Loading

dcr8898 commented Mar 30, 2025 •

edited

Loading

dcr8898 commented Mar 31, 2025 •

edited

Loading

dcr8898 commented Mar 31, 2025 •

edited

Loading

dcr8898 commented Apr 16, 2025 •

edited

Loading

dcr8898 commented Apr 18, 2025 •

edited

Loading

dcr8898 Apr 25, 2025 •

edited

Loading

dcr8898 Apr 29, 2025 •

edited

Loading

dcr8898 commented Apr 25, 2025 •

edited

Loading

dcr8898 commented Apr 25, 2025 •

edited

Loading

dcr8898 commented Apr 27, 2025 •

edited

Loading

dcr8898 commented Apr 29, 2025 •

edited

Loading

dcr8898 commented Oct 11, 2025 •

edited

Loading

rkh commented Apr 18, 2026 •

edited

Loading