Query: remove nav expansion as a way to tackle the pending selector problem #32957

maumar · 2024-01-30T00:20:06Z

See #20291 for additional context.

Basically the idea is to retire nav expansion as a way of dealing with navigations in our query pipeline. Instead, the work would be done in translation phase. This is good because:

nav expansion is located in core without means of extending it for provider specific functionality (see the hack we added for temporal tables - NavigationExpansionExtensibilityHelper),
during translation we have more information about the query that's being processed (e.g. we know which functions are translatable and which should be client-evaled),
some of the processing can potentially be streamlined, e.g we process includes right away (add JOIN and generate final client projection), rather than converting them to LeftJoin and then processing that again.

On the flipside, translation phase is already very complex, and this will add even more complexity, UNLESS some things can be simplified.
Also, nav expansion currently is is performing other tasks, e.g. reasons about primitive collections - this is because it's the only place in pre-processing where we have access to the model. So all preprocessing needing model information is performed there.

We either need to keep some pre-processing based on model where all those tasks can be performed (and maybe make it extensible for providers?) or move some of that to translation. @roji

The text was updated successfully, but these errors were encountered:

roji · 2024-01-30T07:16:21Z

Just to note that a main reason to solve the pending selector problem is performance: because we defer Select(), we end up duplicating the same SQL (e.g. subqueries) across multiple operators rather than having it just once in the query. It may be possible to keep nav expansion in its current form but to stop deferring selectors, but we know we already want to try to get rid of nav expansion as a separate phase anyway (for the above other reasons).

nav expansion is located in core without means of extending it for provider specific functionality (see the hack we added for temporal tables - NavigationExpansionExtensibilityHelper),

Examples of these include the inability to properly process ExecuteUpdate/Delete since they're relational (see #32493). Similarly, provider-specific LINQ operators such as DistinctBy (directly translatable on PostgreSQL) cannot be handled.

Finally, it's worth noting that nav expansion was written in a perf-suboptimal way: in order to avoid all visitor state, it visits the tree multiple times, first wrapping everything in "state nodes", doing its job, and then unwrapping them back again to get a normal LINQ expression tree. This adversely affects query compilation performance, which we're starting to pay a bit more attention to.

On the flipside, translation phase is already very complex, and this will add even more complexity, UNLESS some things can be simplified.

I'm personally not really worried about this - I think it's a question of factoring the logic correctly into the translation phase. In fact, I believe that the splitting up of navigation handling into the separate pre-processing phase adds much more complexity than it saves, and doing it in the right way inside translation could potentially make it much simpler. Breaking a thing into two passes really isn't necessarily a great way to make that thing simpler.

We either need to keep some pre-processing based on model where all those tasks can be performed (and maybe make it extensible for providers?) or move some of that to translation.

IMHO we should not have any sort of model awareness in preprocessing - preprocessing really should be concerned with basic normalization and operations working only the LINQ expression tree shape; all model knowledge should happen at the translation phase only. This is because of provider extensibility, and also because tracking which node corresponds to which model thing (e.g. binding properties) is very non-trivial, and we shouldn't need to do it twice (both in preprocessing and translation).

So at least in my ideal mental model, any processing that needs to be aware of the model should move to translation, just like nav expansion - compared to the complexity of nav expansion, I don't necessarily foresee a huge amount of complexity there.

One motivating factor for having nav expansion in pre-processing, was that providers get this logic for free rather than needing to implement it (e.g. conversion of enumerable LINQ operators to queryable, query filter integration...). We should try to move this universal logic to core via other means during translation (e.g. do enumerable->queryable in QueryableRelationalExpressionVisitor), rather than as a preprocessing pass as we currently do.

maumar · 2024-05-08T07:43:12Z

When this is done, see if #33621 is also solved

roji · 2024-06-13T14:36:28Z

Another problematic query tree mangling that nav expansion performs...

Am trying to make the following test pass; the important part is ElementAt(0) over a list of Orders (structural types):

public virtual Task ElementAt_over_owned_collection(bool async)  
    => AssertQuery(  
        async,
        ss => ss.Set<OwnedPerson>().Where(p => p.Orders.ElementAt(0).Id == -11));

p.Orders itself translates well, to a ShapedQueryExpression that wraps what I call a "bare array"; at this point of the translation, no operation (e.g. Where) has been applied to the array, which means that we can do various specialized translations. For example, the above should translate to WHERE p.Orders[0].Id (the specialized translation is the array indexer); but if before the ElementAt() we had a Where(), we'd have to have a full scalar subquery instead (WHERE (SELECT t FROM t IN c.Orders WHERE ...) - the precise SQL is more complicated).

But of course, nav expansion helpfully decides to move the .Id before the ElementAt() (or the ElementAt() after the Id?), so the fragment coming out of nav expansion is: (Property(o, "Orders").AsQueryable().Select(o0 => o0.Id).ElementAt(0) == -11).

Since Select() has been applied before ElementAt(), we no longer have a bare array.

roji · 2024-06-15T22:33:04Z

Another one... Given a simple SelectMany without a result selector:

public virtual Task Column_collection_SelectMany(bool async)
    => AssertQuery(
        async,
        ss => ss.Set<PrimitiveCollectionsEntity>().SelectMany(c => c.Ints));

Nav expansion produces the following tree:

[Microsoft.EntityFrameworkCore.Query.EntityQueryRootExpression]
    .SelectMany(p => Property(p, "Ints").AsQueryable(), (p, c) => new TransparentIdentifier`2(Outer = p, Inner = c))
    .Select(ti => ti.Inner)

In other words, a result selector is added, projecting out to a TransparentIdentifier, only to then compose a Select() which then gets rid of the TransparentIdentifier. It's a transformation that produces a mathematically equivalent but more complex tree.

The above can be pattern-matched and simplified back, but of course there's further interference. For example:

public virtual Task SelectMany_without_result_selector_over_owned_collection(bool async)
    => AssertQuery(
        async,
        ss => ss.Set<OwnedPerson>().SelectMany(p => p.Orders).Where(o => o.Id > -30));

Gets transformed into:

[Microsoft.EntityFrameworkCore.Query.EntityQueryRootExpression]
    .SelectMany(o => Property(o, "Orders").AsQueryable(), (o, c) => new TransparentIdentifier`2(Outer = o, Inner = c))
    .Where(ti => (ti.Inner.Id > -30))
    .Select(ti => [Microsoft.EntityFrameworkCore.Query.IncludeExpression])

i.e. the Select() is moved forward, making it impossible to simplify back.

maumar added needs-design area-query labels Jan 30, 2024

ajcvickers added this to the 9.0.0 milestone Feb 1, 2024

ajcvickers assigned maumar Feb 1, 2024

ajcvickers added the type-enhancement label Feb 1, 2024

This was referenced Feb 27, 2024

[9.0] Query pipeline architecture improvements #31327

Closed

Translate Select() with index using ROW_NUMBER #24218

Open

roji mentioned this issue Apr 1, 2024

query: reuse complex projection in orderby etc rather than defining it again #16038

Open

maumar mentioned this issue Apr 7, 2024

EF Core 8 Query Translation Bug: Array operations with union #33258

Open

viniciuschiele mentioned this issue May 25, 2024

FirstOrDefault in subquery produces a wrong SQL AutoMapper/AutoMapper#4454

Closed

roji mentioned this issue Jun 12, 2024

Cosmos: handle fake LeftJoins in query tree #33969

Closed

maumar modified the milestones: 9.0.0, Backlog Aug 6, 2024

maumar added the consider-for-next-release label Aug 6, 2024

roji mentioned this issue Aug 24, 2024

[10.0] Query pipeline architecture improvements #34524

Open

22 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Query: remove nav expansion as a way to tackle the pending selector problem #32957

Query: remove nav expansion as a way to tackle the pending selector problem #32957

maumar commented Jan 30, 2024 •

edited

Loading

roji commented Jan 30, 2024 •

edited

Loading

maumar commented May 8, 2024

roji commented Jun 13, 2024

roji commented Jun 15, 2024 •

edited

Loading

Query: remove nav expansion as a way to tackle the pending selector problem #32957

Query: remove nav expansion as a way to tackle the pending selector problem #32957

Comments

maumar commented Jan 30, 2024 • edited Loading

roji commented Jan 30, 2024 • edited Loading

maumar commented May 8, 2024

roji commented Jun 13, 2024

roji commented Jun 15, 2024 • edited Loading

maumar commented Jan 30, 2024 •

edited

Loading

roji commented Jan 30, 2024 •

edited

Loading

roji commented Jun 15, 2024 •

edited

Loading