Fix struct type metadata preservation in query results and continuations #3753

arnaud-lacurie · 2025-11-14T14:08:13Z

This commit implements a solution for GitHub issue #3743 where struct type
metadata (type names like "STRUCT_1", "STRUCT_2") was getting lost during
query execution, especially in continuations.

Problem:
When executing queries that return struct types, ResultSetMetaData would return
UUID-based names (like "id...") instead of proper struct type names. This affected:

Star expansion queries (SELECT * FROM table)
Nested star queries (SELECT (*) FROM table)
Direct struct projections (SELECT struct_column FROM table)
Query continuations (EXECUTE CONTINUATION)

Root Cause:

Semantic analysis produces correct DataTypes with struct names ("STRUCT_1", etc.)
Cascades planner Type.Record loses struct names during optimization (becomes null)
executePhysicalPlan() previously relied only on planner types → UUID generation
Continuations had no semantic type info → always generated UUIDs

Solution - Hybrid Approach:

Merge semantic type structure with RecordMetaData descriptors:

Type structure from semantic DataTypes (preserves "STRUCT_1", "STRUCT_2")
Additional enrichment from RecordMetaData descriptors for nested types

This commit implements a solution for GitHub issue FoundationDB#3743 where struct type metadata (type names like "STRUCT_1", "STRUCT_2") was getting lost during query execution, especially in continuations. **Problem:** When executing queries that return struct types, ResultSetMetaData would return UUID-based names (like "id...") instead of proper struct type names. This affected: - Star expansion queries (SELECT * FROM table) - Nested star queries (SELECT (*) FROM table) - Direct struct projections (SELECT struct_column FROM table) - Query continuations (EXECUTE CONTINUATION) **Root Cause:** 1. Semantic analysis produces correct DataTypes with struct names ("STRUCT_1", etc.) 2. Cascades planner Type.Record loses struct names during optimization (becomes null) 3. executePhysicalPlan() previously relied only on planner types → UUID generation 4. Continuations had no semantic type info → always generated UUIDs **Solution - Hybrid Approach:** Merge semantic type structure with planner field names: - Field names from planner Type.Record (handles aliases, star expansion, "_0" naming) - Type structure from semantic DataTypes (preserves "STRUCT_1", "STRUCT_2") - Additional enrichment from RecordMetaData descriptors for nested types

Refactored canReadStructTypeName helper to use integer parameters for controlling base query and continuation reruns. Added tests to cover the withExecutionContext method in ContinuedPhysicalQueryPlan, addressing the Teamscale test gap.

…UATION` syntax is self-contained.

…fdb-record-layer into struct_type_metadata_fix

…a_fix

hatyo · 2025-11-28T11:01:30Z

...src/main/java/com/apple/foundationdb/relational/recordlayer/query/visitors/QueryVisitor.java

+     * @return List of DataTypes preserving struct type names (field names are placeholders)
+     */
+    @Nonnull
+    private static List<DataType> captureSemanticTypeStructure(


Expressions is a comprehension that captures operations on an ordered list of Expressions, please move the method and make it part of it, this enables for example, maintaining and caching the underlying data types. Something like:

Iterable<DataType> getDataTypes() { return .... }

Or, you can wrap the data types of the individual Expression item(s) of Expressions with a single StructType that is structurally equivalent to the output of the RelationalExpression's (and corresponding physical RecordQuery* operator's) RecordType.

This I think will streamline things quite nicely down the line, especially when you want to use that later on to parse the resultset metadata (please see my commet there).

Ohh, this would be much nicer there, thanks for pointing this out!

hatyo · 2025-11-28T11:28:56Z

...al-core/src/main/java/com/apple/foundationdb/relational/recordlayer/query/PlanGenerator.java

+        final List<DataType> semanticFieldTypes;
+        if (resultType instanceof Type.Record) {
+            final Type.Record recordType = (Type.Record) resultType;
+            semanticFieldTypes = recordType.getFields().stream()


If the underlying Cascades type doesn’t preserve internal struct names, then the semantic field types based on them will also lose this information, right?

If this is the case, we might want to include the semantic information in the continuation as well, so that their (re)construction is completely independent of the underlying operator typing system.

Having said that, it seems like a good idea to associate the semantic information in the continuation itself, or put differently, integrate it into the state of the physical plan, which, albeit being useful for state management, could assist streamlining query processing in a world where separate but cooperative runtimes exist for plan generation, optimization, and execution exist for example.

hatyo · 2025-11-28T11:51:10Z

...c/main/java/com/apple/foundationdb/relational/recordlayer/metadata/TypeMetadataEnricher.java

+     * @throws RelationalException if type structures don't match
+     */
+    @Nonnull
+    public static DataType.StructType mergeSemanticTypesWithPlannerNames(


why is this needed? it seems to me what we need is to perform the following assertion:

Assert.thatUnchecked(plannerType.isStrurtucallyEquivalentTo(DataTypeUtils.toRecordLayerType(semanticType))

This is no longer needed, and was replaced by something that takes semantic information and combines it with RecordMetaData information for nested structs.

hatyo · 2025-11-28T11:59:40Z

...tional-core/src/main/java/com/apple/foundationdb/relational/recordlayer/query/QueryPlan.java

            final var currentPlanHashMode = OptionsUtils.getCurrentPlanHashMode(options);
-            final var dataType = (DataType.StructType) DataTypeUtils.toRelationalType(type);
+
+            final DataType.StructType resultDataType = TypeMetadataEnricher.mergeSemanticTypesWithPlannerNames(type, semanticFieldTypes, fdbRecordStore.getRecordMetaData());


The semanticFieldTypes (or more precisely, the StructType made of these) must be structurally equivalent to the resulting Type.Record of the top-level physical operator of the plan. Therefore, I don't think you need to do any merging, perhaps we just want to validate the structural equivalence, and then only use the StructType created with the semanticFieldTypes to construct the result set metadata.

Should we need the underlying Type.Record for some reason, if I am not mistaken, DataTypeUtils.toRecordType can be used to construct the Type.Record with the nested field names correctly. (if this is not the case, we can definitely fix it).

Done, this is no longer using Type.Record, but is now coming from semanticStructType
Good suggestion 👍

Move field name capture to semantic analysis phase and simplify type merging to only enrich nested structs from RecordMetaData.

hatyo · 2025-12-01T10:08:39Z

...c/main/java/com/apple/foundationdb/relational/recordlayer/metadata/TypeMetadataEnricher.java

+ * type metadata for result sets.
+ */
+@API(API.Status.EXPERIMENTAL)
+public final class TypeMetadataEnricher {


Can we remove this class? The DataType.StructType already captures all of the named-fields/named-record types correctly, so when a physical plan is received, the Type.Record object of the top-level operator does not provide any extra information that is needed. And since SQL dictates that the final result given to user must match the constructed result set as imperatively defined in the query, both objects must align structurally (the one coming from the physical plan operator, and the one that is the result of the plan generator and semantic analysis).

Having said that, I am not sure why we need this metadata enrichment, and as I mentioned in a previous comment maybe (although not necessary) we can add an assertion that verifies the structural equality of both structures, which is very simple to do (but perhaps expensive to compute).

This class does not combine DataType.StructType and Type.Record anymore. But it combines DataType.StructType with RecordMetaData information. DataType.StructType does have access to the structural shape of nested records, but it does not have access to the underlying nested type struct names, so I don't think we can remove this unfortunately.

…a_fix

arnaud-lacurie added 3 commits November 13, 2025 15:57

Add continuations to struct type name tests

83078c9

Add more tests

64d5a83

arnaud-lacurie added the enhancement New feature or request label Nov 14, 2025

Add some tests around named structs

7a808ab

arnaud-lacurie mentioned this pull request Nov 16, 2025

Add validation for dynamic struct type compatibility #3755

Draft

arnaud-lacurie added 2 commits November 17, 2025 09:26

Fix style and remove unnecessary sorting

b12aea9

Add additional tests to cover teamscale coverage gap

21eea73

arnaud-lacurie mentioned this pull request Nov 17, 2025

Cover more Struct MetaData name tests #3750

Closed

arnaud-lacurie marked this pull request as draft November 17, 2025 17:32

arnaud-lacurie added 2 commits November 17, 2025 17:41

Merge branch 'main' into struct_type_metadata_fix

f3307bf

arnaud-lacurie marked this pull request as ready for review November 17, 2025 18:33

arnaud-lacurie added 8 commits November 17, 2025 23:51

Extract type enrichment logic to TypeMetadataEnricher utility class

9435474

Change TypeMetadataEnricher class to final

a694dfc

Remove deprecated WITH CONTINUATION syntax now that `EXECUTE CONTIN…

5ce3daf

…UATION` syntax is self-contained.

Merge branch 'remove_with_continuation' into struct_type_metadata_fix

cff2f13

Merge branch 'struct_type_metadata_fix' of github.com:arnaud-lacurie/…

6a66aaa

…fdb-record-layer into struct_type_metadata_fix

Merge remote-tracking branch 'upstream/main' into struct_type_metadat…

5760a9d

…a_fix

Address teamscale issues

42bbe3c

Minor fix

9d1ba5e

arnaud-lacurie requested a review from hatyo November 28, 2025 09:48

hatyo requested changes Nov 28, 2025

View reviewed changes

arnaud-lacurie added 5 commits November 28, 2025 17:45

Move getDataTypes into Expressions

6af1e5c

Refactor semantic type capture to use StructType

76e6542

Move field name capture to semantic analysis phase and simplify type merging to only enrich nested structs from RecordMetaData.

Remove unused import

063bc47

Remove unnecessary file

4c122af

Remove unused method

93b5f87

arnaud-lacurie requested a review from hatyo November 28, 2025 20:44

hatyo reviewed Dec 1, 2025

View reviewed changes

Merge remote-tracking branch 'upstream/main' into struct_type_metadat…

1386af8

…a_fix

alecgrieser mentioned this pull request Dec 1, 2025

Remove recursive type name preservation when parsing meta-data objects #3788

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix struct type metadata preservation in query results and continuations #3753

Fix struct type metadata preservation in query results and continuations #3753

arnaud-lacurie commented Nov 14, 2025 •

edited

Loading

Uh oh!

hatyo Nov 28, 2025

Uh oh!

hatyo Nov 28, 2025

Uh oh!

arnaud-lacurie Nov 28, 2025

Uh oh!

hatyo Nov 28, 2025

Uh oh!

hatyo Nov 28, 2025

Uh oh!

arnaud-lacurie Nov 28, 2025

Uh oh!

hatyo Nov 28, 2025

Uh oh!

arnaud-lacurie Nov 28, 2025

Uh oh!

hatyo Dec 1, 2025

Uh oh!

arnaud-lacurie Dec 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix struct type metadata preservation in query results and continuations #3753

Are you sure you want to change the base?

Fix struct type metadata preservation in query results and continuations #3753

Conversation

arnaud-lacurie commented Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

arnaud-lacurie commented Nov 14, 2025 •

edited

Loading