[SPARK-54971] Add WITH SCHEMA EVOLUTION syntax for SQL INSERT #53732

longvu-db · 2026-01-08T15:10:02Z

What changes were proposed in this pull request?

Similar to the MERGE WITH SCHEMA EVOLUTION PR, this PR introduces a syntax WITH SCHEMA EVOLUTION to the SQL INSERT command. Since this syntax is not fully implemented for any table formats yet, users will receive an exception if they try to use it.

When WITH SCHEMA EVOLUTION is specified, schema evolution-related features must be turned on for this single statement and only in this statement.

In this PR, Spark is only responsible for recognizing the existence or absence of the syntax WITH SCHEMA EVOLUTION, and the recognition info is passed down from the Analyzer. When WITH SCHEMA EVOLUTION is detected, Spark sets the mergeSchema write option to true in the respective V2 Insert Command nodes.

Data sources must respect the syntax and give appropriate reactions: Turn on features that are categorised as "schema evolution" when the WITH SCHEMA EVOLUTION Syntax exists.

Why are the changes needed?

This intuitive SQL Syntax allows the user to specify Automatic Schema Evolution for a specific INSERT operation.

Some users would like Schema Evolution for DML commands like MERGE, INSERT,... where the schema between the table and query relations can mismatch.

Does this PR introduce any user-facing change?

Yes, Introducing the SQL Syntax WITH SCHEMA EVOLUTION to SQL INSERT.

How was this patch tested?

Added UTs.

Was this patch authored or co-authored using generative AI tooling?

No.

github-actions · 2026-01-08T15:10:12Z

JIRA Issue Information

=== Improvement SPARK-54971 ===
Summary: Recognizing the existence of the SQL Syntax WITH SCHEMA EVOLUTION for SQL INSERT statements in the Parser
Assignee: None
Status: Open
Affected: ["4.2.0"]

This comment was automatically generated by GitHub Actions

sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala

szehon-ho · 2026-01-09T01:54:38Z

I was thinking it can be interesting to have Spark optionally call alterTable , if the V2 data source has TableCapability.AUTOMATIC_SCHEMA_EVOLUTION (which we introduced when doing MERGE INTO schema evolution implementation in DSV2). That will ease the burden on the data sources. But it can be a future enhancement.

sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala

sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSourceStrategy.scala

sql/core/src/test/scala/org/apache/spark/sql/execution/command/PlanResolutionSuite.scala

longvu-db · 2026-01-09T14:04:34Z

sql/core/src/test/scala/org/apache/spark/sql/sources/InsertSuite.scala

    }
  }

+  test("SPARK-54971: INSERT WITH SCHEMA EVOLUTION is currently unsupported") {


To cover the first

case InsertIntoStatement(l @ LogicalRelationWithTable(_: InsertableRelation, _),

parts, _, query, overwrite, false, _) if parts.isEmpty => parts, _, query, overwrite, false, _, withSchemaEvolution) if parts.isEmpty && !withSchemaEvolution =>

longvu-db · 2026-01-09T14:04:51Z

sql/hive/src/test/scala/org/apache/spark/sql/hive/InsertSuite.scala

      }
  }

+  testPartitionedTable("SPARK-54971: INSERT WITH SCHEMA EVOLUTION is currently unsupported") {


To cover the 2nd case

case i @ InsertIntoStatement(l @ LogicalRelationWithTable(t: HadoopFsRelation, table),

parts, _, query, overwrite, _, _, withSchemaEvolution) if query.resolved && !withSchemaEvolution =>

common/utils/src/main/resources/error/error-conditions.json

Add WITH SCHEMA EVOLUTION

654e4ae

github-actions bot added the SQL label Jan 8, 2026

longvu-db changed the title ~~[SPARK-54971] Recognizing the existence of the SQL Syntax WITH SCHEMA EVOLUTION for SQL INSERT statements in the Parser~~ [SPARK-54971] Add WITH SCHEMA EVOLUTION syntax for SQL INSERT Jan 8, 2026

szehon-ho reviewed Jan 8, 2026

View reviewed changes

sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala Show resolved Hide resolved

johanl-db reviewed Jan 9, 2026

View reviewed changes

Resolve Johan's comments

a10ea97

longvu-db requested review from johanl-db and szehon-ho January 9, 2026 13:49

Test name change

6963dd7

longvu-db commented Jan 9, 2026

View reviewed changes

Add more tests

4308f30

szehon-ho reviewed Jan 9, 2026

View reviewed changes

common/utils/src/main/resources/error/error-conditions.json Outdated Show resolved Hide resolved

Remove special if

bde23ab

longvu-db requested a review from szehon-ho January 9, 2026 21:26

szehon-ho approved these changes Jan 9, 2026

View reviewed changes

longvu-db added 2 commits January 10, 2026 17:10

Update DDLParserSuite.scala

5a2062c

Update AstBuilder.scala

877b88f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-54971] Add WITH SCHEMA EVOLUTION syntax for SQL INSERT #53732

[SPARK-54971] Add WITH SCHEMA EVOLUTION syntax for SQL INSERT #53732

longvu-db commented Jan 8, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Jan 8, 2026

Uh oh!

Uh oh!

szehon-ho commented Jan 9, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

longvu-db Jan 9, 2026

Uh oh!

longvu-db Jan 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[SPARK-54971] Add WITH SCHEMA EVOLUTION syntax for SQL INSERT #53732

Are you sure you want to change the base?

[SPARK-54971] Add WITH SCHEMA EVOLUTION syntax for SQL INSERT #53732

Conversation

longvu-db commented Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

github-actions bot commented Jan 8, 2026

JIRA Issue Information

Uh oh!

Uh oh!

szehon-ho commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

longvu-db Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

longvu-db Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

longvu-db commented Jan 8, 2026 •

edited

Loading

szehon-ho commented Jan 9, 2026 •

edited

Loading