test: Run integration tests against v2 as well by nightscape · Pull Request #657 · nightscape/spark-excel

nightscape · 2022-10-16T08:28:16Z

No description provided.

nightscape · 2023-07-10T08:06:06Z

@CodiumAI-Agent

QodoAI-Agent · 2023-07-10T08:06:41Z

PR Analysis

🎯 Main theme: Running integration tests against v2
🔍 Description and title: No
📌 Type of PR: Tests
🧪 Relevant tests added: Yes
✨ Minimal and focused: Yes, the PR is focused on updating the test suite to run against v2 of the application.
🔒 Security concerns: No, the changes made in this PR are related to testing and do not introduce any new security concerns.

PR Feedback

💡 General PR suggestions: The PR lacks a description which is important for understanding the context and the reason for the changes. Please add a detailed description explaining the need for these changes and how they affect the overall functionality of the application.
🤖 Code suggestions:
- relevant file: IntegrationSuite.scala
  suggestion content: Consider using constants for repeated string values such as 'excel' and 'com.crealytics.spark.excel'. This will make it easier to manage and update these values in the future. [medium]
- relevant file: IntegrationSuite.scala
  suggestion content: The 'runTests' function has been modified to include an 'implementation' parameter. It would be beneficial to update the function documentation to reflect this change and explain the purpose and expected values of this new parameter. [medium]

How to use

Tag me in a comment '@CodiumAI-Agent' to ask for a new review after you update the PR.
You can also tag me and ask any question, for example '@CodiumAI-Agent is the PR ready for merge?'

Copilot

Pull request overview

This PR updates the Scala integration test suite to run the same set of tests against multiple Excel datasource implementations (intended: v1 vs v2), enabling comparative coverage across implementations.

Changes:

Refactors IntegrationSuite into an abstract base suite parameterized by a datasource implementation string.
Switches the core write/read helper from .excel(...) helpers to explicit .format(implementation) with options.
Adds two concrete ScalaTest suites (IntegrationSuiteV1, IntegrationSuiteV2) to run the same tests against different implementations.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-30T12:55:16Z

src/test/scala/dev/mauch/spark/excel/IntegrationSuite.scala

+      val writer = df.write
+        .format(implementation)
+        .option("dataAddress", s"'$sheetName'!A1")
+        .option("header", header)
+        .mode(saveMode)
      val configuredWriter =
        Map("dataAddress" -> dataAddress).foldLeft(writer) {
          case (wri, (key, Some(value))) => wri.option(key, value)
          case (wri, _) => wri
        }
      configuredWriter.save(theFileName)

-      val reader = spark.read.excel(dataAddress = s"'$sheetName'!A1", header = header)
+      val reader = spark.read.format(implementation).option("dataAddress", s"'$sheetName'!A1").option("header", header)
      val configuredReader = Map(


The suite is now parameterized by implementation, but there are still direct .excel(...) reads later in this file that hardcode the v1 provider (via the package-level implicit) and will bypass the selected implementation. That means IntegrationSuiteV2 won't consistently exercise the v2 datasource. Consider switching the remaining .excel(...) usages to spark.read.format(implementation) (and setting the same options), so both suites truly test the chosen implementation end-to-end.

Copilot · 2026-03-30T12:55:16Z

src/test/scala/dev/mauch/spark/excel/IntegrationSuite.scala

  runTests(maxRowsInMemory = Some(1), maxByteArraySize = Some(100000000))
 }
+
+class IntegrationSuiteV1 extends IntegrationSuite("com.crealytics.spark.excel")


IntegrationSuiteV1 uses format("com.crealytics.spark.excel"), but this repository doesn't appear to include or depend on the com.crealytics.spark.excel.DefaultSource provider. As written, the V1 suite will fail at runtime when Spark tries to resolve that datasource. Either add the crealytics spark-excel dependency to the test classpath, or change the v1 implementation string to this project’s v1 provider (e.g., "dev.mauch.spark.excel") if the intent is to test the built-in v1 datasource.

Suggested change

class IntegrationSuiteV1 extends IntegrationSuite("com.crealytics.spark.excel")

class IntegrationSuiteV1 extends IntegrationSuite("dev.mauch.spark.excel")

- Fix V1 format name: com.crealytics.spark.excel → dev.mauch.spark.excel - Replace all .excel() convenience method calls with .format(implementation) to properly test both V1 and V2 code paths - Set ignoreNullable=true in assertDataFrameApproximateEquals since V2 returns all fields as nullable - Avoid .rdd conversion in assertEqualAfterInferringTypes which causes encoding errors on Spark 4 - Skip append-to-existing-file tests for V2 since FileDataSourceV2 writes to directories instead of single files Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

- Add missing "header" option to "reads files with missing cells" test (V2 requires it explicitly) - Treat null and empty string as equivalent in row comparison since V2 reads empty cells as null while V1 reads them as empty string Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

nightscape mentioned this pull request Nov 7, 2022

V2 streaming read (alternative approach) #653

Merged

nightscape force-pushed the v2-integration-tests branch from 00e4ab3 to 5999b76 Compare July 10, 2023 08:06

nightscape force-pushed the main branch from 3d855e8 to e911d0c Compare August 1, 2023 22:14

nightscape force-pushed the main branch from fa067f7 to af8172a Compare November 14, 2023 09:56

nightscape force-pushed the v2-integration-tests branch from 5999b76 to 5929991 Compare November 15, 2023 22:28

nightscape force-pushed the v2-integration-tests branch 2 times, most recently from ea6064d to 3dc5e63 Compare March 6, 2024 22:48

nightscape force-pushed the main branch 3 times, most recently from 6b58ec4 to 6866cb1 Compare December 18, 2024 12:49

test: Run integration tests against v2 as well

d6e5746

nightscape force-pushed the v2-integration-tests branch from 3dc5e63 to d6e5746 Compare March 30, 2026 12:51

Copilot AI review requested due to automatic review settings March 30, 2026 12:51

Copilot started reviewing on behalf of nightscape March 30, 2026 12:52 View session

Copilot AI reviewed Mar 30, 2026

View reviewed changes

nightscape and others added 2 commits March 30, 2026 22:37

nightscape merged commit 87fc409 into main Mar 31, 2026
35 checks passed

nightscape deleted the v2-integration-tests branch March 31, 2026 12:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: Run integration tests against v2 as well#657

test: Run integration tests against v2 as well#657
nightscape merged 3 commits intomainfrom
v2-integration-tests

nightscape commented Oct 16, 2022

Uh oh!

nightscape commented Jul 10, 2023

Uh oh!

QodoAI-Agent commented Jul 10, 2023

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 30, 2026

Uh oh!

Copilot AI Mar 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	class IntegrationSuiteV1 extends IntegrationSuite("com.crealytics.spark.excel")
	class IntegrationSuiteV1 extends IntegrationSuite("dev.mauch.spark.excel")

Conversation

nightscape commented Oct 16, 2022

Uh oh!

nightscape commented Jul 10, 2023

Uh oh!

QodoAI-Agent commented Jul 10, 2023

PR Analysis

PR Feedback

How to use

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants