feat: [ANSI] Ansi sql error messages by parthchandra · Pull Request #3580 · apache/datafusion-comet

parthchandra · 2026-02-23T21:35:37Z

Which issue does this PR close?

Closes parts of #551
Closes #2215
Closes #3375

Rationale for this change

With Spark 4.0 (And Spark 3.5 with Ansi mode), Spark produces ansi compliant error messages that have an error code and in many cases include the original SQL query. When we encounter errors in native code, Comet throws a SparkException or CometNativeException that do not conform to the expected error reporting standard.

What changes are included in this PR?

This PR introduces a framework to report ansi compliant error messages from native code.

Summary of error propagation -

Spark-side query context serialization : For every serialized expression and aggregate expression, a unique expr_id is generated. If the expression's origin carries a QueryContext (SQL text, line, column, object name), it is extracted and attached to the protobuf. This is done for both Expr and AggExpr.
Native planner (planner.rs): The PhysicalPlanner now holds a QueryContextMap. When planning Expr and AggExpr nodes, if expr_id and query_context are present, the context is registered in the map. When creating physical expressions for Cast, CheckOverflow, ListExtract, SumDecimal, AvgDecimal, and arithmetic binary expressions, the relevant QueryContext is looked up and passed to the constructor.
Native errors : The SparkError enum is extended with new variants will all the Spark ANSI errors (from https://github.com/apache/spark/blob/master/sql/catalyst/src/main/scala/org/apache/spark/sql/errors/QueryExecutionErrors.scala). A new SparkErrorWithContext type wraps a SparkError with a QueryContext. All affected expression implementations look up the context and produce a SparkErrorWithContext when available.
The SparkError implementation also has new to_json() and exception_class() methods for JNI serialization.
JNI boundary (errors.rs -> CometQueryExecutionException): The throw_exception function now checks for SparkErrorWithContext or SparkError and throws CometQueryExecutionException. CometQueryExecutionException carries the entire SparkErrorWithContext as a JSON message. On the Scala side, CometExecIterator catches this exception and calls SparkErrorConverter.convertToSparkException() to convert to the appropriate Spark exception. If the JSON message contained the QueryContext, the exception will contain the query, otherwise it will not.
There are two version specific implementations -one for Spark 3.x (fallback to generic SparkException) and one for Spark 4.0 (calls the exact QueryExecutionErrors.* methods).

Notes: Not all expressions have been updated. All the expressions that failed unit tests as a result of incorrect error messages have been updated. ( Cast, CheckOverflow, ListExtract, SumDecimal, AvgDecimal, and binary arithmetic expressions). Binary arithmetic expressions are now represented as CheckedBinaryExpr which also includes the query context.
Most errors in QueryExecutionErrors are reproduced as is in the native side. However some errors like INTERVAL_ARITHMETIC_OVERFLOW have a version with a user suggestion and one without a user suggestion. In such cases there are two variants in the native side.

How are these changes tested?

New unit tests. Failing tests listed in #551, #2215, #3375

This PR was produced with the generous assistance of Claude Code

parthchandra · 2026-02-23T22:04:37Z

@coderfender, fyi

coderfender · 2026-02-23T22:15:48Z

Thank you @parthchandra . This is awesome

andygrove · 2026-02-23T22:50:13Z

spark/src/test/scala/org/apache/comet/CometCastSuite.scala

+                  // context eagerly so it displays the call site at the
+                  // line of code where the cast method was called, whereas spark grabs the context
+                  // lazily and displays the call site at the line of code where the error is checked.
+                  assert(sparkMessage.startsWith(cometMessage.substring(0, 40)))


is 40 just an arbitrary number to get a decent sized chunk of the error message or does it have a specific meaning in the context of this test?

I used 40 as an arbitrary number to get enough of the error message. There is no special significance.

andygrove · 2026-02-23T22:51:40Z

spark/src/main/spark-4.0/org/apache/spark/sql/comet/shims/ShimSparkErrorConverter.scala

+      case "BinaryArithmeticOverflow" =>
+        Some(
+          QueryExecutionErrors.binaryArithmeticCauseOverflowError(
+            params("value1").toString.toShort,


Could this overflow if the value is larger than a short?

I believe that should not happen. The original Spark exception has these values declared as short.

andygrove · 2026-02-23T22:56:34Z

native/spark-expr/src/query_context.rs

+
+        // Extract the problematic fragment
+        let fragment = if start_idx < self.sql_text.len() && stop_idx <= self.sql_text.len() {
+            &self.sql_text[start_idx..stop_idx]


The earlier docs say that start_idx is a character index, but it is being used as a byte index here, I think. Perhaps you could tests for non-ASCII cases, if that makes sense?

Good catch. Fixed to use char index. Added a unit test

parthchandra · 2026-02-24T01:36:21Z

Changed to draft to figure out backward compatibility

parthchandra requested a review from andygrove February 23, 2026 21:43

parthchandra force-pushed the sql-query-errors branch from fc0b78f to cf4588f Compare February 23, 2026 22:04

andygrove reviewed Feb 23, 2026

View reviewed changes

parthchandra marked this pull request as draft February 24, 2026 01:02

coderfender mentioned this pull request Feb 24, 2026

[EPIC] Fully support ANSI mode #313

Open

parthchandra added 9 commits February 24, 2026 17:46

feat: [ANSI] Ansi sql error messages

ee08fc6

fix after rebase

0632e66

fix benches

1983832

fmt

4947104

clippy

d3d7af1

remove unused dependencies

4866048

Handle non-ascii chars in query context summary

0006b4b

fix test

35d2e30

Shims for 3.4, and 3.5

e386824

parthchandra force-pushed the sql-query-errors branch from 71caba8 to e386824 Compare February 25, 2026 02:27

parthchandra added 2 commits February 24, 2026 18:38

format

8fc365b

remove temp file

759cf24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

feat: [ANSI] Ansi sql error messages#3580

feat: [ANSI] Ansi sql error messages#3580
parthchandra wants to merge 11 commits intoapache:mainfrom
parthchandra:sql-query-errors

parthchandra commented Feb 23, 2026

Uh oh!

parthchandra commented Feb 23, 2026

Uh oh!

coderfender commented Feb 23, 2026

Uh oh!

andygrove Feb 23, 2026

Uh oh!

parthchandra Feb 23, 2026

Uh oh!

andygrove Feb 23, 2026

Uh oh!

parthchandra Feb 23, 2026

Uh oh!

andygrove Feb 23, 2026

Uh oh!

parthchandra Feb 23, 2026

Uh oh!

parthchandra commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

parthchandra commented Feb 23, 2026

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

How are these changes tested?

Uh oh!

parthchandra commented Feb 23, 2026

Uh oh!

coderfender commented Feb 23, 2026

Uh oh!

andygrove Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

parthchandra Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

andygrove Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

parthchandra Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

andygrove Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

parthchandra Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

parthchandra commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants