Rewrite VARBINARY to BINARY in SparkSqlRewriter by c-h-afzal · Pull Request #593 · linkedin/coral

c-h-afzal · 2026-04-08T23:46:32Z

Calcite's type system uses VARBINARY for binary data, but Spark's SQL parser does not recognize VARBINARY as a valid cast target — it uses BINARY instead. When Hive views call base64() on string columns, Calcite inserts an implicit CAST(... AS VARBINARY) which produces unparseable Spark SQL, causing CoralSpark translation failures and downstream function registry poisoning via the DaliSpark Hive fallback path.

This follows the same pattern already used by TrinoSqlRewriter, which rewrites BINARY/VARBINARY to Trino's VARBINARY in its convertTypeSpec method. Here we do the equivalent for Spark: rewrite VARBINARY to BINARY in SparkSqlRewriter.visit(SqlDataTypeSpec).

Verified existing test cases pass and added a new test case.

coral-spark/src/test/java/com/linkedin/coral/spark/CoralSparkTest.java

Calcite's type system uses VARBINARY for binary data, but Spark's SQL parser does not recognize VARBINARY as a valid cast target — it uses BINARY instead. When Hive views call base64() on string columns, Calcite inserts an implicit CAST(... AS VARBINARY) which produces unparseable Spark SQL, causing CoralSpark translation failures and downstream function registry poisoning via the DaliSpark Hive fallback path. This follows the same pattern already used by TrinoSqlRewriter, which rewrites BINARY/VARBINARY to Trino's VARBINARY in its convertTypeSpec method. Here we do the equivalent for Spark: rewrite VARBINARY to BINARY in SparkSqlRewriter.visit(SqlDataTypeSpec). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

c-h-afzal changed the title ~~Rewrite VARBINARY to BINARY in SparkSqlRewriter (incident-10814)~~ Rewrite VARBINARY to BINARY in SparkSqlRewriter Apr 9, 2026

ruolin59 reviewed Apr 10, 2026

View reviewed changes

coral-spark/src/test/java/com/linkedin/coral/spark/CoralSparkTest.java Outdated Show resolved Hide resolved

ruolin59 reviewed Apr 10, 2026

View reviewed changes

coral-spark/src/test/java/com/linkedin/coral/spark/CoralSparkTest.java Show resolved Hide resolved

c-h-afzal force-pushed the afzal/fix-varbinary-cast-spark-dialect branch from a4e1adf to f137382 Compare April 11, 2026 03:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rewrite VARBINARY to BINARY in SparkSqlRewriter#593

Rewrite VARBINARY to BINARY in SparkSqlRewriter#593
c-h-afzal wants to merge 1 commit intolinkedin:masterfrom
c-h-afzal:afzal/fix-varbinary-cast-spark-dialect

c-h-afzal commented Apr 8, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

c-h-afzal commented Apr 8, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants