[SPARK-28587][SQL] Route JDBC partition bound literals through JdbcDialect.compileValue by AliaksandrAleksiukCollibra · Pull Request #55727 · apache/spark

AliaksandrAleksiukCollibra · 2026-05-07T08:21:07Z

What changes were proposed in this pull request?

toBoundValueInWhereClause in JDBCRelation now accepts the resolved JdbcDialect and calls dialect.compileValue for date and timestamp partition bounds instead of hardcoding bare quoted string literals.

Why are the changes needed?

When partitioning a JDBC table by a date or timestamp column, Spark generates WHERE clauses like col < '2024-01-01'. Strict-typing engines such as Athena and Phoenix reject this with a type mismatch error (Cannot apply operator: date < varchar) because the bare quoted string is treated as VARCHAR, not DATE/TIMESTAMP.

JdbcDialect.compileValue already exists for dialect-specific value formatting and is used in filter pushdown, but was never wired into partition bound generation. Dialects for strict-typing engines can now override compileValue to emit typed literals such as DATE '2024-01-01' or TIMESTAMP '2024-01-01 00:00:00'.

Does this PR introduce any user-facing change?

Yes. Users with a custom JdbcDialect that overrides compileValue will now see that override applied to partition WHERE clauses as well as filter pushdown. The base JdbcDialect.compileValue returns the same bare-quoted string as before, so all built-in dialects and users without a custom dialect are unaffected.

How was this patch tested?

Added SPARK-28587: columnPartition should use dialect.compileValue for date/timestamp bounds in JDBCSuite. The test registers a custom dialect that emits DATE '...' / TIMESTAMP '...' typed literals, calls JDBCRelation.columnPartition directly, and asserts the generated WHERE clauses contain typed literals rather than bare quoted strings.

Existing tests SPARK-34843 and SPARK-22814 continue to pass, confirming backward compatibility.

Was this patch authored or co-authored using generative AI tooling?

Generated-by: Claude Sonnet 4.6

…alect.compileValue Spark generates partition WHERE clauses like `col < '2024-01-01'` for date/timestamp columns, which strict-typing engines (Athena, Phoenix) reject with a type mismatch since bare quoted strings are VARCHAR, not DATE/TIMESTAMP. Fix passes the dialect to `toBoundValueInWhereClause` and calls `dialect.compileValue` for date/timestamp bounds. The base implementation returns the same bare-quoted string as before, so existing dialects are unaffected.

…alastyle errors

AliaksandrAleksiukCollibra · 2026-05-07T16:24:04Z

cc: @gatorsmile @MaxGekk @HyukjinKwon @maropu

AliaksandrAleksiukCollibra added 3 commits May 7, 2026 12:04

Trigger CI

8700736

[SPARK-28587][SQL][FOLLOWUP] Update Oracle test assertions and fix sc…

9e39da5

…alastyle errors

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-28587][SQL] Route JDBC partition bound literals through JdbcDialect.compileValue#55727

[SPARK-28587][SQL] Route JDBC partition bound literals through JdbcDialect.compileValue#55727
AliaksandrAleksiukCollibra wants to merge 3 commits intoapache:masterfrom
AliaksandrAleksiukCollibra:SPARK-28587-jdbc-dialect-compileValue

AliaksandrAleksiukCollibra commented May 7, 2026

Uh oh!

AliaksandrAleksiukCollibra commented May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

AliaksandrAleksiukCollibra commented May 7, 2026

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

AliaksandrAleksiukCollibra commented May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant