Commit 9becdb0
fix: [df52] route timestamp timezone mismatches through spark_parquet_convert (apache#3494)
INT96 Parquet timestamps are coerced to Timestamp(us, None) by DataFusion
but the logical schema expects Timestamp(us, Some("UTC")). The schema
adapter was routing this mismatch through Spark's Cast expression, which
incorrectly treats None-timezone values as TimestampNTZ (local time) and
applies a timezone conversion. This caused results to be shifted by the
session timezone offset (e.g., -5h45m for Asia/Kathmandu).
Route Timestamp->Timestamp mismatches through CometCastColumnExpr which
delegates to spark_parquet_convert, handling this as a metadata-only
timezone relabel without modifying the underlying values.
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>1 parent 9a047ce commit 9becdb0
1 file changed
Lines changed: 12 additions & 3 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
194 | 194 | | |
195 | 195 | | |
196 | 196 | | |
197 | | - | |
198 | | - | |
199 | | - | |
| 197 | + | |
| 198 | + | |
| 199 | + | |
| 200 | + | |
| 201 | + | |
| 202 | + | |
| 203 | + | |
| 204 | + | |
| 205 | + | |
| 206 | + | |
| 207 | + | |
200 | 208 | | |
201 | 209 | | |
202 | 210 | | |
203 | 211 | | |
204 | 212 | | |
| 213 | + | |
205 | 214 | | |
206 | 215 | | |
207 | 216 | | |
| |||
0 commit comments