Open
Description
Using the attached geojson boundary for Ontario there was an error for a broadcast timeout when trying to run against about ~1b points. The points data set is parquet.
Final app status: FAILED, exitCode: 15, (reason: User class threw exception: org.apache.spark.sql.catalyst.errors.package$TreeNodeException: execute, tree: Exchange SinglePartition
Also this seemed to result in a broadcast time out on Spark and increasing spark.sql.autoBroadcastJoinThreshold and spark.sql.broadcastTimeout did not help.
We did notice through a conversion process to create the GeoJson structure that the precision is very high >15.
Metadata
Metadata
Assignees
Labels
No labels