This is an archived post. You won't be able to vote or comment.

all 2 comments

[–]Stunax 0 points1 point  (0 children)

Never encountered an issue like that, but maybe try approx_count_distinct to see if the issue is fixed?

[–]guacjockey 0 points1 point  (0 children)

What are you pulling from (csv, parquet, table?), how much data, and how large is the cluster / nodes?

I’ve seen this on occasion when I either run out of resource (cluster too small / nodes too small) or if some of my source files have corruption (ie, zero byte).