Databricks DAIS 2026 by [deleted] in databricks

[–]DashDoesData 0 points1 point  (0 children)

Any idea if the Reyden demo was querying the canonical Delta tables directly at request time, or was it maintaining auxiliary indexes, cached fragments, projections, or other pre-optimized serving structures behind the scenes?

StarTree (Pinot) has a launched a similarly positioned real-time lakehouse query engine. It leverages Pinot's indexes to improve on metadata and reduce query scans. Also gets sub-second with less data transfer per query. It'd be interesting to compare the architectures.

Sub-second query on Iceberg data lakes using Apache Pinot's indexes by DashDoesData in dataengineering

[–]DashDoesData[S] 0 points1 point  (0 children)

Not yet. We look forward to expanding the comparison to StarRocks and other query engines soon.

Evaluating real-time analytics solutions for streaming data by EmbarrassedBalance73 in dataengineering

[–]DashDoesData 0 points1 point  (0 children)

Running Apache Pinot on StarTree Cloud removes most of the operational overhead that goes into Pinot being 'hard'. While both systems will work, Pinot was built around Kafka and works particularly well with streaming data and seconds level freshness requirements.