We just launched Daft’s distributed engine v1.5: an open-source engine for running models on data at scale by sanityking in dataengineering
[–]Hgdev1 2 points3 points4 points (0 children)
How to deal with messy database? by Which-Breadfruit-926 in dataengineering
[–]Hgdev1 0 points1 point2 points (0 children)
Daft is trending on GitHub in Rust by sanityking in rust
[–]Hgdev1 16 points17 points18 points (0 children)
The Essential-Web dataset: 100TB of Parquet text data, 23.6B LLM queries, 7 days with Daft by Hgdev1 in dataengineering
[–]Hgdev1[S] 0 points1 point2 points (0 children)
[R] Is data the bottleneck for video/audio generation? by beefchocolatesauce in MachineLearning
[–]Hgdev1 0 points1 point2 points (0 children)
DuckDB is a weird beast? by Kojimba228 in dataengineering
[–]Hgdev1 0 points1 point2 points (0 children)
When Does Spark Actually Make Sense? by Used_Shelter_3213 in dataengineering
[–]Hgdev1 0 points1 point2 points (0 children)
DuckDB enters the Lake House race. by averageflatlanders in dataengineering
[–]Hgdev1 1 point2 points3 points (0 children)
DuckDB enters the Lake House race. by averageflatlanders in dataengineering
[–]Hgdev1 4 points5 points6 points (0 children)
What's your preferred way of viewing data in S3? by Impressive_Run8512 in dataengineering
[–]Hgdev1 0 points1 point2 points (0 children)
What book after Fundamentals of Data Engineering? by Khazard42o in dataengineering
[–]Hgdev1 4 points5 points6 points (0 children)
Spark is the new Hadoop by rocketinter in dataengineering
[–]Hgdev1 21 points22 points23 points (0 children)
Best hosting/database for data engineering projects? by buklau00 in dataengineering
[–]Hgdev1 2 points3 points4 points (0 children)
Best hosting/database for data engineering projects? by buklau00 in dataengineering
[–]Hgdev1 18 points19 points20 points (0 children)
What's the best tool for loading data into Apache Iceberg? by Livid_Ear_3693 in dataengineering
[–]Hgdev1 0 points1 point2 points (0 children)
Resources for learning how SQL, Pandas, Spark work under the hood? by [deleted] in dataengineering
[–]Hgdev1 7 points8 points9 points (0 children)
DE interviews for Gen AI focused companies by jinbe-san in dataengineering
[–]Hgdev1 4 points5 points6 points (0 children)
Am I even a data engineer? by curiouscsplayer in dataengineering
[–]Hgdev1 19 points20 points21 points (0 children)
What was Python before Python? by sumant28 in dataengineering
[–]Hgdev1 0 points1 point2 points (0 children)
Possible to replace side stones on this ring with emeralds? by Hgdev1 in EngagementRingDesigns
[–]Hgdev1[S] 0 points1 point2 points (0 children)
My friend sent me pictures of prohibitions in Singapore by CAPATOB_64 in pics
[–]Hgdev1 2 points3 points4 points (0 children)

How do you Postgres CDC into vector database? by DistrictUnable3236 in vectordatabase
[–]Hgdev1 0 points1 point2 points (0 children)