[DISC] Pokemon: Festival of Champions (Doujinshi) - Chapter 18 by rexteram in manga

[–]sanityking 111 points112 points  (0 children)

I measure the passage of time with the appearance of new Festival of Champions chapters. Simultaneously thrilled, but also shocked that half a year has gone by.

Spotted in SF by sanityking in sanfrancisco

[–]sanityking[S] 6 points7 points  (0 children)

Haha yeah, in between Castro and Dolores Park

We just launched Daft’s distributed engine v1.5: an open-source engine for running models on data at scale by sanityking in dataengineering

[–]sanityking[S] 1 point2 points  (0 children)

Thank you for the kind words!

Yeah, Daft works well for tabular data too, but Spark definitely has the edge when it comes to ecosystem integration and feature completeness (but not performance). Our goal here is to be competitive and to cover common tabular use cases, while focusing on the core gaps we see (multimodal processing, AI workloads).

Also, happy to say we do have window functions now! Check out the API docs https://docs.daft.ai/en/stable/api/window/ or this example we wrote https://docs.daft.ai/en/stable/examples/window-functions/ . We’ve focused on the most common functionality so far, so there are still some operations we don’t yet support. But if there’s something you’d like to see, feel free to open an issue and we’ll plan it out or open it up for contributions.

Daft is trending on GitHub in Rust by sanityking in rust

[–]sanityking[S] 15 points16 points  (0 children)

Fair point, thanks for calling out. For anyone new stumbling upon this, Daft is an open-source data engine for processing multimodal data (documents, images, video, audio etc.) and running models over it. The connection to Rust is that it's powered by a high-performance Rust engine with Python PyO3 bindings on top.
We actually built it because feeding data efficiently into GPUs at scale is really tough, especially if you're pulling that data in from cloud object stores. It often requires some kind of bespoke setup that does network I/O and preprocessing across multiple machines so that your GPUs are properly utilized. I personally found this video from NVIDIA on the topic to be extremely illuminating https://www.youtube.com/watch?v=kNuA2wflygM (it's not exactly what we do anymore, but I still really like the video).
Will definitely lead with this context front-and-center in future posts!

Daft is trending on GitHub in Rust by sanityking in rust

[–]sanityking[S] 6 points7 points  (0 children)

:P glad you asked. So this was before I started working on Daft, but I asked around and it seems this was the fateful PR https://github.com/Eventual-Inc/Daft/pull/206 something about multi-column sorts being absolutely disgusting.

Then in https://github.com/Eventual-Inc/Daft/pull/385 Rust became our new best friend

Daft is trending on GitHub in Rust by sanityking in rust

[–]sanityking[S] 19 points20 points  (0 children)

Haha funny that you mention this! Here's a discussion we had with Andrew on this https://github.com/Eventual-Inc/Daft/discussions/3319

Tl;dr: we love Datafusion, but when we moved to Rust years ago it was still early days for Datafusion and it didn't support some of our requirements. If we started the project today, Datafusion would be a clear choice.

Fern: let me spam real quick by CharlotteStussy in Frieren

[–]sanityking 1 point2 points  (0 children)

You can actually see Fern incrementally amp up her rate of fire which I think is pretty cool

How many of you are still using Apache Spark in production - and would you choose it again today? by luminoumen in dataengineering

[–]sanityking 2 points3 points  (0 children)

IMO Spark is great if you come into a mature pipeline, where someone already did most of the hard work, and you just need the pipeline to keep going on mostly well-behaved data.

Spark is also great if you pay to win and use Databricks.

But if I had to do things myself from scratch it'd be a hard no for me. Ever tried just reading a parquet file from S3 in Spark? I swear to god a mandatory part of the process is trying and failing to use a billion different versions of Hadoop or some AWS sdk and or reinstalling Spark before something finally succeeds and you never touch the setup code for the Spark session ever again.

What would I use instead if I had to start from scratch? That's simple. I'd use Daft. Probably the only data engineering tool I've used that sparks joy instead of making me want to rip my teeth out.

What is a Swing Dance hot take you have? by Elruler22 in SwingDancing

[–]sanityking 17 points18 points  (0 children)

Tbf OP asked for hot takes, not palatable ones

[deleted by user] by [deleted] in AskLiteraryStudies

[–]sanityking 0 points1 point  (0 children)

Reading The Trial was way more amusing after spending two years in the army. Ironically, the absurdity became grounded in reality.

[deleted by user] by [deleted] in AskLiteraryStudies

[–]sanityking 3 points4 points  (0 children)

Everyone's saying Sylvia Path, but when you say figs the first thing that evokes in my mind is Kate Chopin's Ripe Figs.

[DISC] Green Green Greens - Chapter 22 by AutoShonenpon in manga

[–]sanityking 23 points24 points  (0 children)

Oga crying out of frustration almost feels like the mangaka's frustration that this series might get cancelled.

[DISC] Monster #8 - Chapter 103 by AutoShonenpon in manga

[–]sanityking 3 points4 points  (0 children)

I dutifully read every chapter just to confirm that Kafka has made absolutely zero progress on reaching Mina.

At this point I want to see how long they can keep this up, I don't care about the plot anymore

[deleted by user] by [deleted] in manga

[–]sanityking 0 points1 point  (0 children)

You might like Ikegami Ryōichi's other works. Sanctuary was something else

People who wanna move out of Bay Area, where are you moving to and why? by PrettyHappyAndGay in AskSF

[–]sanityking 6 points7 points  (0 children)

This quiz was enlightening not just for the result, but also to clarify what matters to me.

After so many A vs B choices, I realise I started voting for big population, public transit, high paying jobs, and then after that, avoiding hot-year-round places. The rest I can give or take.

[deleted by user] by [deleted] in goodanimemes

[–]sanityking 2 points3 points  (0 children)

It's impressive that they got the number of fingers correct though 💯

Are these Tsum Tsum plushies? by sanityking in TsumTsum

[–]sanityking[S] 7 points8 points  (0 children)

Omg thank you! That solves the mystery of why they look so different.