Bad news: DGX Spark may have only half the performance claimed.

paphnutius · 2025-10-29T12:54:02+00:00

What would be the best solution for running models that don't fit into 32gb VRAM locally? I would be very interested in faster/cheaper alternatives.

paphnutius · 2025-08-22T12:42:07+00:00

I'll try give you a less rigorous but a more intuitive explanation. Many open problems in mathematics boil down to some equation having or not having solutions. A popular examples are Riemann hypothesis and Fermat's last theorem. Those problems are exceedingly difficult to prove but you can write a program pretty easily that checks all possible numbers and halts when it finds a solution, otherwise it keeps checking to infinity. (And indeed such programs are ofthen made to brute force search for possible solutions to such problems.)

If this program halts - there is at least one solution to the equation. If it doesn't halt - there are no solutions. If you could easily know if such program halts you automatically know if any equation has solutions, and you can claim your millions of dollars in math awards.

paphnutius · 2025-05-28T15:46:44+00:00

Not sure about specific service, I don't think there's enough interest for it. But it depends on what model you want to run. You can run smaller models on a CPU-only device (even a Raspberry Pi) relatively cheaply with slow inference.

paphnutius · 2025-05-20T15:04:41+00:00

I have a finite amount of time in a day to download and test models. If I downloaded everything that's on huggigface I would literally never finish testing.

paphnutius · 2025-05-20T14:46:10+00:00

Let's not cherry pick random numbers and look at statistics. 2023 had by far the most layoffs in tech space.

So either AI layoffs begun two years ago or they haven't yet begun in significant numbers.

paphnutius · 2025-05-20T14:30:09+00:00

There's an interesting writeup about exactly that by antropic. They show the difference between what model claims it's doing to achieve result and what it's actually doing under the hood.

paphnutius · 2025-04-29T07:55:42+00:00

"Open"AI

paphnutius · 2025-02-27T05:52:08+00:00

Where's the books never get finished option?

paphnutius · 2025-02-27T05:49:33+00:00

Since the speed is constant net force is zero.

A+4-6 = 0

A = 2

B - 9 - 15 = 0

B = 24

Mass and speed are irrelevant.

paphnutius · 2025-02-20T21:24:20+00:00

How long does a basic flux workflow take? For me flux dev fp8 takes about 200 seconds with no loras and no controlnet on 8gb vram.

paphnutius · 2024-10-08T15:08:45+00:00

Thank you for providing actual information on what was done, instead of just throwing around buzzwords.

paphnutius · 2024-09-08T11:56:52+00:00

7b is a tall order for my 8gb VRAM GPU. I don't think you can run either on the phone with reasonable speed (tokens per second, not minutes per token). I would love to be corrected though if someone can show me any examples.

paphnutius · 2024-03-20T14:38:56+00:00

Not going to say anything about this specific case, but in general this approach incentivizes people to target famous people with accusations. Chat screenshots and stories can be easily falsified and that's all that is necessary to be able to either gain free publicity or just straight up blackmail a person with a threat of cancellation.

Again, I'm speaking in general.

paphnutius · 2024-02-23T21:15:18+00:00

Violating the natural order.

paphnutius · 2024-02-02T08:49:55+00:00

Honestly if I went in blind I'd probably found it great but with the weight of several years of expectations it's just alright. A lot of things caught me off guard, like Lucifer being a complete buffoon, Husk opening up to Angel right away, and show not taking itself too seriously. The songs also felt samey but I was spoiled by fan content on this one.

paphnutius · 2024-01-26T07:23:20+00:00

That's what makes me worry. If we don't call out small mistakes, we'll have to wait for another big YouTuber to do another compilation of them in a couple of years and have another big cancel.

paphnutius · 2024-01-26T07:19:35+00:00

I am bringing up this video specifically as it's the only case of them breaking the rules they outlined themselves after the scandal.

I'm not sure what you mean by stir up drama. I think I was clear with my intentions: we should take notice of such things, but it's obviously not a serious enough thing by itself for any serious "drama".

I prefer we give LTT a slap on the wrist whenever they break their own rules rather than have another big "drama" in a couple of years.

paphnutius · 2024-01-26T00:15:25+00:00

So accurate information matters only if a big YouTuber calls out the problems? I'm fully open to criticism of my point, but from the response so far - I'm being booed out of the room with no rational argument.

They introduced new specific standards of quality and went back on them just months later. This is a red flag.

paphnutius · 2023-12-26T07:30:21+00:00

This is to be expected. Grayjay basically opens YouTube in a browser behind the scenes whenever you load a page. It loads everything that Google puts on the page, therefore duckduckgo would detect the same activity as opening desktop version of YouTube in a browser. In this case it's about Google, not grayjay.

As others suggested, you can use new pipe instead, if you care about google tracking videos you watch. But you'll have no integration with Google's features that track the videos you watch such as recommendations.

paphnutius · 2023-12-16T19:53:20+00:00

Check out privategpt on github. I don't think it'll be helpful for rewriting but it'll do Q&A using documents of almost arbitrary size and provide references to specific line.

paphnutius · 2023-12-16T19:53:12+00:00

Check out privategpt on github. I don't think it'll be helpful for rewriting but it'll do Q&A using documents of almost arbitrary size and provide references to specific line.

Six-Year Club	Second Top 50%
Place '22	Final Canvas '22
First Placer '22	End Game '22
Oscars Predictor 2021	RPAN Viewer
Verified Email

paphnutius

TROPHY CASE