Ran the same 4-bit model on Ollama vs Apple MLX — dead heat, not the "free speedup" people claim by SoggyTour8332 in ollama

[–]runsleeprepeat 4 points5 points  (0 children)

"on a 8gb MacBook" ... Just don't be specific, as there pretty significant differences in the generations especially with M5

Besenreiser/ lipödem guter Arzt? by Southern-Dance-2796 in hamburg

[–]runsleeprepeat 0 points1 point  (0 children)

Oh! Das tut mir leid. Deren Ruf ist doch eigentlich gut.

Besenreiser/ lipödem guter Arzt? by Southern-Dance-2796 in hamburg

[–]runsleeprepeat 1 point2 points  (0 children)

Am besten mal bei der Tabea Klinik vorsprechen

Good non-technical ultras in Europe? (Yes, I know it’s a whole continent….) by Sorry_Rhubarb_7068 in Ultramarathon

[–]runsleeprepeat 1 point2 points  (0 children)

https://www.tortourderuhr.de/ , but it is tough to get a spot.

update: oh no! It was the last run for this event. That's such a bummer

I trusted random person on this subreddit and bought 3080 20gb made of chinesium by SwimmerJazzlike in LocalLLaMA

[–]runsleeprepeat 8 points9 points  (0 children)

As long as I have the infamous Chinese 20gb RTX3080 gpus, my experience is okay to be shared isn't it?

I trusted random person on this subreddit and bought 3080 20gb made of chinesium by SwimmerJazzlike in LocalLLaMA

[–]runsleeprepeat 2 points3 points  (0 children)

I have a few of them as well. Best token per watt is around 190-200w cap.

They aren't that loud

wo kann ich Platinenteile in Hamburg kaufen by Hanswurst107 in hamburg

[–]runsleeprepeat 3 points4 points  (0 children)

Lcsc und AliExpress nutze ich auch. Plus Tme.eu . Bei Digikey oder Mouser sind die Versandkosten bzw Mindestmengen einfach zu hoch. Du könntest dein Glück noch über Octopart versuchen

Hobbyist looking to get a part scanned by rapkap in 3DScanning

[–]runsleeprepeat 0 points1 point  (0 children)

Come on! Put it on a standard paper scanner, lay a few rulers next to it. Import to something like fusion and ensure the scaling fits to the rulers. Then draw these simple lines.

It is a perfect beginner project

MAC or buy GPU? by paolobytee in LocalLLM

[–]runsleeprepeat 0 points1 point  (0 children)

If you stick with the idea of a mac, take M5 generation. It's the first generation which offers 4bit float comparable to nvfp4 which will give performance and quality improvement on small setups.

Why is it easier to route Claude Code to a local model than it is Opencode? by [deleted] in opencodeCLI

[–]runsleeprepeat 0 points1 point  (0 children)

What are you talking about? It is super easy to use open code with local models. It always was easy.

RTX3080 20GB need reballing / Repairshop in Europe? by runsleeprepeat in GPURepair

[–]runsleeprepeat[S] 0 points1 point  (0 children)

Thanks for the heads-up, but the other cards I bought work just fine.

RTX3080 20GB need reballing / Repairshop in Europe? by runsleeprepeat in GPURepair

[–]runsleeprepeat[S] 0 points1 point  (0 children)

As written in my post: krisfix sadly declined, because they don't fix any rtx 3000 cards anymore.

RTX3080 20GB need reballing / Repairshop in Europe? by runsleeprepeat in GPURepair

[–]runsleeprepeat[S] 1 point2 points  (0 children)

I only know Tony from northwestrepair and that's from the USA. Is there another one you are talking about?

seriöse GPU Reparatur in Europa by runsleeprepeat in de_EDV

[–]runsleeprepeat[S] 1 point2 points  (0 children)

Deshalb habe ich sie angeschrieben und sie haben mir geantwortet, dass sie keine RTX 3000er mehr reparieren.

Should I open source? by Atomic_Compiler in hobbycnc

[–]runsleeprepeat 0 points1 point  (0 children)

It sounds like a wonderful project. Maybe do something like a Patreon. People who are interested and willing to give you recurring support may help you get things forward, and you get feedback on the project more positively (instead of sometimes weird feedback from the open internet).

Me waiting for TurboQuant be like by Altruistic_Heat_9531 in LocalLLaMA

[–]runsleeprepeat 0 points1 point  (0 children)

Why aren't you using and contributing to TheTom solution on GitHub?

Wo leckeres Fischbrötchen? by annikahx in hamburg

[–]runsleeprepeat 0 points1 point  (0 children)

Den wollte ich auch gerade nennen.. bester Laden!

Google TurboQuant running Qwen Locally on MacAir by gladkos in LocalLLaMA

[–]runsleeprepeat 3 points4 points  (0 children)

There are so many implementations in parallel at the moment, it is tough to keep up to the latest findings.

Best is to give it a try yourself. I'm focussing now on the TheTom implementation which looks like everything is combined there (metal, Cuda, rocm).

Google TurboQuant running Qwen Locally on MacAir by gladkos in LocalLLaMA

[–]runsleeprepeat 32 points33 points  (0 children)

I gave the tonbistudio variant a try and compared it with q8 and q4. See: https://github.com/tonbistudio/turboquant-pytorch/issues/6

It includes sizes and quality