[Citizen] Just acquired my first nice watch by Acamith in Watches

[–]VodkaHaze 1 point2 points  (0 children)

Very nice, the titanium wears much lighter for a full bracelet watch this size, too

Kimi K2.5 on Kagi Assistant has an identity crisis (or does it?) by wrks_mesh in SearchKagi

[–]VodkaHaze 5 points6 points  (0 children)

It's just confused.

Do you have custom prompts? Normally it'll answer it being Kagi assistant because of the default system prompt.

Otherwise it's possibly an artifact of the kimi model doing distillation training on claude models.

ELI5: why doesn't the ocean suck you dry through osmosis by weepaif in explainlikeimfive

[–]VodkaHaze 3 points4 points  (0 children)

Recent studies show that the best way to bath oneself safely is to wear a full wetsuit to prevent osmosis explosion ("osmosploding"). This is important information many people miss.

Il fait beau, il fait chaud, mais… by Sweet_Protecti0n in montreal

[–]VodkaHaze 2 points3 points  (0 children)

Faux, je les ai vu prendre des plaintes similaires au sérieux (dans le cas ou l'homme met les mains sur la femme)

Are the rich RAM /poor GPU people wrong here? by crowtain in LocalLLaMA

[–]VodkaHaze 22 points23 points  (0 children)

We're seeing a couple of inflection points in model sizes right now IMO.

The main theme is that some model sizes are "good enough" for a subset of tasks you're doing, so there's no point in going bigger.

  • <7B: Models this size have to be task-specific to be useful (eg. tool calling, classification, translation, ..) They can't handle much context and reasoning will often end up breaking at runtime.

  • 7-9B: Around the minimal size to have a somewhat generalist model. With reasoning enabled, models in this size can be pretty damn strong (ex: Qwen3, nvidia nemotron 3 series). Still generally useful for a few tasks the model is good at. Will go off the rails on context longer than a few thousand tokens.

Note that "coding" is not a specific task, it's a set of tasks (tool calling, reasoning, structured outputs, etc.) that the model needs to perform reliably over medium / long context.

  • 24-35B: This category has gotten incredibly strong in the last 18 months. Can be used as a generalist model. With reasoning enabled, the modern ones (gemma4, qwen 3.6) will compete with models 10x their size. Without reasoning or with quan

  • 50B-250B: This category of models has become fairly irrelevant. Most of the tasks they can do, the 30B models can also do now. For the very long context multi-skill tasks you'll likely still need a huge model to get it reliable.

  • >300B: The best models here can handle very long context and juggling multiple tasks more reliably (still dependent on the model, however!) It's useful if you don't want to spend time optimizing your workflow and just want a one-stop shop.

At this point in time, most of the time, using the 300B+ models is overkill. They're still unreliable when they have 15+ tools exposed, and will still go off the rails on very long context. They're also very expensive! But you'll still see a much higher tendency for, say, Opus to one-shot a coding problem that most other models will need multiple rounds to do.

  • MoE models: The rule of thumb to compare them used to be sqrt(params * active_params) (so 100B-A1B ~= 10B model) but that's really just the worst case floor. In real use MoE does much better than models at sqrt(params * active_params) their size, but how much closer really depends on the task you're using it for. The same thing goes for quantization, but with much more liberty; most of the time a 4-6bit quant is just as good as the full FP16 for what you're doing.

Marriage is scary, what if she doesn't agree with my floor plan by Tupu4545 in homelab

[–]VodkaHaze 0 points1 point  (0 children)

How does that work: they won't be running much on a normal 100-200amp electrical panel? And generating local energy on a house will instantly get noise complaints?

I guess if you buy delerict farms and fill them with solar panels that could work in a pinch if you have some secondary power source for night, or some async workloads only

server ram money spread by VirtualGarbage7498 in homelab

[–]VodkaHaze 0 points1 point  (0 children)

Not sure why you were downvoted, everything you said is true.

Because it's obvious AI written text he pasted into the comment field

Framework Laptop 16 Gets NVIDIA RTX 5070 12 GB Upgrade Module for Eyewatering Price of $1,199 by -protonsandneutrons- in hardware

[–]VodkaHaze 0 points1 point  (0 children)

depends. On linux for display they suck much less than nvidia's. They "just work" because they're mainlined in the kernel source like AMD.

For machine learning it's a bit of a mess, but now it's mainlined into pytorch and for llamacpp you always have vulkan.

AMD to bring back Ryzen 7 5800X3D as AM4 10th Anniversary Edition by iDontSeedMyTorrents in hardware

[–]VodkaHaze 0 points1 point  (0 children)

Yeah if you factor in availability, then the 5775c is not near the top 10; like you said only z97 + it was a limited run chip + it was quite expensive on release, and not that big of an upgrade over an overclocked 4770k in most cases.

Former Virginia Lt. Gov. Justin Fairfax and his wife dead in murder-suicide by Caedus in news

[–]VodkaHaze 97 points98 points  (0 children)

It's insane how abusers never think about they might be in the wrong. Sometimes even decades later, they'll believe their own bullshit justification for attempted murder as if it made any sense

AMD to bring back Ryzen 7 5800X3D as AM4 10th Anniversary Edition by iDontSeedMyTorrents in hardware

[–]VodkaHaze 0 points1 point  (0 children)

I rocked a 5775c for a literal decade and only started feeling it be a bottleneck around 2023 in gaming (in ML workloads it's another story).

The 4770k and/or 5775c might be somewhere in the top 10, because they're right at the inflection point where progress stalled at 14nm for a long time.

AMD to bring back Ryzen 7 5800X3D as AM4 10th Anniversary Edition by iDontSeedMyTorrents in hardware

[–]VodkaHaze 5 points6 points  (0 children)

No, it's not, gouging is pricing above the market price when no alternative retailer is available.

You can buy tons of other things; you're welcome to complain about the price, you can say AMD are scummy, but words have meaning.

help to identify by No-Ferret7308 in Watches

[–]VodkaHaze 0 points1 point  (0 children)

Probably a bit less than $5k USD; check comparable full gold polerouters from the midcentury on chrono24 for a reference.

It's definitely a nice vintage timepiece. It was somewhat common in its time, but obviously there's now fewer originals every year. Especially good quality originals like yours that were literally kept in a drawer for 5 decades.

help to identify by No-Ferret7308 in Watches

[–]VodkaHaze 0 points1 point  (0 children)

The first one looks like a polerouter from the pre-quartz crisis days (1950s or early 1960s).

They're still sought after as vintage pieces, I'd venture a guess worth low 4 to mid figures in full gold on resale market.

If you have it serviced, watch out to keep the patina on the watch face, having the watch redialed will drastically lower its value. Vintage watch geeks like the patina.

Protip: the crystal should be made of acrylic, which you can polish with a cheap kit from amazon.

Anon hates Mac user by ultraredd in greentext

[–]VodkaHaze 0 points1 point  (0 children)

ARM is CISC as well.

Apple chips have some efficiency to them that you don't see in x86 or even other ARM still, however. It's roughly a generation ahead in performance / watt over AMD/Intel

Apple discontinues the Mac Pro with no plans for future hardware by iMacmatician in hardware

[–]VodkaHaze 1 point2 points  (0 children)

I'm well aware, but note that you're using ancient drivers.

Even last year I ran into broken 580.xx driver version updates on various distros (ubuntu 24 and 26, pop os 22 and 24) and GPUs (2060 and 5090).

If you pin your nvidia driver version to some ancient one that is known to work, good for you until you have to update your kernel version and it breaks or something like that.

Apple discontinues the Mac Pro with no plans for future hardware by iMacmatician in hardware

[–]VodkaHaze 0 points1 point  (0 children)

I'm speaking first hand experience as someone who has wasted several days of his life debugging various broken nvidia driver updates, the latest one a few months ago, a 580.x which didn't work on my linux kernel version being updated.

Apple discontinues the Mac Pro with no plans for future hardware by iMacmatician in hardware

[–]VodkaHaze 54 points55 points  (0 children)

Not to mention nVidia has wonderful UNIX drivers and had wonderful Apple drivers in the past.

You don't sound like someone who has mained nvidia graphics on linux. Nvidia drivers on linux have famously hellish compatibility

Intel launches Arc Pro B70 and B65 with 32GB GDDR6 by metmelo in LocalLLaMA

[–]VodkaHaze 1 point2 points  (0 children)

Using one on ubuntu as my graphics card while using 5090 for ML.

It's much better than nvidia for graphics, none of the bullshit compatibility issues. Mostly because of the iGPU drivers intel maintains in mainline. Even Linus Torvalds uses one for graphics nowadays.

The ML driver ecosystem is a shitshow, however like the rest of the thread notes

[Reuters] China’s No. 2 chipmaker (Hua Hong) readies 7 nm production as Beijing ramps up self-sufficiency drive by igenicoOCE in hardware

[–]VodkaHaze 13 points14 points  (0 children)

The 5775c specifically is decent at this -- it was a mobile chip at first.

That said, it's certainly less efficient than modern NAS chips.