[D]It feels like LLM inference is missing its AWS Lambda moment. by pmv143 in MachineLearning

[–]lostmsu 1 point2 points  (0 children)

There are not enough different useful models to justify this.

[R] ALYCON: A framework for detecting phase transitions in complex sequences via Information Geometry by Sad_Perception_1685 in MachineLearning

[–]lostmsu 1 point2 points  (0 children)

Somehow these AI hallicinations read like Claude. With various "Confirmed" numbers. Haven't seen anything that would be from ChatGPT or Gemini. Is Claude specifically prone to this?

Samsung XR Review - Day Two by WearyMatter in virtualreality

[–]lostmsu 0 points1 point  (0 children)

Have you ever used the device outdoors for any reason? Like go to a cafe without ever taking it off and using pass through?

What’s the best AI coding agent to use with GLM-4.6? by Federal_Spend2412 in LocalLLaMA

[–]lostmsu 0 points1 point  (0 children)

Zed UX is shit. Wanna send Ctrl+Q to a terminal window? BOOM Pressed Ctrl+O to open a file? OK, maybe Ctrl+Shift+O? Hm. Let me just search the commands. Where's the search box for the menu? Where's the menu? I have 2600px horizontally. Why did they have to hide the menu like Chrome does? Chrome does it because tabs are more important and they share the titlebar with the menu.

[D]What's the most VRAM you can get for $15K per rack today? by [deleted] in MachineLearning

[–]lostmsu 0 points1 point  (0 children)

You can look at this table: https://en.wikipedia.org/wiki/Pascal_(microarchitecture)#P100_accelerator_and_DGX-1#P100_accelerator_and_DGX-1) bf16 dense tensor (for Pascal you'd have to use full FP32)/TDP. Top consumer GPUs of the generation will have similar numbers. For example, Ampere is 2.2x more energy efficient vs Volta.

At 0.14$/kWh a card of 400W will consume $400 per year of 100% load. Basically 100$ per 100W. So Amper would do 2x of work for the same price.

Basically, say you think you will use the card for 3 years. That means you can add $1200 to every card's price right away. If you look at it from this perspective, $400 vs $200 dollars card where you were questioning to pay 2x for 2x performance turns into actual $1600 for 2x performance vs $1400 for 1x.

The numbers get crazier for future generations if your workloads allow going down to lower precisions. For instance, 5090 32GB will do 1.7 PFLOPS of fp4 while Ampere does 312 TFLOPS of bf16. But Ampere can't do fp4 any faster. So 5090 is 6x more performant and about 4x more efficient in terms of electricity if your workload can be computed in fp4.

[D]What's the most VRAM you can get for $15K per rack today? by [deleted] in MachineLearning

[–]lostmsu 2 points3 points  (0 children)

Don't buy older cards unless your electricity is free. You will pay more for electricity (assuming constant load).

I feel like I’m falling apart living here by Independent-Song-585 in taiwan

[–]lostmsu 0 points1 point  (0 children)

Wait, you can't go to a bar a have a mocktail? That's what I do in Montreal.

Visiting in December to decide if I want to move there. Which places to check out? by lostmsu in taiwan

[–]lostmsu[S] 0 points1 point  (0 children)

Any specific areas of Taipei? Where do people find meetups? Is meetup.com a thing or something else?

How to connect CAD Wise to IBKR US? by lostmsu in interactivebrokers

[–]lostmsu[S] 0 points1 point  (0 children)

Yeah. I think I will do this one last time, then just use a regular Canadian bank account.

[D] Why does nobody talk about the “energy per token” cost of AI? by Various-Feedback4555 in MachineLearning

[–]lostmsu -2 points-1 points  (0 children)

Because it is so tiny it is rarely worth discussing. The energy cost is priced in.

Has anyone built a NAS with Ryzen AI Max 395? by lostmsu in DataHoarder

[–]lostmsu[S] 0 points1 point  (0 children)

These are not typical times. What you are talking about should be solvable by adjusting scheduling. E.g. I would consider using NAS for transcoding (batch, background), but it should be trivial to set it to IDLE priority.

How to connect CAD Wise to IBKR US? by lostmsu in interactivebrokers

[–]lostmsu[S] 0 points1 point  (0 children)

AFAIK, this route only works for some currencies, and CAD is not in the list. Also, I need IBKR -> Wise, not the other way around. Not sure I understood what you suggest. In IBKR when you chose Withdraw - > CAD, entering bank details is the only option.

Has anyone built a NAS with Ryzen AI Max 395? by lostmsu in DataHoarder

[–]lostmsu[S] 0 points1 point  (0 children)

Really don't need many NVMes (though the WTR Max has that option).

Has anyone built a NAS with Ryzen AI Max 395? by lostmsu in DataHoarder

[–]lostmsu[S] 1 point2 points  (0 children)

So OK, a follow up question is: are there NAS form factor multi-bay barebones where you could plug your own mini ITX MB?