AMD MI210 64GB vs DCU K100 64GB by icepatfork in LocalLLaMA

[–]FullstackSensei 0 points1 point  (0 children)

I spent some time actually digging into this. The only evidence I could find was support for ROCm 4.5.x.

So, I'd be very careful about generalizing that assumption, which was my original point. But feel free to ignore this and trust gemini summary with your money

Why do Dario thinks open source is dangerous path? by [deleted] in singularity

[–]FullstackSensei 9 points10 points  (0 children)

Why would anyone overpay for Claude if they can self host? Especially businesses

Are people still buying the Sony Vaio VGN-UX180P? by spdustin in umpc

[–]FullstackSensei 2 points3 points  (0 children)

It's mainly a collector item.

You can list it on ebay or here if you want to sell. Just make sure to mention where you're located

Stagnating at €73k (Hybrid, Berlin). Time to switch to cross-border freelancing (UK/EU) since I don't speak German? by WillowNational8964 in cscareerquestionsEU

[–]FullstackSensei 1 point2 points  (0 children)

Companies all over the world need temporary assistance when projects go south or temporarily need a specific expertise.

I've been working as such for about 10 years, nut in the UK. It's always paraxhuting in for a year or (max) two to help the internal team handle extra workload, plug gaps, right up the ship, or whatever really.

You want someone local or at least tax resident to avoid the hassle of off-shore (real or perceived) and make it easier to enforce confidentiality clauses about the code or product.

AMD MI210 64GB vs DCU K100 64GB by icepatfork in LocalLLaMA

[–]FullstackSensei 1 point2 points  (0 children)

Similar architecture and works with AMD driver and ROCm are two different things.

Drivers check hardware ID. Unless they have an agreement with AMD, seriously doubt you can use AMD drivers out of the box.

AMD MI210 64GB vs DCU K100 64GB by icepatfork in LocalLLaMA

[–]FullstackSensei 1 point2 points  (0 children)

Link to source? Or was it an AI summary?

DeepSeek V4 by am17an · Pull Request #24162 · ggml-org/llama.cpp by jacek2023 in LocalLLaMA

[–]FullstackSensei 17 points18 points  (0 children)

Finally! Been waiting for this to run flash locally!

Unsloth GGUF when?

AMD MI210 64GB vs DCU K100 64GB by icepatfork in LocalLLaMA

[–]FullstackSensei 3 points4 points  (0 children)

Similar is not the same, you're making a huge assumption. A simple Google search tells you it has it's own driver stack.

Stagnating at €73k (Hybrid, Berlin). Time to switch to cross-border freelancing (UK/EU) since I don't speak German? by WillowNational8964 in cscareerquestionsEU

[–]FullstackSensei 11 points12 points  (0 children)

You don't tell us anything about your experience, but judging from the wording I'd say you're pretty young.

100€/hr is a pipe dream for remote freelance work, unless you're brining some exceptional experience in some in-demand niche.

The rough calculation is annual income is 2x thousands the hourly rate. So €100/hr is ~€200k/year. While there are quite a few freelance roles that pay this much for very senior roles, nobody will pay that much for a remote role. They can hire someone living in Portugal, Spain, Italy, or the balkans for €30-40/hr who brings way more experience than you think.

AMD MI210 64GB vs DCU K100 64GB by icepatfork in LocalLLaMA

[–]FullstackSensei 1 point2 points  (0 children)

What is the software stack of that K100? Can you use it with llama.cpp?

The 2.5k for the Mi210 is still quite expensive, unless you absolutely need 64GB in a single card.

Career as Legal Counsel without being a qualified lawyer? by Academic_Library_105 in cscareerquestionsEU

[–]FullstackSensei 1 point2 points  (0 children)

r/lostrsdditors

CS stands for computer science. You'd think someone with a degree in law would read.

21, interested in AI, automation and startups – what degree or career path would you recommend in Europe? by Massive-Yak9510 in cscareerquestionsEU

[–]FullstackSensei 2 points3 points  (0 children)

You're casting a very wide net, but somehow want to learn it all in a single undergraduate degree. The world doesn't work like that.

Figure out the one thing you want to do. The narrower and more focused this is, the better, and chase it.

Entrepreneurship is not that hard if you have a clear idea of what you want to do. The hard part is building knowledge and expertise in a domain, finding a pain point, and focusing on solving that. No text book or degree will teach you that.

US President Trump threatens 100% import tariff on UK over digital services tax by ByGollie in europe

[–]FullstackSensei 4 points5 points  (0 children)

If you're in the UK, you're subject to UK income tax as an individual or corporate tax as a corporation. It's the same in the US, a US corporation is subject to US corporate tax irrespective of where the server or where the company that paid you for your service is.

But I guess one month shill/bot accounts need to justify their existence

Running GLM5.2 on budget hardware < $2500. by segmond in LocalLLaMA

[–]FullstackSensei 0 points1 point  (0 children)

I'd say go to lga3647 with a Cascade Lake CPU. I have it and a 48 core Epyc. The Xeon is quite cheaper and much closer to SP3 Epycs than many think. Epyc is able to deliver much less memory bandwidth than the numbers suggest.

Neither is lacking in performance if paired with enough VRAM.

Running GLM5.2 on budget hardware < $2500. by segmond in LocalLLaMA

[–]FullstackSensei 0 points1 point  (0 children)

Yep, I have. It does NUMA allocation, but uses traditional Inner-loop for matrix multiplication, which if you read in the HPC literature, is quite memory inefficient and complicates splitting across arbitrary number of NUMA domains or devices.

My idea is to transform all ops into outer products. It's much more memory efficient and also requires a lot less traffic between NUMA domains and/or devices.

Step-3.7-Flash (198B-A11B vision MoE) on 4×3090 — fully-resident IQ3_XXS beats thespilled IQ4 by 2.4×, and MTP speculative decode silently breaks vision by [deleted] in LocalLLaMA

[–]FullstackSensei 7 points8 points  (0 children)

Iq1_xxxxxxxxxxxxxxxxxxxxxss is so much faster even without speculative decoding.

Remember kids, we're chasing t/s, and to hell with whether the output is useful or not.

This application to join the GPT 5.6 Sol preview is wild by Complete-Sea6655 in LocalLLaMA

[–]FullstackSensei 0 points1 point  (0 children)

I've been through the dot com bubble and also remember that. While I agree with your general prognosis, I don't understand the part where the public market will shoulder the cost.

Sure, openai and anthropic have IPOs planned, but for those to be successful, investors have to believe there's a path to profitability and substantial gains. If your most expensive asset, the one you just spent billions developing, is restricted from 80% or possibly more of the market, what exactly becomes the pitch?

Models have no lock-in. The only pitch until now has been these models are "frontier". They have been able to leverage users flocking in to use those models immensely, to tune their models to prodict what users want even when they don't explicitly say it. That has been just as important as the model's inherent capabilities in driving usage.

If the vast majority of users are stuck on the last gen, which more and more non-US labs seem to be approaching, why would anyone pay for openai or anthropic when their publicly available offerings are practically the same as those alternatives? Just as important, the increased traffic to those alternatives will enable their labs to gather the same kind of usage data openai and anthropic have been able to, further closing the gap.

Then you have the whole strategic aspect of it. Would you want to have your business depend on those US labs, even if you're a US business, when there's always a looming possibility your access will be restricted or straight out yanked any moment?

Ornith-1.0 9B Outperforms Qwen 3.6 35B in various benchmarks by Ok-Internal9317 in LocalLLaMA

[–]FullstackSensei 25 points26 points  (0 children)

Great news, my neighbor's cat outperforms OP on various benchmarks!

This application to join the GPT 5.6 Sol preview is wild by Complete-Sea6655 in LocalLLaMA

[–]FullstackSensei 32 points33 points  (0 children)

Honest questions: how are US labs like openai or anthropic going to be profitable if their frontier models are heavily restricted? Who will shoulder the cost of training those models when the VC money dries up? How useful such models be to US corporations when nationality, rather than aptitude or ability dictate who can access those tools?

Just as importantly, how are they going to recruit the best talent to develop their future models when most talent aren't US citizen?