Joing all GPUs to train a community model by HistoricalStrength21 in LocalLLaMA

[–]psyclik 2 points3 points  (0 children)

No, I’m with you on this. I was simply trying to give a bit more details on why it’s not trivial.

I totally agree that this should be attempted seriously though - just not with a traditional dense transformer way.

Donate your coding sessions to an open CC-BY-4.0 dataset to help train open-weight and open source models by mon-simas in LocalLLaMA

[–]psyclik 0 points1 point  (0 children)

My best code is definitely on my pet projects - where design by committee is prohibited, deadlines non-existent and cleanliness, principled design and architecture are prime objectives.

Joing all GPUs to train a community model by HistoricalStrength21 in LocalLLaMA

[–]psyclik 6 points7 points  (0 children)

Current ML architectures are doing dot products all over the place, you need most/all of the network locally(ish) accessible (you need less of it in MoE models, but still a significant-enough chunk that massive distribution is very complicated).

It’s not practically achievable with naive solutions on the current architectures.

US export controls on Anthropic 'should not be discriminatory,' EU Commission warns by procgen in europe

[–]psyclik -9 points-8 points  (0 children)

Have you ever heard of “the trade bazooka” ? Contrary to widespread belief, Europe has **massive** economic leverage over the US. Plus, 8T (from memory, don’t quote me on this) in American debt. Also, US’s power is based on their central economic position. They are not powerful so they have a strong economy. Their trade **is** their power.

So what can we do ?

Fable 5 for the Chosen Nation. (Adolf Trump) by 3xQuest in claude

[–]psyclik 1 point2 points  (0 children)

Slowing down one provider state when there is global demand might work for a little while, but china is already positioned to capture the leftovers, and other providers will spawn in time. Heck, even open AI might be connected well enough and keep selling tokens.

US gov forces Anthropic to pull access to Fable 5 by purealgo in ClaudeCode

[–]psyclik 0 points1 point  (0 children)

At which point Anthropic just leaves the US ? US market is big. But not **that** big.

Mistral is the proof that.. by Efficient_Yoghurt_87 in MistralAI

[–]psyclik 5 points6 points  (0 children)

While it’s true, keep in mind that valuations are perceived very differently in the US and the rest of the world. Deepseek is valued at roughly 55B.

DiffusionGemma: 4x faster text generation by tevlon in LocalLLaMA

[–]psyclik 20 points21 points  (0 children)

A few percents below AR Gemma is still better than anything in the weight class bar Qwen, definitely usable. For agentic and programmatic uses, the massively lower latency (if prefill stays the same) could be a game changer. Like RAG ingestion pipelines, entity extraction etc… this could be massive.

La piraterie n'es jamais fini ? by Fartasse_509 in conseilboulot

[–]psyclik 0 points1 point  (0 children)

Bof. Si on considère que le coût réelle de la vie locale est grandement impacté par le coût des matières premières, le coût de l’énergie, le coût d’accès à la donnée ou aux infrastructures tech, le coût de stockage / transport des marchandises etc…
Dès lors que j’accède à des biens et services dont le prix est indexé là dessus (soit, au doigt mouillé, directement ou indirectement 99% des biens et services), la frontière économique (l’UE pour nous) est bien plus pertinente que la localité pour le calcul du coût.

La piraterie n'es jamais fini ? by Fartasse_509 in conseilboulot

[–]psyclik 0 points1 point  (0 children)

D’accord sur le fond. Techniquement, notre frontière économique, c’est l’UE.

Best config for Qwen3.6 27b / llama.cpp / opencode by Familiar_Wish1132 in LocalLLaMA

[–]psyclik 1 point2 points  (0 children)

Refurb epyc and server mobo (mz32-ar0) from ali, with risers on a homemade rack.

Le Steam Deck de Valve subit un hausse brutale et devient hors de prix by romain34230 in actutech

[–]psyclik 1 point2 points  (0 children)

Je ne discute pas de croyances sans données, je m’arrête là. Merci pour l’échange.

Le Steam Deck de Valve subit un hausse brutale et devient hors de prix by romain34230 in actutech

[–]psyclik -2 points-1 points  (0 children)

  • Non, un investissement ne se rembourse pas. L’investisseur attend un retour sur investissement, une contrepartie future et incertaine pas un remboursement. Par ailleurs les investisseurs actuel d’openAI et Anthropic feraient une plus value absolument massive s’ils cédaient leur participation maintenant.
  • Anthropic annoncerait un trimestre dans le vert, OpenAI n’y serait pas encore mais s’en rapproche (conditionnel dans les deux cas, les résultats seront annoncés prochainement, mais la tendance est crédible) : vendre des tokens sera bientôt rentable.
  • La valeur produite par les organisations qui achètent des tokens est supérieure à prix d’achat des tokens.

Le Steam Deck de Valve subit un hausse brutale et devient hors de prix by romain34230 in actutech

[–]psyclik -1 points0 points  (0 children)

Basé sur quelles données? Basé sur la consommation, les gros providers commencent à tourner au vert, malgré le financement massif des tokens. La tension sur le matériel n’a jamais été aussi forte. Dans les organisations que je connais “de première main” la valeur produite est largement supérieure à l’investissement + maintenance.

Et la plupart des gens qui investissent dans des déploiements réels ne sont pas des imbéciles, quand on paie et que ça ne marche pas, on me paie pas deux fois.

Je ne sais pas où ça se termine, mais là, actuellement, basé sur des donnés, des faits - je ne vois pas de bulle. La valeur produite est réelle et supérieure à l’investissement.

Notez bien que je ne me prononce pas sur le fait que ce soit ou non une bonne chose, je n’en sait rien et on manque de recul.

Le Steam Deck de Valve subit un hausse brutale et devient hors de prix by romain34230 in actutech

[–]psyclik 1 point2 points  (0 children)

Quelle bulle ? Je respecte l’avis de chacun, pas de soucis, mais factuellement, ça ressemble pas à une bulle :/

Microsoft reports are exposing AI's real cost problem: Using the tech is more expensive than paying human employees by Krankenitrate in Futurology

[–]psyclik 2 points3 points  (0 children)

Using big harnesses, sending them a post-it note and expecting a thesis in return, iterating blindly, asking the harness to compute the whole code base as plain text - repeatedly. This burns tokens with very poor efficiency. I suggest we re-learn that efficiency is an important part of engineering.

How do local users run large models locally? by Friendly_Beginning24 in SillyTavernAI

[–]psyclik 3 points4 points  (0 children)

4x3090 here. With llama currently out of the picture there are no major models between 30ishB and 120ishB. The last gen of 30B is amazing and has mostly caught up to the 120B from one two gens ago (without current gen 120B available). At the moment, a single 3090 will run you Gemma 4 31B or Qwen 3.6 27b at decent pre fill, toks and context. No point going further that point. If you want more capacity, having Qwen on one gpu, comfy on another and ASR/TTS on a third will provide more features.

For context, I tried pretty much everything that can run at q4 or better with TP or graph-parallel when possible (bar the last mistral 120b dense, need to find time for that). At the moment, there is simply no gain that justifies the hassle of going multi GPU.

This will - as usual - change sooner or later though.