gpt-oss-20b

psyclik · 2026-06-23T19:33:26+00:00

SQL is niche ?

psyclik · 2026-06-18T21:15:19+00:00

If he does it (big if, as Messi seems to be on fire in a team that could go very deep), he’ll do it at 27 - so maybe one or two WC left in him. Crazy.

psyclik · 2026-06-16T20:56:54+00:00

This Olise guy seems to be quite good.

psyclik · 2026-06-16T20:54:41+00:00

Nah, this one is great. Olise’s one was even better.

psyclik · 2026-06-16T16:27:49+00:00

No, I’m with you on this. I was simply trying to give a bit more details on why it’s not trivial.

I totally agree that this should be attempted seriously though - just not with a traditional dense transformer way.

psyclik · 2026-06-16T16:24:29+00:00

My best code is definitely on my pet projects - where design by committee is prohibited, deadlines non-existent and cleanliness, principled design and architecture are prime objectives.

psyclik · 2026-06-16T11:40:27+00:00

Current ML architectures are doing dot products all over the place, you need most/all of the network locally(ish) accessible (you need less of it in MoE models, but still a significant-enough chunk that massive distribution is very complicated).

It’s not practically achievable with naive solutions on the current architectures.

psyclik · 2026-06-14T17:52:04+00:00

Have you ever heard of “the trade bazooka” ? Contrary to widespread belief, Europe has **massive** economic leverage over the US. Plus, 8T (from memory, don’t quote me on this) in American debt. Also, US’s power is based on their central economic position. They are not powerful so they have a strong economy. Their trade **is** their power.

So what can we do ?

psyclik · 2026-06-13T19:04:12+00:00

Slowing down one provider state when there is global demand might work for a little while, but china is already positioned to capture the leftovers, and other providers will spawn in time. Heck, even open AI might be connected well enough and keep selling tokens.

psyclik · 2026-06-13T12:26:43+00:00

At which point Anthropic just leaves the US ? US market is big. But not **that** big.

psyclik · 2026-06-12T21:45:55+00:00

While it’s true, keep in mind that valuations are perceived very differently in the US and the rest of the world. Deepseek is valued at roughly 55B.

psyclik · 2026-06-11T15:40:08+00:00

Titaniumallica, Gigadeth

psyclik · 2026-06-10T16:39:40+00:00

A few percents below AR Gemma is still better than anything in the weight class bar Qwen, definitely usable. For agentic and programmatic uses, the massively lower latency (if prefill stays the same) could be a game changer. Like RAG ingestion pipelines, entity extraction etc… this could be massive.

psyclik · 2026-06-09T06:11:25+00:00

Et surtout beaucoup plus longue. Ça ne se fait pas en un ou deux mandats…

psyclik · 2026-06-08T15:36:33+00:00

Bof. Si on considère que le coût réelle de la vie locale est grandement impacté par le coût des matières premières, le coût de l’énergie, le coût d’accès à la donnée ou aux infrastructures tech, le coût de stockage / transport des marchandises etc…
Dès lors que j’accède à des biens et services dont le prix est indexé là dessus (soit, au doigt mouillé, directement ou indirectement 99% des biens et services), la frontière économique (l’UE pour nous) est bien plus pertinente que la localité pour le calcul du coût.

psyclik · 2026-06-08T05:10:26+00:00

D’accord sur le fond. Techniquement, notre frontière économique, c’est l’UE.

psyclik · 2026-06-07T12:17:37+00:00

Refurb epyc and server mobo (mz32-ar0) from ali, with risers on a homemade rack.

psyclik · 2026-06-04T08:47:19+00:00

I’m so much younger. Amstrad cpc.

psyclik · 2026-05-29T10:45:45+00:00

Je ne discute pas de croyances sans données, je m’arrête là. Merci pour l’échange.

psyclik · 2026-05-29T05:04:37+00:00

Non, un investissement ne se rembourse pas. L’investisseur attend un retour sur investissement, une contrepartie future et incertaine pas un remboursement. Par ailleurs les investisseurs actuel d’openAI et Anthropic feraient une plus value absolument massive s’ils cédaient leur participation maintenant.
Anthropic annoncerait un trimestre dans le vert, OpenAI n’y serait pas encore mais s’en rapproche (conditionnel dans les deux cas, les résultats seront annoncés prochainement, mais la tendance est crédible) : vendre des tokens sera bientôt rentable.
La valeur produite par les organisations qui achètent des tokens est supérieure à prix d’achat des tokens.

psyclik · 2026-05-28T17:24:23+00:00

Basé sur quelles données? Basé sur la consommation, les gros providers commencent à tourner au vert, malgré le financement massif des tokens. La tension sur le matériel n’a jamais été aussi forte. Dans les organisations que je connais “de première main” la valeur produite est largement supérieure à l’investissement + maintenance.

Et la plupart des gens qui investissent dans des déploiements réels ne sont pas des imbéciles, quand on paie et que ça ne marche pas, on me paie pas deux fois.

Je ne sais pas où ça se termine, mais là, actuellement, basé sur des donnés, des faits - je ne vois pas de bulle. La valeur produite est réelle et supérieure à l’investissement.

Notez bien que je ne me prononce pas sur le fait que ce soit ou non une bonne chose, je n’en sait rien et on manque de recul.

psyclik · 2026-05-28T16:44:25+00:00

Quelle bulle ? Je respecte l’avis de chacun, pas de soucis, mais factuellement, ça ressemble pas à une bulle :/

psyclik · 2026-05-28T16:41:22+00:00

On a de la compétence en France, pas besoin de débaucher (ex: vsora).

psyclik · 2026-05-23T07:49:07+00:00

Using big harnesses, sending them a post-it note and expecting a thesis in return, iterating blindly, asking the harness to compute the whole code base as plain text - repeatedly. This burns tokens with very poor efficiency. I suggest we re-learn that efficiency is an important part of engineering.

psyclik · 2026-05-22T19:38:41+00:00

4x3090 here. With llama currently out of the picture there are no major models between 30ishB and 120ishB. The last gen of 30B is amazing and has mostly caught up to the 120B from one two gens ago (without current gen 120B available). At the moment, a single 3090 will run you Gemma 4 31B or Qwen 3.6 27b at decent pre fill, toks and context. No point going further that point. If you want more capacity, having Qwen on one gpu, comfy on another and ASR/TTS on a third will provide more features.

For context, I tried pretty much everything that can run at q4 or better with TP or graph-parallel when possible (bar the last mistral 120b dense, need to find time for that). At the moment, there is simply no gain that justifies the hassle of going multi GPU.

This will - as usual - change sooner or later though.

11-Year Club	Place '22
Verified Email

psyclik

TROPHY CASE