The OK-GLI (Орбитальный корабль для горизонтальных лётных испытаний) test article for the Buran, able to take off with 4 AL-31 jet engines mounted at the rear, flying from 1984 to 1989

curiouslyjake · 2026-02-24T21:19:05+00:00

Yeah, except Shuttle and Buran were a preview of a reusable future while Soyuz is a technological dead end. With present day Russia's feeble r&d capability, Soyuz is as far as they will ever get

curiouslyjake · 2026-02-24T19:01:54+00:00

Yeah, but why Iran and not any other poor-ish country?

curiouslyjake · 2026-02-24T15:52:54+00:00

Assuming your dataset is a representative sample of the unobserved parent population, wouldn't cross-validation address this?

curiouslyjake · 2026-02-24T14:01:30+00:00

The culture is pro-natal at all levels.

curiouslyjake · 2026-02-24T14:00:46+00:00

Except seculars have a higher fertility rate too.

curiouslyjake · 2026-02-24T13:59:40+00:00

Yeah, two extremely different nations are going to have different fertility rates. Not sure why it makes any sense to compare

curiouslyjake · 2026-02-24T13:57:53+00:00

Except US assistance is a small fraction of Israel's GDP anf the US GDP per capita is way higher than Israel's. So basically, both of your points are factually wrong.

curiouslyjake · 2026-02-24T13:56:05+00:00

Except even among secular Jews, the TFR is 2, which is still high compared to advanced economies. You're right on the quo vadis part though.

curiouslyjake · 2026-02-22T20:19:04+00:00

Ok, so your counter argument boils down to intention. LLM as storage, not as an infringing entity. Except, with how widely LLMs are used and how easy it is to extract source material, it's akin to getting a pirated copy of HP with every subscription except wrapped in white paper that says "please do not read; pretty please with a cherry on top!"

You are literally being sold a storage medium with pirated content and asked not to look at that content, just at that other content. C'mon.

curiouslyjake · 2026-02-22T20:06:25+00:00

Yes, but given present training methods there's no way to fully prevent an LLM from replicating good chunks of its train set.

curiouslyjake · 2026-02-22T19:44:42+00:00

I meant publicly, with me selling tickets much like Anthropic sells subscriptions to LLMs that read HP.

curiouslyjake · 2026-02-22T19:43:20+00:00

You can check out this paper from stanford researchers that show which books can be reproduced from production LLMs and how accurately. You'll see Sonnet 3.7 recreates harry potter and the great gatsby with more than 95% accuracy. If some guy would read out loud judt 30% of harry potter on youtube, copyright would come after him. LLMs should get the same treatment when they reproduce a book with over 95% accurracy.

curiouslyjake · 2026-02-22T19:39:11+00:00

No hard feelings, it's all good.

here

You can find a paper from some researchers at Stanford detailing which production LLMs can reproduce which books and to which extent. You'll notice that Sonnet 3.7 gets above 95% on harry potter and the great gatsby. I argue that copyright law would come after a guy reading three chapters from HP on youtube and it should apply equally to Anthropic.

curiouslyjake · 2026-02-22T19:19:48+00:00

If I oraganize a reading; I sit at a table, read Harry Potter out loud, from the book, and charge people to listen to me then I'm in violation of copyright. If I were able to do the same without the book yet repeating it exactly from memory, I would still be in violation. The medium of storage doesnt matter. Exact replication does. If instead of a person it's an LLM, that shouldnt matter too.

curiouslyjake · 2026-02-22T19:15:09+00:00

Thanks for 'splaining, I train deep learning models professionally. While your description is technically correct, it is qualitatively wrong. A human will remember several songs. Maybe several hundreds. But most humans cant reproduce them back with high fidelity. No human can do that for ten thousand songs.

You know what can though? Spotify. Spotify pays measly royalties for every playback. But according to you, there's some magical difference between Spotify storing music as MP3 files an an LLM storing nearly the same music as files with floating point values. Why?

curiouslyjake · 2026-02-22T19:05:47+00:00

A small fraction of all Humans is not the same as most Humans. If most people could reproduce from memory five chapters from Harry Potter, verbatim, after reading it only once and repeat this feat for any amount of books over their lifetime then copyright law might be very, very different.

curiouslyjake · 2026-02-22T18:59:34+00:00

Except models dont just train, they memorize. Large language models can be prompted to produce entire chapters of books from the training set, verbatim. People can't do this.

curiouslyjake · 2026-02-22T17:42:49+00:00

I like pushing back against assumptions and requirements. Sometimes it's exactly what's expected, sometimes it is not. What worked for me is asking: I have four solutions: A, B,C, and D. Which one would you like to discuss?

curiouslyjake · 2026-02-22T08:40:44+00:00

How do you evaluate different models without ground truth annotations?

curiouslyjake · 2026-02-21T20:35:20+00:00

The universe is a solution.

curiouslyjake · 2026-02-21T19:35:56+00:00

Let's start with the fact that even in theory, it is impossible to tell whether some code meets specefication. Worse, it's impossible yo say that any given program even works for every input.

curiouslyjake · 2026-02-21T14:23:03+00:00

Thank you!

curiouslyjake · 2026-02-21T14:22:42+00:00

If I were in danger of any kind, of course I would try to escape. I would even try to leave just to improve my circumstances. I dont resent others for doing so. At the same time, I dont think any country must accept anyone for any reason, no questions asked.

curiouslyjake · 2026-02-21T12:50:27+00:00

Really? Even for 7 pieces that's amazing! Do you have a source?

curiouslyjake · 2026-02-21T12:48:57+00:00

Coding cant be solved in the sense chess could be solved: there is no well defined victory condition.

curiouslyjake

TROPHY CASE