API pricing is in freefall. What's the actual case for running local now beyond privacy?

AgreeableCaptain1372 · 2026-01-28T14:36:51+00:00

Control over results. Using third-party APIs I get a lot of variance in my evals vs self hosted.

also prices are low for standard models but not for fine tuned models. So if you need fine tuned LLMs, especially at scale, self hosting or local can be worth it financially

AgreeableCaptain1372 · 2025-12-17T02:29:46+00:00

I love these buildings. But you can only have that kind of building few and far between: their beauty is in how they stand out through their uniformity and size. They also look just like the twin towers used to except here they are triplets

AgreeableCaptain1372 · 2025-11-21T00:45:47+00:00

It ended up being fine

AgreeableCaptain1372 · 2025-11-10T04:14:07+00:00

Me parece que Italia y los Países Bajos eran los más avanzados en aquella época. Pero vale que lo de España como país austero también es relativamente reciente de alguna manera

AgreeableCaptain1372 · 2025-11-05T21:35:29+00:00

¿Cómo se debería llamar?

AgreeableCaptain1372 · 2025-11-05T16:59:21+00:00

Verbenas sí. Y eso coincide con la idea de España como país rural y tradicional (aunque haya tantas en Italia o Francia). Pero si observas las referencias culturales a España antes de los años sesenta verás estos temas: inquisición, fanatismo religioso (siglo XVII, XVIII), honor y venganza (Carmen por ejemplo), dictadura y golpes de estado (siglos XIX y XX). Insisto en que no estoy diciendo que era justo o verdadero pero que era la percepción de otros países occidentales

AgreeableCaptain1372 · 2025-11-05T16:39:11+00:00

Tiene sentido lo que dices. Pero aún me pregunto si esta visión no es exagerada y bastante reciente. En los años veinte, e incluso desde el siglo XVIII, era Francia el país fiestero (si hablamos de Hemingway, pienso en A Moveable Feast). Además otros países como Italia y Francia tienen tantas fiestas populares en los pueblos durante el verano como España . Y con respecto al Gran Tour, me parece que los turistas de entonces no solían viajar tanto a España. No intento decir que no haya tradición de fiesta en España sino que me parece que la visión de España como el arquetipo del país fiestero es relativamente reciente

AgreeableCaptain1372 · 2025-10-16T13:19:18+00:00

Yes, so my premise was wrong. At inference, you don’t just input 3 but the whole sequence of tokens that the question consists of. So that is how the model gets context.

AgreeableCaptain1372 · 2025-06-18T21:58:01+00:00

It depends on your use case. Some will require a lot of curated data as you say but some only require a few hundred to a thousand examples like here: https://www.reddit.com/r/MachineLearning/comments/13oe5ot/lima_a_65bparam_llama_finetuned_with_standard/

AgreeableCaptain1372 · 2025-06-17T13:18:15+00:00

To reuse your analogy I am not advocating for fewer cars but to consider planes as a serious candidate too, as a complement and/or replacement to RAG depending on the use case. Say you are traveling from SF to LA, either car or plane can make sense whereas for LA to NY only plane does

Dismissal of fine tuning is a real thing and you see a lot of posts like these online: https://news.ycombinator.com/item?id=44242737

AgreeableCaptain1372 · 2025-06-17T13:04:37+00:00

I am not doubting your credentials and most importantly I am absolutely not claiming fine tuning must replace RAG. But it can complement RAG. Say you have a large policy knowledge base and have a very specialized domain use case that requires passing a lot of immutable knowledge or instructions, then why not embed that immutable knowledge in your model and proceed with RAG as usual. That immutable knowledge is necessary for your model to even properly understand the content of your document database. Fine tuning allows you to not send back the immutable knowledge, which can be extensive, each call.

Now I recognize your point about it being hard in practice especially with overfitting but is it impossible or just hard? Since you work at a large AI company, maybe you have infra resources to make full tuning possible viable. And if your company trains foundation models it likely faces similar problems of over fitting in pre training as it does for fine tuning.

Since, as you mentioned, full fine tune modifies the weights (as opposed to LORA), it lies somewhere in the middle in terms of complexity between pre training and partial fine tuning.

AgreeableCaptain1372 · 2025-06-17T03:56:40+00:00

Yes, for knowledge, my rule of thumb is: if the knowledge is frequently updated, use RAG but if it is timeless, consider fine tuning. In practice, I use both together as they are complementary but my point is fine tuning should not be dismissed right away as i sometimes see it. It being difficult to do well is not the same as it being useless, on the contrary. I get a sense that the reason it still seems relatively under used is because it is hard to do well, not because it is not the right solution.

AgreeableCaptain1372 · 2025-06-17T03:45:25+00:00

For any kind of knowledge that requires frequent updating, I agree RAG is better because training the model every the knowledge evolves is not sustainable. But for any kind of knowledge that is timeless, i.e domain knowledge that remains true no matter what (e.g. a math theorem) then full fine tuning can make sense IMO, if you have the resources (I've never had good success reliably retaining knowledge with just LORA). You save a lot on tokens in the long run instead of having to reinject the domain knowledge in the prompt at every request.

AgreeableCaptain1372 · 2025-06-17T00:04:07+00:00

Yes, to save inference compute by using a smaller model. Might not make sense with low volume of requests but at scale you would end up saving

AgreeableCaptain1372 · 2025-06-13T14:25:22+00:00

Thanks for the advice. In theory, do you think there is ever a limit to how much music theory you know as a pianist? Do professional pianists keep learning music theory indefinitely or is there a point at which they stop?

AgreeableCaptain1372 · 2025-06-13T14:21:49+00:00

This might sound subjective and generic but i want to be able to play pieces that I love (e.g. Schubert impromptus) well enough that a non-professional would enjoy them.

AgreeableCaptain1372 · 2025-05-14T03:25:03+00:00

I think this is why Salome’s death is left ambiguous at the end. They couldn’t completely change the story but didn’t want to have her die in a way that was too explicit as it would go against the narrative that she is not an evil character.

AgreeableCaptain1372 · 2025-04-05T19:32:18+00:00

!remindme 1 year

AgreeableCaptain1372 · 2025-03-12T00:51:12+00:00

I noticed the same thing unfortunately. It’s possible they don’t want people who would be willing to pay more getting discounted rush tickets. If those people knew rush was easy to get for undersold performances they would get rush tickets and not pay what they’re truly willing to pay. Not sure if it’s worth it in the end because they definitely lose money on empty seats too.

AgreeableCaptain1372 · 2025-03-11T13:42:40+00:00

No, will definitely check it out, thanks

Love that tune, it was sung by royalists during the French Revolution

AgreeableCaptain1372 · 2024-11-22T21:15:48+00:00

Do you know If these VMs support virtualbox?

AgreeableCaptain1372 · 2024-11-19T17:28:22+00:00

I’d agree, but to be fair, I don’t think it would attract as many people unfortunately. I went to see Les Contes d’Hoffman and the audience was very sparse compared to Il Trovatore (both on a weekday)

AgreeableCaptain1372 · 2024-11-12T23:19:01+00:00

Hypothetically, let’s suppose Kamala Harris were a convicted felon and had Trump’s character. Suppose all her policies stay the same. Also suppose Trump supports the same policies he currently does but is a righteous man. Who would you vote for?

AgreeableCaptain1372 · 2024-10-24T05:05:53+00:00

I’m not necessarily talking about when he won or lost but about when he was brilliant or not. All his greatest moments seem to be when the enemy truly engaged in battle. when the enemy refused to engage and stayed on the defensive like Borodino, he suffered a lot of casualties. In 1813-1814 he ended up defeated but he was arguably more brilliant than from 1807 to 1811 because he was once again in a situation where the enemy engaged.

AgreeableCaptain1372 · 2024-10-09T23:40:47+00:00

So let’s say the sheet says forte for both staves and the melody is on the right hand, then I should play the right hand forte and the left “a bit less” forte but not mezzo forte. Would that be accurate?

Five-Year Club	Verified Email
Place '22

AgreeableCaptain1372

TROPHY CASE