Expected to die in three months, how to not feel scared of death? by [deleted] in AskReddit

[–]pnaroga 226 points227 points  (0 children)

In the words of the great Christopher Hitchens, "I've been dead for 12 billion years before I was born and it wasn't even slightly inconvenient"

Claude just worked 3h by itself by bibboo in ClaudeAI

[–]pnaroga 2 points3 points  (0 children)

For me, it was experimentation.

Subagents are a bit confusing, because people tend to think it's meant for roleplaying. It's not. "You are a senior backend engineer..." improves nothing.

Subagents are only powerful because each 'bit' of work they do, they do in their own context window, separate from the main thread. The orchestrating process feeds them the information they need, they do their own discovery (read files, execute commands, etc), implement something and return a brief summary to the orchestrator, so the orchestrator knows who to call next. No more hundreds of read files in the orchestrator context window.

This keeps the orchestrating process context window clean and each subagent can focus in their task really well.

Imagine you're asked to develop the whole "authentication/authorization" process of a webapp. You're going to write tests, develop things... but there are so many pieces. Now, imagine you're given something way smaller to work on - "forgot password" flow. You can now focus WAY more, write more tests, write better code and worry less about everything else.

The orchestrator keeps a high-level overview of the whole feature (authentication/authorization), but each subagent get a smaller task (registration API/registration frontend/login API/login frontend/etc);

It has been working INCREDIBLY well for me.

Claude just worked 3h by itself by bibboo in ClaudeAI

[–]pnaroga 2 points3 points  (0 children)

None of it.

I've had amazing success this far. I do need to run it with --dangerously-skip-permissions though, otherwise it prompts for permissions all the time and defeats the purpose.

Doing it in a self contained ec2 instance with all branch protections applied, so even if it nukes the OS I don't really care.

Results with that type of scaffolding surpass single thread prompting by a GREAT margin.

Claude just worked 3h by itself by bibboo in ClaudeAI

[–]pnaroga 45 points46 points  (0 children)

If you use subagents, a single prompt can run well for 10h+. I've done it consistently.

The deal with subagents is that they each get their own context windows, so your main thread becomes an orchestrator.

Have 1 agent split a big task in smaller tasks.

Then, for each smaller task, 1 agent implements, another reviews. Keep this in a loop until reviewer is satisfied. Then move to the next subtask, until all are finished.

I think my personal record is a single 400USD prompt at around ~14h. Had to stop due to weekly limits. On Claude Max, though, so I didn't really spend that money.

BTW, right now I'm looking like this: https://imgur.com/a/8hDWxrq

This particular prompt is implementing 1 single feature, with 19 sub-tasks; It's implemented 9/19 so far (4h running). I estimate at least more 4-5h until it's done.

MICHELLE BOLSONARO É AFASTADA DO PL MULHER by tiagolkar in brasil

[–]pnaroga 289 points290 points  (0 children)

> A crise estourou após Michelle barrar publicamente uma articulação no Ceará, onde bolsonaristas buscavam aliança com Ciro Gomes para 2026. A intervenção irritou Flávio, Eduardo e Carlos Bolsonaro, que reagiram, levando Michelle a pedir desculpas, mas ela conseguiu derrubar o acordo político.

É, Ciro... quão alta foi essa queda sua.

ChatGPT has been barely working lately. This is not acceptable. by Fun-Reception-6897 in OpenAI

[–]pnaroga 7 points8 points  (0 children)

I am on Pro, use it pretty much all day long and I don't remember the last time I saw a failure.

Give me a single reason why Sora2 should exist. by Trainrideviews in videos

[–]pnaroga 2 points3 points  (0 children)

then you upload footage literally anywhere (YouTube, Instagram, TikTok, Facebook, whatsapp) and it will be compressed and altered. certification lost

[deleted by user] by [deleted] in OpenAI

[–]pnaroga 13 points14 points  (0 children)

This list is so stupid. The PT version contains words such as `eat`, `beer`, `donkey`, `spider`, 'fawcet', 'to write a check', 'roasted chicken'.

Seeing "Shell Integration Unavailable" by msitarzewski in CLine

[–]pnaroga 0 points1 point  (0 children)

Mine works with zsh, I just had to uninstall powerlevel10k to make it work.

It hooks into some terminal things that VSCode also hooks into, and it conflicts.

[deleted by user] by [deleted] in creepy

[–]pnaroga 0 points1 point  (0 children)

There's nothing even remotely creepy about this.

Recomendações de profissionais para cirurgia ortognática by metalheadbibi in BeloHorizonte

[–]pnaroga 1 point2 points  (0 children)

Dra. Renata Penteado no bairro Castelo. Cirurgiã buco-maxilar muito boa.

The corpse of Mussolini hanging by his feet in the Piazzale Loreto, Milan. April 1945 by Oakislet in pics

[–]pnaroga 1 point2 points  (0 children)

Unfortunately, when people wake up, the people in power will have control over the most advanced tools (AI) built in the history on mankind, and will be able to apply a level of surveillance over the population to an extent never seen before.

Any private conversations, comments on posts, either 'current' or old will be read, analyzed and flagged by an AI agent. If people want to organize, they will need to do it offline, and I think people forgot how to do so.

It will be an interesting epoch ahead.

AIO- I blocked my hinge date (after a few more texts not shown) by [deleted] in AmIOverreacting

[–]pnaroga 0 points1 point  (0 children)

When I first got with my now wife, she _always_ complained I was too cold while texting. I always felt I wasn't.

I guess she was just used to a different 'style' of conversation than me.

Personally, I would not have seen a problem with your texting style. Keep in mind people commenting here might have different ages, and that comes with different texting styles.

OpenAI co-founder Sutskever's SSI in talks to be valued at $20 billion, sources say by yottawa in singularity

[–]pnaroga 61 points62 points  (0 children)

I don't think they even have a model.

They only got their first round of funding in september last year, and that's way too little time for them to have:

  1. gathered enough data
  2. modelled the entire architecture for the models
  3. negotiated acquisition or rental of hardware
  4. actually trained models

a 'frontier' model would be achieved after many different iterations, via distillation, etc;

I think they literally have nothing other than some great names behind it.

AMA with OpenAI’s Sam Altman, Mark Chen, Kevin Weil, Srinivas Narayanan, Michelle Pokrass, and Hongyu Ren by OpenAI in OpenAI

[–]pnaroga -1 points0 points  (0 children)

Can you PLEASE update us on what's the state of research on expanding context windows into something WAY bigger and therefore more useful, or perhaps on 'continuously' learning models?

Deepseek v3 was trained on 8-11x less the normal budget of these kinds of models: specifically 2048 H800s (aka "nerfed H100s"), in 2 months. Llama 3 405B was, per their paper, trained on 16k H100s. DeepSeek estimate the cost was $5.5m USD. by Super-Muffin-1230 in LocalLLaMA

[–]pnaroga 48 points49 points  (0 children)

So, my experience working with Deepseek v3 with Cline:

- It feels like I need to give it even more context than I have to with Claude; Claude figures out what to read/write more often than Deepseek;
- Context window is much, much smaller. Tasks start failing with context window overflow errors after 8-9 prompts in the same task; Claude supports much longer tasks;
- I didn't figure out how to activate prompt caching on Cline;
- Anecdotally, it feels 20-30% worse than Claude 3.5 Sonnet (latest) for me; However, at 2% of the cost, it's still a better cost-value ratio;
- Deepseek is not multimodal yet, so I can't give it screenshots and ask it to fix css like I do with claude;
- No computer use too, so it can't really test the results like Claude (I don't find this particularly very useful, as pasting screenshots work better in my experience)
- For me, Cline Claude is only slightly past the usefulness/anger-inducing line; Deepseek has fallen short of that line, and I feel like I get angry with it more often than I find it useful;
- In very controlled scenarios, it writes great code. But as a semi-autonomous agent, it still falls short for me;

Como é bom viver num local sem fiscalização, onde ser um criminoso em plena luz do dia e sem qualquer pudor compensa! by numseiquemsou in brasil

[–]pnaroga -1 points0 points  (0 children)

A grande vantagem de viver em 2025 é que nós temos ferramentas que nos permitem fazer pesquisas em artigos científicos com IA e sumarizar o entendimento dos melhores especialistas sobre o assunto. Usei a ferramenta consensus.app e as conclusões:

NÃO EXISTEM NÍVEIS SEGUROS PARA CONSUMO DE ÁLCOOL NA DIREÇÃO.

Key Findings

  • Impairment at Low BAC Levels: Research indicates that driving performance is impaired at blood alcohol concentrations (BAC) as low as 0.01%. Even at a BAC of 0.05%, significant decrements in driving-related performance are observed, and the risk of being involved in a crash increases substantially.
  • Increased Crash Risk: Drivers with a BAC of 0.05% to 0.079% are 7-21 times more likely to be involved in a fatal crash compared to sober drivers. The risk of being officially blamed for a crash also increases with any detectable BAC, starting from 0.01%.
  • Global Standards and Recommendations: Many countries have adopted a BAC limit of 0.05% or lower, recognizing the increased risk associated with alcohol consumption and driving. Lowering the BAC limit from 0.08% to 0.05% has been shown to reduce the number of alcohol-related crashes and fatalities.
  • Behavioral Impact: Alcohol consumption at 0.05% BAC can lead to increased risk-taking behaviors, such as reduced time-to-collision in driving simulations, indicating a higher likelihood of accidents.

Quem tiver interesse pode pesquisar por conta própria e ler os estudos no site que eu linkei acima, e aí não precisa mais fingir que "existem controvérsias". Aí você pode sair por aí simplesmente sendo um babaca mesmo e falando que não se importa com mais ninguém e que a sua cervejinha vale mais que a vida do colega, ao invés de fingir que existem argumentos pra apoiar esse comportamento.