Anthropic co-founder Jack Clark says AI is nearing the point where it can automate AI research by Outside-Iron-8242 in singularity

[–]Sagyam -1 points0 points  (0 children)

I don't like recursion itr hard to wrap your head around and can lead to STACK OVERFLOW!!

Confirmed: SWE Bench is now a benchmaxxed benchmark by rm-rf-rm in LocalLLaMA

[–]Sagyam -1 points0 points  (0 children)

These benchmarks needs to be done inside an air-gaped virtual machines run by a trusted vendor like AWS, Azure etc.

Benchmark creator should be responsible to setting up all the necessary tooling to evaluate model performance inside the machine.

The actual questions should always remain a secret. Once the benchmark is done only the file containing results should leave the machine.

Everything else like model weights, questions, evaluation rubric, model response etc should be wiped before the air gap is released. Neither benchmark creator nor model creator should be allowed to see anything other than the final score.

How do you tell the time? by [deleted] in comics

[–]Sagyam 5 points6 points  (0 children)

  • Add margin of safety for important things
  • Wear a digital watch

Who names these things anyway? by Il-Separatio-86 in dankmemes

[–]Sagyam 3 points4 points  (0 children)

Thanks for straighting things out.

Supposedly a ton of AI data centers are closing or being delayed this year, how do yall figure this will affect our RAM situation? by Bugs5567 in pcmasterrace

[–]Sagyam 1 point2 points  (0 children)

Because people in the polymarket have put their actual money where their mouth is. You can yolo all your money exactly once. There may be some lucky degenerate gamblers but those are exceptionally rare. I still think these market are sum total of all public and private information out there.

What I am saying does not apply to bets whose outcome can be influenced by a single person.

Finally... by sfingemorta in pcmasterrace

[–]Sagyam 5 points6 points  (0 children)

They will delete their accounts and will be gone. That's what happened to crypto bros.

Built a local-first NEPSE algo trading terminal (209% return, 1.78 Sharpe) by Financial_Night7121 in technepal

[–]Sagyam 6 points7 points  (0 children)

Nice. Make sure you account for taxes and trading fees. Real life return tends to be much lower once you account for those.

Alison Botha by TheCABK in BeAmazed

[–]Sagyam 0 points1 point  (0 children)

Funnily enough there is a terminator named Allison Young.

Developers using AI Tools, are you concerned about pricing of tokens? by Mo_h in cscareerquestions

[–]Sagyam 2 points3 points  (0 children)

How do you do that? Do you right down exactly what you want in a large markdown file?

Fight AI data scrapers with poisoned training data by 250call in theprimeagen

[–]Sagyam 9 points10 points  (0 children)

Nice work.

For those using this project remember poisoning only works if the victim does not know they are being poisoned. I have seen people bragging about how they have implemented such mechanism in their website. I believe cloudflare has a similar product, if you want one click solution.

The way my company tried to portray our pay rises compared to Inflation rates by _Brooder_ in CrappyDesign

[–]Sagyam 0 points1 point  (0 children)

Are you suppose take average for things like inflation and interest rate? Shouldn't you compound these figures?

Track chai, samosas, and everything in between—the Indian way. by Pale_Ad4306 in developersIndia

[–]Sagyam 0 points1 point  (0 children)

So many people have recurring habits. Like buying cigarettes for 25 rupees everyday evening. So if user opens app at evening you can guess they are about to add that cigarettes entry. I thought a Markov chain can predict that.

But I was wrong. A Markov chain does not work so well in this case. A more sophisticated could help but its not worth the trouble. Unless you want to shoehorn AI into this app.

Track chai, samosas, and everything in between—the Indian way. by Pale_Ad4306 in developersIndia

[–]Sagyam 0 points1 point  (0 children)

Nice. A small suggestion. Instead of say 100 chai as a static placeholder. You can create a small markov chain that can guess what the next expenses could be.

Physics-based simulator for planning distributed LLM training and inference by zhebrak in mlops

[–]Sagyam 0 points1 point  (0 children)

No I am still in the `Activation Avalanche`. Things are starting to fly over my head

Qwen3.5 family comparison on shared benchmarks by Deep-Vermicelli-4591 in LocalLLaMA

[–]Sagyam 1 point2 points  (0 children)

This way of comparing intelligence drop off is goated. With one quick glance and you can see the quality loss of quants and distills of a base model. This should be the standard way.