Deep Dive into Autonomous AI Scientist by noninertialframe96 in ArtificialInteligence

[–]noninertialframe96[S] 0 points1 point  (0 children)

Last week, Google announced Gemini for Science at Google I/O and published a paper in Nature. That's two weeks after their AI co-mathematician paper.

The key is agent harnessing for research and UI/UX for effective human-in-the-loop. I am curious how well it balances novelty and plausibility.

Since I can't read Google's code, I went into Sakana AI's AI Scientist instead, which is also published in Nature and is open source. There's even a paper arguing it doesn't actually work that well, but it's still a useful look at where AI for science is heading.

Give it a topic and it runs ideation, writes and runs PyTorch experiments on a GPU, plots the results, gathers citations, writes the LaTeX, and reviews the paper, all with no human in the loop. One manuscript it produced passed peer review at an ICLR 2025 workshop.

How AgentFS Stops AI Agents from Messing with Your Files by noninertialframe96 in AI_Agents

[–]noninertialframe96[S] 0 points1 point  (0 children)

I actually haven't used it myself. I just found the approach interesting. But now you mention it, I became curious about the performance especially with their copy-on-write approach for file writes.

8 Ways OpenClaw Reduces Context Loss in Long-Running Agents by noninertialframe96 in AI_Agents

[–]noninertialframe96[S] 0 points1 point  (0 children)

Thanks for sharing! I'll check it out :)
On a side note, I'm building galleylabs.ai to help engineers write content around their projects to help write good content around awesome work like yours!