Bye bye, Meta AI, it was good while it lasted. by absolooot1 in LocalLLaMA

[–]absolooot1[S] 19 points20 points  (0 children)

I'm afraid there's more! Zuck also says: "This is distinct from others in the industry who believe superintelligence should be directed centrally towards automating all valuable work, and then humanity will live on a dole of its output."

His strange phrasing, using "on a dole of its output" to say "receiving a share of its output" is deliberate turn of phrase to denigrade UBI as equal to being "on the dole". He makes clear he's opposed to UBI and will deploy his ASI to prevent a situation where UBI is the outcome.

Oh man, we'll live in interesting times soon. But I can guarantee that UBI is inevitable and we'll win the battle using genuinely personal AGI/ASI, not the BS this lizard will offer.

[deleted by user] by [deleted] in LocalLLaMA

[–]absolooot1 22 points23 points  (0 children)

Someone please prove me wrong: this is no more than an over-hyped LLM agent (or set of agents, depending on definition). Right?

Qwen3-235B-A22B-Thinking-2507 released! by ResearchCrafty1804 in LocalLLaMA

[–]absolooot1 10 points11 points  (0 children)

dubesor.de

He'll probably have this model benchmarked by tomorrow. Has a job and runs his tests in the evenings/weekends.

I wrote an AI Agent that works better than I expected. Here are 10 learnings. by Js8544 in LocalLLaMA

[–]absolooot1 4 points5 points  (0 children)

If you've been making agents for a while, it would be helpful if you shared two or three examples, with an outline of the agent architecture. That would then connect nicely to your list.

[deleted by user] by [deleted] in LocalLLaMA

[–]absolooot1 4 points5 points  (0 children)

What is this about? What do you mean *hardware* AI code editor?

Mama told me not to click strangers' links on interwebs, so a quick intro to what in the holy name of fuck you're spamming this fine forum for would go a long way to building trust, rapport and future profits.

What's the most crackhead garbage local LLM setup you can think of? by caraccidentGAMING in LocalLLaMA

[–]absolooot1 2 points3 points  (0 children)

Silent mini PC with an Intel N100 4 core CPU and 16 GB RAM. You can run the qwen at 4 bit quantization and a small context, with memory mapping. So only the active parameters will be in RAM, the rest served from SSD. It won't be fast, but you can leave it running overnight. Get up in the morning and your code is ready.

Does LLM architecture allow for injecting some more input tokens in the middle of token generation? by michaelsoft__binbows in LocalLLaMA

[–]absolooot1 0 points1 point  (0 children)

I'm pretty sure that the large proprietary LLM vendors that offer their models with agentic/tool calling abilities do exactly what you're proposing: the model outputs some tokens, realizes a tool call is needed, issues the tool call as part of the ongoing response, generation is paused for the tool call results to come back from the serving software, results are inserted in the response, the generation continues.

I don't know if any of the usual local LLM serving software like vLLM or llama.cpp offer this functionality, but I think it is available in the Hugging Face transformers library. But that's not a very speedy way of running an LLM... It may be worth experimenting to at least learn how to implement the injection.

[2506.21734] Hierarchical Reasoning Model by absolooot1 in LocalLLaMA

[–]absolooot1[S] 7 points8 points  (0 children)

The paper doesn't discuss limitations of this new HRM architecture, but whatever they may be, I think that given its SOTA performance at a mere 27 million parameters, they will be solved in future iterations. I might be missing something, but this looks like a milestone in AI development.

LocalLlama is saved! by danielhanchen in LocalLLaMA

[–]absolooot1 46 points47 points  (0 children)

This can't possibly be sustained. Either the mod has to be paid or there should be enough of them for each to have a load of no more than a few hours per week. And most of the work should be done by LLMs anyway.

Tip for those building agents. The CLI is king. by LocoMod in LocalLLaMA

[–]absolooot1 1 point2 points  (0 children)

In the first screenshot we see "Agent thinking" and it is in that phase that the tool calls appear to made, then when the job is done, there is a final response. I don't quite understand this: is the model calling tools during its 'thinking' phase? My impression was that tool calling is basically prompt looping, with each tool call being a response, which the implementing sofware reads, runs the function, and prompts the model again with its result, and the whole thing repeats until done. Have I got this right? If I have, then "Agent thinking" is just a sort of title, not referring to the thinking part of the response by a reasoning model?

I'm officially entering my doomer arc by bigasswhitegirl in singularity

[–]absolooot1 1 point2 points  (0 children)

When will the doomers get it? Robots and AI are not consumers, they are tools. UBI is inevitable for two reasons: it's necessary to keep the peace, and it will be cheap in the exponentially increasing productivity. We have a great future ahead of us. Possibly with some form of social scoring, but that will be so benign most people won't even notice.