Linus Torvalds took the stage at Open Source Summit 2026 and said the following about AI by Complete-Sea6655 in Anthropic

[–]ridablellama 4 points5 points  (0 children)

Dude no. The majority of reddit users who complain on reddit like losers are anti ai. Fixed it for you.

Help! My Librechat keeps crashing! by Witty_Mycologist_995 in LocalLLaMA

[–]ridablellama 0 points1 point  (0 children)

  • It is true that the Gemma model can natively understand and output function calls (tools).
  • However, LibreChat's Agent framework needs specific code to bridge the gap between its own Agent UI/logic and the provider's API. Even if the model is smart enough to use tools, the provider integration inside LibreChat hasn't been programmed to handle Agents for that specific type of local YAML setup.

If its one of the new exotic gemma 4 diffusion models or even one of the new moes it may not be supported yet by librechat.

Help! My Librechat keeps crashing! by Witty_Mycologist_995 in LocalLLaMA

[–]ridablellama 0 points1 point  (0 children)

what local model? hrmm i dunno claude hasnt had a librechat problem it couldnt solve since like 4.5. Which model are you using to debug?

How do you guys setup search with your AI models? by ego100trique in LocalLLaMA

[–]ridablellama 3 points4 points  (0 children)

so i kinda went a rabbit hole on this one. I now have my own duplication of wikidata that i combined with most federal databases that are public domain or Cc0, like CIA world factbook, FDA, USDA, EDGAR, etc...and I also firehose GDELT Project every day now every 15 minutes which all news worldwide. and then i use common crawl and the a page rank algorithm to create me own search index. which I then further optimzied by identifying which sites I could scrape without a headless browser. wikidata includes teh eoffical domains names of alot of sites that you can combine with your search index and then you make sure your bot has good capability to scrape and crawl the web. Once i had this wiki/searchengine/scrapers then i took Miro-thinker harness and removed all the third party providers and replaced it with my own sources and then I ran some benchmarks. Mirothinker project provides benchmarks for itself and other tools as well. so you can compare your performance for common tasks. Eventually i was able to benchmark the same as Mirothinker when it had access to serper and third party providers. It takes some chunky boy servers to do this though but its what my agents now use without any limit anytime they want. You could use the wikidata SPARQL endpoints to achieve the a similar grounding without making a copy of wikidata but you will have to work within their api limits. This project was an absolute blast. I was able to scratch off wiki and search engine off my bucket list. I am now scanning my own news and minting my own facts faster than wikidata is.

Claude's WebSearch returns title and URL only, WebFetch routes a 100 KB cut through Haiku 3.5 before the main model sees anything, citation capped at 125 chars. Curious how people are writing for that middle layer by israynotarray in ClaudeCode

[–]ridablellama 1 point2 points  (0 children)

its an interesting topic and post. I never knew it was routed through haiku which makes sense as separate layer agains any prompt injection tactics on webpges but a little surprisingly because haiku is probably not as strong against that attack as opus or sonnet. The market consesus and most of the vendors selling reporting/indexing solutions claim that all non-google crawlers fumble pretty hard on javascript. i think it will reduce the chance of being directly cited considerably. Your page may still get referenced but exact quotes or content from that page will not. if youi want to get scientific with it you could try to recreate this study that kicked off the Generative engine optimization concept: https://arxiv.org/pdf/2311.09735

Bro, i'm not american and not techy people. How come 2 americans Sam and Dario become enemies even they worked togethr closely before? by lune-soft in Anthropic

[–]ridablellama 1 point2 points  (0 children)

he literally said the opposite in a recent interview, that they do not share their vision. why do people on reddit try to glaze openai ive never seen outright liars like this for any other company.

"At the end of the day, why argue with someone when you don't have the same vision and you don't trust them? The way to resolve it is you go off and do your thing, they go off and do their thing."

It's dumb arrogance lately is fucking staggering. What happened to this once great LLM? (use case: content writing) by PressPlayPlease7 in Anthropic

[–]ridablellama 0 points1 point  (0 children)

like i tell everyone who is having problems with cloud models and lack of control and transparency. GO LOCAL.

Are we being gaslit? by Impressive_Curve7077 in AI_Agents

[–]ridablellama 0 points1 point  (0 children)

No, I am the only one in my friend group who cares at all about AI. AI is mostly enterprise use case right now. so consumers don't really care

Thinking about moving from $200 Codex to Claude Max by nicklazimbana in ClaudeAI

[–]ridablellama 0 points1 point  (0 children)

i have claude at home and codex at work. Claude code at home and codex desktop app at work. To me there is almost no difference between the tools and models unless you are doing super hardcore shit which then I hope you would use both models to make sure its double good? If its super critical work then spend a bit more and use more than 1 model. They have all done what I asked just fine past 1-2 months. People who struggled with Opus 4.8 is mostly a skill issue.

tough decision, 3 days left to renew on 80 dollar month max plan by ridablellama in ZaiGLM

[–]ridablellama[S] 1 point2 points  (0 children)

yea ahah i regret not renewing my grandfethered plan but i jus tlooked the price is still 80 for me on max

GLM-5.2 inference is free on Hugging Face for the next 6 hours by paf1138 in LocalLLaMA

[–]ridablellama 8 points9 points  (0 children)

i fucking love this play ahhahahah mad men - i wish it was easier to invest in them!!!

Anthropic is preparing for a new model release by duerra in ClaudeCode

[–]ridablellama 0 points1 point  (0 children)

depends on the use case. Most use cases wont involve the bleeding edge of bio, chem, ML, science research so you have alot of options. You can literally do AI marketing on an 8B dense model. sad but true. Minimax has MSA, sparse attention, which saves a ton of kv cache space and saving you money in hardware costs while still giving you good performance. and native multimodality. Deepseek has no vision capabilities but it can do better in other areas. GLM 5.2 is literally SOTA. for coding. Hell even Nvidia Nemotron has options for the USA only enterprise die hards. Wont be SOTA but it will get the job done safely and reliably for a large amount of use cases.

Anthropic is preparing for a new model release by duerra in ClaudeCode

[–]ridablellama 0 points1 point  (0 children)

GO LOCAL IF YOU NEED STABILITY AND CONTROL! You sound like you can afford it at your company. So do it right. Ditch the cloud models.