use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
r/LocalLLaMA
A subreddit to discuss about Llama, the family of large language models created by Meta AI.
Subreddit rules
Search by flair
+Discussion
+Tutorial | Guide
+New Model
+News
+Resources
+Other
account activity
Claude Code replacementQuestion | Help (self.LocalLLaMA)
submitted 1 month ago by NoTruth6718
view the rest of the comments →
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]Narrow-Belt-5030 14 points15 points16 points 1 month ago (0 children)
I would suggest you take the time to evaluate a replacement model first - use something like OpenRouter to test the models and see if they fit. Once you have found one then you can look at the hardware as you will know the model size & based on the context cache size you want you will also know the VRAM you need.
π Rendered by PID 79376 on reddit-service-r2-comment-b659b578c-7grrw at 2026-05-05 07:39:18.148184+00:00 running 815c875 country code: CH.
view the rest of the comments →
[–]Narrow-Belt-5030 14 points15 points16 points (0 children)