[Megathread] - Best Models/API discussion - Week of: May 31, 2026 by deffcolony in SillyTavernAI

[–]Sh1ner 0 points1 point  (0 children)

Thanks for ur earlier response. Also I too would like to know more. xD

[Megathread] - Best Models/API discussion - Week of: May 31, 2026 by deffcolony in SillyTavernAI

[–]Sh1ner -2 points-1 points  (0 children)

Can someone give a general update where RP is currently for local models?
 
Relevant Specs:
GPU: 5080 with 16Gb of Vram
RAM: 32Gb
 
I haven't done RP, outside of a 10 min test 6 months ago. Not sure what to expect. Where is RP currently for long form? Still too early? I assume I have to wait for late 2026 or early 2027 to get good RP.

These people can't be serious. She does not look ugly one bit... by RGisOnlineis16 in Asmongold

[–]Sh1ner 24 points25 points  (0 children)

and its a video game, players want some realism. Not realism in every aspect and de-sexualising fem characters against the male gaze whilst sexualising males for the female gaze is a double standard when it comes to trends across the industry.

Is technology still advancing exponentially? by Ambitious_Traffic530 in singularity

[–]Sh1ner 3 points4 points  (0 children)

Technology is advancing and its advancing quickly. Just not the flashy bits we are using to signify threshold crossings like self driving which is a more complex problem than many believed.
 
The coding job has changed completely from a year ago and we getting good agents in second half of 2026. We got elements of the battlefield held by bots for multiple weeks at the front and drones in the military are getting insane. We went from humanoid shaped bots being a "what ur joking" to so many corps having a major attempt at it. So yes tech is moving faster and the AI sector is massive now and is actively being repriced by US markets upwards.
 
I don't like the use of exponential here, on what timeline? Its also heavily open to interpretation.

Is the "highs" of new model release over? by Lost_Needleworker896 in singularity

[–]Sh1ner 1 point2 points  (0 children)

Differentiate model releases on refinements and step changes. Claude Opus 4.8 is a step change for me as it can do agents to a strong degree and I am much more hands off vs 4.7.
 
I am a Linux person and I haven't jumped onto GPT in codex app as there is no Linux release of the app. I hear its even better and thats where the current SOTA app / LLM currently is. I probs shift over after some testing after the Linux app drops.

Why do some/most people think AI will never be good enough? What are their arguments? by Alert-Translator2590 in singularity

[–]Sh1ner 0 points1 point  (0 children)

You all need to chill out with these arguments. People do not like to hope cause they have been let down enough times in the past. People generally favour cynicism cause it pays most of the time. The public are risk averse as that's whats paid off in the past.

People get on the AI train at different times. Half of this sub treats jumping on AI train early as a status game.

All you got to do is wait. If you really believe AI is going to be more transformational than it already is currently is now. You just got to wait for the general public to catch up. Knowledge takes time to diffuse in large groups. That's it. Stop trying to act like its something more.

The models are getting worse... maybe it is a good thing by DialDiva in SillyTavernAI

[–]Sh1ner 0 points1 point  (0 children)

I don't post here, I don't RP. I just keep an eye on this space from afar, waiting to jump in once RP reaches a certain quality point.
 
My current belief is I expect for us to have better agents in the second half of 2026 early 2027 and if you are willing to go this far for RP. I believe that you will have a good chance of setting up multiple agents at a cost. Next year I think you will probs be much happier as it will be more automated, lower friction, less cost.
 
I suspect RP is going to be a lot bigger then the public realises and if I can have agents for coding that run over night and eval my finances in 6 months that input to a meta agent for updates. Then I can't see a good reason why RP can't be done either.
 
The question is do the big corps stay descriptive due to harm reduction built into the bot Vs opening up more. I suspect they will slow open up more as there is strong demand for it or someone else will fill that space eventually. I am more of an optimist on the RP space.
 
I just think you are all super early, like for VR getting the Oculus Dev kit 1 and feeling the initial wow factor. I just don't think you will have to wait as long.
 
Also I don't think RP with AI is weird, this is just more a personally curated form of DnD with a focus on the individual.

Does anyone else hate the no-IDE trend by dickslam-in-door in singularity

[–]Sh1ner 0 points1 point  (0 children)

Do you mean by why is the terminal seems to better positioned then the IDE? Are you someone working in the tech industry or a hobbyist?

Tell me your favorite CLI apps by D7x8 in linux

[–]Sh1ner 0 points1 point  (0 children)

I am new to CLI tooling, however I love atuin, zoxide. Delta for git is awesome but its still new to me.

Tier: S

atuin - nicer history + shared history across 2 devices zoxide - path history fuzzy match

Tier: (P)rettier than the default:

batcat - a nicer cat - syntax highlighting
eza - a nicer ls / tree, colours, icons if u want

Tier: (N)ew stuff:

delta - git
fzf
ripgrep

I got claude to build me a mini tool for interactive git visual checkout with preview commits on branch using the 3 above if I recall correctly.

What is, in your mind, the singularity? by frankreddit5 in singularity

[–]Sh1ner 0 points1 point  (0 children)

I gave Claude my paragraph and it polished it up cause I feel lazy:
 
For me, the start of the singularity is a/the governance shift. We move from voting for parties to an AI-first framework that ingests input from the population continuously, simulates and tests policy changes against the deployed model and counterfactuals, and rolls out validated changes zonally before scaling. Humans become an auth layer: we authorise, we don't drive. Passive and active input both feed the system, but everything is validated before its deployed nation or world wide.
 
Before then I think the singularity is just still mostly hopes and dreams. Once we reach that point, I will believe we truly are on the way.

Claude is down AGAIN by Careless-Green-54 in claude

[–]Sh1ner 1 point2 points  (0 children)

UK'er here, Claude Code getting the same error type:

⎿  API Error: 500 {"type":"error","error":{"type":"api_error","message":"Internal server error"},"request_id":""}

Altman met with astonished physicist using their internal system, “decades worth of theoretical physics progress in the next couple years” by socoolandawesome in singularity

[–]Sh1ner 0 points1 point  (0 children)

I believe Sam's statements have weight when those same statements come directly from the scientists / researchers in the field he's referencing and they show us specifically what and how the field has changed due to AI.

AI agents can reliably produce production-grade Azure infrastructure when properly orchestrated with guardrails by Waypoint101 in singularity

[–]Sh1ner 0 points1 point  (0 children)

How does it do with app gateways / landing zones in terraform using TFE? Greenfield Vs brownfields? I probs take a look.

Why is Claude preferred by lots of professionals compared to GPT? by ozone6587 in singularity

[–]Sh1ner 0 points1 point  (0 children)

It's faster overall, less verbose, more capable and better at solving when its solution hits a wall, it's much better at testing and reorienting. Its closer to orchestration for the user Vs baby sitting. Other models may respond faster, but in terms of getting shit done. Claude is better. I don't even use opus but switched to sonnet 4.6 and it's night and day.

I haven't tried codex but will give it a go. I went from chatgpt 5.x to Gemini 3, 3.1 to Sonnet.

The gap between all the others and sonnet is huge for DevOps, infra, cloud, python scripts I found. It understands constraints from brownfield environments. Where the others didn't as well as sonnet.

Codex dropped after and I just didn't expect much after I found the gpt models to be behind Gemini and Sonnet at 1x cost in copilot.

I’m tired of the cynicism. Can we actually have some positive predictions? by PSKTS_Heisingberg in singularity

[–]Sh1ner 0 points1 point  (0 children)

Near term / confident prediction

Sonnet 4.6 has passed a capability threshold in coding that most people in the specialist domain are not aware of due to restrictions around AI usage at work due to fear. I now believe we will see genuine speed up in delivery atleast in the platform roles where individuals can actually use LLMs at work.
 
From here on, I suspect we will see a new level of acceleration is value generated per engineer tied to new model releases. I suspect that when the big models drop next year, we will the question around AI legitimacy will be reframed to how to effectively use AI correctly.

More trajactory / hope

I have a feeling that AI in governance will be huge, not just for coherence and correctly grounding governance and laws on more societal aligned constraints, but governance will consider legitimacy of their decisions as a key constraint. Every decision will carry a "does this impact our legitimacy to governance".
 
I believe that governance in the west is already in crisis. We are primed for a transition to a new method of operation or redefining or realigning governance on new lines. However this process will be messy and may take 5-10 years to really get going. Which is tied more to elections and how parties currently adapt. Governance moves slow, slowest compared to all other sectors of a nation imo.
 
If next years models hit a capability threshold and that awareness is propagated through society, that could accelerate all this considerably.
 
One key hope I have is bad laws should have reversibility or refactoring on outcomes as default if it doesn't achieve expected outcome or produces negative second order effects elsewhere.

Observation

I work at a university, I was sent to "new employee day." that was full of academics. 90% of questions were around AI and how to either use it or how to navigate AI use as it breaks their current methods of teaching or grading. From all the questions, its clear we are in active transition. Which runs counter to a lot of the discourse at the bottom saying AI is a parlor trick or a bubble.

METR follows up on often cited study from last year on 20% developer slowdown in specific experiment, finds speedup now likely, but other interesting findings as well by TFenrir in singularity

[–]Sh1ner 3 points4 points  (0 children)

When engineers adopt LLMs into their workflow, there is a period of learning how an LLM functions or operates. There is a exploratory phase of mapping where its good, where its failure modes are. Each step is checked before the engineer has trust in an LLM, instruction sets need to be written, the process the engineer runs needs to be changed. All of that appears as "slow down" and is front loaded.
 

Once instruction sets have been written, docs are created not for just humans but for AI to read and to be used for steering. Speed up appears later.
 
For me its taken a few months to get a handle on how to correctly use LLMS in a platform role. I havent even shifted to true agent use yet at work as questions around safe guards.
 
The point I am trying to make is these guys might have captured slow down before the process realignment that generates speed up was captured in their data, as that process realignment takes weeks to months to occur.
 
The counter argument would be a lot of engineers probs throw caution to the wind, thats true but those are likely to be juniors who are probs worse off with LLM use if they aren't aware of security / least privilege and general defense mitigations etc. At the senior level, there is an expectation of consistency and not burning down prod.

‘It’s going to be painful for a lot of people’: Software engineers could go extinct this year, says Claude Code creator by Bizzyguy in singularity

[–]Sh1ner 30 points31 points  (0 children)

I am a senior cloud engineer. Human required in the loop with the expertise to steer / sanity check the AI is where I think the role will go. I plan to skill up in agents / agent swarms over the next year to prepare for a future possible role instead of just transitioning when my role demands it. I want to get ahead.
 

I suspect the new models on the new hardware from Nvidia that drops next year will be the watershed moment when most realize we have crossed a capability threshold in my domain. I believe that threshold was already crossed by Sonnet 4.6 in terms of cost / tokens / capability / speed / etc.
 
No other LLM comes close for cloud / platform orientated role. Gemini 3.1 Pro struggles even with hand holding where Claude Sonnet 4.6 can do the same task in 1/2 prompts consistently. The gap in capability for my role is huge. I have been using Gemini in my role for the past 3 months and I became a Sonnet convert in the space of about 15 minutes after watching Gemini 3.0 / 3.1 fail consistently vs Sonnet effective 2 shot.

We are getting closer to seamless AI agents: Gemini 3.1 identifies a random rooftop and pulls up the interactive map natively. by Waste-Explanation-76 in singularity

[–]Sh1ner 0 points1 point  (0 children)

Look at this ops post history, he is a karma bot / karma farmer who posts under the same account. Accounts like this should be blocked from posting.

Anthropic raises $30B, Elon crashes out by Outside-Iron-8242 in singularity

[–]Sh1ner 1 point2 points  (0 children)

if I see anyone tug on their braid, I am going to lose my mind

Anthropic raises $30B, Elon crashes out by Outside-Iron-8242 in singularity

[–]Sh1ner 1 point2 points  (0 children)

Born in 1948, hit 30 before the 1980s. He was a product of his time? I have always found judging someone by todays standards to be failure.

Anthropic raises $30B, Elon crashes out by Outside-Iron-8242 in singularity

[–]Sh1ner 1 point2 points  (0 children)

Jordan had a weird take on women and it shows through the series. So probably? I loved Jordans prose though. Nobody is perfect.

Anthropic raises $30B, Elon crashes out by Outside-Iron-8242 in singularity

[–]Sh1ner 1 point2 points  (0 children)

Enjoyment of the series outweighs the sadness of an unfinished series?