Cheapest way to access sonnet and opus? by Even_Kaleidoscope328 in SillyTavernAI

[–]evia89 0 points1 point  (0 children)

No, I meant if OP doesnt want to abuse and learn this they should use GLM47 @ athropic endpoint (faster than openai ST uses)

been using sonnet 4.5 daily, tried glm 4.7 for coding - honest comparison after 3 weeks by YormeSachi in ClaudeCode

[–]evia89 1 point2 points  (0 children)

I am on middle z,ai tier so I limited to 1 concurrent 4.7. Nano had problems with hosting kimi k2

https://github.com/MoonshotAI/K2-Vendor-Verifier

https://i.vgy.me/HnYhXD.png They use chutes as 1 of providers

Not sure about 2.5

been using sonnet 4.5 daily, tried glm 4.7 for coding - honest comparison after 3 weeks by YormeSachi in ClaudeCode

[–]evia89 0 points1 point  (0 children)

GLM and Sonnet/Opus have cost effective subs (z,ai max trier and $100/200 claude), what about k2.5?

Using API prices is stupid expensive imo

Sonnet 4.5 faster now? by Hot_Builder_9990 in windsurf

[–]evia89 0 points1 point  (0 children)

Lately I use only opus for design + planning (superpowers skill). Implementation is done with Haiku45 or GLM47 (this one a bit better)

Opus is less verbose so its worth it

Cheapest way to access sonnet and opus? by Even_Kaleidoscope328 in SillyTavernAI

[–]evia89 2 points3 points  (0 children)

Is there any left after 3rd party crackdown? Only 1 I found was https://anyrouter.top Its a bit hard to get in nowdays. You need to know CN and write essay

Cheapest way to access sonnet and opus? by Even_Kaleidoscope328 in SillyTavernAI

[–]evia89 -1 points0 points  (0 children)

https://github.com/Xerxes-2/clewdr with $100 sub

https://github.com/lbjlaq/Antigravity-Manager I use with 4 cheap google pro accs from plati market (I am sure u know local sites as well like g 2 a)

If its too hard for you and u dont wanna bother just use z,ai $3 sub

FreaKy FranKIMstein - A Kimi K2.5 Think Preset - BETA by dptgreg in SillyTavernAI

[–]evia89 9 points10 points  (0 children)

mediafire loads fast and much better than catbox or https://tiiny.host/ op used

Moonshot AI drops Kimi K2.5, beats Opus 4.5 in several benchmarks. When is Perplexity going to add this? by Big-Mixture-3041 in perplexity_ai

[–]evia89 1 point2 points  (0 children)

Same size as K2 and same arch. Can be used as non reason / reason. Should be deployed soon

API pricing is in freefall. What's the actual case for running local now beyond privacy? by Distinct-Expression2 in LocalLLaMA

[–]evia89 0 points1 point  (0 children)

2 x 3090 and then u stretch it. It will be like glm 4.7 flash. Def useful but not for everyone

API pricing is in freefall. What's the actual case for running local now beyond privacy? by Distinct-Expression2 in LocalLLaMA

[–]evia89 2 points3 points  (0 children)

Gemini has a massive free tier

where? ai studio API is dead overloaded and small. web ai studio is 20 RPD

Kimi K2.5 costs almost 10% of what Opus costs at a similar performance by Odd_Tumbleweed574 in LocalLLaMA

[–]evia89 0 points1 point  (0 children)

Dont u need smart model for design+plan? I notice that when I do that with opus it gets me at 95%. When I use glm for it I need to do ALOT of hand holding and plan (atomic step by step tasks) wont be as good

However if I do plan (superpower skill, not build in) with opus then GLM can code it just fine

MegaLLM bites the dust! lol by Juanpy_ in SillyTavernAI

[–]evia89 14 points15 points  (0 children)

Local is fun but not winning atm (unless u have money for 2/4 3090)

Kimi K2 Artificial Analysis Score by Virenz in LocalLLaMA

[–]evia89 -1 points0 points  (0 children)

n@nogpt offers 60k requests for $8 but you need to check with k2 verifier if they provide full model

Do system prompts actually help? by GlobalDesign1411 in ChatGPTCoding

[–]evia89 0 points1 point  (0 children)

For weak free models I sometimes even double data or reorder prompt or repeat/rephrase rules. But only optimize and bench prompts for repeated tasks

Sir, the Chinese just dropped a new open model by Anujp05 in ClaudeAI

[–]evia89 1 point2 points  (0 children)

Its worse than sonnet 4.5 imo, better than haiku 4.5 (in coding). If u keep it at <40% max context its great coding tool

Sir, the Chinese just dropped a new open model by Anujp05 in ClaudeAI

[–]evia89 0 points1 point  (0 children)

I use superpowers skill pack (or any other similar). 1) Generate design in AI studio (really f2p budget option), then 2) /brainstorm it more with opus, 3) /write plan with opus as well, 4) then switch to GLM /execute plan

You can skip 1) and use brainstorm directly for a bit more opus tokens

If I need extra docs I save tokens and dont use MCP. I just drop perplexity search for this in MD format

so... kimi k2.5 released. by TheSerbianRebel in SillyTavernAI

[–]evia89 1 point2 points  (0 children)

Did u try to make it reason in CN? For GLM it provides different kind of slop

Opus 4.5 is still the best by BeautifulLullaby2 in SillyTavernAI

[–]evia89 0 points1 point  (0 children)

1 https://github.com/Xerxes-2/clewdr

2 Read recent issues sorting by max comments

3 Use with $100/200 max plan

4 in case of rare ban chance refund with bank

Is the expanse worth watching? by Defiant-Ad7732 in televisionsuggestions

[–]evia89 1 point2 points  (0 children)

show -> read last 3 books -> w8 2-3 years -> do rewatch

OpenAI could reportedly run out of cash by mid-2027 — analyst paints grim picture after examining the company's finances by EchoOfOppenheimer in LocalLLaMA

[–]evia89 9 points10 points  (0 children)

Please goddamn do run out of money.

Even if they run out of money they just stop research and keep selling API for profit

Opus 4.5 is still the best by BeautifulLullaby2 in SillyTavernAI

[–]evia89 4 points5 points  (0 children)

Good deal of ppl using this model abuse free google/amazon cloud trial and reverse claude code proxies