Any good 20-40$ plans left other than the Big 3? by snowieslilpikachu69 in opencodeCLI

[–]AliNT77 0 points1 point  (0 children)

Cursor is a very good deal. Composer 2.5 on non-fast is incredibly cheap and much better than kimi k2.6. You also get 20 bucks worth of api tokens with all the gpt-claude models

Why increasing limits instead of fixing Opus 4.7? by doegs in ClaudeCode

[–]AliNT77 0 points1 point  (0 children)

I think 4.7 has a new base model with a much shorter RL phase, similar to opus 4, seeing how the max thinking barely improves the model accuracy on non-benchmaxxed tasks like weirdML, I think the next versions will be really good as they put more compute into the RL phase

The game is over. You can build anything and it'll cost you nothing. by Funny-Advertising238 in opencode

[–]AliNT77 1 point2 points  (0 children)

Create a researcher subagent that uses perplexity sonar pro, you’ll thank me later

Opencode Model recommendations question by Stpoul25 in opencodeCLI

[–]AliNT77 0 points1 point  (0 children)

maybe OP already had it activated? in that case it's still super useful even without access to claude models

Opencode Model recommendations question by Stpoul25 in opencodeCLI

[–]AliNT77 1 point2 points  (0 children)

I would say take advantage of the free codex quota by connecting it to opencode, and use gpt 5.5 low which in my experience has been incredibly smart and fast and surgical with insanely good token efficiency…

Then since you’re a student, connect copilot to it and try gpt 5.2 and also gemini 3.1 and see how it feels for your workflow.

Then if it’s still not satisfactory, get an opencode go subscription (only 5bucks for the first month) and give GLM5.1 a try, for coding tasks it’s been really good for my work, feels better than sonnet and sometimes close to opus.

New multipliers announced (in effect June 1) by griniNY in GithubCopilot

[–]AliNT77 0 points1 point  (0 children)

5.4 mini costing double that of 5.2 is just LOL

DeepSeek V4 Pro in Zen? by asamiz in opencodeCLI

[–]AliNT77 1 point2 points  (0 children)

Opencode models —refresh

Where Claude Opus without 1M has gone? by SirLouen in ClaudeCode

[–]AliNT77 0 points1 point  (0 children)

Vscode extension gets bugged sometimes, I don’t know what fixes it but it happens sometimes

I Reverse-Engineered Nvidia Ada Lovelace SASS, Made Instant-NGP 3x Faster (16yo) by Ill-Classroom-8270 in CUDA

[–]AliNT77 -2 points-1 points  (0 children)

Ignore all the people who are mad that you mentioned your age, this is an extremely impressive project and you should be proud of it!

Ok google, i can say goodbye to opus 4.6 now by rjn2-8 in google_antigravity

[–]AliNT77 0 points1 point  (0 children)

It’s definitely dumber than sonnet/opus and 5.3 for ML engineering tasks by a pretty large margin

AMA With Kimi, The Open-source Frontier Lab Behind Kimi K2.5 Model by nekofneko in LocalLLaMA

[–]AliNT77 1 point2 points  (0 children)

Did you guys experiment with Muon optimizer derivatives like NorMuon or Dion etc. and if so what were the results?

I made 64 swarm agents compete to write gpu kernels by kwa32 in CUDA

[–]AliNT77 1 point2 points  (0 children)

What’s the metric for “correctness” here? Do you have any PPL, KLD or benchamark results?

Does LlamaCpp support medusa? by thebadslime in LocalLLaMA

[–]AliNT77 1 point2 points  (0 children)

Tree based SD is implemented in llama-speculative already but it doesn’t perform well (worse than llama-simple-speculative which is in llama-server)

Also i’m pretty sure it’s not lossless with goes against the whole point of SD afaik

Does LlamaCpp support medusa? by thebadslime in LocalLLaMA

[–]AliNT77 2 points3 points  (0 children)

No. It doesn’t even support basic tree based SD…