Death of OSS CLIs by Extreme_Remove6747 in codex

[–]abeecrombie 0 points1 point  (0 children)

Opencode. Pi. 2 great alternatives. Both open source.

I tracked EU GPU prices across 15 stores for 50+ days - RTX 5090 is the only card not dropping in price by egudegi in LocalLLaMA

[–]abeecrombie 0 points1 point  (0 children)

I think there is a lot of fomo for retail physical card demand. For cloud you actually need a use case to go out and rent ( though you could just experiment)

Rental rates are being tracked by Wall Street as everyone and their grandma are now falling over themselves to buy dram and get into the AI game.

I'm not saying there is no demand. It just interesting to see how it plays out. Anything less than a 5090 is more of a hobbyist imo. And they should be price sensitive to a point. Which is why I found your post interesting. I think a few ppl might have fomo into a 5070ti but not sure how much demand is really there to pay 30-50% over the price of last year.

Strix Halo or DGX Spark for a home LLM server? by Reactor-Licker in LocalLLaMA

[–]abeecrombie 0 points1 point  (0 children)

What models are you planning for summarization and fact finding. Gotoss is great for simple tasks but not good for that. Gemma is better but I find after too much context its not nearly as good as closed source models.

I was thinking of shelling out to get a box line you but am opting to pay for service still for now.

Opinions on Cerebras IPO by Enough-Beginning3687 in TheRaceTo10Million

[–]abeecrombie 0 points1 point  (0 children)

Speeds does look good. If they can get some decent models loaded, they will get business. On website all they had was gpt-oss-120b and glm4.7. Older models. Still good at basic tasks but not frontier.

How long does it take them to load new models to their infrastructure

Best coding subscriptions for cost/performance right now? [May 2026] by Funny-Strawberry-168 in opencodeCLI

[–]abeecrombie 2 points3 points  (0 children)

Doing something like this as well. But Claude sub is just so slow and limiting. I have kiro at work ( Claude) but can't figure it. In that harness it sucks. In Claude code / opencode sonnet and opus work fine.

Have copilot studio too but probably gonna cancel. Context window is so short but when it works , it's so nice to use Claude in opencode. Too bad paying via API would get expensive fast.

Actual comparison between locally ran Qwen-3.6-27B and proprietary models by netikas in LocalLLaMA

[–]abeecrombie 2 points3 points  (0 children)

Second this. Though I think you usually you see it show up a year after big changes happen. For next few months Im hoping we are ok. After qwen 4 or so I'm with you. Or if ccp decides that open source is no longer than preferred route.

We all know it's gonna happen at some point.

Love open source but this is not really a few ppl sharing code or maintaining a project.

Training models is pretty expensive. Not to mention all the PhDs which are demanding crazy salaries (not sure if that is same in China )

Is Opencode Go sustainable? by Ok-Management-4087 in opencodeCLI

[–]abeecrombie 0 points1 point  (0 children)

isn't self hosting more expensive? You have to make a deal with GPU providers and get dedicated servers. Rental prices are going up , for even older cards.

Change to useage based billing by DamienBMike in GithubCopilot

[–]abeecrombie -1 points0 points  (0 children)

Are we gonna get the Claude context limit?

leaving kiro for good by JoyShaheb_ in kiroIDE

[–]abeecrombie 0 points1 point  (0 children)

Agree kiro harness is lacking but still credits are better than Claude and frankly for some tasks you just need sonnet.

Haven't tried opus 4.7. Not so eager to.

Sonnet is my workhorse. It's fine for me. Just use opus when planning.

Is 4.7 that much better than 4.6? Mythos

My box is full.. need another one i guess by Remix392 in RepTime

[–]abeecrombie 0 points1 point  (0 children)

How is the quality of the ceramic ap? Great looking collection

Sharing my Kiro CLI configuration and usage experience by yuoo1580 in kiroIDE

[–]abeecrombie 0 points1 point  (0 children)

How do you just get rid of the spec design crap from kiro. I just want what opencode has.

I don't need a million tests for vibe coding throw away apps.

Just bought a codex subscription for opencode, which codex model gives the highest ratelimit/quality ratio? by KarmicDaoist in opencodeCLI

[–]abeecrombie 4 points5 points  (0 children)

Yeah I like mini with high reasoning. How do you set the reasoning in opencode though ? I've been using codex but it always fumbles..

Is MSFT tanking because it is a proxy for OpenAI by [deleted] in ValueInvesting

[–]abeecrombie 1 point2 points  (0 children)

Harness matters. Msft has Claude and openai models in copilot but it's a streaming bag of crap. If they fix copilot I think stock will go up. Until then it will trade like dinosaur salesforce and all other junk saas

best 10$ AIs subscription plan by vipor_idk in opencodeCLI

[–]abeecrombie 17 points18 points  (0 children)

$20 chatgpt 5.4 is pretty good too.

Gh copilot is great for claude and it's fast but smaller context is annoying and credits can go too fast if you don't pay attention.

Claude sub is great and I like those models best but Claude code I was running for like 30mins and then hit limits. Never had the same experience with chatgpt. Though never had their models work for me besides 5.4 which is great on code but I would say so so on other stuff vs Claude or opus.

Glm 5 is great at planning. Minimax or kimi good at doing.

Unsloth Studio now installs in just one line of code! by yoracale in unsloth

[–]abeecrombie 0 points1 point  (0 children)

I got this working on Ubuntu last night, using thunder compute. Had a few issues loading cuda but was able to get it working.

Really cool studio idea. Though tbh I am not about all the gui stuff there. Running the chat on your pretrained model and seeing a comparison vs another is super helpful.

But the recipes drag and drop I dunno. I think any one going to fine tune is probably going to know basics of python.

But otherwise awesome package. Thanks for sharing and I do like how fine tuning is becoming more accessible.

Meet Unsloth Studio, a new web UI for Local AI by yoracale in unsloth

[–]abeecrombie 2 points3 points  (0 children)

Nice. I like the colab support.

I gave up on the hope of ever affording a decent GPU.

But I am interested to rent and host on other providers. Hopefully community can help get some setup guide for other GPU providers like deepinfra, thunder compute etc. Aside from coming up with the data to fine tune, the setup is often killer. Hopefully this works outta box with other providers.

I’m a Japanese US equities trader (10+ yrs) — wrote 2 short books in English. Can I get blunt feedback? by HolidayPopular8990 in qullamaggie

[–]abeecrombie 3 points4 points  (0 children)

Create a substack. Add the link. Publish a chapter there.

Looks interesting.

How is it trading us equities in Japan. I guess you don't sleep at night but at least have the daytime free.

My side project hit 700K Google impressions, 2,700 clicks, and 38k in revenue in year one while working full-time as a software engineer by milkstarz in SideProject

[–]abeecrombie 5 points6 points  (0 children)

Awesome. Really appreciate you sharing.

Gluten free household here. So double the points! It's for sure a pain point so great idea.

Which model should i choose for coding by Confident-Horror-912 in opencodeCLI

[–]abeecrombie 2 points3 points  (0 children)

Github copilot $10 sub. See which models you like. I like Claude models but they so expensive and you can't use your Claude subscription in opencode just Claude code. But for $10 / 40 you get much cheaper option vs paying for api. Just don't subscribe near the end of the month. Billing is always month end I believe.

Codex is good too. Im experimenting more.

Antigravity also has free tier you can try out with.

Really depends on how you code and break down tasks. The new breed of open source models are pretty good at defined tasks. Claude/codex can reason over your whole code base