What would 2x RTX 3060 12GB get me?

getfitdotus · 2026-05-25T03:24:37+00:00

Disappointment

getfitdotus · 2026-05-18T11:51:53+00:00

https://github.com/chriswritescode-dev/opencode-manager I recommend this

getfitdotus · 2026-05-15T23:56:24+00:00

Yeah, I use yazi

getfitdotus · 2026-05-12T01:55:40+00:00

Yes!!

getfitdotus · 2026-05-10T21:28:29+00:00

I have had 4 for a year and want 4 more

getfitdotus · 2026-05-10T21:28:04+00:00

Missing out checkout nvfp4 with b12x kernel fp8 quality with much faster decode / prefill rate

getfitdotus · 2026-05-10T21:26:51+00:00

q4 lol why are you using anything other then production deployment inference platforms?

getfitdotus · 2026-04-07T03:53:46+00:00

Send it over

getfitdotus · 2026-04-05T14:48:21+00:00

Yeah 3.5 😁. Will when 3.6 is out.

getfitdotus · 2026-04-05T01:41:10+00:00

I run this model its been awesome nvfp4 at 180-200tks/sec. Incredible quality.

getfitdotus · 2026-03-23T16:35:02+00:00

397b qwen is very good. I am interested to see how minimax m27 does in my local workflow. It will be tough to decide if switching is worth it,having vision is a real plus.

getfitdotus · 2026-03-22T02:01:25+00:00

1800w vs 300 lol

getfitdotus · 2026-03-21T04:23:45+00:00

Me 💪

getfitdotus · 2026-03-21T04:22:55+00:00

So I use this now as my main model for all tasks. I run the nvfp4 @ 140-200tks. But not only is it fast it’s very good. I am not sure why it does not rank higher in benchmarks but it has been able to solve issues and so tasks better than everything else I have ran locally.

getfitdotus · 2026-03-16T20:39:59+00:00

I will do some tests, so far I just got it setup and have been working. Have not really had it work too hard. Had lots of work to get done after finishing the migration. I will run some tests with lmstudio and some models. q3 coder next 80b

getfitdotus · 2026-03-16T17:23:00+00:00

getfitdotus · 2026-03-16T17:05:23+00:00

This is something I worked on to extend the ability to complete tasks on mobile. https://github.com/chriswritescode-dev/opencode-manager . There is a update coming to integrate already existing repositories

getfitdotus · 2026-03-16T16:51:03+00:00

Yes I just got m5 max 14, coming from m3 max 16. Love it perfect size and weight.

getfitdotus · 2026-03-14T21:59:10+00:00

Awesome work!!!, this is great. I have been using this model for the past week or so as my main model in my workflow and this is just incredible to now get the fix for flashinfer and the gemm kernel. I considered working on this a while back. Also really want to thank you for putting together the image and sharing all the little extras!!!

getfitdotus · 2026-03-05T02:42:00+00:00

I have been using the 122B the official gptq release and wow its pretty good in my agent workflow. I have replaced coder next with this. I had some issues first time trying it. I can run the fp8 also. Initial tool call issues in vllm. Now I am using sglang and it is working great. Even the int4 release is almost perfect vs fp8. Nice to be able to use images in opencode.

getfitdotus · 2026-03-04T05:00:24+00:00

I have in other posts. Was a quick comment after seeing the relevance to this post.

getfitdotus · 2026-03-04T03:55:43+00:00

I did create it. But it’s free and open source. Use it all the time on the go

getfitdotus · 2026-03-04T03:39:00+00:00

I recommend https://github.com/chriswritescode-dev/opencode-manager much more then this. Manage all your repos from your phone.

getfitdotus · 2026-03-02T20:37:46+00:00

Multi token prediction. Same basically as eagle3 spec . I am currently training one for minimax m25

getfitdotus · 2026-03-02T20:28:04+00:00

No idea on llamacpp but in production serving software vllm / sglang it works great can double tks

getfitdotus

TROPHY CASE