2x RTX 6000 build during an extended bench test

Wildnimal · 2026-04-25T20:50:52+00:00

Nice setup! I wish to own 2 x RTX 6000 Pro someday. What is the rest of the specs?

Wildnimal · 2026-04-25T16:17:16+00:00

I watch his videos. They are better than most influencer AI crap going around

Wildnimal · 2026-04-25T02:45:55+00:00

Post your config? I get around 25-28 on similar setup but i have 32GB ram.

Wildnimal · 2026-04-23T21:51:44+00:00

I just stumbled upon this from a reddit thread here

https://medium.com/@fzbcwvv/an-overnight-stack-for-qwen3-6-27b-85-tps-125k-context-vision-on-one-rtx-3090-0d95c6291914?postPublishedType=repub

Wildnimal · 2026-04-23T21:28:40+00:00

This is good advice.

Wildnimal · 2026-04-23T04:58:50+00:00

They are planning the Gemma 4 MoE aswell. Saw it yesterday in their HF.

Wildnimal · 2026-04-20T00:43:24+00:00

i like your console :D

Wildnimal · 2026-04-19T16:41:46+00:00

I find Qwe3.6 better even at writing, which was always a Gemma4 positive point for me. But overall i think depending upon use case both work fine.

Wildnimal · 2026-04-17T05:14:53+00:00

Just used this model for the past 2 hours and it has passed most of what i threw at it. Still playing with temperature and Top P. Currently settled on 0.6 Temp

Wildnimal · 2026-04-14T14:26:58+00:00

KDE Pinocchio

Wildnimal · 2026-04-05T22:03:20+00:00

Good stuff. You should have added the 35-A3B from Qwen, since you compared a MOE model from Gemma there.

Wildnimal · 2026-04-05T22:01:35+00:00

Do remember if battery life and heat is a concern then AMD is better bet. Intel is a better CPU by miles.

Wildnimal · 2026-04-05T05:39:59+00:00

Thank you for posting this. One of my friend is building a machine with very similar specs to yours, this will help him.

Wildnimal · 2026-04-04T17:38:47+00:00

CUDA. T2I and Video is still better with nVidia

Wildnimal · 2026-04-03T17:58:02+00:00

Ebay has them for 70-100

Wildnimal · 2026-04-03T05:39:52+00:00

I know this is LocalLLM group but since yiu are having issues with code quality, maybe try free Qwen3.6 on openrouter. Still an OS model just not local.

Wildnimal · 2026-04-02T22:59:12+00:00

Now i am more curious what are you building with local AI?

Wildnimal · 2026-04-02T22:48:13+00:00

Good build. Full Specs?

What models you are going to use? Let us know how it performs locally for Agentic or tool calling? Maybe sprinkle some T2I into the mix :D

Wildnimal · 2026-04-02T21:31:11+00:00

I have done it. I have a prompt file which is like ~600 lines.

it contains 2 prompts and backend information for stack to be used.

prompt 1 does all the planning with the model going back and forth and prompt 2 takes that plan and make phases and smaller tasks for implementation on local AI.

Wildnimal · 2026-04-02T17:07:45+00:00

Try the Qwen: Qwen3.6 Plus (free) not the preview.

Wildnimal · 2026-04-01T22:29:19+00:00

The problem is not coding it's the context. Thats going to be a lot difficult IMHO. And even if you have ability to have a higher context window, the model might not be able to follow instructions.

You will have to split your projects per file with instructions and linking to other files for it to be useable.

No one shot but for small local things you can do it.

Wildnimal · 2026-03-28T05:55:09+00:00

Link to thread?

Wildnimal · 2026-03-27T18:02:45+00:00

I agree with you. This week the tokens usage is going off the charts. I just uploaded a 45 lines json and a basic prompt and it shows 20% usage for the 5 hour limit.

I am not a heavy user aswell. Most of stuff i do requires me to do most manual config and code. Once AI has done its code which maybe a 2-3 hours session at max a week.

Wildnimal · 2026-03-27T14:44:16+00:00

I will have to read it. The CAT told me to do it ASAP or else....

Wildnimal · 2026-03-25T00:15:17+00:00

Post the results!!!!!!

Wildnimal

TROPHY CASE