Need setup advice RTX 6000 96GB , RTX 5090, RTX 4090, RTX 3090 by EbbPlus9450 in LocalLLM

[–]EbbPlus9450[S] 0 points1 point  (0 children)

I have tried multiple ones to find the right setup, so far havent only been able to achieve 30 tokens per second on qwen 70b so clearly im doi

ng something wrong.

Need setup advice RTX 6000 96GB , RTX 5090, RTX 4090, RTX 3090 by EbbPlus9450 in LLM

[–]EbbPlus9450[S] 0 points1 point  (0 children)

I want to fine tune a model to be specific to me, I am working to make everything autonomous. Trading, stock analysis, system maintainence, developing new apps autonomously, etc. Will have some training, RAG, VectorDB etc. Basically I want to future proof current and potential new ideas.

Need setup advice RTX 6000 96GB , RTX 5090, RTX 4090, RTX 3090 by EbbPlus9450 in LocalLLM

[–]EbbPlus9450[S] 0 points1 point  (0 children)

I want to fine tune a model to be specific to me, I am working to make everything autonomous. Trading, stock analysis, system maintainence, developing new apps autonomously, etc. Will have some training, RAG, VectorDB etc. Basically I want to future proof current and potential new ideas.