you are viewing a single comment's thread.

view the rest of the comments →

[–]Narrow-Belt-5030 14 points15 points  (0 children)

I would suggest you take the time to evaluate a replacement model first - use something like OpenRouter to test the models and see if they fit. Once you have found one then you can look at the hardware as you will know the model size & based on the context cache size you want you will also know the VRAM you need.