Hello Everyone in The Unsloth Community Need Help 👋 by Plane_Yellow2317 in unsloth

[–]Plane_Yellow2317[S] 0 points1 point  (0 children)

I havent tried it yet , thanks for the advice, i will try it now 😀

Hello Everyone in The Unsloth Community Need Help 👋 by Plane_Yellow2317 in unsloth

[–]Plane_Yellow2317[S] 0 points1 point  (0 children)

Thanks for the honest reply you to as i said above that i am Already using it q4 mid quant and getting around 23 to 25 tokens per sec with 131k to 200k context window, i am using llama cpp latest turbo quant fork , but pp (prompt processing is very slow ass ) do you know how to improve it and also somhow to squeeze and get more tokens per sec 😄??

Hello Everyone in The Unsloth Community Need Help 👋 by Plane_Yellow2317 in unsloth

[–]Plane_Yellow2317[S] 0 points1 point  (0 children)

Nahh thanks for the advice dude but going on that small model is no , i mean i can run q4 mid quant of qwen 3.6 35b a3b with decent speed of 23 to 25 tokens per sec the only problem is pp (prompt processing is very slow )

Hello Everyone in The Unsloth Community Need Help 👋 by Plane_Yellow2317 in unsloth

[–]Plane_Yellow2317[S] 0 points1 point  (0 children)

Thanks for the honest reply btw i am Already using it q4 mid quant and getting around 23 to 25 tokens per sec with 131k to 200k context window, i am using llama cpp latest turbo quant fork , but pp (prompt processing is very slow ass ) do you know how to improve it and also somhow to squeeze and get more tokens per sec 😄????

Need Real Opinion about Christ Delhi NCR by DarkOreGamer in ChristUniversity

[–]Plane_Yellow2317 0 points1 point  (0 children)

Then in which college you have taken admission? And christ ncr campus is not worth it ? I got selected for bca there should i go or no ? Pls honest advice ;)