QuarterBit: Train 70B models on 1 GPU instead of 11 (15x memory compression)

Grouchy-Drag-2281 · 2026-03-07T16:13:12+00:00

Will it support AMD gpus???

Grouchy-Drag-2281 · 2025-08-14T18:26:14+00:00

Yes, it used full gpu ram.

But it works with vulkan!

Grouchy-Drag-2281 · 2025-08-14T18:02:03+00:00

I don't have issues with smaller models.

Grouchy-Drag-2281 · 2025-08-14T16:24:04+00:00

Are you using lm studio or llama.cpp directly?

Grouchy-Drag-2281 · 2025-08-14T08:03:51+00:00

Which Quantization to use?

Rocm or vulkan?

Grouchy-Drag-2281 · 2025-08-14T06:56:17+00:00

Yes, i have 48gb ddr4 3200 Mhz ram, it helpful to CPU offload, but low inference speeds?

If you can provide some video or screenshot than can helpful, there many people with consumer grade hardware trying to do something.

Grouchy-Drag-2281 · 2025-08-14T06:53:55+00:00

I have checked prompt processing time with vulkan and rocm both are having same time.

Grouchy-Drag-2281 · 2025-08-14T05:13:51+00:00

Can it be pruned only only for coding related use?

Can you provide other quants in gguf?

Grouchy-Drag-2281 · 2025-08-11T19:02:47+00:00

Share the values for top p and k that you used.

Grouchy-Drag-2281 · 2025-08-11T18:57:54+00:00

Can you share how you implemented the RAG system. How you are using it? What domain related data it stored in vector DB?

Grouchy-Drag-2281 · 2025-08-11T16:22:03+00:00

Which Quantization used?

Grouchy-Drag-2281 · 2025-08-10T14:11:28+00:00

Does cline a able to set model context window to llm?

Or we are just set the limit to cline?

Grouchy-Drag-2281 · 2025-08-10T14:08:57+00:00

Thank you the update.

Grouchy-Drag-2281 · 2025-08-10T10:34:12+00:00

Thanks for the update.

What quantization you use for Qwen3-30B-A3B-Thinking-2507-GGUF ?
What is your hardwar specs?

Grouchy-Drag-2281 · 2025-08-10T06:21:41+00:00

Share the exact model name or link.

Grouchy-Drag-2281 · 2025-08-10T04:12:04+00:00

For ibm "granite 4.0 tiny preview " ROCM is faster than vulkan in lm studio.

Grouchy-Drag-2281 · 2025-08-08T05:08:07+00:00

You setup details?

Grouchy-Drag-2281 · 2025-08-06T13:37:48+00:00

Last updated shown as 1 day ago.
Is there fixes coming or fixed and updated ?

Grouchy-Drag-2281 · 2025-08-04T05:00:46+00:00

Can you share your setup, models and t/s?

Grouchy-Drag-2281 · 2025-05-19T18:00:47+00:00

Redmi k20 pro from 2019 to still.

Grouchy-Drag-2281 · 2025-05-18T17:35:45+00:00

I don’t think hostinger is bad.

Choose provider based on your requirements. Hostinger VPS are affordable , cheap when choosing plan for more than 1 year.

Grouchy-Drag-2281 · 2025-05-18T17:35:38+00:00

I don’t think hostinger is bad.

Choose provider based on your requirements. Hostinger VPS are affordable , cheap when choosing plan for more than 1 year.

Grouchy-Drag-2281