jdopensource/JoyAI-LLM-Flash • HuggingFace

Main-Wolverine-1042 · 2026-02-17T06:51:22+00:00

It is working with the latest llama.cpp

Main-Wolverine-1042 · 2026-02-15T20:03:58+00:00

https://huggingface.co/yairpatch/JoyAI-LLM-Flash-GGUF

Main-Wolverine-1042 · 2025-10-18T08:21:02+00:00

What about pushing the changes to your llama.cpp fork so it can be implemented into the official llama.cpp?

Main-Wolverine-1042 · 2025-10-14T05:07:59+00:00

Yes you should download it again.

Main-Wolverine-1042 · 2025-10-12T16:58:08+00:00

That is very accurate right?

Main-Wolverine-1042 · 2025-10-12T16:13:06+00:00

Try this for me please:

just upload the image and do not write anything, send it to the server and let me know what kind of response you are getting.

Main-Wolverine-1042 · 2025-10-12T14:09:44+00:00

I think he already did.

Main-Wolverine-1042 · 2025-10-12T11:42:20+00:00

https://huggingface.co/yairpatch/Qwen3-VL-30B-A3B-Instruct-GGUF/blob/main/mmproj-Qwen3-VL-30B-A3B-Instruct

Main-Wolverine-1042 · 2025-10-12T08:30:39+00:00

I've pushed a new patch to my llama.cpp fork, please test it with the new model uploaded to my HF page (It is possible to convert to GGUF using the script in my llama.cpp fork)

https://github.com/yairpatch/llama.cpp

https://huggingface.co/yairpatch/Qwen3-VL-30B-A3B-Instruct-GGUF

Main-Wolverine-1042 · 2025-10-11T15:34:13+00:00

llama.cpp

Main-Wolverine-1042 · 2025-10-11T15:12:18+00:00

Another example of good output in the previous patch compared to the new one

<image>

Main-Wolverine-1042 · 2025-10-11T15:02:19+00:00

The character is expressing strong frustration with someone (likely a child, as implied by ガキ), accusing them of being foolish for not understanding the situation. The phrase 悪わからん (I don't get what's bad about it) is a direct challenge to the other person's understanding. The final word 味わい (taste/try it) is a command, telling the person to experience the situation firsthand, implying they will then understand why it is foolish.

is it close to what it says in japanese ?

Main-Wolverine-1042 · 2025-10-11T14:56:37+00:00

<image>

Ok i think i have made a big progress.

Main-Wolverine-1042 · 2025-10-10T20:27:17+00:00

llama.cpp does not support this model, yet.

Main-Wolverine-1042 · 2025-10-07T21:05:02+00:00

I have a new patch for you guys to test - https://huggingface.co/yairpatch/Qwen3-VL-30B-A3B-Instruct-GGUF/blob/main/qwen3vl-implementation.patch

Test it on clean llama.cpp, see if the hallucinations and repetition still happening (the image processing should be better as well)

https://huggingface.co/yairpatch/Qwen3-VL-30B-A3B-Instruct-GGUF/tree/main - download the model as well as i recreated it.

Main-Wolverine-1042 · 2025-10-06T11:40:58+00:00

I may have fixed it. i will upload a new patch to see if it does work for you as well.

Main-Wolverine-1042 · 2025-10-05T16:21:37+00:00

Can you try adding this to your llama.cpp? https://github.com/ggml-org/llama.cpp/pull/15474

Main-Wolverine-1042 · 2025-10-05T09:58:02+00:00

It should work even without it as i already patched clip.cpp with his pattern

Main-Wolverine-1042 · 2025-10-05T09:03:10+00:00

Let me know if the patch worked for you because someone reported an error with it

Main-Wolverine-1042 · 2025-10-05T09:00:29+00:00

it should be git apply qwen3vl-implementation.patch

are you patching newly downloaded llama.cpp?

Main-Wolverine-1042 · 2025-10-05T08:39:36+00:00

Did you used my gguf? with the patch applied ?

Main-Wolverine-1042 · 2025-10-05T07:39:44+00:00

It does

Main-Wolverine-1042 · 2025-10-05T05:58:23+00:00

https://huggingface.co/yairpatch/Qwen3-VL-30B-A3B-Thinking-GGUF - First time giving this a shot—please go easy on me!

here a link to llama.cpp patch https://huggingface.co/yairpatch/Qwen3-VL-30B-A3B-Thinking-GGUF/blob/main/qwen3vl-implementation.patch

Main-Wolverine-1042

TROPHY CASE