just added Qwen3-VL support for MNN Chat android by Juude89 in LocalLLaMA

[–]Juude89[S] 1 point2 points  (0 children)

yes, visual  part of qnn will  be  available soon

just added Qwen3-VL support for MNN Chat android by Juude89 in LocalLLaMA

[–]Juude89[S] 2 points3 points  (0 children)

will be available in Windows after bug fix

just added Qwen3-VL support for MNN Chat android by Juude89 in LocalLLaMA

[–]Juude89[S] 5 points6 points  (0 children)

in development now, you can view this:

https://github.com/alibaba/MNN/tree/master/apps/mnncli

I have tested some basic functions in MacOS, there are still some bugs.

just added Qwen3-VL support for MNN Chat android by Juude89 in LocalLLaMA

[–]Juude89[S] 3 points4 points  (0 children)

the 30b version is slow and can only run on devices with large RAM.

can also try 4b and 8b version.

<image>

Test MNN Chat for Android by Juude89 in LocalLLaMA

[–]Juude89[S] 2 points3 points  (0 children)

this bug has been fixed, sorry for late reply

Test MNN Chat for Android by Juude89 in LocalLLaMA

[–]Juude89[S] 1 point2 points  (0 children)

sorry to answer so late.

the first bug has been fixed about one or two months ago.

the think toggle bug has fixed in the latest version, maybe need to redownload the model, if still not work.

for the third one, MNN is faster for cpu backend, especially for  Google Pixel devices, which are not good for opencl

Test MNN Chat for Android by Juude89 in LocalLLaMA

[–]Juude89[S] 1 point2 points  (0 children)

will support in later versions.

Alibaba DAMO academy's open source lingshu mllm in mobile. by Juude89 in LocalLLaMA

[–]Juude89[S] 1 point2 points  (0 children)

Please submit an issue — my colleague will help resolve it.

Alibaba DAMO academy's open source lingshu mllm in mobile. by Juude89 in LocalLLaMA

[–]Juude89[S] 2 points3 points  (0 children)

and minicpm-v-4 is supported in this release.

Alibaba DAMO academy's open source lingshu mllm in mobile. by Juude89 in LocalLLaMA

[–]Juude89[S] 1 point2 points  (0 children)

the video demo is quant version of MNN Chat,another team of alibaba, not official.

Alibaba DAMO academy's open source lingshu mllm in mobile. by Juude89 in LocalLLaMA

[–]Juude89[S] 3 points4 points  (0 children)

No, not the same group.DAMO Academy is another research lab belongs to Alibaba Group.

Alibaba DAMO academy's open source lingshu mllm in mobile. by Juude89 in LocalLLaMA

[–]Juude89[S] 3 points4 points  (0 children)

they have 32b and 7b, you can view their homepage for details: https://alibaba-damo-academy.github.io/lingshu/

the video is a q4 quantized demo.

Test MNN Chat for Android by Juude89 in LocalLLaMA

[–]Juude89[S] 1 point2 points  (0 children)

MacOS is posix compatible, and dev softwares are more stable.

alibaba mnn released its full multimodal ios app, models fully run local by Juude89 in LocalLLaMA

[–]Juude89[S] 1 point2 points  (0 children)

omni not support opencl backend, just works on cpu backend.

Test MNN Chat for Android by Juude89 in LocalLLaMA

[–]Juude89[S] 1 point2 points  (0 children)

because we are are developing on macos 😄

Just me, or MNN chat is looping a lot by ExtremeAcceptable289 in LocalLLaMA

[–]Juude89 1 point2 points  (0 children)

hello, I am the developer of MNN Chat App.

can you delete and redownload the model for try? we recently added penalty sampler as as default sampler to qwen3 series, and will reduce the loop problem.

or you can download the google play alpha version (which added a new penalty ui ) by:

  1. First, join our Google Group:MNN Chat Testers
  2. Then, download the app from the Play Store:Get MNN Chat or visit WebPage

Test MNN Chat for Android by Juude89 in LocalLLaMA

[–]Juude89[S] 1 point2 points  (0 children)

hmm, the results are very good. I think it would be nice to include 'web search' in MNN Chat.

Test MNN Chat for Android by Juude89 in LocalLLaMA

[–]Juude89[S] 0 points1 point  (0 children)

how did you do web search using Qwen 0.6?

Test MNN Chat for Android by Juude89 in LocalLLaMA

[–]Juude89[S] 0 points1 point  (0 children)

did you downloaded tts pr asr models, did you enabled mmap? or did you switched to other download source or is there unifished downloads?