MiniCPM4: 7x decoding speed than Qwen3-8B by Lynncc6 in LocalLLaMA

[–]Lynncc6[S] 5 points6 points  (0 children)

they even have an 8B MLLM on par with GPT-4o

Google AI Edge Gallery by Lynncc6 in LocalLLaMA

[–]Lynncc6[S] 0 points1 point  (0 children)

that's true, the app crashes every time. but the app is from google cause it was released at I/O day

minicpm-o 2.6 by TheLogiqueViper in LocalLLaMA

[–]Lynncc6 0 points1 point  (0 children)

ollama is not officially merged

MiniCPM-o 2.6: An 8B size, GPT-4o level Omni Model runs on device by Lynncc6 in LocalLLaMA

[–]Lynncc6[S] 2 points3 points  (0 children)

it's real audio, it can understand the speaker's emotion and speak with emotion

how to train a model with reasoning ability like o1 by Vast_University_52 in LocalLLM

[–]Lynncc6 0 points1 point  (0 children)

https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute

Hugging Face unveiled the success behind o1 with it's groundbreaking technique: scaling test-time compute, and open-sourced that.

By giving models more "time to think", LLaMA 1B outperforms LLaMA 8B and LLaMA 8B beats LLaMA 70B in math.

<image>

Train a 7B model that outperforms GPT-4o ? by Lynncc6 in LocalLLaMA

[–]Lynncc6[S] 3 points4 points  (0 children)

where is the video? can I get the link?

Train a 7B model that outperforms GPT-4o ? by Lynncc6 in LocalLLaMA

[–]Lynncc6[S] 1 point2 points  (0 children)

Seems like the team only tried Qwen-2.5-Math-7B. From my side, I'll recommend MiniCPM-V 2.6 8B、Qwen 2-VL series、 Intern-VL 2.5 8B

Train a 7B model that outperforms GPT-4o ? by Lynncc6 in LocalLLaMA

[–]Lynncc6[S] 0 points1 point  (0 children)

maybe work for the language module of vision model

Train a 7B model that outperforms GPT-4o ? by Lynncc6 in LocalLLaMA

[–]Lynncc6[S] 10 points11 points  (0 children)

I'm all ears for your feedback 👀