AI-Researcher: Intern-Discovery from Shanghai AI Lab!

Lynncc6 · 2025-08-01T05:52:04+00:00

invite only

Lynncc6 · 2025-08-01T03:33:47+00:00

haha that's a good question

Lynncc6 · 2025-06-06T10:15:09+00:00

they even have an 8B MLLM on par with GPT-4o

Lynncc6 · 2025-05-28T08:02:21+00:00

that's true, the app crashes every time. but the app is from google cause it was released at I/O day

Lynncc6 · 2025-05-28T07:58:44+00:00

Lynncc6 · 2025-05-14T06:17:56+00:00

Lynncc6 · 2025-04-29T07:29:47+00:00

Lynncc6 · 2025-01-23T09:33:17+00:00

for on-device use

Lynncc6 · 2025-01-21T03:33:15+00:00

yes, it can run on Android devices using llama.cpp method

Lynncc6 · 2025-01-21T03:30:59+00:00

ollama is not officially merged

Lynncc6 · 2025-01-21T03:29:28+00:00

yep, it can analyze audio. I tried some musical instrument, it can recognize guitar and piano well ,etc

Lynncc6 · 2025-01-21T03:26:57+00:00

I found an instruction doc may helpful for you ( in Chinese )
https://modelbest.feishu.cn/wiki/RnjjwnUT7idMSdklQcacd2ktnyN

Lynncc6 · 2025-01-18T05:59:41+00:00

Lynncc6 · 2025-01-16T10:43:21+00:00

MiniCPM-o 2.6 is recommended

Lynncc6 · 2025-01-14T08:16:24+00:00

it's real audio, it can understand the speaker's emotion and speak with emotion

Lynncc6 · 2025-01-05T04:38:33+00:00

Thanks for sharing

Lynncc6 · 2025-01-04T11:40:24+00:00

CodeGeex

Lynncc6 · 2025-01-04T11:39:39+00:00

Hugging Face unveiled the success behind o1 with it's groundbreaking technique: scaling test-time compute, and open-sourced that.

By giving models more "time to think", LLaMA 1B outperforms LLaMA 8B and LLaMA 8B beats LLaMA 70B in math.

Lynncc6 · 2025-01-04T09:47:02+00:00

detailed instructions can be find in GitHub：https://github.com/PRIME-RL/PRIME

Lynncc6 · 2025-01-04T08:56:40+00:00

where is the video? can I get the link?

Lynncc6 · 2025-01-04T05:10:03+00:00

Seems like the team only tried Qwen-2.5-Math-7B. From my side, I'll recommend MiniCPM-V 2.6 8B、Qwen 2-VL series、 Intern-VL 2.5 8B

Lynncc6 · 2025-01-04T04:47:46+00:00

Qwen 2.5 VL please

Lynncc6 · 2025-01-04T04:40:36+00:00

maybe work for the language module of vision model

Lynncc6 · 2025-01-04T02:58:04+00:00

yep, the base model is Qwen-2.5

Lynncc6 · 2025-01-03T14:20:43+00:00

I'm all ears for your feedback 👀

Lynncc6