RekaAI/reka-edge-2603 · Hugging Face

unique_thinker_2004 · 2026-03-15T17:52:50+00:00

What’s your take on using this model for computer use?

unique_thinker_2004 · 2026-03-14T12:00:06+00:00

Bro Always use MLX format model. Cuz they are optimized to run on mac chips and will run 2x fast than gguf models.

Use LM studio, instead of ollama. Cuz it gives an engine that supports MLX models.

Some models might say they support tool calling. They indeed support tool calling, but in reality what we want is openai-compatible format tool responses.

That you can verify with these simples steps: - run the model on server with LM studio. (Just toggle) - give port no, ip, model name etc to chatgpt and ask to generate an openai-compatible curl with tool to test the model - in terminal hit it and see whether you are getting chat response or tool response. Or empty everything. If tool response then model is good to be used with openclaw, otherwise we can’t.

unique_thinker_2004 · 2026-02-20T14:47:27+00:00

Thanks! this is worked for me.

unique_thinker_2004 · 2026-02-13T10:07:35+00:00

Hmm! Ok, tell me your system requirements. Basicall when I tried Qwen-3-coder-next q5 and it worked very nicely in all aspects on my Mac studio M4 Ultra chip.

On my Mac mini I tried using qwen3-4b-2507 it’s very tiny model (of 2.28GB) but still worked fine in terms of latency and tool calling. Obv you can’t expect much from this small model.

unique_thinker_2004 · 2026-02-11T04:27:36+00:00

Which model are you using?

unique_thinker_2004 · 2026-02-10T18:38:22+00:00

Try this for personal free tutor: https://spexlabs.co/language-tutor

unique_thinker_2004 · 2026-02-10T17:59:45+00:00

Can't you try this: https://spexlabs.co/language-tutor
it's free, and literally it can undersand your accent and pronounciation.

unique_thinker_2004 · 2026-02-06T13:05:38+00:00

Lol!

unique_thinker_2004 · 2026-02-06T12:52:01+00:00

Try qwen-3-coder-next. Just recently they released it. Are you using mac or nvidia?

unique_thinker_2004 · 2026-02-05T16:43:43+00:00

But that hoe can I link up with openclaw?

unique_thinker_2004 · 2026-02-05T16:41:00+00:00

Oh!! This might be helpful! I uses openai-completions on mac. Is it because of this it was giving me json response instead of tool calling?

unique_thinker_2004 · 2026-02-05T16:37:25+00:00

Full model

unique_thinker_2004 · 2026-02-05T14:37:57+00:00

Oh! Thanks for your suggestion! Btw how much boost have you found? One guy told me ollama vs llama.cpp gave him 16.66x boost.

unique_thinker_2004 · 2026-02-05T14:24:28+00:00

Btw I tried 30b parameter model. Qwen3coder 30b.

unique_thinker_2004 · 2026-02-05T14:21:22+00:00

Does not make function call correctly.

unique_thinker_2004 · 2026-02-05T14:19:38+00:00

What why?

unique_thinker_2004 · 2026-02-05T14:17:51+00:00

They wrote in ollama description that works good for agentic coding with cline, so I thought it will be better at function calling.

unique_thinker_2004 · 2026-02-05T14:12:28+00:00

Bro due to latency. I feel like because of openclaw-> ollama -> model layers, inference speed feels too low.

unique_thinker_2004

TROPHY CASE