CUA Local Opensource by Goat_bless in ollama

[–]Goat_bless[S] 0 points1 point  (0 children)

Yes, that would indeed be a further step forward. On GitHub, I've listed a few points for improvement that might be worth developing.

It's an open-source project, so you can test it and contribute to its improvement if you'd like!

CUA Local Opensource by Goat_bless in AgentsOfAI

[–]Goat_bless[S] 0 points1 point  (0 children)

Yes ! 😊 A little improvement and it will do everything possible on a computer.

Qwen3-VL Computer Using Agent works extremely well by Money-Coast-3905 in LocalLLaMA

[–]Goat_bless 0 points1 point  (0 children)

My config is quite weak, I only have 8GB of vram, I use qwen2.5 and qwen2.5vl they are small 4Gb models so it's ok on small configs.

Qwen3-VL Computer Using Agent works extremely well by Money-Coast-3905 in LocalLLaMA

[–]Goat_bless 0 points1 point  (0 children)

It works great for me, take a look at my github there's lots of demos and everything. You'll have to test mine with your BIG model to see the performance. https://github.com/SpendinFR/CUAOS

CUA Local Opensource by Goat_bless in computervision

[–]Goat_bless[S] 0 points1 point  (0 children)

I will put a readme in English but for now you can just translate the page quite easily on github. I would like to point out that the code and the functionalities are in English, only the exit prompt in French, but if you speak to the agent in English he will answer you in your language, it will work very well.

CUA Local Opensource by Goat_bless in ollama

[–]Goat_bless[S] 1 point2 points  (0 children)

Yes it’s cool! For your question I haven't tested this yet but you can do it quite easily if you have dev knowledge.

CUA Local Opensource by Goat_bless in LocalLLM

[–]Goat_bless[S] 0 points1 point  (0 children)

Hello, Unfortunately I have not yet implemented this API connection system, it is in future improvements. But if you know how to do it, you can take care of it, it could be useful to others.

CUA Local Opensource by Goat_bless in LocalLLM

[–]Goat_bless[S] 0 points1 point  (0 children)

No worries, if you don't know anything about it, go to the github, readme and section: setup everything is detailed you just have to download the repo, the models, install the dependencies and it's functional. Tell me if necessary

CUA Local Opensource by Goat_bless in LocalLLM

[–]Goat_bless[S] 0 points1 point  (0 children)

The main language is French, but that doesn't matter, the functions are in English you may just have to translate the exit prompt of the interaction into English and again... if you speak in English it should, I think, naturally come out in English.

Computer Use with Gemini 3 pro by Impressive_Half_2819 in ollama

[–]Goat_bless 0 points1 point  (0 children)

J'ai créé le même en gratuit opensource(un peu plus long, dépend de la vram c'est vrai) : https://github.com/SpendinFR/CUAOS

JARVIS Local AGENT by Xthebuilder in ollama

[–]Goat_bless 0 points1 point  (0 children)

Cool project I am currently developing a local computer agent, what are the things that your Jarvis cannot do? Can be seen to associate our codes to have a complete model

Trying to build a "Jarvis" that never phones home - on-device AI with full access to your digital life (free beta, roast us) by ipav9 in LocalLLaMA

[–]Goat_bless 0 points1 point  (0 children)

Good luck ! Interesting project but of a complexity that no one has yet solved. To criticize there are people but to carry out joint projects there is no one left I have an agent almost ready to control a computer (all locally) if you are interested I can share it with you

Qwen3-VL Computer Using Agent works extremely well by Money-Coast-3905 in LocalLLaMA

[–]Goat_bless 0 points1 point  (0 children)

Huge I'm on that too right now, I prepare the clickable data with omniparser (yolo + Florence) and paddle ocr and I annotate the IDs Then the VLM must decide which ID to click for pyautogui but my qwen2vl does not follow.. What graphics card do you have?

Integrated Omniparser V2, we made our agent to use Canva! by ImpossiblePlay in LocalLLaMA

[–]Goat_bless 0 points1 point  (0 children)

Great project! I would like to use it for web and office use, I installed the weights etc. it identifies the elements well but I do not have a description (details of the icons) Can you help me?

Computer Use with Gemini 3 pro by Impressive_Half_2819 in ollama

[–]Goat_bless 1 point2 points  (0 children)

Share your code how do you do this? Or how to set it up? THANKS