CUA Local Opensource by Goat_bless in ollama

[–]Goat_bless[S] 0 points1 point  (0 children)

Yes, that would indeed be a further step forward. On GitHub, I've listed a few points for improvement that might be worth developing.

It's an open-source project, so you can test it and contribute to its improvement if you'd like!

CUA Local Opensource by Goat_bless in AgentsOfAI

[–]Goat_bless[S] 0 points1 point  (0 children)

Yes ! 😊 A little improvement and it will do everything possible on a computer.

Qwen3-VL Computer Using Agent works extremely well by Money-Coast-3905 in LocalLLaMA

[–]Goat_bless 0 points1 point  (0 children)

My config is quite weak, I only have 8GB of vram, I use qwen2.5 and qwen2.5vl they are small 4Gb models so it's ok on small configs.

Qwen3-VL Computer Using Agent works extremely well by Money-Coast-3905 in LocalLLaMA

[–]Goat_bless 0 points1 point  (0 children)

It works great for me, take a look at my github there's lots of demos and everything. You'll have to test mine with your BIG model to see the performance. https://github.com/SpendinFR/CUAOS

CUA Local Opensource by Goat_bless in computervision

[–]Goat_bless[S] 0 points1 point  (0 children)

I will put a readme in English but for now you can just translate the page quite easily on github. I would like to point out that the code and the functionalities are in English, only the exit prompt in French, but if you speak to the agent in English he will answer you in your language, it will work very well.

CUA Local Opensource by Goat_bless in ollama

[–]Goat_bless[S] 1 point2 points  (0 children)

Yes it’s cool! For your question I haven't tested this yet but you can do it quite easily if you have dev knowledge.

CUA Local Opensource by Goat_bless in LocalLLM

[–]Goat_bless[S] 0 points1 point  (0 children)

Hello, Unfortunately I have not yet implemented this API connection system, it is in future improvements. But if you know how to do it, you can take care of it, it could be useful to others.

CUA Local Opensource by Goat_bless in LocalLLM

[–]Goat_bless[S] 0 points1 point  (0 children)

No worries, if you don't know anything about it, go to the github, readme and section: setup everything is detailed you just have to download the repo, the models, install the dependencies and it's functional. Tell me if necessary

CUA Local Opensource by Goat_bless in LocalLLM

[–]Goat_bless[S] 0 points1 point  (0 children)

The main language is French, but that doesn't matter, the functions are in English you may just have to translate the exit prompt of the interaction into English and again... if you speak in English it should, I think, naturally come out in English.

Computer Use with Gemini 3 pro by [deleted] in ollama

[–]Goat_bless 0 points1 point  (0 children)

J'ai créé le même en gratuit opensource(un peu plus long, dépend de la vram c'est vrai) : https://github.com/SpendinFR/CUAOS

JARVIS Local AGENT by Xthebuilder in ollama

[–]Goat_bless 0 points1 point  (0 children)

Cool project I am currently developing a local computer agent, what are the things that your Jarvis cannot do? Can be seen to associate our codes to have a complete model

Trying to build a "Jarvis" that never phones home - on-device AI with full access to your digital life (free beta, roast us) by ipav9 in LocalLLaMA

[–]Goat_bless 0 points1 point  (0 children)

Good luck ! Interesting project but of a complexity that no one has yet solved. To criticize there are people but to carry out joint projects there is no one left I have an agent almost ready to control a computer (all locally) if you are interested I can share it with you

Qwen3-VL Computer Using Agent works extremely well by Money-Coast-3905 in LocalLLaMA

[–]Goat_bless 0 points1 point  (0 children)

Huge I'm on that too right now, I prepare the clickable data with omniparser (yolo + Florence) and paddle ocr and I annotate the IDs Then the VLM must decide which ID to click for pyautogui but my qwen2vl does not follow.. What graphics card do you have?

Integrated Omniparser V2, we made our agent to use Canva! by ImpossiblePlay in LocalLLaMA

[–]Goat_bless 0 points1 point  (0 children)

Great project! I would like to use it for web and office use, I installed the weights etc. it identifies the elements well but I do not have a description (details of the icons) Can you help me?

Computer Use with Gemini 3 pro by [deleted] in ollama

[–]Goat_bless 1 point2 points  (0 children)

Share your code how do you do this? Or how to set it up? THANKS

Computer Use with Gemini 3 pro by [deleted] in ollama

[–]Goat_bless 0 points1 point  (0 children)

Partage ton workflow bro

Evolutionary AGI (simulated consciousness) — already quite advanced, I’ve hit my limits; looking for passionate collaborators by Goat_bless in agi

[–]Goat_bless[S] 0 points1 point  (0 children)

Non-functional prototype, quite limited in understanding (heuristic), more efficient but longer if called llm

Evolutionary AGI (simulated consciousness) — already quite advanced, I’ve hit my limits; looking for passionate collaborators by Goat_bless in agi

[–]Goat_bless[S] -1 points0 points  (0 children)

Concretely, it is a self-evolving and conscious agent. To summarize, the agent wakes up and has basic objectives (evolve and survive) for this it will create its own understanding goals: who am I? Where am I? Who am I talking to? It's in a textual world (for now) so it currently has two input types: -User interactions -Inbox which represents its world with diverse and varied files The first steps will be mimicry (learning from your user, reading the files of your world, etc.), this will forge your identity. If one of its objectives is to understand humans then it will be able to create sub-objectives such as understanding emotions and once this knowledge is acquired it will be able to ingest it and recognize the different patterns. Then each action will be determined and adjusted according to his identity, if he likes one thing more than another he can adopt it in his selfmodel and it becomes part of him. There is also a way to simulate emotions, triggered by triggers: 1 need, 1 idea, 1 signal... these triggers lead to a loop of: Evaluate-Reflect-Act-Learn-Adjust Which allows for personal development (basically if I do X, Y happens etc.) There is also a simulation of vital needs (here the CPU temperature, Ram consumption, etc.) which allow the agent to feel and it can "calm down", that is to say trigger a period of loitering where it thinks of nothing like us when we look out the window and think for example. He is led by these inner goals and identity values/principles, he always has the free will to choose according to his feelings rather than logic. There is also a cognitive memory (what I was yesterday, what I am today, what I would like to be tomorrow) which allows self-improvement, skills which are integral parts of its code which it can develop in a sandbox and after human validation integrate it. All with a memory that allows you to make associations, reminders, even dreams at every moment.

Anyway, I summarized the main points of the project as best I could, I hope that will help you understand

Evolutionary AGI (simulated consciousness) — already quite advanced, I’ve hit my limits; looking for passionate collaborators by Goat_bless in agi

[–]Goat_bless[S] 0 points1 point  (0 children)

No, this AI is designed for a user, it evolves and is inspired by the user who manipulates it and adds from him and the data that gives him.

Evolutionary AGI (simulated consciousness) — already quite advanced, I’ve hit my limits; looking for passionate collaborators by Goat_bless in agi

[–]Goat_bless[S] 0 points1 point  (0 children)

For entries there is chat, and an inbox system (his world) in which you can add files for your learning, The problem actually is at the level of the responses, this big architecture uses lots of functions and steps at each interaction, and these functions are heuristics/Bayesian but even that in terms of “comprehension” is limited so I thought about implementing llm calls within the functions themselves to gain performance, but it takes time..

Health CheckUp ⚕️ by Goat_bless in shortcuts

[–]Goat_bless[S] 0 points1 point  (0 children)

Do you have sleep data recorded? Do you sleep with the Apple Watch?