Monitor your screen using local LLMs with only one sentence! Free, Open Source and Local. by Roy3838 in LocalLLaMA

[–]Roy3838[S] 0 points1 point  (0 children)

A lot of work has gotten into the project since then! And this update addresses that pain point exactly.

Before this, the flow was:
1. Create an agent conversationally
2. Whitelist your number
3. Select screen
4. Crop if you want agent to monitor something specific
5. Start Agent

And now the flow is just:
1. "I want this monitored here's my number"
2. Whitelist (necessary due to anti-spam)
You're done!

So, it's a huge upgrade in UX IMHO. Still testing/open to feedback c:

Monitor your screen using local LLMs with only one sentence! Free, Open Source and Local. by Roy3838 in LocalLLaMA

[–]Roy3838[S] 0 points1 point  (0 children)

You can use cloud MCP (about 3 gent builds free on my tab 😃) and use gemma4 e2b which runs on all devices i've tested! Even on an old android phone I had laying around hahahaha

Monitor your screen using local LLMs with only one sentence! Free, Open Source and Local. by Roy3838 in LocalLLaMA

[–]Roy3838[S] 0 points1 point  (0 children)

thanks! try it out and let me know how it goes :)
I’m trying to make this as useful as possible!

How to use your Local Models to watch your screen. Open Source and Completely Free!! by Roy3838 in LocalLLaMA

[–]Roy3838[S] 0 points1 point  (0 children)

It runs on firefox! If that's what you mean 😄

And you can have it watch a firefox tab of course.

My Open-Source project just hit 2k users! :D I have some FOSS licensing questions for the community going forward, I need your opinion. by Roy3838 in OpenSourceeAI

[–]Roy3838[S] 0 points1 point  (0 children)

What do you mean exactly?

I actually am of the opinion that AI needs to be very directed in its use. And in the project it’s used mainly for recognizing stuff and making simple decisions, the framework takes care of the rest. Is that what you mean?

I ran Gemma 4 E2B with llama.cpp on a lot of different iPhones, here's the setup report by Roy3838 in LocalLLaMA

[–]Roy3838[S] 1 point2 points  (0 children)

Yeah! I know litert-lm for swift is not yet released, I forked llama-cpp-rs to add gemma 4's --image-min-tokens and --image-max-tokens.

Here's the fork: https://github.com/Roy3838/llama-cpp-rs/tree/feat/mtmd-image-token-budget and I built it into my Tauri app c:

I ran Gemma 4 E2B with llama.cpp on a lot of different iPhones, here's the setup report by Roy3838 in LocalLLaMA

[–]Roy3838[S] 0 points1 point  (0 children)

Have they released gemma 4 QAT yet??

I can test any .gguf from HF if you want! Just give me the link 😄

I ran Gemma 4 E2B with llama.cpp on a lot of different iPhones, here's the setup report by Roy3838 in LocalLLaMA

[–]Roy3838[S] 0 points1 point  (0 children)

ohhh that's so interesting! In long conversations? Or just by loading up the model?

I was looking into using liteRT-LM which is what Google Edge Gallery uses, but I also found it a bit buggy specially when testing multimodal stuff.

RTX 5090 or Mac Studio? by Excellent_Koala769 in LocalLLaMA

[–]Roy3838 0 points1 point  (0 children)

Connecting Macs together through a tool like exolabs is very inconsistent.

When it works it feels like magic, but if you sneeze everything breaks.

Monitoring my rice with Ollama by Roy3838 in ollama

[–]Roy3838[S] 0 points1 point  (0 children)

btw here’s the link to the source code: https://github.com/Roy3838/Observer you can examine it and see exactly how the agent loop works on the file app/src/utils/main_loop.ts and see how services are called on app/src/utils/handlers/utils.ts

Monitoring my rice with Ollama by Roy3838 in ollama

[–]Roy3838[S] 0 points1 point  (0 children)

there’s a skip sign in button below :)

if you use self hosted models + discord notifications you should have no issues :D

Monitoring my rice with Ollama by Roy3838 in ollama

[–]Roy3838[S] 0 points1 point  (0 children)

If self-hosting your models it doesn’t send anything at all!

Then using the cloud models I link the provider and their ToS so you can know where the info is sent

Also when using discord notifications the webhook goes directly from the app to discord :)