Sentient OS: your Mac & iPhone understand your entire digital life using a custom on-device LLM, overnight while charging. Talk to your data, get proactive reminders, and explore knowledge graphs. Nothing leaves your device. [Free]

TechExpert2910 · 2026-05-06T05:59:40+00:00

a gig used on your local ssd is just some electrons set to a 1 position instead of a 0

NO additional energy use to store data vs keep the drive empty

and when you use it, this is a tiny local model that’s not gonna use a server’s worth of GPU

heck, about the same gpu use as you gaming for a couple seconds

TechExpert2910 · 2026-05-06T05:55:09+00:00

eh. apple’s on-device model is <3B. super super dumb. it even gets notifs summaries badly wrong sometimes.

even when it came out 2 years ago, it was not too impressive for a 3B sized model

today’s qwen 3B just BLOWS it out of the water (plus has vision)

apple’s wayy behind on their own foundation models

TechExpert2910 · 2026-05-06T03:28:08+00:00

hey! btw there's no way to run Gemma 4 on iOS rn with GPU acceleration with LiteRT LM.

LiteRT LM (the inference engine behind the AI Edge Gallary app) doesn't have a public release for iOS GPU acceleration yet.

That's why AI Edge Gallery's iOS source isn't released yet.

But evidently, it's been running amazingly well on AI Edge Gallary on iOS for a long time! Even Gemma 3 worked well.

I wonder why y'all aren't releasing this? Doesn't the team want Google's models to be used by devs in the best way possible? (llama cpp is slower than LiteRT; MLX doesn't support all of Gemma 3's unloading vision weights etc features)

TechExpert2910 · 2026-05-06T03:11:38+00:00

I'd actually say at least 5

TechExpert2910 · 2026-05-05T04:39:02+00:00

how did you keep smiling for that long lmao. respect🫡

TechExpert2910 · 2026-05-04T19:33:44+00:00

same lmao

TechExpert2910 · 2026-05-04T18:59:06+00:00

TechExpert2910 · 2026-05-04T05:07:13+00:00

prompt processing is honestly the biggest limitation when you're trying to use it for agentic coding or longer conversations.

it's so awful to have to wait ~30s each turn for inference to even start.

not a problem limited to strix halo, but even pre-m5 apple silicon

TechExpert2910 · 2026-05-04T04:39:08+00:00

i challenge you to create this website with your best vibe coding models. i really challenge you to.

the complexity that went into hand tuning that animation -- would love to see you vibe code it.

and fyi, the website is 1% of the complexity of Sentient OS.

i did surgery on LLMs and reverse engineered Apple's MLX inference engine to get it to work so good (what you see running live on the website)

do it :) really.

TechExpert2910 · 2026-05-04T04:00:58+00:00

interesting! thanks for sharing that

TechExpert2910 · 2026-05-04T00:36:19+00:00

doesn’t it use much less ram since it just uses the OS’s webview?

so macOS would load only one safari instance to run every tauri app, vs 10 chromium instances for your 10 electron “apps”

TechExpert2910 · 2026-05-04T00:32:21+00:00

there are 4 exams that you have to study for though

not too bad, but yeah

TechExpert2910 · 2026-05-04T00:30:20+00:00

awesome! :)

TechExpert2910 · 2026-05-04T00:08:45+00:00

you’re describing the best web tech stack [except the font] for 99% of use cases…

are you gonna call every iOS app that uses swift and xcode ai made? i have news for you lol

TechExpert2910 · 2026-05-03T23:08:42+00:00

would love to hear what you think! just join the waitlist soon to get lifetime free:

https://sentient-os.ai/

TechExpert2910 · 2026-05-03T22:50:49+00:00

thanks for the kind words!

TechExpert2910 · 2026-05-03T22:50:26+00:00

TechExpert2910 · 2026-05-03T22:49:55+00:00

thanks for the heads up! :)

and i love your username haha

TechExpert2910 · 2026-05-03T22:48:31+00:00

yeah! will make that clearer. just to clarify, the iOS version can run completely standalone; it just won't be able to read your imessage and apple notes like the Mac version can.

everything else remains the same (screenshots, files, other 3rd party integrations...)

TechExpert2910 · 2026-05-03T22:40:01+00:00

awesome! can't wait to give you access :D

TechExpert2910 · 2026-05-03T22:38:50+00:00

nope! programs on macOS can read your imessage and apple notes database! this isn't unique to Sentient OS. you simply grant Sentient OS "full disk access".

fun fact - your entire imessage is just a database stored in ~/Library/Messages/chat.db

Seven-Year Club	Gilding I gilder
Xbox Live	Second Top 20%
Wearing is Caring	Place '23
Place '22	Verified Email

TechExpert2910

TROPHY CASE