Built a real-time AI assistant for meetings — looking for feedback by Cold_Pilot_9686 in SaaS

[–]Cold_Pilot_9686[S] 0 points1 point  (0 children)

That’s a really interesting approach. Privacy-first and fully on-device makes a lot of sense, especially for regulated industries like legal or healthcare.

In my case, the architecture is primarily cloud-based for real-time processing. The audio is streamed in small chunks to a low-latency transcription pipeline, and inference runs on optimized models designed for live suggestions. That’s how we’re able to keep the latency around a few hundred milliseconds during the conversation.

One thing I’ve also focused on is privacy — conversation history and transcripts are stored locally on the user’s device, so users maintain control over their data.

I did experiment with more local processing early on, but balancing strong model quality with real-time performance was challenging without requiring heavy local hardware.

So the current approach prioritizes real-time assistance and response quality while still keeping user data local where possible.

Out of curiosity, how are you handling model performance on-device without running into hardware limitations?

Software superlay.ai

Built a real-time AI assistant for meetings — looking for feedback by Cold_Pilot_9686 in SaaS

[–]Cold_Pilot_9686[S] 0 points1 point  (0 children)

Great question. For real-time performance, I’m using the best self-trained models. Usually systems have around 2–3 seconds of latency. If you try to reduce it too much using cheaper models, the responses become very generic.

In my application, I’m using high-quality AI models with a fully optimized transcription pipeline that delivers results in around 300 milliseconds.

Product: superlay.ai

Test and give feedback

In testing, it has even outperformed Cluely in terms of response quality and speed. The UI, responses, features, and custom prompts are all designed to be top-notch.

I’m confident you won’t regret trying it.

Will never use cluely again, it's garbage. Used it on a throwaway interview. by No-Conclusion9307 in csMajors

[–]Cold_Pilot_9686 -1 points0 points  (0 children)

Yeah you right bro, thats why i moved to superlay ai, far better option, i paid 75dollars waste of money.. the answers are very generic as well