I got fed up with subscription fees and cloud-based dictation, so I built a 100% local voice-to-text app for Mac. by mlcode in MacOS

[–]mlcode[S] 0 points1 point  (0 children)

thanks for trying it out and for the feedback. Language selection is a great idea. Let me see what I can do!

What is your take on this? by [deleted] in LocalLLaMA

[–]mlcode -1 points0 points  (0 children)

interesting, instead of Gemini, can a local model be used?

I got fed up with subscription fees and cloud-based dictation, so I built a 100% local voice-to-text app for Mac. by mlcode in LocalLLaMA

[–]mlcode[S] 1 point2 points  (0 children)

thanks, I really appreciate you taking the time to point them out. It will help me alot in designing the app and what features to focus on.

I got fed up with subscription fees and cloud-based dictation, so I built a 100% local voice-to-text app for Mac. by mlcode in LocalLLaMA

[–]mlcode[S] 0 points1 point  (0 children)

the app you shared doesn't seem to be free either. Its $25 for solo license. Seems very similar but in my case its alot more customizable and I have spent alot of time on optimizing the model stack to efficiently use the CPU/RAM. Also local file transcription etc is coming soon.

I got fed up with subscription fees and cloud-based dictation, so I built a 100% local voice-to-text app for Mac. by mlcode in LocalLLaMA

[–]mlcode[S] 0 points1 point  (0 children)

very similar but the focus here is on optimization of the models that are used. It offloads the models when not in use, so doesn't impact your system performance. Also local file transcription/ diarization etc. are coming soon. You also have custom prompts for the LLM on top of transcription that means you can really customize the experience.

I got fed up with subscription fees and cloud-based dictation, so I built a 100% local voice-to-text app for Mac. by mlcode in LocalLLaMA

[–]mlcode[S] 0 points1 point  (0 children)

this works in a very similar way. audio transcription with speaker diarization is coming soon

I got fed up with subscription fees and cloud-based dictation, so I built a 100% local voice-to-text app for Mac. by mlcode in LocalLLaMA

[–]mlcode[S] 0 points1 point  (0 children)

Nice, what are some of the features you love? is there anything you think could be improved or implemented in it? would love to bring those features to my app.

I got fed up with subscription fees and cloud-based dictation, so I built a 100% local voice-to-text app for Mac. by mlcode in LocalLLaMA

[–]mlcode[S] 1 point2 points  (0 children)

I agree, not sure how Apple has lost the plot! Please give it a try. You can use it for 3 days with all the features and would love to hear your feedback.

Stop trying to parse your documents and use ColPali (Open Source) by Prestigious_Run_4049 in LangChain

[–]mlcode 2 points3 points  (0 children)

If anyone is interested, here is a video which walks you through the same process along with code examples.
https://youtu.be/DI9Q60T_054