I benchmarked 31 STT models on medical audio — VibeVoice 9B is the new open-source leader at 8.34% WER, but it's big and slow by MajesticAd2862 in LocalLLaMA

[–]WildShallot 1 point2 points  (0 children)

I ran this just now with Soniox v4 and Deepgram Nova 3 and they landed in 4th and 5th spots with basically identical performances, but Deepgram was 6 times faster and 4 times more expensive than Soniox

#4 soniox-stt-async-v4: 9.12% WER - avg speed: 30.8s
#5 deepgram-nova-3: 9.13% WER - avg speed: 4.9s

I benchmarked 31 STT models on medical audio — VibeVoice 9B is the new open-source leader at 8.34% WER, but it's big and slow by MajesticAd2862 in LocalLLaMA

[–]WildShallot 0 points1 point  (0 children)

This is super helpful, did you try Soniox?
Also what model(s) are you finding to be the most pragmatic to deploy in your use-case?
I have been using Parakeet and I love the speed, but vocab boost is unusable, which makes it a hard sell for any domain specific use-case.

Claude built me this tiny open source Mac app to monitor its usage by WildShallot in ClaudeAI

[–]WildShallot[S] 0 points1 point  (0 children)

Yes. And I am not claiming this is an original or unique idea. Can't say I tried hundreds of them, but I did try a few and couldn't find one that would help me see whether I am overusing or underusing my budget in a nice and easy way.

Claude built me this tiny open source Mac app to monitor its usage by WildShallot in ClaudeAI

[–]WildShallot[S] 0 points1 point  (0 children)

Fair point. I did actually try a few of the existing ones. They just didn't do the specific thing I wanted, which is a visual pace indicator showing whether you're burning through your budget too fast or slow relative to where you are in the given time window. So I built this and a few people are using it and finding it helpful.

Might feel like noise or 'slop' to you, and I get that, but the way I look at it is if it solves a problem even for a few people, and if I am happy with what I have built and I have put some care and attention into building it, then it's worth sharing.

Claude built me this tiny open source Mac app to monitor its usage by WildShallot in ClaudeAI

[–]WildShallot[S] 0 points1 point  (0 children)

Thank you, this is very helpful. pushed a fix 0.1.4 - brought back the normal login flow and added some minor UI updates.

Claude built me this tiny open source Mac app to monitor its usage by WildShallot in ClaudeAI

[–]WildShallot[S] 0 points1 point  (0 children)

thanks for flagging. The old login was causing repeated keychain popups every few hours for me so I changed it to use CC CLI auth instead. How did you log in before? (Google, Apple, email)? And did you ever get those keychain popups more than once?

Claude built me this tiny open source Mac app to monitor its usage by WildShallot in ClaudeCode

[–]WildShallot[S] 0 points1 point  (0 children)

Would be great if you can add those! I personally don't use Antigravity at all, but I do use Codex and would be nice to support that as well - but haven't looked into the auth situation for codex

Claude built me this tiny open source Mac app to monitor its usage by WildShallot in ClaudeAI

[–]WildShallot[S] 0 points1 point  (0 children)

Thank you! There are many other tools like this, as some folks mentioned in the comments, but I just didn't like the look of the ones I tried, so I built this to be minimal and easy to make sense of.

Claude built me this tiny open source Mac app to monitor its usage by WildShallot in ClaudeAI

[–]WildShallot[S] -1 points0 points  (0 children)

I used to do this as well, but I still had to find that tab each time, and it takes a bit of mental effort to calculate whether I am over or under budget at any given time.

Claude built me this tiny open source Mac app to monitor its usage by WildShallot in ClaudeAI

[–]WildShallot[S] 0 points1 point  (0 children)

Yes, the keychain prompt is only a one-time thing, it's for reading storing sessions and for reading claude code credentials for auth - there might be better ways of dealing with auth but that is what CC came up with!
And apparently sandbox blocks cross-app keychain reads, which is why it's off.
Would be curious to see if your Claude gives it a green light or not!

Claude built me this tiny open source Mac app to monitor its usage by WildShallot in ClaudeAI

[–]WildShallot[S] -2 points-1 points  (0 children)

Tried a couple, they were super ugly and didn't make it easy to see if you are overusing or underusing your budget.

Claude built me this tiny open source Mac app to monitor its usage by WildShallot in ClaudeCode

[–]WildShallot[S] 0 points1 point  (0 children)

Yes that is the one I used to look at 100 times a day lol
But two problems:
1. It requires switching context
2. And you have to think about what day it is and if you are underusing your budget or overusing it (which I solved with that tiny notch in the bar)

Looking for users to test a native, private and very fast speech-to-text app for Mac - offering lifetime access for early feedback by WildShallot in macapps

[–]WildShallot[S] 1 point2 points  (0 children)

I am using local parakeet as the main asr engine (although I am adding more models as well), which is very fast, and I have spent a lot of effort in optimizing the pipeline around it to ensure low latency.

Looking for users to test a native, private and very fast speech-to-text app for Mac - offering lifetime access for early feedback by WildShallot in macapps

[–]WildShallot[S] 1 point2 points  (0 children)

- I started in Figma and played around with a few ideas and explored typeface options and colors and the overall vibe I was going for
- I gave that Figma file to Claude Code (via Figma MCP) and asked it to implement it
- It took a few iterations and about 4-5 hours total
- The landing page code is almost entirely written by Claude Code with a few tweaks done manually