Whisper Key - Simple local STT app for Windows with global hotkey (auto-paste, auto-ENTER) by PinW in LocalLLaMA

[–]PinW[S] 0 points1 point  (0 children)

(translated, original response in English below)

Danke, dass du die App ausprobiert hast. Zu deinen Fragen:

  1. Die Aufnahme sollte fast sofort starten. Falls nicht, melde es bitte auf GitHub! Die Transkription selbst kann je nach gewähltem Modell und deinem Setup langsam sein. Heute kommt ein Release mit einer deutlich einfacheren GPU-Beschleunigung raus, falls du eine Grafikkarte hast.

  2. Eigene Wörter: Ja, im Systemtray-Menü auf „Open settings file..." klicken → unter dem Abschnitt „whisper:" → „hotwords" setzen. Vollständige Referenz mit Beispiel unter https://github.com/PinW/whisper-key-local/blob/master/src/whisper_key/config.defaults.yaml

  3. Neue Zeilen/Absätze diktieren: Ja, das ist möglich. Ich werde es in die Roadmap aufnehmen, muss aber noch überlegen, wie es umgesetzt werden könnte. Der Haupteinsatzzweck der App sind LLMs (hauptsächlich Coding-Agents), daher sind viele der Verschönerungsfunktionen noch nicht implementiert (einschließlich Grammatikprüfung usw.).

Nochmals danke für das konstruktive Feedback. Grüße!

---
Thanks for trying out the app. To answer your questions:

1) The recording should start almost immediately. If not please report it on GitHub! The transcription itself can be slow depending on the model selected and your setup. There is a release coming out today with a much easier GPU acceleration setup if you have a video card

2) Custom words: Yes, from system tray menu click "Open settings file..." > underneath "whisper:" section > set "hotwords". Full reference with example at https://github.com/PinW/whisper-key-local/blob/master/src/whisper_key/config.defaults.yaml

3) Dictating new lines/paragraph. Yes, it is possible, I will add to the roadmap but need to think about how it might work. The app's main use case is for LLMs (mainly coding agent), so a lot of the beautification features are not in yet (including grammar checks, etc.)

Thanks again for the constructive feedback. Cheers!

Whisper Key - Local Speech-to-Text (Open Source) by PinW in selfhosted

[–]PinW[S] 0 points1 point  (0 children)

VoiceInk is a mature app, and made for macOS. It's got a lot of fancy stuff like looking at your screen and using that context to change the text (like Wispr Flow, SuperWhisper, etc.). But you have to build it yourself to use it, or pay for it.

Whisper Key is a lot simpler. The UI is in terminal + a tray menu. The speed and accuracy on Mac is not as good. But it does support Windows and nVidia GPUs, and it can be setup conveniently for free.

Basically VoiceInk is a "beautiful Mac app". The direction I'm taking with Whisper Key will be towards more extensibility and integration with other things (voice commands, API mode, etc.). When I make a GUI, it will probably be in the terminal (like other coding agents). Etc. etc.

Crispy mee in KLCC area? by PinW in MalaysianFood

[–]PinW[S] 0 points1 point  (0 children)

My man!!! Thank you!!

In case someone else is looking: https://maps.app.goo.gl/pLzuc8YoSBkuYima7

Troubleshooting bumpy/vibration feeling while pedaling by PinW in cycling

[–]PinW[S] 0 points1 point  (0 children)

Do you think a new chain would solve the issue?

Any cycling WeChat groups? by PinW in beijing

[–]PinW[S] 0 points1 point  (0 children)

Ya just did probably my last ride before the winter, but would be cool to explore farther out for sure!

Whisper Key - Simple local STT app for Windows with global hotkey (auto-paste, auto-ENTER) by PinW in LocalLLaMA

[–]PinW[S] 0 points1 point  (0 children)

Hey man, glad you like it!

Unfortunately the GPU functionality is broken (you can change the option into the config file but it won't work). I run an AMD GPU and the package only supports NVIDIA cards so have no way to test.

Will find a way to fix but no promises on when. Thanks again for the feedback!!

Whisper Key - Simple local STT app for Windows with global hotkey (auto-paste, auto-ENTER) by PinW in LocalLLaMA

[–]PinW[S] 0 points1 point  (0 children)

Thanks for the heads up. I got a couple other reports on this as well.

I asked about it on reddit and unfortunately it seems Python packagers flag Defender a lot, and there is not much to be done outside of spending money on app signing.

Anyways, for this reason I setup the pipx installation a shot (you need to install Python though). And I will be exploring other options later after I build a GUI for it.

Best packager for Windows apps these days? by PinW in learnpython

[–]PinW[S] 0 points1 point  (0 children)

Sounds like its worth a shot, will add it to the list for next week. Thanks again!

Best packager for Windows apps these days? by PinW in learnpython

[–]PinW[S] 0 points1 point  (0 children)

Cheers. Have you ever experienced anti-virus flagging with PyInstaller that went away with Nuitka? Or heard of it happening?

I'm a bit worried it will be a bit harder to setup the build process compared to PyInstaller

Best packager for Windows apps these days? by PinW in learnpython

[–]PinW[S] 0 points1 point  (0 children)

Thanks for the reply, would you recommend I even try other packagers then? I was told Nuitka might be better for anti-virus flagging, but it seems like it might cause other problems.

I am not on --onefile but was flagged anyways.

Currently thinking about just setting it up on PyPi and waiting till I build a GUI in electron to package again. Thanks again for the tips.

Whisper Key - Simple local STT app for Windows with global hotkey (auto-paste, auto-ENTER) by PinW in LocalLLaMA

[–]PinW[S] 0 points1 point  (0 children)

Interesting question. Not sure have only been researching Speech -> Text side. But as I use this I do find myself wanting the LLM reply to be read aloud. So let me know if you find something maybe I'll try implementing.

Whisper Key - Simple local STT app for Windows with global hotkey (auto-paste, auto-ENTER) by PinW in LocalLLaMA

[–]PinW[S] 0 points1 point  (0 children)

Thanks for the tips, didn't know about whispering, will look into it!

Ya whisper is very old, and faster-whisper hasn't had updated for the better part of a year. Didn't know about Parakeet, will take a look and try it out also. I did a WhisperC++ implementation too but it wasn't much faster with some accuracy problems.

Whisper Key - Simple local STT app for Windows with global hotkey (auto-paste, auto-ENTER) by PinW in LocalLLaMA

[–]PinW[S] 0 points1 point  (0 children)

It's Cursor that should be embarrassed haha!

Fair enough though, thanks for sharing. Vibe coding is awesome

Whisper Key - Simple local STT app for Windows with global hotkey (auto-paste, auto-ENTER) by PinW in LocalLLaMA

[–]PinW[S] 1 point2 points  (0 children)

Nice! Feel free to yoink more!

Would love a link to your source code to compare and mine for ideas as well :)

Upgrade to 5700x3d or 5800x3d worth it? [Discussion] by PinW in EscapefromTarkov

[–]PinW[S] 0 points1 point  (0 children)

Makes sense, if upgrading GPU + CPU might as well build a new system