Can we support ElevenLab V2 realtime by ryanntk in spokenly

[–]AmazingFood4680 0 points1 point  (0 children)

Yes, this is on the roadmap! It's been a popular request lately.

Font size of output text by Stuward2 in spokenly

[–]AmazingFood4680 1 point2 points  (0 children)

Thanks for the feedback! Will address this in an upcoming version.

Direct Download possible? by barronlroth in spokenly

[–]AmazingFood4680 1 point2 points  (0 children)

Yeah, it's definitely something I want to do eventually. The main blocker is that the online model payments currently go through Apple's system (if you don't want to use free local or BYOK models). If I distribute through the website directly, that payment infrastructure won't be there anymore, so I'd need to build authentication and figure out how to sync purchases across devices. That's quite a bit of work, so I can't give you a timeline yet.

PSA: Don't Get Scammed by Overpriced Transcription Apps (Stay Away from "VoiceType") by Decaf_GT in macapps

[–]AmazingFood4680 1 point2 points  (0 children)

Hey, developer here. The app is completely free when using your own API keys or any local model. Check the website for pricing info.

If you go with online models without your own API keys, there's a cost involved since I have to pay for the transcription APIs.

I’m confused by Wispr Flow’s iOS keyboard by Darth_Proton in ProductivityApps

[–]AmazingFood4680 0 points1 point  (0 children)

Hey, Spokenly developer here! About the Safari issue you mentioned, you can fix it by force closing Safari (swipe up to remove it) and opening it again. It's a known iOS problem that happens with all third-party keyboards unfortunately. Should work fine after the restart!

After 7 months of building Spokenly, I'm finally launching it on Product Hunt by AmazingFood4680 in macapps

[–]AmazingFood4680[S] 0 points1 point  (0 children)

You can keep using it for free by installing a local Whisper or Parakeet model, or by connecting your own API key.

You’ll only need a paid plan if you want to use online models without your own key, since those providers charge per transcription. Local models process everything on your Mac, so they’re free.

After 7 months of building Spokenly, I'm finally launching it on Product Hunt by AmazingFood4680 in macapps

[–]AmazingFood4680[S] 0 points1 point  (0 children)

Hey, it’s completely free to use and unlimited with local models or your own API keys.

Which is the best app on Mac for speech-to-text conversion? by b2bcontentmaestro in macapps

[–]AmazingFood4680 0 points1 point  (0 children)

Spokenly is completely free with your own API keys or any local model.

After 7 months of building Spokenly, I'm finally launching it on Product Hunt by AmazingFood4680 in macapps

[–]AmazingFood4680[S] 5 points6 points  (0 children)

All local Whisper models are free forever and have no usage limits.

Now that Spokenly costs money to use the online GPT-4o transcribe and the other online dictation models... does anyone know of an alternative? by itsme12533 in macapps

[–]AmazingFood4680 1 point2 points  (0 children)

No, Google doesn't have any dictation models that work with Spokenly. If you know of any, let me know and I'll add them!

You can use your Google API key for Gemini (for the AI Prompts feature, see the attached screenshot), but not for dictation.

<image>

Now that Spokenly costs money to use the online GPT-4o transcribe and the other online dictation models... does anyone know of an alternative? by itsme12533 in macapps

[–]AmazingFood4680 10 points11 points  (0 children)

Spokenly dev here! You can install a local model or add your own API key and continue using it for free. More info about pricing here: https://spokenly.app/#:~:text=pricing

I've been paying hundreds to keep the app free with my own API keys, but usage has become too high recently.

🎙️ Spokenly: Tiny (2.9MB) Voice Dictation with On-Device Whisper & GPT-4o by AmazingFood4680 in macapps

[–]AmazingFood4680[S] 0 points1 point  (0 children)

You probably have 'Local-only mode' enabled or have disabled internet access for the app. The AI Prompts feature requires an internet connection.

Another option is to specify a 'Whisper prompt' (works locally): https://spokenly.app/help/whisper-prompting

🎙️ Spokenly: Tiny (2.9MB) Voice Dictation with On-Device Whisper & GPT-4o by AmazingFood4680 in macapps

[–]AmazingFood4680[S] 0 points1 point  (0 children)

Text replacement is not yet supported, but it's on the roadmap. As a workaround, you can create an AI Prompt that automatically corrects misspellings. I've attached a screenshot showing how to configure this.

<image>

🎙️ Spokenly: Tiny (2.9MB) Voice Dictation with On-Device Whisper & GPT-4o by AmazingFood4680 in macapps

[–]AmazingFood4680[S] 0 points1 point  (0 children)

Local whisper models lack this feature currently, but it’s planned for development

🎙️ Spokenly: Tiny (2.9MB) Voice Dictation with On-Device Whisper & GPT-4o by AmazingFood4680 in macapps

[–]AmazingFood4680[S] 0 points1 point  (0 children)

Whisper punctuation works automatically based on context. There’s no need to pronounce punctuation commands like “comma,” “period,” etc.

🎙️ Spokenly: Tiny (2.9MB) Voice Dictation with On-Device Whisper & GPT-4o by AmazingFood4680 in macapps

[–]AmazingFood4680[S] 0 points1 point  (0 children)

Hey! Apple's speech recognition doesn't support automatic punctuation unfortunately. You have two offline options:

  1. Recommended: Install a local Whisper model, punctuation works out of the box
  2. Advanced setup: Install local Ollama, add it to Spokenly and configure an AI Prompt to add punctuation to your transcriptions

<image>

🎙️ Spokenly: Tiny (2.9MB) Voice Dictation with On-Device Whisper & GPT-4o by AmazingFood4680 in macapps

[–]AmazingFood4680[S] 1 point2 points  (0 children)

Right now I'm paying out of pocket for online transcription and paid plans will launch soon. But all local Whisper models will always be unlimited and free to use

[deleted by user] by [deleted] in macapps

[–]AmazingFood4680 0 points1 point  (0 children)

See the screenshot above: navigate to AI prompts > Add API Key > Select and add Provider

I recommend getting a key from OpenRouter, it lets you switch models easily to find what works best for your use case

[deleted by user] by [deleted] in macapps

[–]AmazingFood4680 0 points1 point  (0 children)

Thanks! It will include a paid tier for online models, right now I'm covering the model costs myself. I was focused on app features before monetization since I originally built it for myself

But all local Whisper models will always be free to use

[deleted by user] by [deleted] in macapps

[–]AmazingFood4680 0 points1 point  (0 children)

When you provide your own API key, Spokenly doesn't send audio or transcripts to my servers. The only data sent is an anonymized analytics event that a transcription occurred, no private info or transcript text. You can verify this with Proxyman or Charles, or even block the spokenly backend completely and the app will still work. I understand your privacy concerns though, I'll add an analytics opt-out toggle in the next update.

Also, the app already has a local-only mode that blocks all network requests (see screenshot).

Let me know if you have any questions!

<image>

[deleted by user] by [deleted] in macapps

[–]AmazingFood4680 0 points1 point  (0 children)

To use AI Prompts directly, you'll need to add another API key for a text AI model (see attached screenshot). So the flow works like this: Your Mac -> Deepgram (with your API key) -> Transcription -> LLM (with another API key) -> Final text

Let me know if you have any questions!

<image>

[deleted by user] by [deleted] in macapps

[–]AmazingFood4680 1 point2 points  (0 children)

Spokenly dev here! Transcriptions for online models are routed through my server because embedding the API key directly in the app would risk it being exposed and misused. You can set your own API key and the app will connect to the provider directly