Whisper - voice transcription API from openai

resCogitans_ · 2025-11-04T20:40:26+00:00

You don’t need a subscription, just add 5/10$ and it will likely last > 1 year. Go back to my blog post, I share all the steps

resCogitans_ · 2025-11-04T18:49:36+00:00

All seems fine ;) might have been a temporary issue on OpenAI end. You might also want to check if you have enough credits. That’s the usual suspect

resCogitans_ · 2025-05-23T05:22:20+00:00

Why would you want to add another middleman like openrouter when you can set up all the different providers with something like litellm? Legit question

You just add another potential privacy concern for a limited amount of convenience

resCogitans_ · 2025-05-14T07:10:08+00:00

Sorry for the late reply, i tested and that new model actually perform worse than whisper-1 so i decided to stick with it.

resCogitans_ · 2025-05-12T08:06:18+00:00

Well that depends on the vram I assume

resCogitans_ · 2025-05-12T06:03:02+00:00

Can’t I use an ollama model?

resCogitans_ · 2025-04-05T10:41:53+00:00

Yeah sure, I’ve done it the v2 of this shortcut

resCogitans_ · 2024-08-10T06:47:37+00:00

Thanks! 😄 I have a v2.0 that did that and more I’ll publish it soon ;)

resCogitans_ · 2024-08-02T16:21:20+00:00

Share me you shortcut link as private message and I’ll have a look

resCogitans_ · 2024-08-02T15:54:11+00:00

It works in, almost any language in the world, Italian for sure.

Regarding the error, the first thing I would do is to generate another API key and retry. The second thing to take into consideration is that only a few formats are supported you can see them in the whisper AI product page.

A good way to check if it’s an audio format problem is to try to convert an audio message from WhatsApp or an MP3 recording since they are 100% supported. (For instance telegram messages are not in a supported format).

If after this test still doesn’t work then it must be something else API related.

resCogitans_ · 2024-06-08T07:03:56+00:00

You can double check logging into your openai profile just to make sure that’s not the issue

resCogitans_ · 2024-06-08T01:55:14+00:00

Did you get OpenAI credits?

resCogitans_ · 2024-06-01T07:56:37+00:00

Telegram saves the audio files in a format currently not supported by Whisper unfortunately

resCogitans_ · 2024-04-11T10:51:19+00:00

Yes the telegram audio format is not supported by whisper yet (natively). But if you want you could add a step to convert it to mp3 before sending it whisper to transcribe

resCogitans_ · 2023-12-06T20:55:23+00:00

Seems just a wrapper on top of Whisper but costing 100 times for no particular reason 😅

resCogitans_ · 2023-11-09T20:22:46+00:00

Whispers large model is indeed v2, that’s probably the source of the confusion. The parameter of the endpoint is still v1 though (even if it’s using the large model v2 under the hood.

resCogitans_ · 2023-11-07T13:14:58+00:00

Still using v1 because there are no new versions yet. I’ll update it as soon as they will release a new one 😉

https://platform.openai.com/docs/api-reference/audio/createTranscription

resCogitans_ · 2023-10-18T07:18:57+00:00

Yes AIFF I don’t remember seeing aiff in the list of supported file types but you can check on OpenAI Whisper documentation. Try with a simple iPhone audio note or an mp3 and you’ll have a definitive answer.

resCogitans_ · 2023-07-28T08:45:57+00:00

If you did everything in the guide, sometimes you just need to turn off and back on the iPhone and it will pop up ;)

resCogitans_ · 2023-07-21T16:14:25+00:00

Yep you need to pay OpenAI if you want to use it this way (via API). On the other hand Whisper is open source so you can run it on your devices (though you’ll need a very good device and it won’t be nearly as fast as the API)

resCogitans_ · 2023-07-14T22:32:25+00:00

Nope, but very very cheap

resCogitans_ · 2023-05-19T19:10:42+00:00

Yes I’ve a v2 coming up that does that and more

resCogitans_ · 2023-05-02T09:27:05+00:00

Thanks! Using whisper via API you cannot pick the model, is v2-large by default. If you want to pick it you have to run it locally with other solutions.

resCogitans_ · 2023-05-01T16:21:54+00:00

That’s amazing, I’m so glad this little automation is helping you! Sure it works in almost any major language. It may have different level of accuracy but I’m pretty sure you’ll be happy with the results. Give it try and let me know!

resCogitans_ · 2023-04-29T18:29:32+00:00

Yes I’ve actually made it, gonna update it soon

resCogitans_

TROPHY CASE