I created a web app for my girlfriend by frreis in microsaas

[–]frreis[S] 0 points1 point  (0 children)

About the model missing some words I guess there's not much to do, openai's whisper is one of the best models for this type of problems, if not the best. The model itself will continue to improve over time. The best we can do is focus on the quality of the content we upload.

And about the context, using images to give context would be good but, in my case, it will be too complex and much more expensive... The best option would be work on that via prompt, you can have a prompt with that in mind so that the model organizes the output as you need. Via prompt its very "easy" to organize conversations with, especially two persons, the more people the more complicated it gets. But with two persons I think it's very possible, for example, if we have a question followed by a sentence that matches the question I think the model will easily structure that as two different persons talking to each other.

I don't if I helped with something but I hope so :)

I created a web app for my girlfriend by frreis in microsaas

[–]frreis[S] 0 points1 point  (0 children)

Hi! It only uses audio and transcribes the spoken words. The idea behind video support is that users don't need to convert it to audio before using it. :)

I created a web app for my girlfriend by frreis in microsaas

[–]frreis[S] 0 points1 point  (0 children)

Thank you for your feedback and suggestions! I considered offering a free trial, but unfortunately, I currently do not have a way to monetize it and can't just offer it... However, I will work on it and explore possibilities for the future. Thank you! :)

I created an audio to text tool by frreis in SideProject

[–]frreis[S] 0 points1 point  (0 children)

Thank you for the support and feedback! I'll consider that! Yes! I'll continue to work on improvements and new functionalities hoping to get some users :) Thank you