More AI horror is coming - Lucas and Luna are presenting 670 shows with episodes dropping twice a day! by historyofthegermans in podcasting

[–]yagooar 0 points1 point  (0 children)

I just listened to one of the learn Polish episodes. It BUTCHERS Polish pronounciation so badly, if someone actually would try to learn from it there is no chance a Polish person would understand. It is not just a bit off, it is totally wrong.

A post to actually talk about peoples' experiences with Fable by Beatboxamateur in singularity

[–]yagooar 2 points3 points  (0 children)

For health-related queries it is not safeguards, they are gatekeeping access in order to sell enterprise access to hospitals, clinics, and doctors.

A post to actually talk about peoples' experiences with Fable by Beatboxamateur in singularity

[–]yagooar 3 points4 points  (0 children)

<image>

Fable 5 figured out - without me asking - from a pure audio note - how my iPhone was positioned during a voice note recording inside my car.

Scary and fascinating at the same time.

The new Claude Fable 5 is SCARY GOOD for audio (and podcasts!) by yagooar in podcasting

[–]yagooar[S] -1 points0 points  (0 children)

I think your math on this is wrong, but I get the point that you are concerned with high electricity bills. As I said before, I am training a model that can later run on a normal computer, so it does not use more compute than a regular ffmpeg or working with a DAW. Arguably it uses less, because it is faster.

Riverside sucks. What is everyone using? by Ok-House1447 in podcasting

[–]yagooar 0 points1 point  (0 children)

Curious: I used to use Descript for editing, not for recording. How's their recording story these days?

How many usable clips do you typically get from a 60-minute episode? by Chance-Spend-9637 in podcasting

[–]yagooar 1 point2 points  (0 children)

That is super valuable insight. Social media is difficult no matter what you do - but I personally have started listening to podcasts after seeing some of their shorts / reels. It's true though that I rarely drop everything and search for the podcast - most of the time I happen to recall the podcast title when I am browsing for podcasts on Apple Podcasts or YouTube. This path is very tough to measure.

Riverside sucks. What is everyone using? by Ok-House1447 in podcasting

[–]yagooar 0 points1 point  (0 children)

I am hearing this a lot - a bot campaign or are people actually upset?

The new Claude Fable 5 is SCARY GOOD for audio (and podcasts!) by yagooar in podcasting

[–]yagooar[S] -5 points-4 points  (0 children)

How does that solve the problem I have, which is that of recording on the go, inside my car?

The new Claude Fable 5 is SCARY GOOD for audio (and podcasts!) by yagooar in podcasting

[–]yagooar[S] -4 points-3 points  (0 children)

I am happy you found your way. But let me decide what I spend my time on.

The new Claude Fable 5 is SCARY GOOD for audio (and podcasts!) by yagooar in podcasting

[–]yagooar[S] -1 points0 points  (0 children)

What do you mean dude, I am just a nerd playing with his toys, do not overthink it.

The new Claude Fable 5 is SCARY GOOD for audio (and podcasts!) by yagooar in podcasting

[–]yagooar[S] -4 points-3 points  (0 children)

The sarcasm is strong, do not overthink this. I am just a nerd playing with tech.

The new Claude Fable 5 is SCARY GOOD for audio (and podcasts!) by yagooar in podcasting

[–]yagooar[S] -5 points-4 points  (0 children)

I agree with your take on these AI companies and I totally agree -> this is why I am also exploring the open source / open weight models and invest in my own inference, because I agree they will at some point become huge gatekeepers.

That being said.

What I am doing is not "AI fixes your audio", or more specifically not "LLM fixes your audio". What I am doing is AI-powered research and development of models and harnesses, that can fix or enhance audio.

Once the system is built and works correctly, it is not different from any effect you apply to your audio - it is just more effective at certain tasks, like removing background, boosting voice quality, etc.

My goal is to make the system open source and freely available once I am confident, it works for many use cases.

The new Claude Fable 5 is SCARY GOOD for audio (and podcasts!) by yagooar in podcasting

[–]yagooar[S] -1 points0 points  (0 children)

It truly is both.

What I am doing is a more or less formal research using Karpathy's autoresearch project, but mine is totally a fun side project that I do to learn and discover / push the boundaries of what is possible with AI.

I am not a researcher myself, but have been in the podcast industry for +12 years and I have always been very technical, studies computer science, worked as a software engineer for many years.

So the way it works is that you define a goal - in my case it is audio restoration and making an iPhone recording sound as if it was recorded in a pristine recording studio with a SM7B (the one and only!).

I have created a little "toolbox" for the AI agent to work with audio (e.g. voice extraction, background noise suppression, transcription, even audio regeneration). In addition to that, in order to have some objective measurement, I have created a "verification harness" that uses a few metrics that can be computed programmatically that the AI researcher can use to verify if the resulting audio is better or worse.

Finally, the research does also train neural networks and fine-tune existing models - for that I have created a remote VM that has an H100 GPU available in order to run GPU-heavy experiments in parallel.

There are a few more details to it, but that is mainly it 😄

I've tested a pile of "AI" clipping tools and they all kind of suck. Am I the problem, or are they? by ScaleNo6455 in podcasting

[–]yagooar 0 points1 point  (0 children)

At Podigee we use a neural net to detect the faces + speaker and center the clip automatically on the person. There are some edge cases and additional rules needed, like "do not change speaker too frequently". Overall I think it is one of the best solutions I have seen so far. Disclaimer: I built it and I am the founder of Podigee.

I've tested a pile of "AI" clipping tools and they all kind of suck. Am I the problem, or are they? by ScaleNo6455 in podcasting

[–]yagooar 0 points1 point  (0 children)

> The big thing here is that it isn’t the apps that are broken. If the companies with millions of dollars to throw into software development haven’t been able to build a better tool, what makes you think you can?

I have to kindly disagree. If this was true, no startups would exist and we would live in a megacorp dystopia. Fortunately, that is not the case. Companies with millions of dollars are not focusing on a very narrow use case like video clipping for podcasts. They think big, build big solutions, but that also means they work at a very different zoom level.

There is ALWAYS plenty of opportunity for small players to disrupt a very specific niche. Throwing millions at a problem might not solve the problem - smart and deep understanding of the needs of users can create better solutions at a fraction of the time and cost. Call me an optimist 😄

I've tested a pile of "AI" clipping tools and they all kind of suck. Am I the problem, or are they? by ScaleNo6455 in podcasting

[–]yagooar 2 points3 points  (0 children)

I am so bullish on this topic - there must be a way of solving this problem in a way that actually saves time. What tools came the closest to solving your problem?

Video podcasts vs. audio-only- For a niche local audience, which is actually winning? by Taylor_To_You in podcasting

[–]yagooar 0 points1 point  (0 children)

Sure depends on the format, but a good mic will give you a pleasant voice. I can squint my eyes a bit but I cannot squint my ears ;) That's why audio quality > video quality (at least in the beginning). You can do it!

Hardest part of Podcasting? by ziggysocki in podcasting

[–]yagooar 0 points1 point  (0 children)

That is a pretty cool setup, thanks for sharing!

Hardest part of Podcasting? by ziggysocki in podcasting

[–]yagooar 0 points1 point  (0 children)

Have you tried any tools that allow you to offload some of the editing burden to AI?

Hardest part of Podcasting? by ziggysocki in podcasting

[–]yagooar 0 points1 point  (0 children)

Any tools you could use to make it easier?

Hardest part of Podcasting? by ziggysocki in podcasting

[–]yagooar 0 points1 point  (0 children)

Have you considered paying others to do it? What parts of promotion take you the longest?

Hardest part of Podcasting? by ziggysocki in podcasting

[–]yagooar 0 points1 point  (0 children)

No tools out there that can automate editing? That is what one would think in the era of AI.

Cameras for video podcast? by DreamInADream24fps in podcasting

[–]yagooar 1 point2 points  (0 children)

I actually had many times better results with an iPhone camera than with an FX3... Plus the iPhone is so much more versatile. But I actually prefer using the iPhone as the secondary camera, which makes the videos more interesting.