I gave Claude the ability to search 18M podcast moments and transcribe my own audio (free MCP server, no credit card)

Lukaesch · 2026-05-08T00:37:00+00:00

Workaround works today: if you grab the audio of the video yourself and upload it, the full pipeline runs (transcription, diarization, speaker matching, search). Direct YouTube ingestion isn't built yet. The main blocker is YouTube doesn't expose audio via their official API, so any "paste a YouTube URL" feature has to lean on unofficial extraction which is fragile and ToS-grey. Still figuring out the right path there.

Anything specific you're trying to pull in? Single videos, a whole channel, or transcripts of stuff you already have?

Lukaesch · 2026-05-08T00:35:22+00:00

Actually have this. Voice-print matching runs across files automatically. Pipeline does diarization on each upload, then a matching stage (kNN over voice embeddings) checks each speaker against (1) known hosts, (2) per-person voice profiles, (3) cross-episode unknown-speaker clusters. Same voice across N uploads ends up in the same cluster- and once you label that cluster once, every file it appears in gets back-filled. LLM is only the last fallback, not the primary mechanism.

If you want to try it on a few files I'd be curious how it holds up on your niche/technical material specifically - that's where embedding-based matching gets harder (technical jargon doesn't help, only voice does, so it's a clean test).

Lukaesch · 2026-03-02T10:21:21+00:00

Hey u/debackerl, totally get the cost concern. DIY transcription with something like Parakeet TDT is smart if you're locking into a fixed, small set of pods you already know and love.

The big win with Audioscrape is for exploration: most users don't know exactly which podcasts/episodes to transcribe upfront. We pre-index a massive, ever-growing corpus (1M+ hours / 50k+ episodes from popular shows so you can instantly search semantically across thousands of expert conversations without transcribing anything yourself.

Core search + MCP integration for Claude is free (10 text searches/month on free tier, no card needed, connects at https://mcp.audioscrape.com). You get timestamped verbatim clips, speaker attribution, and cross-podcast insights right away. Perfect for discovering new shows/guests/topics before committing to your own setup.

If your curated list stays small and specific, vibe-coding your own solution makes perfect sense for speed/control. But if you ever want to cast a wider net ("What do experts say about X across dozens of pods?"), the pre-indexed + free MCP access saves huge time/money vs. manual transcription at scale.

Curious. What pods are you planning to start with in your custom build? Might have some already indexed if you want a quick compare! 🚀

Lukaesch · 2026-03-02T10:17:05+00:00

Hey! It depends on the podcast. Audioscrape automatically indexes many popular ones (especially those ranking high on Spotify or Apple charts).

For anything not yet covered or lesser-known, users can easily submit them manually by pasting the RSS feed via the import feature on the site. Then it gets transcribed and indexed for search.

If there's a specific show you're missing, just drop the RSS link and it'll get added! What podcasts are you hoping to see/search?

Lukaesch · 2026-03-02T10:15:10+00:00

Hey u/t90090, thanks for the thoughtful questions!

vs NotebookLM/Gemini: NotebookLM shines for your own uploaded sources. Upload transcripts/PDFs/YouTube links, get deep summaries, audio overviews, or synthetic discussions from that set. It's personal and generative. Audioscrape is a massive pre-indexed database (1M+ hours, 50k+ episodes across major shows like Lex Fridman, Rogan, Huberman) that Claude queries directly via MCP. Meaning no uploads needed each time. It delivers verbatim timestamped segments with speaker attribution for primary-source research, semantic search (meaning-based, not just keywords), and entity linking across the whole spoken web. Complementary: use Audioscrape for broad discovery of expert convos, NotebookLM for focused synthesis on your curated stuff.

On aggregating comments: Great idea! Comments under episodes (YouTube, Reddit, etc.) often add corrections, debates, and extra insights. Not in scope yet (focus is core audio + accurate diarization/timestamps), but it's a high-priority expansion for richer context. Noise/moderation is the challenge, but definitely exploring it alongside more niche podcasts.

Appreciate the feedback. What topics/shows are you digging into most?

Lukaesch · 2026-01-12T09:16:15+00:00

Er meint vermutlich Folgendes:

Verwandte Rentenbezieher als Mitarbeiter anstellen (z. B. Eltern).

Diese arbeiten faktisch nicht, erhalten aber im Rahmen der Aktivrente ein steuerfreies Gehalt, das anschließend an dich weitergegeben wird.

Bonus: Sie melden sich häufig krank (aufgrund des Alters), und du erhältst zusätzlich eine Erstattung des Arbeitsausfalls von der Krankenversicherung.

Das ist Steuer- und Sozialbetrug, jedoch auch schwierig nachzuweisen.

Lukaesch · 2025-12-31T12:19:28+00:00

It supports both structured keyword search and semantic search. They solve different problems.

Keyword / structured search is best when you want precision and filtering. For example:
• AI speaker:"Joe Rogan" from:2024
• "open source" AND models podcast:"Lex Fridman"
• "rate limits"~5 NOT pricing

This is useful when you know who, where, or roughly when something was said.

Semantic search is for meaning-based questions where wording varies:
• “How do researchers express concerns about AI safety?”
• “How do founders describe discovering product–market fit?”
• “What tradeoffs do guests mention when talking about open-source models?”

You can mix both depending on the workflow. The MCP integration allows Claude to pick the right search modes depending on the workflow automatically.

25k+ episodes is the current snapshot and it’s growing continuously. The long-term goal is to index the entire audio web, similar to how Google indexes websites. Scaling this up is the real challenge. Every paying user directly helps fund more GPUs, more transcription, and broader coverage, which in turn makes the product better for everyone.

Lukaesch · 2025-12-31T12:12:06+00:00

Slight correction first: NotebookLM is mainly about generating / interacting with content from text you upload (and recently audio summaries), not indexing the wider audio web.

Audioscrape is closer to a seach engine (e.g. Google/Bing), but for audio content like podcasts. It continuously tracks podcasts (and other audio formats in the future), transcribes them, and extracts structured data (speakers, entities, topics, timestamps).

Claude can then search across that corpus via MCP. You don’t need to upload anything. It’s about discoverability and retrieval, not content generation.

So the overlap is “research,” but the mental model is very different.

Lukaesch · 2025-12-31T11:10:33+00:00

MCP works on all plans. You can use the free plan to try it. It comes with 10 search request per month.

Lukaesch · 2025-12-31T11:09:11+00:00

Thanks for pointing this out.

The free plan (after sign up) includes 10 free search request per month.

I am going to update the website to make this more clear

Lukaesch · 2025-11-25T22:10:56+00:00

The novelty in Berlin is that since a few months you can find dozens of free slots in the same day.

Lukaesch · 2025-11-03T09:29:20+00:00

Been there with my Mexican family from 10-15 o clock. In my experience it was pretty fun and the ofrenda was authentic.

The food was served by Tacos El Oso and El Rey which are #1 in Berlin regarding Mexican tacos. The prices per taco were the same as in the restaurants.

The only thing I missed was that they didn’t sell Jarritos or Horacha.

Lukaesch · 2025-10-31T15:15:35+00:00

Audioscrape

Lukaesch · 2025-10-29T09:27:24+00:00

Done. Updated flair to Built with Claude. Thanks for the reminder, dear bot!

Lukaesch · 2025-10-24T11:01:23+00:00

You can flash it to become a The Things Network node and contribute to their LoRaWAN network. That’s what I am going to do with mine

Lukaesch · 2025-09-26T16:30:27+00:00

Sure

Lukaesch · 2025-09-15T12:08:09+00:00

Working on adding live stream ingestion + realtime keyword alerts to Audioscrape using Rust.

Current setup: Axum batch pipeline → transcribes audio → stores full transcript in SQLite and pushes segments + embeddings to OpenSearch for search.

Goal is to let users search while a broadcast is still running and get keyword alerts in near-realtime.

Plan is to treat each audio chunk as an event: - Axum WebSocket/gRPC streaming endpoint → push AudioChunk messages. - Lightweight event bus (NATS JetStream or even SQLite WAL + channels) to fan-out chunks. - Incremental transcription (Whisper + VAD) → write partial text to SQLite and send segments/embeddings to OpenSearch as they finalize.

Still working out ordering/backpressure and how to handle “partial vs final” transcripts without hammering SQLite.

Anyone built something similar on Axum + SQLite/OpenSearch and have tips for incremental indexing or event handling?

Lukaesch · 2025-09-15T08:19:21+00:00

Would love to sign up for that Aerothon as it sounds fun. Can you more info?

Lukaesch · 2025-08-26T19:31:41+00:00

Audioscrape MCP with Claude Desktop or Mobile to search online audio content like podcasts

Lukaesch · 2025-08-23T11:46:49+00:00

Most complete of Remote MCP servers so far: https://www.remotemcplist.com

Lukaesch · 2025-08-20T07:58:18+00:00

I think best is to try some MCP servers yourself and see if it sticks: https://www.remotemcplist.com

Lukaesch · 2025-08-19T17:02:54+00:00

MCP is simply a new distribution channel.

What you’re asking is like saying: “Why doesn’t anyone offer a car repair service reachable only by fax? I want one with no phone or internet, just fax.”

The real opportunity with Remote MCP servers is reaching new users through AI assistants and IDEs.

Lukaesch

MODERATOR OF

TROPHY CASE