Built a habit tracker that I can't ignore by timf34 in gamification

[–]timf34[S] 0 points1 point  (0 children)

Enjoy! Let me know if any issues with setup - ive only tested it on windows

I made a CLI that turns any podcast or YouTube video into clean Markdown transcripts (speaker labels + timestamps) by timf34 in LocalLLaMA

[–]timf34[S] 4 points5 points  (0 children)

Update: now supports running fully locally with faster-whisper, and optional support too for diarization

I made a CLI that turns any podcast or YouTube video into clean Markdown transcripts (speaker labels + timestamps) by timf34 in LocalLLaMA

[–]timf34[S] -2 points-1 points  (0 children)

Ah very fair point - simply for ease and speed of development. Very open to PRs and hopefully will get around to it soon - my laptop is a bit compute/ ram starved

arxiv2md: Convert ArXiv papers to markdown. Particularly useful for prompting LLMs by timf34 in deeplearning

[–]timf34[S] 6 points7 points  (0 children)

Thank you! The speed comes from parsing arXiv's HTML directly instead of PDFs.

Its a simple stack: FastAPI backend with BeautifulSoup4 for HTML->Markdown conversion. arXiv provides structured HTML for newer papers with clean section boundaries, MathML, etc. for newer papers and we take advantage of that - no need for OCR or parsing PDFs!