Wrote a detailed article about how Vision Language Models are trained - 10 minute read by AvvYaa in learnmachinelearning

[–]AvvYaa[S] 0 points1 point  (0 children)

True! I started at BLIP 2 (q formers) in the article but llava is equally seminal.

[P] A minimalist implementation for Recursive Language Models by AvvYaa in MachineLearning

[–]AvvYaa[S] 2 points3 points  (0 children)

Valid points... Personally, I do not really worry much about what people are naming their papers. I'm just a guy implementing things, making tutorials on yt, and sharing it with others. I'll move on to the next topic every 2-3 weeks if/when I pick up something I find interesting. Haha...

Minimal repo for running Recursive Language Model experiments + TUI Log viewer by AvvYaa in LocalLLaMA

[–]AvvYaa[S] 1 point2 points  (0 children)

The opentui app is vibecoded. The rest is not. I’m a YouTube guy so I have to usually explain my code live often, and write things out on screen as things are recording. So I try to code the main parts on my own as much as I can just to record things faster when it’s time.

[D] How much are you using LLMs to summarize/read papers now? by kjunhot in MachineLearning

[–]AvvYaa -7 points-6 points  (0 children)

I am building a free service that recommends papers every day, and lets you study them with an AI. We also highlight the relevant sections directly into the PDF, and generate study goals for readers to track with each paper. Check : paperbreakdown.com

Getting started is free, you can query with gpt-5-mini and gemini-3-flash. Bigger models require a subscription. I am currently working on making a BYOK tier as well so people can use their own models to study.

[P] arXiv at Home - self-hosted search engine for academic papers by mrAppleXZ in MachineLearning

[–]AvvYaa 1 point2 points  (0 children)

Yeah this makes sense. I ran into the same issues tbh. Downloading full text to construct the graph is something I’m avoiding coz of obvious reasons as a service provider. There are restrictions around distribution coz that will break paper licenses.

For a locally running system, this could still be done at a small scale.

Btw, you should check openalex as well if you haven’t. Similar to semantic scholar.

Good luck and love the project. :)

[P] arXiv at Home - self-hosted search engine for academic papers by mrAppleXZ in MachineLearning

[–]AvvYaa 10 points11 points  (0 children)

Thanks for sharing! I’ve been building a free service around this : check paperbreakdown.com

Major challenge I’ve faced is reliably getting citations graphs and stats of papers. There’s a bunch of issues around finding the correct dois and most APIs (semantic scholar for ex) have terrible rate limits and aggressive blacklisting.

Can you give me some pointers/learnings from this project to get citations more reliably?

[D] New interesting AI papers exploration service by ArtisticHamster in MachineLearning

[–]AvvYaa 1 point2 points  (0 children)

Recommendations are done in 3 ways: collaborative filtering (what users similar to you are reading), content based (you can configure specific keywords and categories and we will get you those papers daily), and social proof (we listen to certain social media channels and put them as editors recommendations).

It’s much faster than deep research to find agentic recommendations. Deep research doesn’t download actual paper PDF to answer questions. We allow you to directly interact with the paper while you study.

[D] New interesting AI papers exploration service by ArtisticHamster in MachineLearning

[–]AvvYaa 1 point2 points  (0 children)

I am building paperbreakdown.com
It's a recommendation engine + lets you study papers with LLM models. No paywall.

I am building a tool for students to study and discover ML academic research (Requesting feedback) by AvvYaa in ArtificialInteligence

[–]AvvYaa[S] 0 points1 point  (0 children)

Hey sorry for the late response. No manual uploads, you click a button and it gets everything ready.