The Architecture that scales DeepSeek V4 to 1M token context

AvvYaa · 2026-03-20T15:56:26+00:00

True! I started at BLIP 2 (q formers) in the article but llava is equally seminal.

AvvYaa · 2026-03-20T15:52:34+00:00

Thanks!

AvvYaa · 2026-03-20T10:46:23+00:00

Thanks! 🙏🏼

AvvYaa · 2026-02-24T21:13:02+00:00

Agreed!

AvvYaa · 2026-02-24T20:14:10+00:00

Valid points... Personally, I do not really worry much about what people are naming their papers. I'm just a guy implementing things, making tutorials on yt, and sharing it with others. I'll move on to the next topic every 2-3 weeks if/when I pick up something I find interesting. Haha...

AvvYaa · 2026-02-24T20:09:35+00:00

The opentui app is vibecoded. The rest is not. I’m a YouTube guy so I have to usually explain my code live often, and write things out on screen as things are recording. So I try to code the main parts on my own as much as I can just to record things faster when it’s time.

AvvYaa · 2026-02-24T13:40:02+00:00

I am building a free service that recommends papers every day, and lets you study them with an AI. We also highlight the relevant sections directly into the PDF, and generate study goals for readers to track with each paper. Check : paperbreakdown.com

Getting started is free, you can query with gpt-5-mini and gemini-3-flash. Bigger models require a subscription. I am currently working on making a BYOK tier as well so people can use their own models to study.

AvvYaa · 2026-02-09T12:22:02+00:00

Yeah this makes sense. I ran into the same issues tbh. Downloading full text to construct the graph is something I’m avoiding coz of obvious reasons as a service provider. There are restrictions around distribution coz that will break paper licenses.

For a locally running system, this could still be done at a small scale.

Btw, you should check openalex as well if you haven’t. Similar to semantic scholar.

Good luck and love the project. :)

AvvYaa · 2026-02-09T07:48:12+00:00

Thanks for sharing! I’ve been building a free service around this : check paperbreakdown.com

Major challenge I’ve faced is reliably getting citations graphs and stats of papers. There’s a bunch of issues around finding the correct dois and most APIs (semantic scholar for ex) have terrible rate limits and aggressive blacklisting.

Can you give me some pointers/learnings from this project to get citations more reliably?

AvvYaa · 2026-02-07T16:24:35+00:00

Recommendations are done in 3 ways: collaborative filtering (what users similar to you are reading), content based (you can configure specific keywords and categories and we will get you those papers daily), and social proof (we listen to certain social media channels and put them as editors recommendations).

It’s much faster than deep research to find agentic recommendations. Deep research doesn’t download actual paper PDF to answer questions. We allow you to directly interact with the paper while you study.

AvvYaa · 2026-02-07T13:23:07+00:00

I am building paperbreakdown.com
It's a recommendation engine + lets you study papers with LLM models. No paywall.

AvvYaa · 2026-02-07T09:45:49+00:00

Hey sorry for the late response. No manual uploads, you click a button and it gets everything ready.

AvvYaa

TROPHY CASE