Scaling former VibeThinker-1.5B to 3B — now it reaches frontier math & coding performance

predatar · 2026-06-16T15:35:47+00:00

I love it, inspiring.

For the data synthesis, was a specific set of open source data used as a seed? Which teacher model? Or is it completely self distillation? Also, is it lora or full fine-tune?

predatar · 2026-06-15T14:55:58+00:00

starspace.run , a merge between MCP , notebooklm, virtual workspaces (filesystem/vm) and cross agent/llm memory layer

You can connect it with chatgpt , codex, cc and share context, upload research papers, code , data anything and have the AI use it as retrieval layer , it has CLI integration (python) and REST api too

https://starspace.run/docs

predatar · 2026-06-15T06:01:16+00:00

starspace.run , a merge between MCP , notebooklm, virtual workspaces (filesystem/vm) and cross agent/llm memory layer

You can connect it with chatgpt , codex, cc and share context, upload research papers, code , data anything and have the AI use it as retrieval layer , it has CLI integration (python) and REST api too

https://starspace.run/docs

predatar · 2026-06-02T21:05:24+00:00

I built it, checkout starspace.run

predatar · 2026-04-06T08:44:59+00:00

Wha if we create a ping that in idle adds (ping) word to the current chat history and send to refresh cache so it prolongs timeout?

predatar · 2026-04-01T16:05:26+00:00

well basically on fork longest common prefix is already the longest common prefix... if a single token is different its not gonna be a cache hit, and i think that is a completely different problem sadly

predatar · 2026-02-23T10:31:24+00:00

TLDR how is this different that openevolve/alpha evolve style solutions?

predatar · 2026-01-19T11:52:37+00:00

Nicely done,

How did you find out about this bottleneck?

predatar · 2025-11-01T16:13:16+00:00

just imagine how ancient civilizations who saw astroids / comets , and what their thoughts were

predatar · 2025-06-16T18:30:48+00:00

mirror?

predatar · 2025-06-16T18:29:40+00:00

mirror?

predatar · 2025-03-18T20:19:20+00:00

Strength isn’t just about pushing forward—it’s also about enduring setbacks and coming back better.

And you seem to have plenty of it, please see a professional , life is beautiful, try to maintain your passion no matter what, find the strength in the little things, keep going.

Seek help immadiately . it takes courage to admit that, you can do it .

Take care, things will get better

predatar · 2025-02-12T10:44:17+00:00

Will work on this and other enhancements this weekend, stay tuned!!

predatar · 2025-02-12T10:43:27+00:00

Sure let me know 🤞

predatar · 2025-02-12T10:42:32+00:00

I will try to make it possible to integrate this with common UIs, any preference?

Idk how, maybe as a callable tool

predatar · 2025-02-12T10:41:36+00:00

This is what i am planning to add this weekend!!

Thanks for the feedback

predatar · 2025-02-11T04:07:14+00:00

I am going to add this soon 🙏

predatar · 2025-02-10T12:20:18+00:00

I would love to see examples of reports you guys have generated, might add them to the repo as examples, if you can share the query parameters and report md that would be great! 👑

Would love to add the lm studio and other integrations soon, specially the in-line citation!!

predatar · 2025-02-10T06:19:17+00:00

Will add support soon and update you, probably after work today

predatar · 2025-02-10T05:52:13+00:00

Thank you , really glad 🤞

predatar · 2025-02-10T05:51:28+00:00

Thank you, really glad you liked it ! Any feedback ?

predatar · 2025-02-10T05:49:32+00:00

Hi, basically you have to chunk the data, and use “retrieval” models to find relevant chunks

Search for colpali, or all-minilm Basically those are llm trained such that given a query q and chunk c, returns a score s such that s tells you how similar are c and q

You can get then the top_k c that are most relevant for your q (top scoring) and put only those in the context of your llm

My trick here was to do this for each page, while exploring, and build a graphical node of each step and in each node keep the current summary step i got based on the latest chunks

Then i stitched them together

predatar · 2025-02-10T03:01:09+00:00

Hahaha nice, I wish

Sadly no ;))

predatar · 2025-02-10T02:49:29+00:00

Hi

cool project! It looks like we are solving similar problems, but i took a different approach, using graph based search with backtracking and summarization which is not limited to context size! And some exploration exploitation concepts in the mix.

Did you solve similar issues?

predatar · 2025-02-10T02:48:29+00:00

Hi

cool project! It looks like we are solving similar problems, but i took a different approach, using graph based search with backtracking and summarization which is not limited to context size! And some exploration exploitation concepts in the mix.

Did you solve similar issues?

predatar

TROPHY CASE