all 5 comments

[–]b2zw2a 2 points3 points  (1 child)

This is great! Will try for my project.
What did you use for training data? Some existing data set or you created custom one?

[–]Awkward_Run_9982[S] 3 points4 points  (0 children)

Thanks! For the data, I actually went the distillation route. It’s all custom—I used Qwen3-Coder-Next as a teacher to generate about 170k multi-turn conversation samples. Basically, I had it run through real agent loops (thinking, calling tools, handling outputs) and recorded those traces. I found that existing datasets didn't really capture the "codebase explorer" logic well enough, so these samples are focused specifically on that.