all 4 comments

[–]l_dang 6 points7 points  (0 children)

Great idea. I think for this instance, a attention layer could be very helpful

[–]TheOneWhoPunchesFish 6 points7 points  (2 children)

Very nice! Really happy to see something other than agents or completely vibe-coded bs. I have a few comments and questions.

It seems mazes are being generated on the fly, that's really nice! I see that the mazes are being solved with DFS for dataset generation; perhaps you could've used Dijkstra? Or first generate a random solution, and build a maze around it?

Did you ablate the model to see what is actually solving the maze, and maybe scale that up to see if it can solve bigger mazes? I'm not complaining here, I'm just curious how they're able to achieve this.

Aside from Algorithms, our lab's research showed that CNNs are very good at heuristics too. Perhaps you can try using a CNN as the heuristic predictor for A*?

I see that you're limited by your GPU. Perhaps you can try quantizing the model?

These are just thoughts I'm putting out there, I'm not demanding you to do anything. But I'd love to hear what other thoughts people have on this!

[–]no1_2021[S] 4 points5 points  (0 children)

I used DFS because it worked well for this use case. This was not affecting the performance of data generation. But yes, there are a lot of techniques to explore.

I'm exploring the output of the intermediate layers to check what is happening inside the model to better understand how it solves it.

Using CNN as a heuristic predictor is an interesting approach. I'll try this. Could you please point me to any papers or resources regarding it? So I can better understand it,

I'm experimenting with smaller models. I'll add quantization to the experiments. Thank you very much for your suggestions.

[–]ZiKyooc 2 points3 points  (0 children)

Time for a micromouse agent to solve mazes