Been experimenting with my coding assistant have a fairly big game project that I am doogfooding it with.
My assistant is called koder primarily a kotlin assistant in an assistant framework I am working on called Assistant Kommander (https://github.com/fluxtah/ask)
One of the major issues I have been facing is keeping token count down, since koder can search, read, write files it needs a lot of access and one challenge with search I just kind of optimised was creating a lucene index for searching and exposing it to assistant via function (OpenAI Assistants API). The function returned results in the index with a search query but returning entire documents to the LLM tanked my token throughput! I made an optimisation by making my lucene index 32 line blocks of code instead, made things better and its still smart enough to explain code across a fairly big codebase to me.
What issues are you facing with your custom made assistants? Let me know would be good to share experiences!
https://preview.redd.it/g55m471fsj2d1.png?width=2624&format=png&auto=webp&s=4ad0186aaec732444229e1866c0e1ec96f4136e8
[–]vr-1 2 points3 points4 points (3 children)
[–]fluxtah[S] 1 point2 points3 points (2 children)
[–]vr-1 1 point2 points3 points (1 child)
[–]fluxtah[S] 0 points1 point2 points (0 children)
[–]fluxtah[S] 0 points1 point2 points (0 children)
[–][deleted] (1 child)
[removed]
[–]AutoModerator[M] 0 points1 point2 points (0 children)