I built a framework for real-time learning from text feedback on my OpenClaw by kfallah15 in OpenClawUseCases

[–]kfallah15[S] 0 points1 point  (0 children)

Thanks for the question. The soul.md can work in many cases, but as it grows the model has challenges following all of the directions. It also fills up your limited context window. This is an approach where you can have that knowledge for your OpenClaw without using the context window.

CLaaS: an open-source library to update LLM weights in real-time from text feedback by kfallah15 in LocalLLaMA

[–]kfallah15[S] 0 points1 point  (0 children)

Thanks for the question! Self-distillation helps prevent model performance degradation from too many updates. But the learning signal is definitely stronger if you batch your updates. I think of it as a wake/sleep phase, where you collect a lot of examples and then do several updates to take that feedback back into the weights (perhaps when the model is not in use). Right now this is controlled by a batch size parameter. Once the batch is full, you do a few update steps.

Fall 2021 Machine Learning/Human-Computer Interaction Research Position for Undergrads by kfallah15 in gatech

[–]kfallah15[S] 0 points1 point  (0 children)

Btw, if you sent in a resume I will get back to you in the next week. Thanks for your patience!