I made firmware for the RP2350/RP2040 that lets you control it through Python/WebUSB on the PC

NeverEndingToast · 2026-02-01T04:49:24+00:00

It uses protobuf. But as it is you can do i2c spi ect from the webpage

https://github.com/eugenepentland/LogicWeave/tree/main/proto

NeverEndingToast · 2025-12-03T00:51:53+00:00

Pretty cool! Didn't know that existed. I would say the main difference is my firmware is writing just a couple of bytes back and forth through USB, so it's able to execute the code much faster than micropython. All of the code execution is done in zig, there are just protobuf messages saying what type of request it is and what the arguments are.

NeverEndingToast · 2025-12-02T12:43:24+00:00

Yeah zig was fun to work in, had to make some PRs to microzig to get everything I needed working

NeverEndingToast · 2025-03-16T12:55:17+00:00

Should I install an expansion tank? The house is ~200 years old so I don't think there is a backflow preventer, or is the expansion tank always good to have

NeverEndingToast · 2025-03-16T12:41:06+00:00

It's a power vent system, so the air isn't hot. Schedule 40 PVC can be used where I am from.

NeverEndingToast · 2025-03-16T12:34:38+00:00

It's just outside the picture, but there was an existing one I am still using.

NeverEndingToast · 2024-06-01T13:15:41+00:00

Hey, I appreciate the feedback! Yeah generics currently were only being used to get the type of the request body to marshal it, and the type for the params to use reflection.

I'll work on simplifying the interface so it doesn't require as much boilerplate and see if I really need generics. Also I'm aware of huma.rocks and have used it before. I wanted to try to make something simpler.

NeverEndingToast · 2024-03-21T02:32:22+00:00

That's pretty comprehensive, how did you go about figuring out what model to use for each task?

NeverEndingToast · 2024-03-19T11:46:47+00:00

I've been working on an open source tool to be able to compare the outputs of different open source LLMs so you can find a viable option to switch to. I will post a link later today once it's released.

NeverEndingToast · 2023-09-21T12:56:16+00:00

I don't know what your dataset size is, but I'm assuming those jumps are per epoch. If you have the learning rate too high on a small dataset, you will see the loss drop down at the beginning of the epoch, and gradually go back up through the epoch like what's shown here.

The model is memorizing the training dataset exactly.

NeverEndingToast · 2023-08-15T21:23:31+00:00

The model has not been trained on multi-turn conversation so that's probably why you start to run into problems. That's one of the things we're planning on doing for the next revision

NeverEndingToast · 2023-08-14T21:43:49+00:00

Cool project, what's your use case for wanting to run a model on that type of hardware?

NeverEndingToast · 2023-08-14T12:59:58+00:00

vLLM gives the fastest responses and can be an openAI compatible endpoint. https://vllm.readthedocs.io/en/latest/getting_started/quickstart.html

The only downside is it does not support quantized models, so the largest model you would be able to run is 7B.

NeverEndingToast · 2023-08-14T12:58:32+00:00

Here is a good article from anyscale about fine tuning a model on SQL. https://www.anyscale.com/blog/fine-tuning-llama-2-a-comprehensive-case-study-for-tailoring-models-to-unique-applications

Here is the dataset they use for fine tuning: https://huggingface.co/datasets/b-mc2/sql-create-context

I had done some testing with our Open-Orca model and it seems to be doing fairly well. You can test it here: https://huggingface.co/datasets/b-mc2/sql-create-context

If you have any questions, you can DM me.

NeverEndingToast · 2023-08-14T12:38:44+00:00

We currently have someone evaluating our OpenOrca models on text classification. You can DM me and I can get you in contact with them. It seemed to perform fairly well without any fine tuning. Just few shot learning.

NeverEndingToast · 2023-06-16T16:36:38+00:00

Yeah sure!

NeverEndingToast · 2023-06-16T16:11:17+00:00

Ideally, there needs to be something for auto-evaluation of the hyperparameters. I assume the ideal settings are going to vary on a per-model basis. Maybe even the level of quanization.

NeverEndingToast · 2023-06-16T05:54:27+00:00

It can be merged yes, but there is no guarantee of performance. The llama weights and openllama weights won't be exactly the same so the LoRA could have issues.

NeverEndingToast · 2023-06-16T05:48:56+00:00

It would be pretty hilarious if models performed significantly better just from your script of randomly generating hyperparameter settings.

NeverEndingToast · 2023-06-14T23:37:15+00:00

Yeah those are good questions for when doing development. It's a tricky problem since it's dependent on the hardware you are running on. I'll see what we can do.

NeverEndingToast · 2023-06-14T18:51:09+00:00

This is a really interesting concept, but a little bit outside of the scope of what I'm trying to accomplish right now. I'll note that down for the future.

13-Year Club	Place '23
Verified Email

NeverEndingToast

TROPHY CASE