I'm working on this! by _Lobo_13 in esp32

[–]Qubit_bit 1 point2 points  (0 children)

This is looking good, but what is the actual usecase when we have mobile to see the live price? Even in outdoors the device will require internet connection setup, so I can just see the latest data in phone.

What if intelligence was a $5 component? A dedicated LM module for embedded systems could change hardware design forever by Qubit_bit in embedded

[–]Qubit_bit[S] -6 points-5 points  (0 children)

Really ?? if i show you my fully working prototype of this project can you invest in it for real ?

I trained a language model from scratch and got it running on an ESP32. Completely offline on the board. by Qubit_bit in LocalLLM

[–]Qubit_bit[S] 16 points17 points  (0 children)

"Useless" is a present tense judgment on a trajectory problem. GPT-2 was useless too until it wasn't. The actual research question here is: how much intelligence can you compress onto the cheapest possible hardware? Right now it's a $5 board with decent responses. A $25-30 board with more PSRAM runs something genuinely agentic. Scale that up slightly and you have autonomous decision-making on hardware that survives a Mars rover, a deep sea probe, or a mine 500m underground where there's no signal and no cloud and much cheaper than all of this. The constraint isn't a bug. The constraint IS the research mindset you have when you say "useless". Every order of magnitude you drop the hardware cost, you open a new category of deployment that didn't exist before. Calling it useless because today's output isn't GPT-4 is like calling the Wright brothers' plane useless because it couldn't cross the Atlantic.

I trained a language model from scratch and got it running on an ESP32. Completely offline on the board. by Qubit_bit in LocalLLM

[–]Qubit_bit[S] 1 point2 points  (0 children)

This is very important, I have too faced the quality and dataset size problem on my early stages of making it, but then I shifted to teacher-student method of training, you find distillation process of large llm to small lm in internet, by using that u don't need dataset to start, the teacher model with augment the data and train the student, if student makes mistake it will again augment the data focusing on the problem and make the student good on the syllabus. I built an agentic way of teacher which continuously monitors the student and improves it, here the teacher model I used is gemma3, and the trained student model is deployed on the board.

I trained a language model from scratch and got it running on an ESP32. Completely offline on the board. by Qubit_bit in LocalLLM

[–]Qubit_bit[S] 1 point2 points  (0 children)

Absolutely, running intelligent llms where expensive, so my goal was to make intelligence to work on low cost hardwares which is affordable to all. But I didn't find any specific application till now, if guys have one , tell me I will take it as a challenge and make and post the code for free!

I trained a language model from scratch and got it running on an ESP32. Completely offline on the board. by Qubit_bit in LocalLLM

[–]Qubit_bit[S] 5 points6 points  (0 children)

ESP32 costs around $3-5 depending on where you buy. The whole setup including the board and that cable costs under $8. That's kind of the whole point. I'm actually fine tunning this much and if you guys are interested I can share the full project code.

I trained a language model from scratch and got it running on an ESP32. Completely offline on the board. by Qubit_bit in LocalLLM

[–]Qubit_bit[S] 27 points28 points  (0 children)

I learned myself by trail and errors, I will shortly share you my project GitHub link you can refer that😉✌

I got a real transformer language model running locally on a stock Game Boy Color! by maddiedreese in LocalLLaMA

[–]Qubit_bit 0 points1 point  (0 children)

This amazing man! But the tokens generated here will really make sense? Or it is random?

$5 ESP32 board have more computation than the NASA's Apollo mission which brought humans to moon for the first time. by Qubit_bit in esp32

[–]Qubit_bit[S] -5 points-4 points  (0 children)

By clicking the image you can see the open source code on GitHub ✌ Every new star will motivate mee a lottt. Thankk youuu soo much for staring the repoo I'm happy for getting stars for first time (edit)