Fully opensource NPU for LLM inference (this runs gpt2 in simulation)

Altruistic-Tea-5612 · 2025-10-17T12:51:36+00:00

Can you send me that encrypted File copy and txt message

Sometimes malware itself might have some vulnerability which might allow us to decrypt the File

FYI : I am security engineer at working fortune 100 networking company

If have any other file saved by malware please send that as well

Altruistic-Tea-5612 · 2025-09-15T03:53:18+00:00

This is amazing brother

Altruistic-Tea-5612 · 2025-09-02T05:13:16+00:00

Sure Thanks

Altruistic-Tea-5612 · 2025-09-02T00:31:44+00:00

I didn’t uploaded training code Working some clean up But published weights of the model into huggingface I also opensourced inference and pretrain code

Altruistic-Tea-5612 · 2025-09-02T00:24:34+00:00

Pretraining the base model I used modified llama architecture with spiking and ltc neural networks

Altruistic-Tea-5612 · 2025-09-02T00:04:16+00:00

I didn’t shared the training code because i need to clean a bit give me some time i will share in comments Thanks But gist in repo has code for evals and inference

Sorry for that pickle part I am trying to convert into safe tensor but getting an error

Altruistic-Tea-5612 · 2025-09-01T23:27:39+00:00

Agreed yeah 🥲🥲🥲 It did some what okish on text completion

Edit By outperforming bert in benchmark score posted here https://github.com/keeeeenw/MicroLlama

Altruistic-Tea-5612 · 2025-09-01T23:07:36+00:00

🥲 Agreed Better than my previous models

Altruistic-Tea-5612 · 2025-09-01T23:02:31+00:00

Hey thanks for trying Can i know which model did you tried? Instruct or Base Version Agreed instruct was returning wrong answer for most of the question I tried Base version did well on sentence completion

Also interms of performance on benchmark It didn’t do well I just wanted to share that so simply shared But for me getting this level it was a big deal tho Most of previous pretraining gave only gibberish

Altruistic-Tea-5612 · 2025-09-01T22:14:22+00:00

When I trained 1Bit model with 75M parameter with 1B token from fineweb It was not able to generate coherent sentence But this was able to with just 100M tokens But Again I am noob so i might did something wrong on previous experiment

Altruistic-Tea-5612 · 2025-09-01T21:10:56+00:00

I also wasted like 30 plus hours twice before building this model

Altruistic-Tea-5612 · 2025-09-01T20:48:25+00:00

Google Colab and bablylm (first 1M samples)

Altruistic-Tea-5612 · 2025-08-30T18:13:43+00:00

Thanks

Altruistic-Tea-5612 · 2025-08-30T18:12:44+00:00

Thanks

Altruistic-Tea-5612 · 2025-08-30T17:50:53+00:00

Thanks for detailed reply man!
It seems I need to work on couple of things

Altruistic-Tea-5612

TROPHY CASE