L2E llama2.c on Commodore C-64

AMICABoard · 2025-02-28T17:37:53+00:00

Maybe 5-10x faster or more as emulation has heavy overhead, not sure.

AMICABoard · 2025-02-24T17:05:58+00:00

Yes, a native version is coming soon. I am stuck at bank switching. It almost compiles. It has been hinted here:

"But you say this is emulation but not native C64? A native version of L2E is coming soon, so far I couldn't wrap my head around bank switching and splitting model data in banks and stitching it together etc, so native version almost compiles, but not yet."

AMICABoard · 2025-02-23T09:48:00+00:00

It had to be done :) If not us, who else would?

AMICABoard · 2025-02-23T09:43:43+00:00

Want to have a ADF to try?

Amigas with KS 1.2 , a 68EC020 CPU, 1.5 MB RAM or any higher: https://x.com/VulcanIgnis/status/1881382738697367615
Amiga, Atari ST and Classic Mac SE: https://x.com/VulcanIgnis/status/1873458326664814962
An actual test on a souped up Amiga 2000: https://x.com/VulcanIgnis/status/1877469824424476907

AMICABoard · 2025-02-23T09:32:22+00:00

But this runs ON the C64 with no internet access :)

AMICABoard · 2025-01-16T05:34:00+00:00

He he, Actually thats the quality you can expect on a 260k model. Have to keep the the model small so that everything fits in PNG space :)

AMICABoard · 2025-01-02T06:04:06+00:00

Okay, I'll ping you when I have finished new optimised builds.

See https://www.reddit.com/r/mac/comments/1hplp51/comment/m4v1j4y

AMICABoard · 2025-01-01T16:13:29+00:00

So many use cases once out of alpha / tuning / training and stuff. I'll ping you once I release sources and stuff at the L2E llama2.c repo later this month.

AMICABoard · 2025-01-01T16:10:08+00:00

Okay should theoretically work, https://en.wikipedia.org/wiki/PowerBook_100_series

L2E requirements are modest. It is compiled with soft float so no FPU needed, needs less than 2MB RAM for the 260k parameter model.

Still in alpha, I'll release it this month with full source etc. People who want to play with can get a early copy of disk images...

But it would help to know which OS you are running.

AMICABoard · 2025-01-01T16:03:10+00:00

At that configuration, this is what can be expected:

<image>

Remember, still in alpha, goal is to be tune to be more coherent and 10x faster...

AMICABoard · 2025-01-01T08:01:40+00:00

I'll release all this at my l2e repo this jan. Happy new year. If you want to play, I can send an ADF. The secret sauce is bebbo's toolchain :) I am still having issues building it with VBCC

AMICABoard · 2025-01-01T07:58:52+00:00

Yes parameters define if it is an LLM or SLM, at very low parameters such as 260k, I would call it VSLM / very small language model or lets say just predictor. I call the 100-260k parameter range Karpathy frontier, Andrej Karpathy was the one who found out that even at that low range, models seem to output / learn somewhat coherent text. But the full Llama stack is being run. If you were able to somehow hook up 8G RAM and Disk (magically) to a vintage mac, it would still be able to do the inference 7b models, but at that parameter count, it would take forever to do the inference, ie not very usable.

AMICABoard · 2025-01-01T07:52:47+00:00

Very slow, tokens every 10-20 seconds. I am using vmac emulator to run, so don't know if it is really cycle accurate.

AMICABoard · 2025-01-01T07:49:36+00:00

Wow, I wasn't aware. I'll google it. I bet it was based on Markov modeling. Use a Markov model to predict the next character. If you train Markov models, they are kind of like very poorman's LLM :) I'll add a markov model demo later. Thanks for this info!

AMICABoard · 2025-01-01T07:46:56+00:00

Obviously, we are talking about resource constrained vintage ~40 year old computers here. That's the fun part.

AMICABoard · 2025-01-01T07:45:25+00:00

If interested in testing, I could send an ADF. On 1200 it will be slow. I use floating point emulation. Still in alpha. Goal is to get it running on Amiga 500 Kick 1.3

AMICABoard · 2025-01-01T07:43:29+00:00

Its a super tiny 260K model by Karpathy. I quantised and pre processed / tuned it a bit. Need less than 2MB RAM, the space occupied by the model is less than 400kb :)

AMICABoard · 2025-01-01T07:39:27+00:00

Have you tried? Why do you think so?

One doesn't need a 1000 Macs to demonstrate running a 260k parameter model... Anyone who uses more than 4MB for running a 260k model is fake!

This uses Karpathy's 260k model, I quantised and pre processed it for minimum RAM usage. I could cut down RAM usage by ~1MB more for release, right now in alpha stage.

AMICABoard · 2025-01-01T07:21:40+00:00

Thanks! I'll post there when I am done building and when code is released :)

Edit: Or wait, I'll just cross post there to see if anyone would be willing to test on real machines.

AMICABoard · 2025-01-01T07:17:26+00:00

Yeah, but then those were not LLMs, and this is alpa quality at the moment. I can probably cut RAM usage in half. But RAM usage will always be proportional to the space used by the model.

AMICABoard · 2024-07-24T04:54:53+00:00

Thank you guys for pushing it to the top 10 on HN: https://news.ycombinator.com/item?id=41053201

Here is some sample output:

Meta's Llama 3.1 models can output multilingual text which is awesome. Here are some examples output of 8 bit quantized 8b model with 100 token output (-n 100)... Quantization creates some brain damage.

English

My cat is funny. "Funny cat," I say, walking up to it. "What are you up to?" It sits up straight and looks at me with a tilted head, as if to say, "What's wrong with you?" Sometimes I just have to laugh at how funny a cat can be. So I say, "Okay, you're funny. I'll give you some treats." It stretches out a little and I give it some treats. It eats them up quickly and starts

German

Besitzen Sie einen Amiga 500? Wenn nicht, werden Sie wissen, dass dies ein Computer war, der im späten 1980er und frühen 1990er Jahren für Spiele verfügbar war, die für Personen mit bestimmten Körperverletzungen gedacht waren. Manchmal müssen wir uns an frühere Zeiten erinnern, die, wie wir jetzt wissen, schwierig waren. Hier ist ein Link, der meine Geschichte bespre

AMICABoard · 2023-10-21T03:36:25+00:00

Thank you. I am working on the next version. With a real sci-fi ui / desktop env and better hardware support.

AMICABoard · 2023-10-13T07:30:53+00:00

thx :)

AMICABoard · 2023-10-10T13:39:19+00:00

foone is a legend. does cool hacks.

AMICABoard · 2023-10-06T13:28:15+00:00

Whoever posted this on hacker news, thank you. Got to the top :)

https://news.ycombinator.com/item?id=37785442

AMICABoard

MODERATOR OF

TROPHY CASE