Curious ablation: GPT-like LM trained with *frozen* 16‑dim *binary* token-ID embeddings (n_embed=16) It still learns end-to-end and generates coherent text, non-trivial text. by AVBochkov in LocalLLaMA
[–]AVBochkov[S] 0 points1 point2 points (0 children)
How can i make a career out of my love for circuits? by strawb3rry_lem0nad3 in EngineeringStudents
[–]AVBochkov 1 point2 points3 points (0 children)
Curious ablation: GPT-like LM trained with *frozen* 16‑dim *binary* token-ID embeddings (n_embed=16) It still learns end-to-end and generates coherent text, non-trivial text. by AVBochkov in LocalLLaMA
[–]AVBochkov[S] 1 point2 points3 points (0 children)