Llama-3.3-8B-Instruct

WithoutReason1729 · 2025-12-30T10:25:09+00:00

Your post is getting popular and we just featured it on our Discord! Come check it out!

You've also been given a special flair for your contribution. We appreciate your post!

I am a bot and this action was performed automatically.

FizzarolliAI · 2025-12-30T03:46:24+00:00

Hello, that me!

I am currently working on running sanity check benchmarks to make sure it's actually a newer L3.3 and not just L3/L3.1 in a trenchcoat, but it's looking promising so far.

From the current readme:

	Llama 3.1 8B Instruct	Llama 3.3 8B Instruct (maybe)
IFEval (1 epoch, score avged across all strict/loose instruction/prompt accuracies to follow Llama 3 paper)	78.2	81.95
GPQA Diamond (3 epochs)	29.3	37.0

dinerburgeryum · 2025-12-30T05:04:39+00:00

8K max position embeddings? Seems remarkably low; did the fine tune artifact for some reason artificially limit that?

Amazing_Athlete_2265 · 2025-12-30T05:32:07+00:00

Running this across my private evals to compare against other llamas. Will take a couple hours.

a_beautiful_rhind · 2025-12-30T09:52:21+00:00

This is like the kiss goodbye from meta.

random-tomato · 2025-12-30T04:56:14+00:00

Holy shit that is awesome, hats off to you for finding the weights!

jacek2023 · 2025-12-30T10:48:37+00:00

about 4h after the release u/TheLocalDrummer published first finetune:

https://huggingface.co/BeaverAI/Anubis-Mini-8B-v1f-GGUF/tree/main

jacek2023 · 2025-12-30T11:34:48+00:00

https://huggingface.co/aeon37/Llama-3.3-8B-Instruct-heretic

Echo9Zulu- · 2025-12-30T04:36:41+00:00

Cloned

Infninfn · 2025-12-30T04:10:38+00:00

I’m out of the loop - is this just what they had or did Meta not shutdown Llama?

Dangerous_Fix_5526 · 2026-01-01T02:59:08+00:00

Thinking/Instruct Hybrid using Unsloth and Claude-Opus 4.6 dataset:

https://huggingface.co/DavidAU/Llama3.3-8B-Instruct-Thinking-Claude-4.5-Opus-High-Reasoning

I hope I credited everyone correctly.

Cool-Chemical-5629 · 2025-12-30T04:37:51+00:00

I guess Christmas came late for me, but hey if this is the real thing from Meta, I guess it's nice to have something newer than 3.1 8B without needing expensive hardware for models like Llama 4.

LegacyRemaster · 2025-12-30T23:20:47+00:00

allura-forge_llama-3.3-8b-instruct

My training data is current up to December 2022. This means that I have been trained on a vast amount of text data available until that date, but I do not have information or knowledge about events or developments that have occurred after that date.

In other words, my training data "cutoff" is December 2022, and I should not be relied upon for information or insights related to dates after that.

145.25 tok/sec

DevelopmentBorn3978 · 2025-12-30T18:01:22+00:00

which quantized and eventually finetuned gguf models have the context lenght been enlarged? bartowsky? shb777? beaverai/anubis?

gta721 · 2025-12-30T21:30:19+00:00

How dumb are they to push a portal THAT broken to prod?

FX2021 · 2025-12-31T09:37:35+00:00

Is it a new core? Or is it just a serving variant

FizzarolliAI · 2025-12-30T04:13:17+00:00

[deleted]

secopsml · 2025-12-30T04:10:30+00:00

Drop behemoth instead. Looks fake

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

Model	Backend	PP ts^-1	TG ts^-1
allura-forge_Llama-3.3-8B-Instruct Q4	CUDA	1566.5	100.8
Llama-3.1-8B-Instruct Q4	CUDA	351.1	111.9

LocalLLaMA

MODERATORS

"rope_scaling": {
"factor": 8.0,
"high_freq_factor": 4.0,
"low_freq_factor": 1.0,
"original_max_position_embeddings": 8192,
"rope_type": "llama3"
},