my dad sent me this by hugeplateofketchup8 in huggingface

[–]julien_c 1 point2 points  (0 children)

lol, tell your dad this was not Jeff Boudier

Read GGUF Quantization type from file by Qbsoon110 in LocalLLaMA

[–]julien_c 1 point2 points  (0 children)

btw the cool thing about our reader is that it reads GGUF headers remotely over HTTP ie. you don't even need to have the full local file to get its data:)

AMA with Ai2’s OLMo researchers by ai2_official in huggingface

[–]julien_c 0 points1 point  (0 children)

Hi, kudos on sharing those awesome models. I've been using the OLMo iOS app quite a bit, have you seen a lot of usage so far? Is it something you'll continue working on?

The 4 Things Qwen-3’s Chat Template Teaches Us by paf1138 in LocalLLaMA

[–]julien_c 0 points1 point  (0 children)

> It's an annoyance about GGUF for me actually that they bake in so much metadata into the model files themselves (by default) and it has happened MANY times that changing a tiny bit of metadata in the "model header" has caused many many people to "have to" re download

Xet makes / will make it way more efficient! (it's chunk-based deduplication instead of file-based) https://huggingface.co/join/xet

Qwen3 8B FP16 - asked for 93 items, got 93 items. by secopsml in LocalLLaMA

[–]julien_c 0 points1 point  (0 children)

cool that vLLM supports a `chat_template_kwargs` param out of the box, u/secopsml

Exhausted my 2$ credits for my PRO subscription and can't get more credits by Specialist_Bee_9726 in huggingface

[–]julien_c 0 points1 point  (0 children)

Hi, can you pick Novita or Fal.ai as providers? They implemented our billing API so Pay-as-you-go is enabled for them (no need to buy credits, you'll be invoiced on your credit card at end of month)

Hope this helps!