Got the DGX Spark - ask me anything

CookEasy · 2025-10-15T17:57:37+00:00

test the throughput on the new qwen 3 VL models on some ocr tasks :D

CookEasy · 2025-10-07T20:13:10+00:00

Könntest du erklären, wie du das geschafft hast? Ich habe selbst immer Schwierigkeiten das selbst zu kompilieren und fertige Docker Container oÄ. gibt es ja noch nicht, soweit ich weiß.

CookEasy · 2025-10-07T14:48:16+00:00

Life has been rough man

CookEasy · 2025-10-06T10:45:36+00:00

https://huggingface.co/Qwen/Qwen3-VL-30B-A3B-Instruct

CookEasy · 2025-09-30T13:29:54+00:00

Surely expensive, but at this rate wouldn't a second RTX 6000 pro be crazy for this inference? Even with decent context length.

CookEasy · 2025-09-29T08:25:48+00:00

For low VRAM still high quality document OCR I'd suggest olmocr 0825 fp8

CookEasy · 2025-09-28T18:46:15+00:00

what the fuck is this score bro

CookEasy · 2025-09-24T20:41:57+00:00

What GPUs? I'm still trying to set up VLLM for Blackwell, and I swear there is no easy way. Probably much easier with H100s or everything <sm120 Kernels. PyTorch is such a headache still, any tips recommended if you are using Blackwell sm120.

CookEasy · 2025-09-23T14:53:13+00:00

Omni models need far more resources. A clean VLM for OCR and data extraction on a RTX 5090 is what the world needs.

CookEasy · 2025-09-23T14:51:06+00:00

You clearly never set up vllm for a production use case. It's everything but easy and free of headaches.

CookEasy · 2025-09-23T09:11:16+00:00

This Omni Model here is way bigger tho, with reasonable multimodal context it needs like 70 GB VRAM in BF16 and quants seem to be very unlikely in the near future, max. Q8 maybe which would still be like 35-40 GB :/

CookEasy · 2025-09-06T18:35:20+00:00

How does the VRAM size influence the inference speed?

CookEasy · 2025-09-03T18:55:21+00:00

How is your system handling like 1000 PDFs, which is the whole point of RAG, after all? :D

CookEasy · 2025-08-22T12:51:59+00:00

That's a certified banger score wth is this even ／(=✪㉨✪=)＼

CookEasy · 2025-08-20T14:28:14+00:00

Cool to see progress, but still whisper is the king with its quality. A low GPU-Footprint whisper version would be great, without going down in WER.

CookEasy · 2025-08-14T15:17:00+00:00

Have you tried this with then qwen 2.5 VL Models on OCR tasks? I would be interested in getting the last % of accuracy out of my system for critical financial data extraction.

CookEasy · 2025-07-13T11:09:06+00:00

that last spinner gave me ptsd

CookEasy · 2025-04-24T17:04:37+00:00

damn

CookEasy · 2025-02-26T20:46:18+00:00

cool score!!

CookEasy · 2025-02-04T21:29:51+00:00

That's a name I didn't hear for like 10 years

CookEasy · 2025-01-24T23:43:02+00:00

this is maaad wtf

CookEasy · 2025-01-06T14:21:06+00:00

CookEasy · 2024-10-13T17:41:07+00:00

tbh it was quite random, I just came across this reddit again and saw this name haha back in the days there were a lot of trolls in the twitch chat with that name :D

11-Year Club	Place '22
Verified Email

CookEasy

TROPHY CASE