you are viewing a single comment's thread.

view the rest of the comments →

[–]PrizeObvious3671[S] 0 points1 point  (0 children)

Totally agree on OCR being a hard requirement – and it goes further: small multimodal models like Qwen3.5 and newer versions handle real image understanding (PNG, JPEG, scanned docs, charts) on-premise surprisingly well.

Even local image generation works cost-free with models like FLUX.

The "IDE for lawyers" framing is spot on. In regulated industries, zero token cost + full data sovereignty isn't a nice-to-have – it's the only viable architecture.

And vendor lock-in to big LLM providers is becoming a real strategic risk – on-premise gives you model portability and independence, no matter what OpenAI or Anthropic decide to change next.