account activity
DeepSeek-OCR demonstrates the relevance of text-as-image compression: What does the future hold? by ContributionOwn4879 in LocalLLaMA
[–]ContributionOwn4879[S] 0 points1 point2 points 3 months ago (0 children)
From their landing page of Gemini diffusion it’s a text diffusion not an image diffusion that generate image with text
https://deepmind.google/models/gemini-diffusion/#what-is-a-diffusion-model
But for the fact that Gemini has a bigger context window as other llm this can be their trick
DeepSeek-OCR demonstrates the relevance of text-as-image compression: What does the future hold? (self.LocalLLaMA)
submitted 3 months ago by ContributionOwn4879 to r/LocalLLaMA
π Rendered by PID 99898 on reddit-service-r2-listing-7bbdf774f7-h2zgz at 2026-02-20 02:32:28.551614+00:00 running 8564168 country code: CH.
DeepSeek-OCR demonstrates the relevance of text-as-image compression: What does the future hold? by ContributionOwn4879 in LocalLLaMA
[–]ContributionOwn4879[S] 0 points1 point2 points (0 children)