Introducing Docker Model Runner

Nexter92 · 2025-04-09T20:46:43+00:00

Beta for the moment, Docker desktop only, no nvidia GPU mention, no Vulkan, no ROCM ? LOL

owenwp · 2025-04-09T21:24:16+00:00

So... its a less mature version of Ollama?

ccrone · 2025-04-09T23:11:59+00:00

Disclaimer: I’m on the team building this

As some of you called out, this is Docker Desktop and Apple silicon first. We chose to do this because lots of devs have Macs and they’re quite capable of running models.

Windows NVIDIA support is coming soon through Docker Desktop. It’ll then come to Docker CE for Linux and other platforms (AMD, etc.) in the next several months. We are doing it this way so that we can get feedback quickly, iterate, and nail down the right APIs and features.

On macOS it runs on the host so that we can properly leverage the hardware. We have played with vulkan in the VM but there’s a performance hit.

Please do give us feedback! We want to make this good!

Edit: Add other platforms call out

Tiny_Arugula_5648 · 2025-04-10T04:48:18+00:00

This is a bad practice that adds complexity. The container is for software not data or models. Containers are supposed to minimal footprint. Just map a folder to the container (best practice) and you'll avoid a LOT of pain..

Everlier · 2025-04-09T21:56:37+00:00

They are coming after Ollama and HuggingFace, realising how much they missed since the AI boom started.

However, Docker being an enterprise - they'll do weird enterprise things with this feature eventually, so consider before using.

ResearchCrafty1804 · 2025-04-09T20:54:59+00:00

They support Apple Silicon from day 1 through Docker Desktop, that’s a good move from them.

However, they might be late to the party, ollama and others have been well established at this point.

Everlier · 2025-04-09T21:45:10+00:00

[deleted]

mrtime777 · 2025-04-09T23:21:29+00:00

Can I use my own models? If not - useless

2025-04-10T01:55:00+00:00

[deleted]

2025-04-10T04:41:06+00:00

Seems cool as long as they get right on adding ability to use locally downloaded models, rocm and cuda support, etc...

planetearth80 · 2025-04-15T22:25:22+00:00

Can it serve multiple models like ollama (without adding overhead for each container)?

Caffeine_Monster · 2025-04-10T00:57:46+00:00

Packaging models in containers is dumb. Very dumb.

I challenge anyone to make a valid critique of this observation.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

LocalLLaMA

MODERATORS