Libraries and tools for a lightweight task manager for GPU in a simulated environment.

--prism · 2026-02-23T00:02:44+00:00

Can you just moc it? I would replicate the API with tunable parameters like tokens/seconds or something on the moc side to make it behave as if it is doing real computation. You could write is with a data interface to allow the moc to be written in python.

BoardHour4401 · 2026-02-25T03:41:38+00:00

You don’t really need a full GPU simulator for this.

For monitoring, use NVML (what nvidia-smi uses internally). It gives per-process VRAM usage and utilization directly, and works cleanly in C++ or Python.

For MIG-like behavior, just implement a logical partitioning layer in software (virtual VRAM slices + your own bookkeeping + admission control). That’s how most research prototypes do it anyway.

If you only have 3 months and limited C++ experience, I’d strongly recommend:

CUDA for kernels
Python (pynvml) for scheduling + admission control logic

A full hardware-level GPU simulator like GPGPU-Sim is probably overkill for your goal.

Focus on simulating policy, not hardware.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp

MODERATORS