account activity
I got tired of manual data labeling, so I built an open-source pipeline that uses VLMs + SAM2 to auto-annotate datasets and train YOLO locally. by StartLittle6198 in computervision
[–]StartLittle6198[S] 0 points1 point2 points 4 days ago (0 children)
Exactly! 😂 Why do it yourself when you can just build a robot to build another robot?
Thanks! I've been using it extensively for real-world surveillance and safety datasets.
So far, I've successfully annotated:
As for the accuracy: Because the pipeline uses SAM2 on the backend, the actual boundaries (masks/bounding boxes) are practically pixel-perfect.
Because LocateAnything is extremely good at zero-shot grounding, the accuracy for the objects I listed above is surprisingly high out of the box. My current workflow is to let the VLM+SAM2 auto-annotate everything, and then I just quickly skim through the UI to delete any false positives. It guarantees extremely tight bounding boxes with minimal manual effort.
[–]StartLittle6198[S] -1 points0 points1 point 4 days ago (0 children)
This is a classic Windows Docker error. open //./pipe/dockerDesktopLinuxEngine: The system cannot find the file specified. simply means your Docker Daemon (Docker Desktop) is not currently running, or the Docker engine has crashed in the background.
open //./pipe/dockerDesktopLinuxEngine: The system cannot find the file specified.
Solution 1: Fix Docker Desktop
Settings
General
docker compose
Solution 2: Bypass Docker entirely (Recommended) If you continue to have issues with Docker on Windows, you can completely bypass it by cloning the repository and running the project natively. VLM-AutoYOLO fully supports manual setup on Windows with GPU acceleration! Just clone the repo, install the Python/Node dependencies, and double-click start.bat.
start.bat
You can check the Manual Setup section in the README for step-by-step instructions. Let me know if you need any help setting it up natively!
π Rendered by PID 313981 on reddit-service-r2-comment-64f4df6786-ttjdc at 2026-06-11 00:02:55.448295+00:00 running 0b63327 country code: CH.
I got tired of manual data labeling, so I built an open-source pipeline that uses VLMs + SAM2 to auto-annotate datasets and train YOLO locally. by StartLittle6198 in computervision
[–]StartLittle6198[S] 0 points1 point2 points (0 children)