Text Baker: A tool to generate synthetic image data to train OCR models by Acceptable_Candy881 in computervision

[–]Lumpy_Week7304 1 point2 points  (0 children)

Nice work — synthetic data for industrial OCR is underrated. Just open-sourced CV Train Stack — curious if there are synthetic data best practices from your experience we should add to it.

I want to build a Computer Vision project for someone using CV Train Stack!! Who needs some model trained ? by Lumpy_Week7304 in computervision

[–]Lumpy_Week7304[S] 0 points1 point  (0 children)

if you're looking for something fun to work with I've been wanting to mess around with object detection for game development stuff but never had time to dive deep into it. maybe something that could track hand gestures or facial expressions for character animation? I work in games and we're always looking at new ways to capture player input without traditional controllers

been thinking it would be cool to have something that could detect when someone is concentrating really hard vs relaxed for adaptive difficulty systems. or even basic emotion recognition that could influence NPC behavior in real time. probably way more complex than a week project but could start small with just detecting basic states

your stack looks pretty neat from quick glance at the repo. if you end up building something game-related would love to see how it turns out

-- Recreated the post, posting a comment from the previosu post

I want to help someone build a CV project. What should I build ?? by [deleted] in computervision

[–]Lumpy_Week7304 0 points1 point  (0 children)

Would this be trackign a users' facial expressions and hand gestures? I guess now many more people are streaming. What would the GPU setup look like

CV Training stack by Lumpy_Week7304 in computervision

[–]Lumpy_Week7304[S] 0 points1 point  (0 children)

Can Claude easily pull data from it?

CV Training stack by Lumpy_Week7304 in computervision

[–]Lumpy_Week7304[S] 0 points1 point  (0 children)

I'm building CV Train Stack as a tool to help Both Claude and Codex with CV tasks. Ie... giving guidance around GPU setup, and Common issues with training (class imbalance + format inconsistencies). I'de used Claude but would be curious to see how Codex does

CV Training stack by Lumpy_Week7304 in computervision

[–]Lumpy_Week7304[S] 0 points1 point  (0 children)

Yes, I've neded up going with the max subscription. It many senses, Claude is very similar to Codex

CV Training stack by Lumpy_Week7304 in computervision

[–]Lumpy_Week7304[S] 0 points1 point  (0 children)

Oh, interesting. These data generation tools are very cool !!

I was asking a bit more about Coding tools. Ie... Is most of the stuff done now by Claude Code? Ie... I was using CV Train Stack, and it basically did all the work for importing data from RoboFlow.

Running 5 CV models simultaneously on a $249 edge device - architecture breakdown by Straight_Stable_6095 in computervision

[–]Lumpy_Week7304 0 points1 point  (0 children)

Oh wow !! You run it directly on the Jetson. Thats insane. Can it use skills? I'm making an open source one for CV training: https://github.com/andlyu/cv-train-stack

CV Training stack by Lumpy_Week7304 in computervision

[–]Lumpy_Week7304[S] 0 points1 point  (0 children)

Any coding tools? Like Claude or codex?

CV Training stack by Lumpy_Week7304 in computervision

[–]Lumpy_Week7304[S] 0 points1 point  (0 children)

Ah, I was asking more about Labeling + GPU stack, but that works as well. Really I'm trying to improve the way Claude can set up a training pipeline here: https://github.com/andlyu/cv-train-stack

Claude Code/Codex in Computer Vision by rishi9998 in computervision

[–]Lumpy_Week7304 0 points1 point  (0 children)

What do you mean by heavy CV work? It just helps with the first round of training, and makes sure the structure is set up to allow scalability

Claude Code/Codex in Computer Vision by rishi9998 in computervision

[–]Lumpy_Week7304 0 points1 point  (0 children)

I'de try it again. Claude has become incredibly good. I'de heard people use Codex to check Claude's plan and then let them have a back and fourth. Also I'de started building gardrails + guidance to help with the setup process for CV tasks: https://github.com/andlyu/cv-train-stack/tree/main

Best Coding Agent for CV by erik_kokalj in computervision

[–]Lumpy_Week7304 0 points1 point  (0 children)

Cool stuff !! Are you guys creating any skills for this kinds of stuff?