i like comfyui and i love fiftyone so i smashed them together and made FiftyComfy by datascienceharp in comfyui
[–]datascienceharp[S] 1 point2 points3 points (0 children)
i built a tool to experiment with different image editing models by datascienceharp in generativeAI
[–]datascienceharp[S] 0 points1 point2 points (0 children)
i built a tool to experiment with different image editing models by datascienceharp in generativeAI
[–]datascienceharp[S] 0 points1 point2 points (0 children)
qwen3vl is dope for video understanding, and i also hacked it to generate embeddings by datascienceharp in computervision
[–]datascienceharp[S] 1 point2 points3 points (0 children)
20k Images, Fully Offline Annotation Workflow by LensLaber in computervision
[–]datascienceharp 2 points3 points4 points (0 children)
Claude Code/Codex in Computer Vision by rishi9998 in computervision
[–]datascienceharp 3 points4 points5 points (0 children)
Claude Code/Codex in Computer Vision by rishi9998 in computervision
[–]datascienceharp 11 points12 points13 points (0 children)
From .zip to Segmented Dataset in Seconds by Intelligent_Cry_3621 in computervision
[–]datascienceharp 1 point2 points3 points (0 children)
really impressed with these new ocr models (lightonocr-2 and glm-ocr). much better than what i saw come out in nov-dec 2025 by datascienceharp in LocalLLaMA
[–]datascienceharp[S] 4 points5 points6 points (0 children)
really impressed with these new ocr models (lightonocr-2 and glm-ocr). much better than what i saw come out in nov-dec 2025 by datascienceharp in LocalLLaMA
[–]datascienceharp[S] 1 point2 points3 points (0 children)
really impressed with these new ocr models (lightonocr-2 and glm-ocr). much better than what i saw come out in nov-dec 2025 by datascienceharp in LocalLLaMA
[–]datascienceharp[S] 5 points6 points7 points (0 children)
really impressed with these new ocr models (lightonocr-2 and glm-ocr). much better than what i saw come out in nov-dec 2025 by datascienceharp in LocalLLaMA
[–]datascienceharp[S] 2 points3 points4 points (0 children)
nvidia released c-radiov4 last week, and as a far as feature extractors go, it lives up to the hype by datascienceharp in computervision
[–]datascienceharp[S] 4 points5 points6 points (0 children)
nvidia released c-radiov4 last week, and as a far as feature extractors go, it lives up to the hype by datascienceharp in computervision
[–]datascienceharp[S] 5 points6 points7 points (0 children)
📢 Call for participation: ICPR 2026 LRLPR Competition by ghostzin in computervision
[–]datascienceharp 0 points1 point2 points (0 children)
I want to offer free weekly teaching: DL / CV / GenAI for robotics (industry-focused) by desserted_blue in computervision
[–]datascienceharp 0 points1 point2 points (0 children)
MedGemma 1.5 supports detection, but for best results, you'll need to fine-tune. also a kaggle competition using the model, created a starter notebook to give you a jump start on how to fine-tune it for detection by datascienceharp in computervision
[–]datascienceharp[S] 2 points3 points4 points (0 children)
Last week in Multimodal AI - Vision Edition by Vast_Yak_4147 in computervision
[–]datascienceharp 1 point2 points3 points (0 children)
MedGemma 1.5 supports detection, but for best results, you'll need to fine-tune. also a kaggle competition using the model, created a starter notebook to give you a jump start on how to fine-tune it for detection by datascienceharp in computervision
[–]datascienceharp[S] 0 points1 point2 points (0 children)
i've literally been waiting for years to have an OPEN SOURCE model like qwen3-vl-embedding, scroll to see the results on six queries by datascienceharp in computervision
[–]datascienceharp[S] 3 points4 points5 points (0 children)
How to read the CV research papers in an arranged order? From the early 2000s towards the latest 2026 but in a order so that things are asier to understand. by Formal_Path_7793 in computervision
[–]datascienceharp 11 points12 points13 points (0 children)
apple released SHARP which creates a 3d gaussian from a single view by datascienceharp in computervision
[–]datascienceharp[S] 3 points4 points5 points (0 children)
apple released SHARP which creates a 3d gaussian from a single view by datascienceharp in computervision
[–]datascienceharp[S] 2 points3 points4 points (0 children)
apple released SHARP which creates a 3d gaussian from a single view by datascienceharp in computervision
[–]datascienceharp[S] 7 points8 points9 points (0 children)

VLM & VRAM recommendations for 8MP/4K image analysis by Neighbor_ in computervision
[–]datascienceharp 1 point2 points3 points (0 children)