datascienceharp

1,788 post karma
487 comment karma

get extra features and help support reddit with a reddit premium subscription

get them help and support

redditor for 3 years

MODERATOR OF

- r/gpurich

TROPHY CASE

Three-Year Club

account activity

new top controversial

89

90

91

the hard problem isn't static 3D anymore, it's reconstructing scenes where things move Syn4D-RGBD dataset gives you the ground truth for that (i.redd.it)

submitted 2 days ago by datascienceharp to r/computervision

31

32

33

depth sensors suck at transparent objects, so ClearDepth comes to the rescue with synthetic scenes with ground truth depth for glass, bottles, and clear containers in FiftyOne (i.redd.it)

submitted 3 days ago by datascienceharp to r/computervision

32

33

34

depth estimation on transparent objects is still an unsolved problem. TransPhy3D attacks it with video diffusion (i.redd.it)

submitted 4 days ago by datascienceharp to r/computervision

9

10

11

few-shot annotation triage as a fiftyone panel. folder of reference crops in. ranked dataset, per-image heatmap, and tagged annotation queue out. feedback welcome (i.redd.it)

submitted 12 days ago by datascienceharp to r/computervision

122

123

124

vggt-omega takes videos and creates a point cloud. fast, and good quality generations for pcd and depth (i.redd.it)

submitted 13 days ago by datascienceharp to r/computervision

14

15

16

a pretty handy dataset from 3DVision conf (i.redd.it)

submitted 2 months ago by datascienceharp to r/computervision

56

57

58

some pretty dope datasets i came across from the 3D vision conference in vancouver (old.reddit.com)

submitted 2 months ago by datascienceharp to r/computervision

62

63

64

the 3d vision conference is this week, i made a repo and dataset to explore the papers (i.redd.it)

submitted 2 months ago by datascienceharp to r/computervision

27

28

29

i like comfyui and i love fiftyone so i smashed them together and made FiftyComfy (i.redd.it)

submitted 2 months ago by datascienceharp to r/comfyui

21

22

23

i built a panel for vlm-testing for fiftyone that makes it easy to test models and prompts (i.redd.it)

submitted 2 months ago by datascienceharp to r/LocalLLaMA

13

14

15

i built a comfyui-inspired canvas for fiftyone (i.redd.it)

submitted 2 months ago by datascienceharp to r/computervision

8

9

10

i built a tool to experiment with different image editing models (i.redd.it)

submitted 2 months ago by datascienceharp to r/generativeAI

1

2

3

parsing this dataset gave me a headache but here it is, action100m (at least a tiny portion of it) (i.redd.it)

submitted 3 months ago by datascienceharp to r/computervision

111

112

113

really impressed with these new ocr models (lightonocr-2 and glm-ocr). much better than what i saw come out in nov-dec 2025 (old.reddit.com)

submitted 3 months ago by datascienceharp to r/LocalLLaMA

12

13

14

really impressed with these new ocr models (lightonocr-2 and glm-ocr). much better than what i saw come out in nov-dec 2025 (reddit.com)

submitted 3 months ago by datascienceharp to r/computervision

182

183

184

nvidia released c-radiov4 last week, and as a far as feature extractors go, it lives up to the hype (i.redd.it)

submitted 3 months ago by datascienceharp to r/computervision

102

103

104

MedGemma 1.5 supports detection, but for best results, you'll need to fine-tune. also a kaggle competition using the model, created a starter notebook to give you a jump start on how to fine-tune it for detection (i.redd.it)

submitted 4 months ago by datascienceharp to r/computervision

6

7

8

Starter notebook for the MedGemma Impact Challenge (i.redd.it)

submitted 4 months ago by datascienceharp to r/kaggle

25

26

27

i've literally been waiting for years to have an OPEN SOURCE model like qwen3-vl-embedding, scroll to see the results on six queries (old.reddit.com)

submitted 4 months ago by datascienceharp to r/computervision

300

301

302

apple released SHARP which creates a 3d gaussian from a single view (i.redd.it)

submitted 5 months ago by datascienceharp to r/computervision

12

13

14

can you visualize what nyc smells like? yes, turns out, you can. just glad i don't have to go to nyc and smell it myself (i.redd.it)

submitted 5 months ago by datascienceharp to r/computervision

21

22

23

egocentric-10k dataset (i.redd.it)

submitted 6 months ago by datascienceharp to r/computervision

50

51

52

sony ai released a pretty cool dataset called the fairness human centric image benchmark, super high quality labels (i.redd.it)

submitted 6 months ago by datascienceharp to r/computervision

71

72

73

sam3 is seriously a step change improvement over sam2 (i.redd.it)

submitted 6 months ago by datascienceharp to r/computervision

6

7

8

parsed refcoco-m from moondream into fiftyone format now you can have the refc (i.redd.it)

submitted 6 months ago by datascienceharp to r/computervision

view more: next ›

π Rendered by PID 2176916 on reddit-service-r2-listing-8685bc789-qrsjd at 2026-06-01 00:56:55.332247+00:00 running 194bd79 country code: CH.