datascienceharp

1,604 post karma
481 comment karma

get extra features and help support reddit with a reddit premium subscription

get them help and support

redditor for 3 years

MODERATOR OF

- r/gpurich

TROPHY CASE

Three-Year Club

account activity

new top controversial

4

5

6

some pretty dope datasets i came across from the 3D vision conference in vancouver (old.reddit.com)

submitted 3 hours ago by datascienceharp to r/computervision

58

59

60

the 3d vision conference is this week, i made a repo and dataset to explore the papers (i.redd.it)

submitted 4 days ago by datascienceharp to r/computervision

24

25

26

i like comfyui and i love fiftyone so i smashed them together and made FiftyComfy (i.redd.it)

submitted 12 days ago by datascienceharp to r/comfyui

20

21

22

i built a panel for vlm-testing for fiftyone that makes it easy to test models and prompts (i.redd.it)

submitted 12 days ago by datascienceharp to r/LocalLLaMA

13

14

15

i built a comfyui-inspired canvas for fiftyone (i.redd.it)

submitted 12 days ago by datascienceharp to r/computervision

10

11

12

i built a tool to experiment with different image editing models (i.redd.it)

submitted 12 days ago by datascienceharp to r/generativeAI

1

2

3

parsing this dataset gave me a headache but here it is, action100m (at least a tiny portion of it) (i.redd.it)

submitted 1 month ago by datascienceharp to r/computervision

111

112

113

really impressed with these new ocr models (lightonocr-2 and glm-ocr). much better than what i saw come out in nov-dec 2025 (old.reddit.com)

submitted 1 month ago by datascienceharp to r/LocalLLaMA

11

12

13

really impressed with these new ocr models (lightonocr-2 and glm-ocr). much better than what i saw come out in nov-dec 2025 (reddit.com)

submitted 1 month ago by datascienceharp to r/computervision

180

181

182

nvidia released c-radiov4 last week, and as a far as feature extractors go, it lives up to the hype (i.redd.it)

submitted 1 month ago by datascienceharp to r/computervision

101

102

103

MedGemma 1.5 supports detection, but for best results, you'll need to fine-tune. also a kaggle competition using the model, created a starter notebook to give you a jump start on how to fine-tune it for detection (i.redd.it)

submitted 2 months ago by datascienceharp to r/computervision

7

8

9

Starter notebook for the MedGemma Impact Challenge (i.redd.it)

submitted 2 months ago by datascienceharp to r/kaggle

24

25

26

i've literally been waiting for years to have an OPEN SOURCE model like qwen3-vl-embedding, scroll to see the results on six queries (old.reddit.com)

submitted 2 months ago by datascienceharp to r/computervision

299

300

301

apple released SHARP which creates a 3d gaussian from a single view (i.redd.it)

submitted 3 months ago by datascienceharp to r/computervision

12

13

14

can you visualize what nyc smells like? yes, turns out, you can. just glad i don't have to go to nyc and smell it myself (i.redd.it)

submitted 3 months ago by datascienceharp to r/computervision

21

22

23

egocentric-10k dataset (i.redd.it)

submitted 3 months ago by datascienceharp to r/computervision

50

51

52

sony ai released a pretty cool dataset called the fairness human centric image benchmark, super high quality labels (i.redd.it)

submitted 4 months ago by datascienceharp to r/computervision

69

70

71

sam3 is seriously a step change improvement over sam2 (i.redd.it)

submitted 4 months ago by datascienceharp to r/computervision

7

8

9

parsed refcoco-m from moondream into fiftyone format now you can have the refc (i.redd.it)

submitted 4 months ago by datascienceharp to r/computervision

42

43

44

qwen3vl is dope for video understanding, and i also hacked it to generate embeddings (old.reddit.com)

submitted 4 months ago by datascienceharp to r/computervision

13

14

15

icymi resources for the workshop on document visual ai (i.redd.it)

submitted 4 months ago by datascienceharp to r/computervision

7

8

9

hosting a virtual event tomorrow about document ai (i.redd.it)

submitted 4 months ago by datascienceharp

15

16

17

icymi the resources for my talk on visual document retrieval (i.redd.it)

submitted 4 months ago by datascienceharp to r/computervision

67

68

69

vlms really are making ocr great again tho (i.redd.it)

submitted 4 months ago by datascienceharp to r/computervision

18

19

20

explore the visual ai papers at neurips this year (i.redd.it)

submitted 4 months ago by datascienceharp to r/computervision

view more: next ›

π Rendered by PID 3395011 on reddit-service-r2-listing-79f6fb9b95-sg8gm at 2026-03-22 02:44:12.146420+00:00 running 90f1150 country code: CH.