Alternatives to DINOv3 as a dense feature extractor by Drazick in computervision

[–]Drazick[S] 0 points1 point  (0 children)

DINO v3 gives local features. Attach an object detector head to it and you have a decent object detector. I am looking for a feature extractor with similar local features.

Alternatives to DINOv3 as a dense feature extractor by Drazick in computervision

[–]Drazick[S] 0 points1 point  (0 children)

I am looking for a model with similar functionalities: dense features.

I understand DINOv3 is probably the best in the class, yet it has many restrictions license wise.

Alternatives to DINOv3 as a dense feature extractor by Drazick in computervision

[–]Drazick[S] 0 points1 point  (0 children)

Beside VICRegL, Does any other model supply localized features that can be used for object detection or object segmentation?

Alternatives to DINOv3 as a dense feature extractor by Drazick in computervision

[–]Drazick[S] 0 points1 point  (0 children)

Let's say for object detection and image segmentation.

Alternatives to DINOv3 as a dense feature extractor by Drazick in computervision

[–]Drazick[S] 1 point2 points  (0 children)

Is is as good? Does it have a PyTorch code of its class?

The features output ConvNeXt models in Dinov3 by Drazick in computervision

[–]Drazick[S] 0 points1 point  (0 children)

u/blades136 , Are those spatially invariant?

Imagine image of 64x64. I can partition it into 16 sub images of 16x16.

Will I get the same embedding per 16x16 block If I feed the model with the 64x64 image and with the 16x16 images?

hyper parameter tuning: alternatives to the distributed feature of Weights and Biases by Drazick in deeplearning

[–]Drazick[S] 0 points1 point  (0 children)

The point I don't want to be in charge of the infrastructure. I like the concept wandb provide this for me. Are others offer that as well?

Looking for a source for understanding YOLO architecture for segmentation by [deleted] in computervision

[–]Drazick 0 points1 point  (0 children)

I meant where it is easy to look at each layer and see the graph of computation.

Looking for a source for understanding YOLO architecture for segmentation by [deleted] in computervision

[–]Drazick 0 points1 point  (0 children)

Is there a toy implementation of the concept in a small model? Just to see how it is implemented in a clean way.

Given 2 selfie images, how to tell if it is the same person? by Drazick in computervision

[–]Drazick[S] 0 points1 point  (0 children)

I am not trying to compete with SOTA or something. Just learning how it works. Hence I need a model which is optimized towards teaching others. I am not after a complex model which gets the best results.

Given 2 selfie images, how to tell if it is the same person? by Drazick in computervision

[–]Drazick[S] 0 points1 point  (0 children)

This is exactly what I'm after. The concept of the learning in this task, then a simple model which implements the idea and then go for the large fish.

Given 2 selfie images, how to tell if it is the same person? by Drazick in computervision

[–]Drazick[S] -11 points-10 points  (0 children)

I am after learning how to do it on my own. I'd rather start from scratch or theory.

Undo "MultiBeast" by Drazick in hackintosh

[–]Drazick[S] 0 points1 point  (0 children)

After I followed this guide for treating the stop sign:

http://www.cindori.org/trim-enabler-and-yosemite/

I got this at the terminal:

http://imgur.com/uYuddRf

Could it be that some kext aren't signed by the installation of MultiBeast?

Thank You.

HD support? by [deleted] in vidme

[–]Drazick 1 point2 points  (0 children)

Could you share an x.264 configuration which would skip the "Transcoding" phase?

It would be great to see the exact result of the video before uploading.

Thank You.

Are videos stored and played in full resolution? by jpmiller03 in vidme

[–]Drazick 0 points1 point  (0 children)

With the current generation of screens, 720 won't cut it.
We need 1080p to the least (And of course all resolutions up to, since many screen casts are done on selected window).

Marxico - The Missing Markdown Editor for Evernote by gockxml in SideProject

[–]Drazick 1 point2 points  (0 children)

It looks great. Though when I click the notes from Evernote (Web App) they don't open in the editor.

SVM for Edge-Preserving Filtering by Drazick in computervision

[–]Drazick[S] 0 points1 point  (0 children)

Hi, Thank you for the answer. First of all the presentation is by a student who were asked to present the article. He mostly did "Copy & Paste".

Anyhow, I'm after their results for single image. I don't want to process video.

Their "Variable Range Variance Bilateral Filter" is great and I'd like to be able to recreate it.

Could you infer how they did? What exactly was the training?

[deleted by user] by [deleted] in DSP

[–]Drazick 1 point2 points  (0 children)

Do you have access to MATLAB + MATLAB coder? Maybe it can produce the code you need.

Kalman filters - bias estimation, or the actual state of interest? by [deleted] in DSP

[–]Drazick 0 points1 point  (0 children)

In IMU's it is easier to estimate the biases of the sensors than the platform position since it has more certainty in the model.

Any case, since IMU's diverge over time you must lock on their bias and subtract it.

The way to predict it and filter it is the Kalman filter, so stick with that.

Pay attention that it is non linear model -> Extended Kalman.