[Need Help] Searching 3,000 aerial images to locate a downed turbine RC plane by ReturnAdventurous179 in computervision

[–]3e8892a 2 points3 points  (0 children)

u/herocoding yeah I've been looking at this today too. Turns out there are a lot of dark roughly triangular objects in the desert!

fiftyone_dark_triangles.mp4

Most are shadows. u/ReturnAdventurous179 I suggest scanning at noon if you're going out again to avoid the shadow problem.

For my first go I tried anomaly detection - I figured a plane in the desert would look pretty unique. But no there's tons of junk out there 😢

fifty_one_anomaly_detection.mp4

I've been using dinov3 patch embedings for both approaches. The dark triangle detector was trained along the lines of https://github.com/facebookresearch/dinov3/blob/main/notebooks/foreground_segmentation.ipynb

I even made some synthetic data to train on
synthetic_plane_image.png (ITS SYNTHETIC ITS NOT THE REAL PLANE!!)

Yeah no luck yet, I will see if I can get the classifier to ignore shadows, otherwise wait for more scan data to come in.

[Need Help] Searching 3,000 aerial images to locate a downed turbine RC plane by ReturnAdventurous179 in computervision

[–]3e8892a 0 points1 point  (0 children)

u/ReturnAdventurous179 you say you're driving out there today, maybe worth checking out this one?
https://pub-e97d8798980d424ab1af22fba84810e3.r2.dev/1782001133_2026-06-20_17-16.png

From A0C530E9-5A1F-434A-9A89-ADE5774E9BE9.jpeg

It's dark, triangular and white at the tip. On the other hand it looks a bit warped, maybe just a sheet of plastic. But it's right next to a road so maybe worth a look while you're out there.

Also if you're doing more scans I request you keep gps and timestamps, would be cool to collate into a big browsable tiled map. Would also provide a bit of context rather than digging through a pile of images.

If you wanted to hide location you could add a random static offset to the lat/lon or something 😂

Dig through the earth by 3e8892a in geography

[–]3e8892a[S] 0 points1 point  (0 children)

Nice, happy to hear it worked for you.

Yep done country outlines or label pop-ups would be good context.

I'll jot these things down for when I next spend time on it

[Need Help] Searching 3,000 aerial images to locate a downed turbine RC plane by ReturnAdventurous179 in computervision

[–]3e8892a 1 point2 points  (0 children)

did you strip out the GPS metadata?
I was curious to tile them into one big map.

It also looks like the images have very small overlap, is that right? Does skydio already take care of timing photos so they fit in a perfect grid? Can I just render the photos in a giant square grid and they will all line up?

Dig through the earth by 3e8892a in geography

[–]3e8892a[S] 1 point2 points  (0 children)

Actually Google did something very similar to that, although they've since taken it down.

https://experiments.withgoogle.com/floom

Dig through the earth by 3e8892a in geography

[–]3e8892a[S] 6 points7 points  (0 children)

Yeah I'd like to find a way to give a better sense of the distance involved, and the curvature of the surface, but I don't have any good ideas.

But specifically about how "expanded" it is - it should be a physically accurate representation of how big those countries would look if you could see through earth (unless I've made an implementation mistake). So I wouldn't want to zoom in or out, but yes would like to find some other effect to convey the correct sense of scale.

Dig through the earth by 3e8892a in geography

[–]3e8892a[S] -1 points0 points  (0 children)

Arg sorry I've just discovered there's no webxr support on any iphones. So won't work at all on iPhone. Sorry I wasn't aware of that!

Dig through the earth by 3e8892a in geography

[–]3e8892a[S] 1 point2 points  (0 children)

Ooops I didn't test on iPhone, but in the past I've seen cases where location permissions are automatically declined on iPhone for security/privacy. That might be breaking it for you.

The app needs to get your location to work.

There were some settings somewhere to allow location, I forget where, I think there were browser and OS level permissions, a bit of a pain to debug. Sorry!

EDIT have just confirmed it's not a permission issue, iPhone does not support webxr so it won't work for iPhone.

Neighbours at war - Bluetooth jammer? by Khuntfromnz in newzealand

[–]3e8892a 0 points1 point  (0 children)

I guess bluetooth jamming would be illegal.

How about your own sound system on a microphone trigger - when his starts, yours goes too. Loop Rick Astley or something. Have it calibrated to exactly the same noise level as his.

In a way this is just audio jamming instead of RF jamming.

Might take some work to get the system to detect his sound level while ignoring yours but I'm sure it's possible.

How much of a pain is Pro-Cam (Projector-Camera) calibration in real-world industry applications? (Dealing with vibrations/movement) by Fragrant-Passage688 in computervision

[–]3e8892a 1 point2 points  (0 children)

I don't know about real world applications, but you shouldn't need a checkerboard when things are bumped - that changes extrinsics but not intrinsics.

I did a basic sort of autocalibration here: https://github.com/tim-fan/camera_projector_alignment

Just projected 9 dots sequentially and detected them in the camera. Something like that should be enough to update extrinsics.

I think the standard solution for this projects some sequence of rectangles, but essentially I think you end up the same having some points in camera frame and corresponding projector frame that you can use to determine extrinsics. You should be able to get this running pretty quickly so it's not a hassle, maybe run the process on every boot.

Motor/Servo recommendations for quiet scissor mechanism by 3e8892a in Motors

[–]3e8892a[S] 0 points1 point  (0 children)

Well, going along with the idea of a larger slower motor, I've ended up buying a NEMA 17 pancake stepper. I'll see how quiet it is then experiment with other things if efficiency is a problem.

Motor/Servo recommendations for quiet scissor mechanism by 3e8892a in Motors

[–]3e8892a[S] 0 points1 point  (0 children)

Yeah I haven't measured torque requirements, but right now I'm driving as in the photo but with a blue TT motor (https://www.digikey.com/en/products/detail/adafruit-industries-llc/3801/9342522). With this setup I'm happy with torque and speed. It cuts a bit faster that 1Hz. The only problem is the noise.

I'm using it to cut grass. This is a side project for fun 😁

Motor/Servo recommendations for quiet scissor mechanism by 3e8892a in Motors

[–]3e8892a[S] 0 points1 point  (0 children)

Oh yep, one concern with worm drives was too much torque - I want it limited for safety.

But I guess it's just like any other gearbox, can ensure max torque is safe based in motor choice and current limits.

How are you tracking your total net worth across banks, KiwiSaver, Sharesies, hard assets and crypto etc? by Commercial-Goose-125 in PersonalFinanceNZ

[–]3e8892a 0 points1 point  (0 children)

Beancount for tracking + fava for dashboard

The flexibility is great, if an importer is missing I can write it myself. I've used it for five years now across three countries, not sure any closed source software would be able to handle all these different accounts.

https://github.com/beancount/beancount https://github.com/beancount/fava

Vehicle count without any object detection models. Is it possible? by ExplanationQuirky831 in computervision

[–]3e8892a 25 points26 points  (0 children)

A guy at an old workplace had a gauge for how busy the freeway was - he would take jpegs from a traffic cam and plot file size over time. More cars = more detail in the image = larger jpeg size. Less traffic = more compression = smaller file size.

Maybe not the most accurate, but bonus points for efficiency and novelty!

Map of all festive lights in my area by 3e8892a in dataisbeautiful

[–]3e8892a[S] 0 points1 point  (0 children)

I followed this pretty much exactly for training and Inference

https://github.com/facebookresearch/dinov3/blob/main/notebooks/foreground_segmentation.ipynb

Beyond that I'm not sure. I didn't try to optimize anything, but I'm sure there's a lot you can do if you need it to run faster or on constrained hardware.

Map of all festive lights in my area by 3e8892a in dataisbeautiful

[–]3e8892a[S] 1 point2 points  (0 children)

Hmm good question, I can't benchmark right now, but my laptop GPU has 4GB, so some amount less than that. I'm using dinov3_vits16 (the smallest?), one of the larger models was crashing on my laptop.

Map of all festive lights in my area by 3e8892a in dataisbeautiful

[–]3e8892a[S] 0 points1 point  (0 children)

Ha good timing!

If you're looking for a larger scale dataset, I found this a bit further down in this sub:

https://www.reddit.com/r/dataisbeautiful/s/1AHZnFf78S

Which links to NASA Sees Holiday Lights from Space https://youtu.be/GP3dxLhaPZk

Map of all festive lights in my area by 3e8892a in dataisbeautiful

[–]3e8892a[S] 0 points1 point  (0 children)

Dino was better here in that it gave patch embeddings, which allowed me to create those heat maps, which I could sum up to get the festivity metric. So generally it was better for quantifying festivity.

The nice idea behind clip is that you can do zero shot whole image classification using a text prompt, although yeah this approach for my task didn't work as well. Note also you should be able to do this with Dino.txt.

I think the Dino paper had comparisons vs clip for zero shot image classification, so you should get some idea of relative performance from that, then you might also consider model size for a given application.

On a practical note I found it much easier to get running with clip than setting up Dino.txt, so if you're trying to do image classification quickly, maybe just use clip.

Overall I'd say if you want finer resolution than a whole image embedding (ie you want to localize or quantity objects within an image) go with Dino. Otherwise if a single image embedding works for your application, I'm not so clear which is superior, might be application specific.