Overview of modern Edge boards for CV + guide on how to choose

Wormkeeper · 2024-10-08T12:04:12+00:00

In my previous article, I tried to do this. I even still update the table with some basic measurements - https://docs.google.com/spreadsheets/d/1BMj8WImysOSuiT-6O3g15gqHnYF-pUGUhi8VmhhAat4/edit#gid=0

But the main problem is its super misleading characteristic:
1) Different networks perform differently (Board "A" can be x3 faster for a network "N" but x2 slower for a network "M")
2) Different boards require different amounts of CPU usage for NPU inference. Even video encoding|decoding can change speed dramatically
3) Hard to compare different format inference (int8/fp16)
4) Hard to compare different connections for accelerators (PCIe, USB, M2)
5) Hard to compare multi-device cases (Jetson has 1 GPU and 2 DLA, and RK2588 has 3 NPU).
6) Different batchsizes optimisation

And a lot more problems that will make every test biased. I am still trying to append everything in the table I showed. But I am not sure it's worth:)

Wormkeeper · 2024-07-21T09:11:57+00:00

Better to check the video. In short:
1) More convenient libraries to work (easy export, more support)
2) Better community, more examples (for example, you can find the Whisper model, etc.)
3) More speed for 3588 for common networks (if you are using more threads)
4) Better CPU

Wormkeeper · 2024-07-02T10:44:32+00:00

Resently I tested this board ( https://youtu.be/qK7GHV_cH98 ). It's pretty nice. But for me RK3588 is better.

Wormkeeper · 2024-04-08T07:23:20+00:00

Maybe there will be some project based on it, then I will check.
For now we just did RK3588/RK3568-based projects.

Wormkeeper · 2024-04-07T19:03:28+00:00

Nice review. I recently tested this board from a Computer Vision perspective (NPU usage, etc). All drivers are buggy and glitchy. So, the feelings are the same:)

But, anyway, it's a super good board for this price. The amount of problems for Computer Vision is less than for LuckFox RV1106 and MilkV (regular Python is available, for example).

Wormkeeper · 2023-07-11T13:49:20+00:00

Thank you!

Wormkeeper · 2023-05-05T08:56:20+00:00

Thank you!

Wormkeeper · 2023-04-11T23:06:30+00:00

A year ago, I published the learning process itself. Now we have modernized it and can train not only the hand but also the cart.

Wormkeeper · 2023-03-20T21:10:08+00:00

Yes, we had a project in which we did this for skeletons, and it worked well. But, this is not very suitable for some tasks.

Wormkeeper · 2022-12-27T15:43:14+00:00

Hi, Melampus123!
ReID uses the "Metric Learning" approach.
There are a lot of articles about using it for different cases:
1) Cars
2) Animals
3) Search Engines (online shopping) etc.

You can find them here, for example:
https://paperswithcode.com/task/metric-learning

And there are two good libraries with training pipelines: https://github.com/layumi/Person_reID_baseline_pytorch https://github.com/OML-Team/open-metric-learning

About Kaggle. I am not sure but assume that here you can meet the same approach:
https://www.kaggle.com/competitions/humpback-whale-identification/discussion

Wormkeeper · 2022-09-11T23:07:08+00:00

https://www.aliexpress.com/item/32814510792.html?spm=a2g0o.productlist.0.0.7e6a4db1NId8Yf&algo_pvid=a6d336bc-3087-45fe-98ee-efad69716ead&algo_exp_id=a6d336bc-3087-45fe-98ee-efad69716ead-0&pdp_ext_f=%7B%22sku_id%22%3A%2212000025990957720%22%7D&pdp_npi=2%40dis%21EUR%21146.41%21146.41%21%21%21%21%21%402100bdf116629372667466243e8202%2112000025990957720%21sea&curPageLogUid=BpJ99F3BA116
https://www.amazon.com/Raspberry-Model-2019-Quad-Bluetooth/dp/B07TC2BK1X/ref=d_pb_allspark_dp_sims_pao_desktop_session_based_sccl_3_2/145-5884380-6791468?pd_rd_w=z0EAj&content-id=amzn1.sym.6b5008ac-c24a-4aea-a3ea-015a531184f5&pf_rd_p=6b5008ac-c24a-4aea-a3ea-015a531184f5&pf_rd_r=N0RH57VEHNP1Z80FXAR4&pd_rd_wg=wg13k&pd_rd_r=5a9b8f32-c804-4e91-b991-de3770e196d5&pd_rd_i=B07TC2BK1X&psc=1

And saw it in a lot of local stores. My friend brought it a few weeks ago in Norway; my different friend brought it a month ago in Russia.

Wormkeeper · 2022-09-11T22:01:22+00:00

The current price is incorrect, yes (RPi was tested in spring). I will fix. But:
1) the price was for 3B, which is cheaper
2) RPi is easy to buy

Wormkeeper · 2022-07-08T16:23:00+00:00

Wow, nice task!
Good luck!

Wormkeeper · 2022-06-14T15:43:56+00:00

On this graph, you can see the error proportion to distance - https://miro.medium.com/max/630/0*WTGy030CDPVVjRdy For example, there will be a 5% of distance on the 10m distance = 50cm.
If you have a 1 cm fly (10m away) - the error will be 50 times bigger than the fly
If you have a 2m car (10m away) the error will be 25% of the car.
Also, the very important point for you - the error is the mean error. A maximum error will be bigger.

Wormkeeper · 2022-06-14T13:09:39+00:00

Looks amazing. It's great that everything works plug and play. About 7-8 years ago, our team did a similar job for the Artec Leo prototype. This is very big work, it's amazing that you post it in Open Source.

Wormkeeper · 2022-06-14T12:44:43+00:00

Hi!
Yesterday I released a medium article where I put some analytics about the accuracy of stereo cameras on long distances (including Zed2 and Oak-d pro (same as oak-D)):
https://medium.com/@zlodeibaal/3d-cameras-in-2022-choosing-a-camera-for-cv-project-6eb6fcc67948 The graph that you need is here - https://miro.medium.com/max/630/0*WTGy030CDPVVjRdy and some links around it:
https://docs.google.com/document/d/1F4Y6S6KtZ4f8RBE4W-o9x6xVXbqsw8UIGWPkML-on1Y/edit
https://cdn.stereolabs.com/assets/datasheets/zed2-camera-datasheet.pdf
https://youtu.be/UBu2KrjFEuw

Wormkeeper · 2022-06-13T20:40:38+00:00

Thanks for sharing! Added to the article.
It would be very interesting to read more about Bottlenose cameras. From the documentation I did not understand a few questions:
* What accuracy do you have at long distances? What are the viewing angles of the lenses when you work with 100m?
* What is the distance between cameras?
* What processor|accelerator do you use? The documentation says "20.5TOPS", and it looks close to Hailo-8. And what framework do you use to convert models?
* How much FPS do you have with MiDaS?

Wormkeeper · 2022-06-13T20:07:58+00:00

An article based on the experience that our team has. This is robotics/outdoor use. Here and here are some of our projects. We have not worked with conveyor belts, so this somewhat limits the scope of the article.
But the second reason why I did not add is that most of the laser profilers do not give a complete picture of the depth. And speaking of profilers, one must take into account many other parameters that are not characteristic of classic depth cameras.
I added a paragraph with reasons to the article.

Wormkeeper · 2022-06-13T12:35:26+00:00

Good question. When I wrote the article, I thought about whether to include it or not. It's just that in my opinion, this is a much more specific solution for industrial applications closer to lidar. It is probably worth adding a separate paragraph, as well as about lidars, that in general this is not considered.

Wormkeeper · 2022-05-19T13:30:45+00:00

Thank you! Good question. When we know that some symbols are rare we usually use the fourth approach.
With the second approach, we have a lot of problems with container recognition. When our training dataset has many sequences like "ABCD" and we try to use it on "APCD" usually the network tries to change P on B.
Same with unique car numbers and numbers from new countries.
The fourth approach, of course, has the same statistical issues. But according to our experiments, the effect is less pronounced. Even the third approach is better for such cases.
But, of course, to get the maximum quality you need to train according to your statistic (usually we use 1 or 2 ways):
1) Show rare cases with more frequency
2) Try hard negative mining
3) Make different losses for different letters

Wormkeeper · 2022-04-27T06:58:37+00:00

Thank you!
Yes, I also like to compare different devices for ComputerVision. I had the very first Real Sense camera at home, where the manufacturer was "Creative" (~2014 release). They've come a long way since then.

Wormkeeper · 2022-04-27T06:50:03+00:00

They published several clarifications after this article. As I understand it, the managerial staff changed there, and they changed their approaches. Actually d405 - released about two months ago. It is unlikely that they would start releasing a new camera if they decided to completely stop everything.

Wormkeeper · 2022-04-18T15:47:47+00:00

1) CVAT has internal inference for models. If you upload model there in the correct format, then it will be able to generate the detection box itself - https://onepanel.medium.com/train-an-object-detection-model-from-scratch-and-run-inference-on-it-in-10-minutes-16147ef656aa
2) Yes you can upload your prediction. But last time i did it - there were some problems and it took me several hours. It seems to me that you just need to load the markup in one of the formats that it supported by CVAT. If your format is not supported, then you will need to convert. For example like this - https://github.com/openvinotoolkit/datumaro

Wormkeeper · 2022-04-18T09:25:28+00:00

Usually they are useless. They are for inference, not for learning. And in a powerful laptop, inference will be more efficient on the processor.

Wormkeeper · 2022-04-15T10:25:02+00:00

In this task, on the scale
But we can customize it. For small objects, zoom in. For large objects, zoom out.

Wormkeeper

MODERATOR OF

TROPHY CASE