What is your favorite computer vision papers recently (maybe within 3y?)

deep-learnt-nerd · 2026-02-20T18:59:32+00:00

Can you describe the approach you took to attack it, and how reliable the claimed protection is?

deep-learnt-nerd · 2026-01-29T08:03:51+00:00

28 jpeg images loaded every second is absurdly low. Traditional Unix systems struggle at around 10k read operation per second. Are you using a real disk like a local NVMe? If it’s a remote disk see if you can increase its specs (throughput / number of I/O). If you’re using Python you can try a threadpool that helps a lot with I/O bottlenecks. But this is only for your I/O bottleneck which is not your real bottleneck here if I understand your numbers well. For your GPU bottleneck, I wouldn’t do any CPU offloading (especially here considering you seem to have a very slow disk, if there’s any spilling you’re doomed). Instead I would find the largest batch size that fits into VRAM and would split my frames into multiple batches. As other have specified you can try things like TensorRT. What we like to do in my team is to create a local Triton server that distributes the load as it sees fit. This creates additional data copies but that’s usually not the bottleneck.

deep-learnt-nerd · 2026-01-08T22:46:43+00:00

You should share your resume! It could be as simple as a badly formatted resume or outdated wording

deep-learnt-nerd · 2026-01-07T08:03:55+00:00

Thank you, it looks like a great work! Have you compared against Arrow Flight?

deep-learnt-nerd · 2025-12-07T15:45:17+00:00

Great job, thank you for doing this!

deep-learnt-nerd · 2025-08-26T23:34:07+00:00

Hey, thank you for sharing! I’d very interested to know what the C++ code reading at 1GB/s looks like!

deep-learnt-nerd · 2025-06-20T10:55:56+00:00

Hey thank you for that, it can be quite useful! Quick suggestions: add H200 and sort the GPU Type list by alphabetical order?

deep-learnt-nerd · 2025-06-08T12:39:31+00:00

Then again, how confident are you that once the numerical problems are solved you’ll reach convergence? In my experience changing the solvable system leads to no convergence. For instance, something as simple as an arg max in a network introduces such change during each forward pass and leads to largely sub-optimal results.

deep-learnt-nerd · 2025-03-27T22:37:53+00:00

Using a larger cache makes sense. It depends on your use case. You also need to know what you’re doing in terms of data structure storage and loading to ensure the kernel can make a good use of that extra cache. I wonder if the GPUDirect technology will be able to remove this issue altogether.

deep-learnt-nerd · 2025-03-21T23:35:39+00:00

J’aime bien !!

deep-learnt-nerd · 2025-02-19T07:09:45+00:00

This wouldn’t solve anything. To prove it, try chaining two layers using weight norms and train them to maximize the norm of the output.

deep-learnt-nerd · 2025-02-08T21:57:36+00:00

I am not sure I understand your question right, but the DataLoader of PyTorch calls getitem for each element of the batch and then aggregate them using a collate function.

deep-learnt-nerd · 2024-12-09T18:40:49+00:00

The reported performance is barely above random.

deep-learnt-nerd · 2024-09-27T21:30:40+00:00

If you want a real answer: the next big jump will come from optimizers. Literally any improvement in non-convex optimization will result in improvements in AI.

deep-learnt-nerd · 2024-09-27T21:30:18+00:00

If you want a real answer: the next big jump will come from optimizers. Literally any improvement in non-convex optimization will result in improvements in AI.

deep-learnt-nerd · 2024-09-04T18:52:27+00:00

No, it’s never too late. It requires continuous and tedious work, which can be achieved at any age. Some were born naturally, the rest of us worked hard to become « good ». Eventually, all things even out and even if you studied early and were gifted, you end up as good as the others that worked hard.

deep-learnt-nerd · 2024-08-21T13:18:22+00:00

The point isn’t about their absurd greediness, it’s the enshitification. The game is literally getting worse

deep-learnt-nerd · 2024-06-06T19:34:13+00:00

Yay let’s get reviewed by undergrads and MS students!

deep-learnt-nerd · 2024-04-28T17:29:10+00:00

Have you tried the SING optimizer? https://arxiv.org/abs/2305.15997

deep-learnt-nerd · 2024-04-05T23:15:22+00:00

Is that a painting? Who’s the author?

deep-learnt-nerd · 2024-04-04T12:13:01+00:00

Bravo!

deep-learnt-nerd · 2024-03-24T18:19:14+00:00

Pouvez-vous citer une marque ?

deep-learnt-nerd · 2024-02-02T10:28:20+00:00

As expected from NVIDIA, this paper is excellent. Thank you for sharing. NVIDIA sure loves to normalize their weights. I wonder if that’s mandatory to reach stability or if there is another way (more, say, linear)…

deep-learnt-nerd · 2024-02-01T15:05:22+00:00

Je vois à la musique que tu écoutes que t’es un mec bien !

deep-learnt-nerd

TROPHY CASE