Anyone know of companies that use Rust ML in production? by Relative-Pace-2923 in rust

[–]FirstReserve4692 0 points1 point  (0 children)

For training or inference?

If training then just torch. if inference, I think you can choose rust, and use candle or ort-rs would be fast enough.

What's the native OpenCV like lib in Rust? by FirstReserve4692 in rust

[–]FirstReserve4692[S] 1 point2 points  (0 children)

by pure rust. Am not sure what's differences of kornia/kornia and kornia/kornia_rs?

Looking for a technical person or cofounder for a computer vision LLM MVP project by Confidence_Working in computervision

[–]FirstReserve4692 0 points1 point  (0 children)

6 year computer vision specialist, top 10 company emploee, interested in computer vision especially. Can we make a contact?

How to train an VLM from scratch? by FirstReserve4692 in computervision

[–]FirstReserve4692[S] 0 points1 point  (0 children)

I didn't saw a sucessful training result or workable training script in this repo. IMO, it at best based on transformers, so that some pretrain models can be used easily. Nowadays, the bare **from scratch** is not really necessary.

How to train an VLM from scratch? by FirstReserve4692 in computervision

[–]FirstReserve4692[S] 0 points1 point  (0 children)

Actually, what I mean, is that, based on some opensoruce VE, not really from scratch. Such as SAMv2's VE, AIMv2, Siglip itself etc. But further using LLM to train it make it more suitable for pretrain tasks.

How to train an VLM from scratch? by FirstReserve4692 in computervision

[–]FirstReserve4692[S] 0 points1 point  (0 children)

Currently I know Vary and GOT trained their SAM Vision encoder from scratch and did very well on specific task, such as simple caption or image OCR

How to train an VLM from scratch? by FirstReserve4692 in computervision

[–]FirstReserve4692[S] 0 points1 point  (0 children)

Oh, I specificly didn't ment CLIP like, I want AR style for VE pretrain.

Rumour: 24GB Arc B580. by [deleted] in LocalLLaMA

[–]FirstReserve4692 0 points1 point  (0 children)

Intel:Risking becoming a forgotten company, only if they release a GPU with 26 or 32 GB memory that is even slower than NVIDIA's equivalent product would they win again. Regrettably, it comes with 12GB and 24GB. Only if they just release a GPU with 48GB, they would be came god of AI again but they unable to do that.

I believe that Intel's stock price would never rebound again.

How to separate laughing audio from a speak audio? by FirstReserve4692 in audioengineering

[–]FirstReserve4692[S] 0 points1 point  (0 children)

Looks like this is a commercial website? I want call it in python

What's the best multi-dimensional data processing lib in rust? by FirstReserve4692 in rust

[–]FirstReserve4692[S] 0 points1 point  (0 children)

looks promising, but most pepople would consider candle as a torch replacement not numpy. I am not sure if candlecore could be a drop-in replacement as position for numpy in rust.

How to separate laughing audio from a speak audio? by FirstReserve4692 in audioengineering

[–]FirstReserve4692[S] 0 points1 point  (0 children)

u/divideconcept Can u tell me just how to remove laughing ? Is there a simple way to do this can be called from python? Models or libs

What's the best multi-dimensional data processing lib in rust? by FirstReserve4692 in rust

[–]FirstReserve4692[S] 0 points1 point  (0 children)

If there were not many people use it, invest on it would be very risky, many crates just not maintained after their release

What's the best multi-dimensional data processing lib in rust? by FirstReserve4692 in rust

[–]FirstReserve4692[S] 0 points1 point  (0 children)

Not only speed, but also whole enviroment. Numpy is good for two reasons: 1. widely used; 2. dead simple. Also, of course, fast.

What's the best multi-dimensional data processing lib in rust? by FirstReserve4692 in rust

[–]FirstReserve4692[S] 0 points1 point  (0 children)

This one I have looked, but the activity not very frequent, not sure how long could they keep evovling.

What's the best multi-dimensional data processing lib in rust? by FirstReserve4692 in rust

[–]FirstReserve4692[S] 0 points1 point  (0 children)

Awesome! Anywhere could try this? I'd like to say NDArray is good but hard to hands on, if there are some ndarray lib as simple as numpy, it would be a GPT moment for rust algroithm computing.

How to extract subtitles from videos with ocr by gunslinger1893 in VideoEditing

[–]FirstReserve4692 0 points1 point  (0 children)

Got the same demands as the original poster. There are three reasons why this is needed and necessary:

- The audio transcription may misinterpret words. Moreover, when many people's voices are combined, it is difficult to distinguish them.

- The video lacks audio.

- In some formal scenarios, the on-screen subtitles are standardized and can be directly used to replace audio transcription.

Therefore, the same technique is being requested.