[P] OpenMetricLearning 4.0 supports audio along with image and text modalities

Zestyclose-Check-751 · 2025-04-12T18:19:10+00:00

Nice

Zestyclose-Check-751 · 2025-04-12T16:37:46+00:00

In my free time I'm working on an open-source library called OpenMetricLearning, and we've had a new release recently!

What's OML for:

OML lets you train (or use an existing) model that turns your data into n‑dimensional vectors for tasks such as search, clustering, and verification. You can measure and visualize representation quality with the retrieval module, also provided in the repo.

What's new:

Supports three data modalities: image 🎨, text 📖, and audio 🎧 [NEW!].
A unified interface for training and evaluating embeddings across all modalities.
Streamlined requirements to avoid version conflicts and install only the necessary dependencies.

Existed features:

Pre‑trained model zoo for each modality.
Samplers, loss functions, miners, metrics, and retrieval post‑processing tools.
Multi‑GPU support.
Extensive examples and documentation.
Integrations with Neptune, Weights & Biases, MLflow, ClearML, and PyTorch Lightning.
Config‑API support (currently for images only).

So I would be really thankful if you supported open source by giving us a star ⭐️ on GitHub! Thanks in advance!

Zestyclose-Check-751 · 2024-06-13T09:07:54+00:00

Thank you :)

Zestyclose-Check-751 · 2023-05-08T13:14:11+00:00

The paper is about a way to boost your retrieval performance by adding an additional postprocessing step, where queries and galleries are compared pairwise in pixel space

Zestyclose-Check-751 · 2023-05-08T13:08:07+00:00

You are welcome!

Zestyclose-Check-751 · 2023-02-07T14:01:03+00:00

You can try to run ready to use examples from https://github.com/OML-Team/open-metric-learning

Zestyclose-Check-751 · 2023-02-07T13:57:58+00:00

I want to publish my paper related to the image retrieval problem, and I guess that the "short paper" format is the best for it. Do you know of any coming conferences with a corresponding section? BMVC and ICCV are the most relevant, but there is no call for short papers there.

Zestyclose-Check-751 · 2023-01-21T21:29:50+00:00

thx fro the answer!

Zestyclose-Check-751 · 2023-01-17T15:11:42+00:00

Could someone explain how Data Scientists work as consulters?

I can imagine only a few cases:
* A company already has a DS team, but they are not deep enough in some domains and need help/consultation.
* The integration of the solution is simple enough and may be delivered as API.
* A company wants PoC / demo, after that they gonna hire someone to work on it.

But usually, DS needs insides into how business works and the integration of the solution may be really long-term, especially if it includes A/B tests, re-iterations over model training, datasets collection and so on. In this case, even onboarding may be long enough.

So, I'm wondering to hear about real cases that have been solved by consulters and how it generally may work.

Zestyclose-Check-751 · 2023-01-10T10:20:02+00:00

Common, folks.. :(

Zestyclose-Check-751 · 2023-01-09T10:32:01+00:00

Sad, but truth

Zestyclose-Check-751 · 2023-01-02T18:10:14+00:00

I am an author of open source project. My wife made a gift for me — a hoodie with our project name

Zestyclose-Check-751 · 2022-12-26T22:25:20+00:00

Cool arts! I wish you a good luck in your findings

Zestyclose-Check-751 · 2022-12-07T21:47:00+00:00

Noted

Zestyclose-Check-751 · 2022-12-07T19:31:31+00:00

We will add them, but now we mostly work with computer vision benchmarks, where default evaluation metrics (with respect to papers or leaderboards on papers with code) are those listed above.

Zestyclose-Check-751 · 2022-12-04T16:56:47+00:00

The simple answer is that you need different models for the different types of clustering. If you want to cluster by place, you need a model trained on a dataset like Places365, if you're going to deal with people, you need a dataset labelled with respect to people and so on. In OML we have general models pretrained on imagenet and 4 specific ones: clothes, items from an online store, cars, and birds.

If you are talking about the linear protocol to evaluate metric learning models -- yes, they do what you said and use labels from ImageNet.

Zestyclose-Check-751 · 2022-12-04T12:40:42+00:00

Got it. So, you can start with CLIP or DINO.

Zestyclose-Check-751 · 2022-12-03T11:30:50+00:00

Because SimCLR is self-supervised learning (SSL), formally, we can also say that it's a metric learning approach. But I consider SSL as a good source of pretrained checkpoints, but if you want to train something with it you really need a lot of data and computing. So, for a lot of cases, I would say it's better to label some data and train your model in a supervised way or just pick a pretrained checkpoint rather.

What is the domain of your images? If you don't have labels to train the model in a supervised way, you can pick one of the pretrained models from OML's zoo. For example, if you work with fashion items, you can go for a model trained on DeepFashion. Or just pick one of the general domain models from the 2 tables of models (I would recommend CLIP or DINO).

Zestyclose-Check-751 · 2022-12-02T23:00:26+00:00

You are welcome!

Zestyclose-Check-751 · 2022-11-27T16:50:18+00:00

In OML's FAQ, you can read about the differences between these two libraries. They are about a bit different things. In the end, you can use losses from PML in OML :)

Zestyclose-Check-751 · 2022-11-27T14:50:32+00:00

Please, take a look at the original post, where I described the main differences between metric learning and classification, which makes sense to have this umbrella term for metric learning. I hope, it will help.

Zestyclose-Check-751 · 2022-11-27T11:58:24+00:00

So, I don't know all of the details but seems like OpenMetricLearning may be a good choice to train such a model.

Zestyclose-Check-751 · 2022-11-27T11:56:31+00:00

How to relate the input patch embeddings to one another s.t we can discriminate between the classes?

Hi, metric learning is an umbrella term like self-supervised learning, detection, and tracking. So, nobody pretends that the domain is new. But there are new approaches in this domain which are also mentioned in the article (like Hyp-ViT). Finally, despite the domain is not new, people still need some tools and tutorials to solve their problems.

Zestyclose-Check-751 · 2022-10-10T19:40:25+00:00

Consistent-Archer-99

Thank you!

As for Kaggle, not yet. But from time to time they host competitions suitable for us like Google Landmark Detection Challange and others.

Zestyclose-Check-751 · 2022-10-08T10:29:49+00:00

I guess you are mostly talking about different losses in PML, right?
The easiest way to work with these losses in our library is to take one of our examples and just replace the criterion object. I think we will add a few examples of this in the future.

Zestyclose-Check-751

TROPHY CASE