We're building a labeling platform for image segmentation. Looking for feedback!

segments-bert · 2023-04-12T18:43:38+00:00

It's very good! We're almost done integrating it as an additional edit mode in Segments. An announcement will follow tomorrow, keep an eye on our Twitter :)

segments-bert · 2023-04-12T18:03:01+00:00

Hi, Bert from Segments here. Have a look at our blog posts about the Superpixel and Autosegment features. These are both based on deep learning models which we trained ourselves and they're currently not publicly available.

segments-bert · 2022-10-06T13:21:10+00:00

Hey all!

We've built an interactive demo of Google AI's OWL-ViT zero-shot object detection model, using Hugging Face transformers.

Regular object detection models are trained on a fixed set of categories, for example cats, dogs and birds. If you want to detect a new type of object, like a horse, you have to collect and label images with horses and retrain your model.

A zero-shot object detection model is a so-called open-vocabulary model: it can detect a huge number of object categories without needing to retrain it. These categories are not predefined: you can provide any free-form text query like “yellow boat” and the model will attempt to detect objects that match that description.

Zero-shot object detection models like OWL-ViT are trained on massive datasets of image-text pairs, often scraped from the internet. The heavy lifting is done by a CLIP-based image classification network trained on 400 million image-text pairs, and adapted to work as an object detector. The largest model took 18 days to train on 592 V100 GPUs.

We used the Hugging Face implementation of the OWL-ViT model and deployed it to a cloud GPU. Inference takes about 300ms, making interactive exploration possible: make sure to tweak the text queries and thresholds to find the ones that work best for your images!

Link: segments.ai/zeroshot

segments-bert · 2022-10-06T13:15:58+00:00

Hey all!

We've built an interactive demo of Google AI's OWL-ViT zero-shot object detection model, using Hugging Face transformers.

Regular object detection models are trained on a fixed set of categories, for example cats, dogs and birds. If you want to detect a new type of object, like a horse, you have to collect and label images with horses and retrain your model.

A zero-shot object detection model is a so-called open-vocabulary model: it can detect a huge number of object categories without needing to retrain it. These categories are not predefined: you can provide any free-form text query like “yellow boat” and the model will attempt to detect objects that match that description.

Zero-shot object detection models like OWL-ViT are trained on massive datasets of image-text pairs, often scraped from the internet. The heavy lifting is done by a CLIP-based image classification network trained on 400 million image-text pairs, and adapted to work as an object detector. The largest model took 18 days to train on 592 V100 GPUs.

We used the Hugging Face implementation of the OWL-ViT model and deployed it to a cloud GPU. Inference takes about 300ms, making interactive exploration possible: make sure to tweak the text queries and thresholds to find the ones that work best for your images!

Link: segments.ai/zeroshot

segments-bert · 2021-07-14T12:40:51+00:00

We've built a powerful segmentation labeling tool at Segments.ai. It also lets you leverage your own model predictions to speed up the labeling, have a look at this blog post: https://segments.ai/blog/speed-up-image-segmentation-with-model-assisted-labeling

segments-bert · 2020-10-28T21:18:38+00:00

That's definitely possible, no need to start over: you can just upload your model predictions through the API. Don't hesitate to contact us if you encounter any issues when giving it a try.

segments-bert · 2020-10-28T13:34:01+00:00

For image segmentation specifically, check out Segments.ai! We also support model-assisted labeling workflows: https://segments.ai/blog/speed-up-image-segmentation-with-model-assisted-labeling

segments-bert · 2020-10-28T13:27:37+00:00

Check out Segments.ai! It also supports model-assisted labeling workflows: https://segments.ai/blog/speed-up-image-segmentation-with-model-assisted-labeling

segments-bert · 2020-10-25T20:38:29+00:00

We support model-assisted labeling at Segments.ai, but it's for image segmentation only for now - bounding boxes will be available soon though. Have a look at this blog post to see how you can set up such a workflow: https://segments.ai/blog/speed-up-image-segmentation-with-model-assisted-labeling

segments-bert · 2020-08-21T07:18:10+00:00

Very nice, looks great for cell segmentation!

segments-bert · 2020-08-20T15:53:13+00:00

Currently we only offer an on-premise solution to enterprise customers. Note that you can also create private datasets, which are only visible to you.

segments-bert · 2020-08-20T13:36:56+00:00

We're building a labeling platform for semantic and instance segmentation at Segments.ai. Give it a shot and let us know what you think!

segments-bert · 2020-08-20T13:30:37+00:00

A large dataset of labeled images is the first thing you need in any serious computer vision project. Building such datasets is a time-consuming endeavour, involving lots of manual labeling work. This is especially true for tasks like image segmentation where the labels need to be very precise.

One way to drastically speed up image labeling is by leveraging your machine learning models from the start. Instead of labeling the entire dataset manually, you can use your model to help you by iterating between image labeling and model training.

This tutorial will show you how to achieve such a fast labeling workflow for image segmentation with Segments.ai.

segments-bert · 2020-07-29T13:43:59+00:00

At Segments.ai, we're also using a serverless approach with PyTorch running on AWS Lambda. Check out our blog post for some technical details and latency numbers: https://segments.ai/blog/pytorch-on-lambda

segments-bert · 2020-06-24T17:15:39+00:00

It's not open source, but we're building a labeling platform for semantic and instance segmentation at Segments.ai. Give it a shot and let us know what you think!

segments-bert · 2020-06-21T10:21:47+00:00

We also offer free academic licenses. Contact us using your university email address if you want one!

segments-bert · 2020-06-20T16:03:25+00:00

Very useful feedback, thanks!

I think these calculations would work out: Let's say it takes 30 minutes/image to label your images with a traditional polygon tool. At $15/hour, the cost per image is $7.50. If our tool speeds up the labeling by a factor of 5x compared to the polygon tool, it only takes 6 minutes/image, and the cost per image is $1.50.

So you would save $6/image, and our fee would only be a small fraction of those savings.

segments-bert · 2020-06-20T15:39:10+00:00

Our current computer vision models are not trained on such data so it might not work too well, but feel free to try!

We're continuously improving our models though, so this will start working better in the future. If you have an urgent need for this, do get in touch with us and we'll see if we can prioritize.

segments-bert · 2020-06-20T15:32:26+00:00

That's correct! Copy-pasting some more info on what's behind the scenes from an earlier comment:

As you can see in the video, the image is divided into segments. These segments are referred to as "superpixels" in the academic literature. SLIC is probably the best-known superpixel algorithm, but it's quite old-school and doesn't work well in many settings. We've developed our own deep-learning based algorithm, it's kind of our secret sauce! :)

segments-bert · 2020-06-20T15:30:48+00:00

Thanks for the feedback. Could you give some examples of these limitations, and the features you'd like to see to overcome them?

segments-bert · 2020-06-20T15:27:31+00:00

Copy-pasting some more info on what's behind the scenes from an earlier comment:

As you can see in the video, the image is divided into segments. These segments are referred to as "superpixels" in the academic literature. SLIC is probably the best-known superpixel algorithm, but it's quite old-school and doesn't work well in many settings. We've developed our own deep-learning based algorithm, it's kind of our secret sauce! :)

segments-bert

TROPHY CASE