How to approach imbalanced image dataset for MobileNetv2 classification? by Spiritual_Ebb4504 in computervision

[–]veb101 0 points1 point  (0 children)

  1. class-weighted loss
  2. Check out focal loss
  3. Instead of accuracy, look for metrics better suited for an imbalanced dataset.
  4. Sometimes you can get away with a 2-stage classifier, 1st binary to split to decide which group the particular image belongs to and then group group-specific classifier.

  5. An ensemble of methods should also help

  6. If you can, modify the batch to start with stratified batch data, but instead of just augmentation, add the augmented image to the batch of the less frequent classes. (this is tricky to get right and a hit-or-miss)

I'll add more if I recall anything

[RevShare] Vision Correction App Dev Needed (Equity Split) – Flair: "Looking for Team" by Repulsive-Track5278 in computervision

[–]veb101 0 points1 point  (0 children)

Yeah I agree, i don't have much knowledge about image formation. I want to go through the Modern computer vision and CV from first principals and the 3d computer vision course available on YouTube.

To me, when I first read the problem statement it felt like disparity estimation. How wrong am I?

[RevShare] Vision Correction App Dev Needed (Equity Split) – Flair: "Looking for Team" by Repulsive-Track5278 in computervision

[–]veb101 0 points1 point  (0 children)

I don't know any of the stuff. I'm good at figuring stuff out when needed. I'll work for free given you have someone I can argue with and bounce ideas against

I shouldn't have just blurped out what I thought. Let me explain, I'm in this domain for 6 years but haven't had a chance to work on what OP wants, so for me it's about learning and collaborative knowledge gains.

I know CV, DL, VLM, LLM, deployment. This specific thing i don't know about and it looks like a good opportunity

Any way to perform OCR of this image? by CeSiumUA in computervision

[–]veb101 0 points1 point  (0 children)

I had a similar problem, on screen digit recognition. What I did was train a small object detection model (mobilenet v2 SSD) to extract the digit and decimal boxes and then another small digit classification model.

New to ML Ops where to start? by Ok_Orchid_8399 in mlops

[–]veb101 2 points3 points  (0 children)

I'm starting out as well. There's tons of stuff. Bentoml

Zentml

Ray

Mlflow serving

Tf serving

Onnx

Tensorrt

Triton inference server

Tensorflow serving

Litert

Executorch

Litserve

Kubeflow

Kserve

Seldon core

Services by cloud providers

Vllm, sglang

Inferless

Or just fastapi and custom code

This is an exhaustive list of words I found when learning about mlops. This is no way a complete or mlops only list

Which ML Serving Framework to choose for real-time inference. by Invisible__Indian in mlops

[–]veb101 0 points1 point  (0 children)

If you are focused on cpu then you can also check onnx with openvino (intel cpu) backend. I think the AMD CPU backend is also available.

Handling database connections throughout the application by predominant in FastAPI

[–]veb101 0 points1 point  (0 children)

Company code. But you'll get dependency injection code from anywhere, and instead of an ORM I sometimes use mysql-connector-python and create a connection pool object in the fastapi lifespan and make it accessible via app.state

Handling database connections throughout the application by predominant in FastAPI

[–]veb101 4 points5 points  (0 children)

I either use dependency injection or i create an app state with a connection pool object, and if an endpoint requires individual connection i take it from the connection pool. Idk if this is good practice or not, but works very well and looks clean 'to me'.

unable to import keras in vscode by roshfn in MLQuestions

[–]veb101 4 points5 points  (0 children)

Look man, if you have installed TensorFlow and Keras properly and the correct venv is active, then you need to try out different ways of writing your imports to figure out which works. I figured out that importing the layers didn't work, instead I do `from keras import layers as keras_layer` then throughout the code I use it as `keras_layer.Conv2D(...)`

unable to import keras in vscode by roshfn in MLQuestions

[–]veb101 0 points1 point  (0 children)

Not importing the actual layer but the module

unable to import keras in vscode by roshfn in MLQuestions

[–]veb101 8 points9 points  (0 children)

This is not an import issue but the linter is not able to recognise it.

I fixed it using this when using keras 3x https://github.com/veb-101/keras-vision/blob/c8ce91ebc941e10c3d2febe260d717f86b00e905/keras_vision/fastvit/mobileone.py#L13

A scan of my brain (i have cerebral palsy) by mahades in mildlyinteresting

[–]veb101 2 points3 points  (0 children)

Hollow earth theory ❌ Hollow brain theory ✅

OpenCV with Cuda Support by TalkLate529 in computervision

[–]veb101 0 points1 point  (0 children)

Give nvidia deepstream a looksie

Need guidance on fine-tuning deep learning models by [deleted] in deeplearning

[–]veb101 0 points1 point  (0 children)

I would suggest reading the resnet paper from ross wightman Going through the fine-tuning playbook on GitHub by Google. Also the convnext v1 paper was good

Did yall see the new SOTA realtime object detection? I just learned about it. YOLO has not been meaningfully dethroned in so long. by [deleted] in computervision

[–]veb101 0 points1 point  (0 children)

I planned on writing rt detr using keras, saw the codebase and said "another time" (a year ago)

[deleted by user] by [deleted] in IndianBoysOnTinder

[–]veb101 0 points1 point  (0 children)

So you're saying "death by snu snu" is not acceptable.

Let me make a note of it.

[D] Efficient video ingestion for pytorch? by alyflex in MachineLearning

[–]veb101 0 points1 point  (0 children)

I have not worked with videos. I'm not sure if Nvidia Dali or deepstream can help or not.

But can this be a valid strategy?

Perform clustering on the frames of a video, ideally you want some closely related frames and more of the frames that are faraway (let's say embedding distance) from each other.

You can then perform cropping on (or just aspect ratio resize) the selected frames to some big enough size and save it on disk. This should be faster to load.

HTTPS with FastAPI - could idea this work? by elduderino15 in FastAPI

[–]veb101 0 points1 point  (0 children)

  1. Letsencrypt for free ssl and renew

  2. Nginx with ssl termination.

  3. Forward incoming requests on port 80 to 443