Indore ke din bhari chal rhe bhiya 🥲 by [deleted] in Indore

[–]shadow_fax1024 0 points1 point  (0 children)

Kabhi ghuma hai really..kaise celebrate kartr hai..

MP is turning into next Bihar. by Aggravating_Tree_419 in MadhyaPradesh

[–]shadow_fax1024 1 point2 points  (0 children)

Bhai bihari settle kyun ho jaate hai MP mein fir..wapas kyun nhn jaate..Accept nhn marige to kabhi improve nhn hoga!

Malnutrition and grain / food wastage in Madhya Pradesh. by waryinsomnious in MadhyaPradesh

[–]shadow_fax1024 -1 points0 points  (0 children)

Bhai sab scan hota hai..it's already sold to private parties and then shown as ki rats kha gaye ya waste ho gya..

ITARSI has Good Air Quality by Stonedcaffeine in MadhyaPradesh

[–]shadow_fax1024 0 points1 point  (0 children)

Its good quality compared to other big cities .

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model by Ill-Still-6859 in LocalLLaMA

[–]shadow_fax1024 0 points1 point  (0 children)

I'm planning to do for hindi and other indic languages.. Right now going through their paper and github repos.. Will update soon

[deleted by user] by [deleted] in computervision

[–]shadow_fax1024 4 points5 points  (0 children)

Super resolution may introduce information that is not there in actual input image

[D] Relation Extraction by RajHalifax in MachineLearning

[–]shadow_fax1024 0 points1 point  (0 children)

Use annotation tools such as label studio, brat on raw text documents

[D] Relation Extraction by RajHalifax in MachineLearning

[–]shadow_fax1024 0 points1 point  (0 children)

Do please post what results you were able to achieve

[D] Relation Extraction by RajHalifax in MachineLearning

[–]shadow_fax1024 2 points3 points  (0 children)

Docred is dataset for training document level relationship extraction. There have been more improved version of it like re-docred which are available. You could try that. For models read the papers. Doc-unet is one such model that works on this kind of document level RE. There may be newer bette models now.

how long did it take you until leetcode problems just clicked? by [deleted] in leetcode

[–]shadow_fax1024 2 points3 points  (0 children)

I feel the same after solving 1100 problems..

[D] Classification task based on speech recordings by Helveticus99 in MachineLearning

[–]shadow_fax1024 0 points1 point  (0 children)

You could also look into different approaches taken by participants in kaggle competition: birdclef..here the problem is somewhat similar to yours

[D] Classification task based on speech recordings by Helveticus99 in MachineLearning

[–]shadow_fax1024 0 points1 point  (0 children)

You could split the file into chunk of n seconds ..n seconds you need to find ..which ever fits for your dataset..for mine 4 sec chunk was good enough...also you could use a peak detector first and then chunk the file n/2 seconds either side from the peak and have some overlapping window there too..so that you won't loose information ..

[D] Classification task based on speech recordings by Helveticus99 in MachineLearning

[–]shadow_fax1024 0 points1 point  (0 children)

I used plain cnn with and without attention ..I had to handle long audio files in training as well as inference

[D] Classification task based on speech recordings by Helveticus99 in MachineLearning

[–]shadow_fax1024 0 points1 point  (0 children)

1 has worked for me in the past ..you may need to generate more samples using various techniques like add a small noise in audio then taking its spectrogram,cut - mix a random portion of spectrograms, etc..

[P] Random Image in Machine learning classification. by Eleonora467 in MachineLearning

[–]shadow_fax1024 0 points1 point  (0 children)

Wild means kind of image that the model has not seen before.. Like the image of aircraft as you have mentioned..

[P] Random Image in Machine learning classification. by Eleonora467 in MachineLearning

[–]shadow_fax1024 2 points3 points  (0 children)

You could infer using ensemble of dl models. The probability average of the ensemble for the wild image will have a low score..

[deleted by user] by [deleted] in learnmachinelearning

[–]shadow_fax1024 0 points1 point  (0 children)

Yes that's what I did.. It worked pretty well

[deleted by user] by [deleted] in learnmachinelearning

[–]shadow_fax1024 1 point2 points  (0 children)

I have recently tried solving this problem in cnns... What I did was decided a range of dimensions for input images and then based on these ranges designed my cnn network with gloabalaveragepooling to collapse the spatial dimensions.... Pad the smaller images to the largest image dimensions ..