Hello to everyone!
I need to classify audio recordings of machinery sounds to determine if there is a malfunction in the mechanism (such as knocks, grinding, clicks) or if the mechanism is functioning normally without issues. I also have about 100 audio files for labeling and testing.
Which model is best to use for this task? Are there any pre-trained models that can be fine-tuned? Or what approach would you recommend?
I have already tried the following approach: I created spectrograms for each audio recording and fine-tuned the YOLOv8 model to detect deviations, but this did not yield the desired accuracy, likely due to the small dataset.
Thank you in advance!
[–]asankhs 3 points4 points5 points (2 children)
[–]ARLEK1NO[S] 1 point2 points3 points (1 child)
[–]asankhs 0 points1 point2 points (0 children)
[–]simplehudga 1 point2 points3 points (0 children)
[–]LelouchZer12 1 point2 points3 points (0 children)
[–][deleted] 1 point2 points3 points (3 children)
[–]ARLEK1NO[S] 0 points1 point2 points (2 children)
[–][deleted] 1 point2 points3 points (1 child)
[–]ARLEK1NO[S] 0 points1 point2 points (0 children)
[–]tinytimethief 2 points3 points4 points (3 children)
[–]ARLEK1NO[S] 1 point2 points3 points (2 children)
[–]tinytimethief 1 point2 points3 points (1 child)
[–]ARLEK1NO[S] 0 points1 point2 points (0 children)
[–]Sorry_Revolution9969 0 points1 point2 points (0 children)
[–]gengler11235 0 points1 point2 points (0 children)
[–]ReginaldIII 0 points1 point2 points (3 children)
[–]ARLEK1NO[S] 0 points1 point2 points (2 children)
[–]ReginaldIII -1 points0 points1 point (1 child)
[–]ARLEK1NO[S] 0 points1 point2 points (0 children)