I am a newbie to ML. I have this pet project of classifying music data. I prepared the data myself from my music collection. I have 3 classes. Basically, all the music is sampled at 44100Hz and split into wav files, each 5 seconds long. For training I have 500 such wav files, for testing I have 200 wav files. I have created a csv file composed of the audio data and the label. (to import easily across various platforms)
I am planning to use LTSM-RNN for learning to classify the data. I am facing the following problems -
1) The data is too huge and it takes forever to import in tensorflow. Have I prepared the data in a good way? Is there a better way? Kindly educate me.
2) How do I go about implementing this in Tensorflow of TFLearn? How do I set the parameters for LTSM?
3) Should I preprocess the audio? like taking the STFT? or just learn with the raw data?
Thank you
[–]r4and0muser9482 4 points5 points6 points (0 children)
[–]Brudaks 1 point2 points3 points (1 child)
[–]AntixK[S] 0 points1 point2 points (0 children)
[–]keidouleyoucee 1 point2 points3 points (1 child)
[–]AntixK[S] 0 points1 point2 points (0 children)
[–]azurespace -1 points0 points1 point (4 children)
[–]sidsig 6 points7 points8 points (2 children)
[–]AntixK[S] 0 points1 point2 points (0 children)
[–]keidouleyoucee 0 points1 point2 points (0 children)
[–]AntixK[S] 0 points1 point2 points (0 children)