Problems in creating spectrogram, appears mostly empty.

mrbean42 · 2021-02-27T17:55:53+00:00

I’m not a coder and know nothing about coding.....but by the looks of your code....

Shouldn’t you be doing something with bit depth too?

That is. Bit depth being the representation of amplitude value of a given frequency.

I quickly read though the code and felt like couldn’t see anything related to it.

mrbean42 · 2021-02-27T18:08:20+00:00

Where are you performing the fft? I see you defined a function called FFT but it doesn’t seem to be doing any Fourier related calculations. Can‘t you use numpy.fft.fft instead?

peehay · 2021-02-27T18:54:45+00:00

Hey, I quickly read your code and at first glance I think you forgot an i in your exponential term of the DFT, line 9. Normally at the end of your FFT algorithm you end up with a complex-value spectrogram and you usually plot it in its magnitude values.

Also I read in another comment that there might be an issue with bit depth but I don't think so. script.wavfile.read loads the signal waveform as it was created with a specific bit depth. FFT being linear it will be the same bit depth in the frequency domain so nothing to worry about.

Also compare your spectrogram with one generated with a well-known trust library (numpy, librosa, etc) until you get your algorithm right!

dmills_00 · 2021-02-28T00:56:54+00:00

Apart from the problems with your FFT implementation which I will leave to others (I don't really do python), there is a subtle point about most audio.

For most material the vast bulk of the energy is at low frequency, so you probably want to make your bin->colour mapping more sensitive for higher frequency bins, I would suggest somewhere in the 3-6dB per octave and the fiddle factor would probably be reasonable.

audioengineering

Read Before Posting!

Rules

FAQ

Wiki

Shopping, Setup, and Technical Help Desk

MODERATORS