Audio analytics in Python final project part two
Due date is December 19th at 10am
Each person submits their own code and accompanying audio files
I’ll be around at PierPont in the afternoon on Friday the 12th if anyone needs any help
I wouldn’t recommend using mpld3 in this instance as plotting long audio files using it can cause browser crashes due to the higher number of points
- Load in the urban audio recording ‘brooklyn_street.wav’
- Plot the waveform in the time domain with time in seconds on the x-axis
- Implement a moving average filter across the urban audio data. The frames should have these characteristics:
- A length of 125ms
- They should overlap 50% with the previous frame
- Each frame should be windowed with a Hanning window
- Calculate the Root Mean Squared magnitude of each frame
- Plot this moving RMS array on the plot generated in part 2
- Implemement a peak/max amplitude tracking algorithm using the same parameters as used in Part 3, apply it to the urban audio waveform and plot this on the same time domain plot from Part 2
- Generate a spectrogram of this audio data with time in seconds on the x-axis, frequency on the y-axis and color representing amplitude. Plot this below a plot of the signal in the time domain, ensuring that the x-axis lines up/corresponds in time
- Implement an event detection algorithm that will automatically identify salient sound sources within the urban audio recording. Salient sound sources mean sounds that stand out from the background noise as distinct sonic events.
- The algorithm can use a combination of time domain and frequency domain techniques
- The algorithm should utilise a moving window approach
- Add individually colored regions (<1.0 alpha) to a time domain and frequency domain (spectrogram) plots identifying the start and end time of each sound event
- Output start time, end time, duration, amplitude (RMS) and at least two spectral features for each identified event
- Save each identified sound event as a wave file, named by its start time in milliseconds
- There are around 10 salient events in the urban audio recording