Skip to content

Latest commit

 

History

History
32 lines (19 loc) · 1.13 KB

File metadata and controls

32 lines (19 loc) · 1.13 KB

Learning audio modeling: Audio Classification.

Summary

The purpose of this notebook is to teach myself audio processing from scratch. This notebooks are resources summarizing and showing example of processing techniques, data augmentation and modeling techniques for audio. Those notebooks are highly based on the references cited below and the toy example is a PyTorch implementation of Sath Adam's series of videos.

TODO's:

  • Audio Processing Techniques: Review and Summary.
  • Toy example: Instrument classification.
    • Data Visualization (fft, bank filters, mfcc).
    • Pre-processing.
    • CNN-modeling.
    • RNN-modeling.

References

Audio Preprocessing

Data Augmentation

Further reading