KEMBAR78
Audio Signal Processing For Machine Learning | PDF | Computers
0% found this document useful (0 votes)
321 views15 pages

Audio Signal Processing For Machine Learning

This document provides an overview of an upcoming course on audio signal processing for machine learning. The course will cover topics like sound waves, audio features, transformations like the Fourier transform, and applications such as audio classification, speech recognition, and music information retrieval. It will provide both theoretical foundations and coding tutorials using the librosa library. The intended audience is machine learning engineers, computer science students, and others interested in audio and music technology. A basic knowledge of Python is recommended.

Uploaded by

outanoute
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
321 views15 pages

Audio Signal Processing For Machine Learning

This document provides an overview of an upcoming course on audio signal processing for machine learning. The course will cover topics like sound waves, audio features, transformations like the Fourier transform, and applications such as audio classification, speech recognition, and music information retrieval. It will provide both theoretical foundations and coding tutorials using the librosa library. The intended audience is machine learning engineers, computer science students, and others interested in audio and music technology. A basic knowledge of Python is recommended.

Uploaded by

outanoute
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 15

Audio Signal Processing for

Machine Learning
Valerio Velardo
Problem

cat
Problem

car
Problem

car
Problem

car
Applications

● Audio classification
● Speech recognition / speaker verification
● Audio denoising / audio upsampling
● Music Information Retrieval
○ Music Instrument Classification
○ Mood Classification
○ …

● ...
Content

● Sound waves
● DAC / ADC
● Time- and frequency-domain audio features (e.g., rms, spectral centroid)
● Audio transformations
○ Fourier Transform / STFT
○ Constant-Q Transform
○ Mel Spectrograms
○ Chromograms

● ...
What should you expect?

● Theory
● Coding tutorials
Where do you I get the code/slides?
Technology stack
What you’ll learn

● Get a deep understanding of audio data


● Familiarise with frequency/time-domain audio features
● Extract features from raw audio
● Recognise what audio features to use for ML applications
● Preprocess audio data for ML
● Understand (some!) math behind audio transformations
● Use librosa for your audio projects
Don’t freak out!
Who’s this series for?

● ML/DL engineers
● Computer science students
● Software engineers
● Music technologists
● Tech-oriented musicians
Prerequisites

● Intermediate Python programming


Join the community!

thesoundofai.slack.com

You might also like