Select Page

Podcast Machine Learning All things Audio

Audio Signals Quality Diagnostics with Image Analysis. 
by Vasily Tolkachev, ZHAW

In this technical talk I present a number of interesting findings from a project with our industrial partner. The goal was to build a decent discriminative model which is able to distinguish between working and broken sound emitters based on the sound files produced by them. 
We approached the problem with various image analysis tools by applying different classifiers on spectrograms of these files. A technique called t-SNE, which led to the key findings in the project, is going to be introduced. Having faced a number of data artefacts such as erroneous labels and class imbalance, sufficiently good performance was already achieved with Random Forest after a number of important transformations. In conclusion, a comparison to variational autoencoders will be exemplified.


Powerpoint presentation

Audio Based Bird Species Identification Using Deep Learning Techniques 
by Elias Sprengel, ETHZ

Accurate bird species identification is essential for biodiversity conservation and acts as an important tool in understanding the impact of cities and commercial areas on ecosystems. Therefore, many attempts have been made to identify bird species automatically. These attempts usually rely on audio recordings because images of birds are harder to obtain. They work well when the number of bird species is low and the recordings contain little background noise, but they quickly deteriorate when employed in any real world scenario.

In this talk, we present a new audio classification approach based on recent advances in the domain of deep learning. With novel pre-processing and data augmentation methods, we train a neural network on the biggest publicly available dataset. This dataset contains crowd-sourced recordings of 999 bird species, providing an excellent way of evaluating our system in a more realistic scenario. Our convolutional neural network is able to surpass current state of the art results and won this year’s international BirdCLEF 2016 Recognition Challenge.

About The Author

Cédric Walter

I worked with various Insurances companies across Switzerland on online applications handling billion premium volumes. I love to continuously spark my creativity in many different and challenging open-source projects fueled by my great passion for innovation and blockchain technology.In my technical role as a senior software engineer and Blockchain consultant, I help to define and implement innovative solutions in the scope of both blockchain and traditional products, solutions, and services. I can support the full spectrum of software development activities, starting from analyzing ideas and business cases and up to the production deployment of the solutions.I'm the Founder and CEO of Disruptr GmbH.