GitHub - luuil/Tensorflow-Audio-Classification: Audio classification with VGGish as feature extractor in TensorFlow

luuil / Tensorflow-Audio-Classification Public

Notifications You must be signed in to change notification settings
Fork 29
Star 127

Audio classification with VGGish as feature extractor in TensorFlow

Apache-2.0 license

127 stars 29 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.vscode		.vscode
audio		audio
data		data
vggish		vggish
.gitignore		.gitignore
LICENSE		LICENSE
README.MD		README.MD
audio_inference_demo.py		audio_inference_demo.py
audio_params.py		audio_params.py
audio_train.py		audio_train.py
conda.env.yml		conda.env.yml

Repository files navigation

Audio Classification

Classify the audios. In this repo, I train a model on UrbanSound8K dataset, and achieve about 80% accuracy on test dataset.

There is a pre-trained model in urban_sound_train, trained epoch is 1000

Usage

audio_train.py: Train audio model from scratch or restore from checkpoint.
audio_params.py: Configuration for training a model.
audio_inference_demo.py: Demo for test the trained model.
./audio/*: Dependencies of training, model and datasets.
./vggish/*: Dependencies of VGGish for feature extracting.

Env setup

Conda are recommended, just need one line: conda env create -f conda.env.yml

Train & Test

Config parameters: audio_params.py.
Train the model: python audio_train.py. (It will create tfrecords automaticly if not exists)
Check the training process from tensorboard: tensorboard --logdir=./data/tensorboard
Test the model: python audio_inference_demo.py.

Tools

Dataset

urban sound dataset

Ref. Blogs

About

Audio classification with VGGish as feature extractor in TensorFlow

audio deep-learning neural-network tensorflow audio-classification audio-processing audioset sound-classification vggish

Apache-2.0 license

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 100.0%