InstallationΒΆ

Download the system

Download the latest package or clone the system directly from the repository. To clone repository with HTTPS:

git clone https://github.com/TUT-ARG/DCASE2017-baseline-system.git

or with ssh:

git clone git@github.com:TUT-ARG/DCASE2017-baseline-system.git

The system is developed for Python 2.7, Python 3.5 and Python 3.6. The system is tested to work on Linux, Windows and MacOS platforms.

One can install either the official CPython or use some Python distribution based on it. New users are recommended to use Anaconda Python distribution.

Using the system on Windows

The baseline system uses rather long directory paths, since it is storing 32 character MD5 hash of the system parameters into directory names. Some Windows system have path length limit (lowest 260 characters), which is causing problems. To avoid problems related to this, install the system as close as possible to the drive root.

External modules

To ensure that all external modules are installed, run command:

pip install -r requirements.txt

PySoundFile

PySoundFile is used to read and write audio files in the system. The library depends on system level library libsndfile. Under Linux (Debian/Ubuntu) you can install this with command:

sudo apt-get install libsndfile1

Theano

The system uses by default Theano as Keras backend.

There was a bug in Theano 0.8.2 version, so make sure to use 0.9.0 release. The bug will affect Task 2 system (when using GPU and binary_crossentropy as loss function). To fix this, make sure you have installed correct version of Theano:

pip install theano==0.9.0

Or use latest from git with command:

pip install --upgrade --no-deps git+git://github.com/Theano/Theano.git

Tensorflow

It is also possible to use Tensorflow as Keras backend. If you plan to use it with GPU make sure you have installed GPU enabled package:

pip uninstall tensorflow
pip uninstall tensorflow-gpu
pip install tensorflow-gpu

Current version of Tensorflow only supports CUDA 3.0 compatible graphic cards.

Audio datasets

The system will automatically download the needed audio datasets, and place them under the directory specified in the parameters (see parameter path->data).

Dataset Type Audio files Size on disk License
TUT Acoustic scenes 2017, development Acoustic scene 4680 22Gb Academic use only (see EULA inside the package)
TUT Rare sound events 2017, development Sound events / synthetic 1281 9.2Gb Academic use only (see EULA inside the package)
TUT Sound events 2017, development Sound events realistic 24 2.6Gb Academic use only (see EULA inside the package)