Datasets¶
Classes for dataset handling
Dataset - Base class¶
This is the base class, and all the specialized datasets are inherited from it. One should never use base class itself.
Usage examples:
1 2 3 4 5 6 7 8 9 10 11 | # Create class
dataset = TUTAcousticScenes_2017_DevelopmentSet(data_path='data')
# Initialize dataset, this will make sure dataset is downloaded, packages are extracted, and needed meta files are created
dataset.initialize()
# Show meta data
dataset.meta.show()
# Get all evaluation setup folds
folds = dataset.folds()
# Get all evaluation setup folds
train_data_fold1 = dataset.train(fold=folds[0])
test_data_fold1 = dataset.test(fold=folds[0])
|
Dataset(\*args, \*\*kwargs) |
Dataset base class |
Dataset.initialize() |
|
Dataset.show_info() |
|
Dataset.audio_files |
Get all audio files in the dataset |
Dataset.audio_file_count |
Get number of audio files in dataset |
Dataset.meta |
Get meta data for dataset. |
Dataset.meta_count |
Number of meta data items. |
Dataset.error_meta |
Get audio error meta data for dataset. |
Dataset.error_meta_count() |
Number of error meta data items. |
Dataset.fold_count |
Number of fold in the evaluation setup. |
Dataset.scene_labels |
List of unique scene labels in the meta data. |
Dataset.scene_label_count |
Number of unique scene labels in the meta data. |
Dataset.event_labels() |
List of unique event labels in the meta data. |
Dataset.event_label_count |
Number of unique event labels in the meta data. |
Dataset.audio_tags |
List of unique audio tags in the meta data. |
Dataset.audio_tag_count |
Number of unique audio tags in the meta data. |
Dataset.download_packages() |
Download dataset packages over the internet to the local path |
Dataset.extract(\*args, \*\*kwargs) |
Extract the dataset packages |
Dataset.train([fold]) |
List of training items. |
Dataset.test([fold]) |
List of testing items. |
Dataset.eval([fold]) |
List of evaluation items. |
Dataset.train_files([fold]) |
List of training files. |
Dataset.test_files([fold]) |
List of testing files. |
Dataset.eval_files([fold]) |
List of evaluation files. |
Dataset.validation_files([fold]) |
List of validation files if they are specified by the dataset. |
Dataset.folds([mode]) |
List of fold ids |
Dataset.file_meta(filename) |
Meta data for given file |
Dataset.file_error_meta(filename) |
Error meta data for given file |
Dataset.file_error_meta(filename) |
Error meta data for given file |
Dataset.relative_to_absolute_path(path) |
Converts relative path into absolute path. |
Dataset.absolute_to_relative(path) |
Converts absolute path into relative path. |
AcousticSceneDataset¶
AcousticSceneDataset(\*args, \*\*kwargs) |
Specialized classes inherited AcousticSceneDataset:
TUTAcousticScenes_2017_DevelopmentSet(\*args, ...) |
TUT Acoustic scenes 2017 development dataset |
TUTAcousticScenes_2016_DevelopmentSet(\*args, ...) |
TUT Acoustic scenes 2016 development dataset |
TUTAcousticScenes_2016_EvaluationSet(\*args, ...) |
TUT Acoustic scenes 2016 evaluation dataset |
SoundEventDataset¶
SoundEventDataset(\*args, \*\*kwargs) |
|
SoundEventDataset.event_label_count([...]) |
Number of unique scene labels in the meta data. |
SoundEventDataset.event_labels([scene_label]) |
List of unique event labels in the meta data. |
SoundEventDataset.train([fold, scene_label, ...]) |
List of training items. |
SoundEventDataset.test([fold, scene_label, ...]) |
List of testing items. |
Specialized classes inherited SoundEventDataset:
TUTRareSoundEvents_2017_DevelopmentSet(...) |
TUT Acoustic scenes 2017 development dataset |
TUTSoundEvents_2017_DevelopmentSet(\*args, ...) |
TUT Sound events 2017 development dataset |
TUTSoundEvents_2016_DevelopmentSet(\*args, ...) |
TUT Sound events 2016 development dataset |
TUTSoundEvents_2016_EvaluationSet(\*args, ...) |
TUT Sound events 2016 evaluation dataset |
AudioTaggingDataset¶
AudioTaggingDataset(\*args, \*\*kwargs) |