Overview
Artificial reverberation has been added to anechoic speech data to train more robust machine learning models for automatic speech processing. We are developing methods for automatic speech recognition, source separation and localization, binaural audio generation, and speech emotion recognition.
Software
pygsound: pygsound is a python package for impulse response generation based on state-of-the-art geometric sound propagation engine. The simulation is implemented with C++ and uses pybind11 for python APIs.
Datasets
6-channel synthetic impulse response dataset: Dataset used in our paper: Improving Reverberant Speech Training Using Diffuse Acoustic Simulation.
Low-frequency compensated synthetic impulse response dataset: Dataset used in our paper: Low-frequency Compensated Synthetic Impulse Responses for Improved Far-field Speech Recognition.