Technical Program

Paper Detail

Paper:	PS-2B.52
Session:	Poster Session 2B
Location:	H Fläche 1.OG
Session Time:	Sunday, September 15, 17:15 - 20:15
Presentation Time:	Sunday, September 15, 17:15 - 20:15
Presentation:	Poster
Publication:	2019 Conference on Cognitive Computational Neuroscience, 13-16 September 2019, Berlin, Germany
Paper Title:	A Unifying Framework for Neuro-Inspired, Data-Driven Detection of Low-Level Auditory Features
Manuscript:	Click here to view manuscript
License:	This work is licensed under a Creative Commons Attribution 3.0 Unported License.
DOI:	https://doi.org/10.32470/CCN.2019.1245-0
Authors:	Lotte Weerts, Claudia Clopath, Dan F.M. Goodman, Imperial College London, United Kingdom
Abstract:	Our understanding of hearing and speech recognition rests on controlled experiments requiring simple stimuli. However, these stimuli often lack the characteristics of complex sounds such as speech. We propose an approach that combines neural modelling with machine learning to determine relevant low-level auditory features. Our approach bridges the gap between detailed neuronal models that capture specific auditory responses, and research on the statistics of real-world speech data and speech recognition. First, we introduce a feature detection model with a modest number of parameters that is compatible with auditory physiology. In order to objectively determine relevant feature detectors within the model parameter space, the model is tested in a speech classification task, using a simple classifier that approximates the information bottleneck. This framework allows us to determine the best model parameters and their neurophysiological and psychoacoustic implications. We show that our model can capture a variety of well-studied features (such as amplitude modulations and onsets) and allows us to unify concepts from different areas of hearing research. Our approach has various potential applications. Firstly, it could lead to new, testable experimental hypotheses for understanding hearing. Moreover, promising features could be directly applied as a new acoustic front-end for speech recognition systems.