User Manual Privacy Policy Disclaimer Contact us
  Advanced SearchBrowse




Conference Paper

Towards an end-to-end computational model of speech comprehension: simulating a lexical decision task


Ernestus,  Mirjam
Centre for Language Studies, Radboud University;
Language Comprehension Department, MPI for Psycholinguistics, Max Planck Society;

There are no locators available
Fulltext (public)

(Publisher version), 415KB

Supplementary Material (public)
There is no public supplementary material available

Ten Bosch, L., Boves, L., & Ernestus, M. (2013). Towards an end-to-end computational model of speech comprehension: simulating a lexical decision task. In Proceedings of INTERSPEECH 2013: 14th Annual Conference of the International Speech Communication Association (pp. 2822-2826).

Cite as: http://hdl.handle.net/11858/00-001M-0000-0014-4D67-1
This paper describes a computational model of speech comprehension that takes the acoustic signal as input and predicts reaction times as observed in an auditory lexical decision task. By doing so, we explore a new generation of end-to-end computational models that are able to simulate the behaviour of human subjects participating in a psycholinguistic experiment. So far, nearly all computational models of speech comprehension do not start from the speech signal itself, but from abstract representations of the speech signal, while the few existing models that do start from the acoustic signal cannot directly model reaction times as obtained in comprehension experiments. The main functional components in our model are the perception stage, which is compatible with the psycholinguistic model Shortlist B and is implemented with techniques from automatic speech recognition, and the decision stage, which is based on the linear ballistic accumulation decision model. We successfully tested our model against data from 20 participants performing a largescale auditory lexical decision experiment. Analyses show that the model is a good predictor for the average judgment and reaction time for each word.