English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Journal Article

How should a speech recognizer work?

MPS-Authors

Norris,  Dennis
Language Comprehension Group, MPI for Psycholinguistics, Max Planck Society;
Decoding Continuous Speech, MPI for Psycholinguistics, Max Planck Society;

/persons/resource/persons122

McQueen,  James M.
Language Comprehension Group, MPI for Psycholinguistics, Max Planck Society;
Decoding Continuous Speech, MPI for Psycholinguistics, Max Planck Society;

External Resource
No external resources are shared
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)

Scharenborg_2005_how should.pdf
(Publisher version), 676KB

Supplementary Material (public)
There is no public supplementary material available
Citation

Scharenborg, O., Norris, D., Ten Bosch, L., & McQueen, J. M. (2005). How should a speech recognizer work? Cognitive Science, 29(6), 867-918. doi:10.1207/s15516709cog0000_37.


Cite as: https://hdl.handle.net/11858/00-001M-0000-0013-1E5D-C
Abstract
Although researchers studying human speech recognition (HSR) and automatic speech recognition (ASR) share a common interest in how information processing systems (human or machine) recognize spoken language, there is little communication between the two disciplines. We suggest that this lack of communication follows largely from the fact that research in these related fields has focused on the mechanics of how speech can be recognized. In Marr's (1982) terms, emphasis has been on the algorithmic and implementational levels rather than on the computational level. In this article, we provide a computational-level analysis of the task of speech recognition, which reveals the close parallels between research concerned with HSR and ASR. We illustrate this relation by presenting a new computational model of human spoken-word recognition, built using techniques from the field of ASR that, in contrast to current existing models of HSR, recognizes words from real speech input.