How should a speech recognizer work?

Scharenborg, Odette; Norris, Dennis; Ten Bosch, Louis; McQueen, James M.

doi:10.1207/s15516709cog0000_37

Item

ITEM ACTIONSEXPORT

Add to Basket

Please note that a newer version of this item is available:
https://pure.mpg.de/pubman/item/item_60379_4

DetailsSummary

Released

Journal Article

How should a speech recognizer work?

MPS-Authors

Norris, Dennis
Language Comprehension Group, MPI for Psycholinguistics, Max Planck Society;
Decoding Continuous Speech, MPI for Psycholinguistics, Max Planck Society;

/persons/resource/persons122

McQueen, James M.
Language Comprehension Group, MPI for Psycholinguistics, Max Planck Society;
Decoding Continuous Speech, MPI for Psycholinguistics, Max Planck Society;

External Resource

No external resources are shared

Fulltext (restricted access)

There are currently no full texts shared for your IP range.

Fulltext (public)

Scharenborg_2005_how should.pdf
(Publisher version), 676KB

Supplementary Material (public)

There is no public supplementary material available

Citation

Scharenborg, O., Norris, D., Ten Bosch, L., & McQueen, J. M. (2005). How should a speech recognizer work? Cognitive Science, 29(6), 867-918. doi:10.1207/s15516709cog0000_37.

Cite as: https://hdl.handle.net/11858/00-001M-0000-0013-1E5D-C

Abstract

Although researchers studying human speech recognition (HSR) and automatic speech recognition (ASR) share a common interest in how information processing systems (human or machine) recognize spoken language, there is little communication between the two disciplines. We suggest that this lack of communication follows largely from the fact that research in these related fields has focused on the mechanics of how speech can be recognized. In Marr's (1982) terms, emphasis has been on the algorithmic and implementational levels rather than on the computational level. In this article, we provide a computational-level analysis of the task of speech recognition, which reveals the close parallels between research concerned with HSR and ASR. We illustrate this relation by presenting a new computational model of human spoken-word recognition, built using techniques from the field of ASR that, in contrast to current existing models of HSR, recognizes words from real speech input.