Successes and critical failures of neural networks in capturing human-like 
speech recognition

Adolfi, Federico; Bowers, Jeffrey S.; Poeppel, David

doi:10.1016/j.neunet.2023.02.032

Local TagsRelease HistoryDetailsSummary

Successes and critical failures of neural networks in capturing human-like speech recognition

Adolfi, F., Bowers, J. S., & Poeppel, D. (2023). Successes and critical failures of neural networks in capturing human-like speech recognition. Neural Networks, 162, 199-211. doi:10.1016/j.neunet.2023.02.032.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/21.11116/0000-000D-1E78-C Version Permalink: https://hdl.handle.net/21.11116/0000-000D-1E79-B

Genre: Journal Article

Files

show Files

hide Files

:

Adolfi_2023_SuccessesAndCriticalFailures.pdf (Publisher version), 3MB

View Save

File Permalink:
https://hdl.handle.net/21.11116/0000-000D-1E7A-A

Name:
Adolfi_2023_SuccessesAndCriticalFailures.pdf

Description:
-

OA-Status:
Hybrid

Visibility:
Public

MIME-Type / Checksum:
application/pdf / [MD5]

Technical Metadata:

View

Copyright Date:
2023

Copyright Info:
Copyright © 2023 The Authors

License:
http://creativecommons.org/licenses/by/4.0/

Locators

show

hide

Locator:
https://www.sciencedirect.com/science/article/pii/S0893608023001016 (Publisher version) Open Access Hybrid

Description:
-

OA-Status:
Hybrid

Creators

show

hide

Creators:
Adolfi, Federico^{1, 2}, Author
Bowers, Jeffrey S., Author
Poeppel, David^{1, 2}, Author

Affiliations:
1Ernst Strüngmann Institute (ESI) for Neuroscience in Cooperation with Max Planck Society, Max Planck Society, ou_2074314
2Poeppel Lab, Ernst Strüngmann Institute (ESI) for Neuroscience in Cooperation with Max Planck Society, Max Planck Society, Deutschordenstraße 46, 60528 Frankfurt, DE, ou_3381225

Content

show

hide

Free keywords: Audition Speech Neural networks Robustness Human-like AI

Abstract: Natural and artificial audition can in principle acquire different solutions to a given problem. The constraints of the task, however, can nudge the cognitive science and engineering of audition to qualitatively converge, suggesting that a closer mutual examination would potentially enrich artificial hearing systems and process models of the mind and brain. Speech recognition — an area ripe for such exploration — is inherently robust in humans to a number transformations at various spectrotemporal granularities. To what extent are these robustness profiles accounted for by high-performing neural network systems? We bring together experiments in speech recognition under a single synthesis framework to evaluate state-of-the-art neural networks as stimulus-computable, optimized observers. In a series of experiments, we (1) clarify how influential speech manipulations in the literature relate to each other and to natural speech, (2) show the granularities at which machines exhibit out-of-distribution robustness, reproducing classical perceptual phenomena in humans, (3) identify the specific conditions where model predictions of human performance differ, and (4) demonstrate a crucial failure of all artificial systems to perceptually recover where humans do, suggesting alternative directions for theory and model building. These findings encourage a tighter synergy between the cognitive science and engineering of audition.

Details

show

hide

Language(s):

Dates: Published Online: 2023-02-24Date issued: 2023-05

Publication Status: Issued

Pages: -

Publishing info: -

Table of Contents: -

Rev. Type: Peer

Identifiers: DOI: 10.1016/j.neunet.2023.02.032

Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show

hide

Title: Neural Networks

Source Genre: Journal

Creator(s):

Affiliations:

Publ. Info: -

Pages: - Volume / Issue: 162 Sequence Number: - Start / End Page: 199 - 211 Identifier: ISSN: 08936080