English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Successes and critical failures of neural networks in capturing human-like speech recognition

Adolfi, F., Bowers, J. S., & Poeppel, D. (2023). Successes and critical failures of neural networks in capturing human-like speech recognition. Neural Networks, 162, 199-211. doi:10.1016/j.neunet.2023.02.032.

Item is

Files

show Files
hide Files
:
Adolfi_2023_SuccessesAndCriticalFailures.pdf (Publisher version), 3MB
Name:
Adolfi_2023_SuccessesAndCriticalFailures.pdf
Description:
-
OA-Status:
Hybrid
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
2023
Copyright Info:
Copyright © 2023 The Authors

Locators

show
hide
Description:
-
OA-Status:
Hybrid

Creators

show
hide
 Creators:
Adolfi, Federico1, 2, Author
Bowers, Jeffrey S., Author
Poeppel, David1, 2, Author                 
Affiliations:
1Ernst Strüngmann Institute (ESI) for Neuroscience in Cooperation with Max Planck Society, Max Planck Society, ou_2074314              
2Poeppel Lab, Ernst Strüngmann Institute (ESI) for Neuroscience in Cooperation with Max Planck Society, Max Planck Society, Deutschordenstraße 46, 60528 Frankfurt, DE, ou_3381225              

Content

show
hide
Free keywords: Audition Speech Neural networks Robustness Human-like AI
 Abstract: Natural and artificial audition can in principle acquire different solutions to a given problem. The constraints of the task, however, can nudge the cognitive science and engineering of audition to qualitatively converge, suggesting that a closer mutual examination would potentially enrich artificial hearing systems and process models of the mind and brain. Speech recognition — an area ripe for such exploration — is inherently robust in humans to a number transformations at various spectrotemporal granularities. To what extent are these robustness profiles accounted for by high-performing neural network systems? We bring together experiments in speech recognition under a single synthesis framework to evaluate state-of-the-art neural networks as stimulus-computable, optimized observers. In a series of experiments, we (1) clarify how influential speech manipulations in the literature relate to each other and to natural speech, (2) show the granularities at which machines exhibit out-of-distribution robustness, reproducing classical perceptual phenomena in humans, (3) identify the specific conditions where model predictions of human performance differ, and (4) demonstrate a crucial failure of all artificial systems to perceptually recover where humans do, suggesting alternative directions for theory and model building. These findings encourage a tighter synergy between the cognitive science and engineering of audition.

Details

show
hide
Language(s):
 Dates: 2023-02-242023-05
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: Peer
 Identifiers: DOI: 10.1016/j.neunet.2023.02.032
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: Neural Networks
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: -
Pages: - Volume / Issue: 162 Sequence Number: - Start / End Page: 199 - 211 Identifier: ISSN: 08936080