English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Information encoding by deep neural networks: what can we learn?

Ten Bosch, L., & Boves, L. (2018). Information encoding by deep neural networks: what can we learn? In Proceedings of Interspeech 2018 (pp. 1457-1461). doi:10.21437/Interspeech.2018-1896.

Item is

Basic

show hide
Genre: Conference Paper

Files

show Files
hide Files
:
TenBosch_Boves_2018_Information encoding by deep neural networks.pdf (Publisher version), 523KB
Name:
TenBosch_Boves_2018_Information encoding by deep neural networks.pdf
Description:
-
OA-Status:
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
-
License:
-

Locators

show

Creators

show
hide
 Creators:
Ten Bosch, Louis1, 2, Author           
Boves, L.1, Author
Affiliations:
1Centre for Language Studies, Radboud University, ou_55238              
2Other Research, MPI for Psycholinguistics, Max Planck Society, Nijmegen, NL, ou_55217              

Content

show
hide
Free keywords: -
 Abstract: The recent advent of deep learning techniques in speech tech-nology and in particular in automatic speech recognition hasyielded substantial performance improvements. This suggeststhat deep neural networks (DNNs) are able to capture structurein speech data that older methods for acoustic modeling, suchas Gaussian Mixture Models and shallow neural networks failto uncover. In image recognition it is possible to link repre-sentations on the first couple of layers in DNNs to structuralproperties of images, and to representations on early layers inthe visual cortex. This raises the question whether it is possi-ble to accomplish a similar feat with representations on DNNlayers when processing speech input. In this paper we presentthree different experiments in which we attempt to untanglehow DNNs encode speech signals, and to relate these repre-sentations to phonetic knowledge, with the aim to advance con-ventional phonetic concepts and to choose the topology of aDNNs more efficiently. Two experiments investigate represen-tations formed by auto-encoders. A third experiment investi-gates representations on convolutional layers that treat speechspectrograms as if they were images. The results lay the basisfor future experiments with recursive networks.

Details

show
hide
Language(s): eng - English
 Dates: 2018-10
 Publication Status: Published online
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: Peer
 Identifiers: DOI: 10.21437/Interspeech.2018-1896
 Degree: -

Event

show
hide
Title: Interspeech 2018
Place of Event: Hyderabad, India
Start-/End Date: 2018-09-02 - 2018-09-06

Legal Case

show

Project information

show

Source 1

show
hide
Title: Proceedings of Interspeech 2018
Source Genre: Proceedings
 Creator(s):
Affiliations:
Publ. Info: -
Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 1457 - 1461 Identifier: -