Item

ITEM ACTIONSEXPORT

Add to Basket

Local TagsRelease HistoryDetailsSummary

Information encoding by deep neural networks: what can we learn?

Ten Bosch, L., & Boves, L. (2018). Information encoding by deep neural networks: what can we learn? In Proceedings of Interspeech 2018 (pp. 1457-1461). doi:10.21437/Interspeech.2018-1896.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/21.11116/0000-0004-9482-1 Version Permalink: https://hdl.handle.net/21.11116/0000-0004-94C1-A

Genre: Conference Paper

Files

hide Files

:

TenBosch_Boves_2018_Information encoding by deep neural networks.pdf (Publisher version), 523KB

View Save

Open Access status unknown

File Permalink:
https://hdl.handle.net/21.11116/0000-0004-9484-F

Name:
TenBosch_Boves_2018_Information encoding by deep neural networks.pdf

Description:
-

OA-Status:

Visibility:
Public

MIME-Type / Checksum:
application/pdf / [MD5]

Technical Metadata:

Copyright Date:
-

Copyright Info:
-

License:
-

Locators

show

Creators

show

hide

Creators:
Ten Bosch, Louis^{1, 2}, Author
Boves, L.¹, Author

Affiliations:
1Centre for Language Studies, Radboud University, ou_55238
2Other Research, MPI for Psycholinguistics, Max Planck Society, Nijmegen, NL, ou_55217

Content

show

hide

Free keywords: -

Abstract: The recent advent of deep learning techniques in speech tech-nology and in particular in automatic speech recognition hasyielded substantial performance improvements. This suggeststhat deep neural networks (DNNs) are able to capture structurein speech data that older methods for acoustic modeling, suchas Gaussian Mixture Models and shallow neural networks failto uncover. In image recognition it is possible to link repre-sentations on the first couple of layers in DNNs to structuralproperties of images, and to representations on early layers inthe visual cortex. This raises the question whether it is possi-ble to accomplish a similar feat with representations on DNNlayers when processing speech input. In this paper we presentthree different experiments in which we attempt to untanglehow DNNs encode speech signals, and to relate these repre-sentations to phonetic knowledge, with the aim to advance con-ventional phonetic concepts and to choose the topology of aDNNs more efficiently. Two experiments investigate represen-tations formed by auto-encoders. A third experiment investi-gates representations on convolutional layers that treat speechspectrograms as if they were images. The results lay the basisfor future experiments with recursive networks.

Details

show

hide

Language(s): eng - English

Dates: Published Online: 2018-10

Publication Status: Published online

Pages: -

Publishing info: -

Table of Contents: -

Rev. Type: Peer

Identifiers: DOI: 10.21437/Interspeech.2018-1896

Degree: -

Event

show

hide

Title: Interspeech 2018

Place of Event: Hyderabad, India

Start-/End Date: 2018-09-02 - 2018-09-06

Legal Case

show

Project information

show

Source 1

show

hide

Title: Proceedings of Interspeech 2018

Source Genre: Proceedings

Creator(s):

Affiliations:

Publ. Info: -

Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 1457 - 1461 Identifier: -