Cross Domain Dialogue Act Classification

Amanova, Dilafruz

Local TagsRelease HistoryDetailsSummary

Cross Domain Dialogue Act Classification

Amanova, D. (2016). Cross Domain Dialogue Act Classification. Master Thesis, Universität des Saarlandes, Saarbrücken.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/11858/00-001M-0000-002E-A0F2-F Version Permalink: https://hdl.handle.net/11858/00-001M-0000-002E-A0F3-D

Genre: Thesis

Files

show Files

hide Files

:

2016_Dilafruz Amanova_MSc thesis.pdf (Any fulltext), 740KB

File Permalink:
-

Name:
2016_Dilafruz Amanova_MSc thesis.pdf

Description:
-

OA-Status:

Visibility:
Restricted (Max Planck Institute for Informatics, MSIN; )

MIME-Type / Checksum:
application/pdf

Technical Metadata:

Copyright Date:
-

Copyright Info:
-

License:
-

Locators

show

Creators

show

hide

Creators:
Amanova, Dilafruz¹, Author
Petukhova, Volha², Advisor
Prof Klakow, Dietrich², Referee
Miettinen, Pauli³, Referee

Affiliations:
1International Max Planck Research School, MPI for Informatics, Max Planck Society, ou_1116551
2Universität des Saarlandes, Sprach- und Signalverarbeitung, C7 1, ou_persistent22
3Databases and Information Systems, MPI for Informatics, Max Planck Society, ou_24018

Content

show

hide

Free keywords: -

Abstract: Nowadays the dialogue act classification is one of the hot topics in computational linguistics. Different machine learning algorithms were used for dialogue act classi- fication. In this thesis, we investigate the cross domain dialogue act classification using Support Vector Machines. The goal of the research reported in this work is to explore features for effective cross domain classification. The work includes two phases of data-driven investigation. The first phase involves collecting, and analyzing corpora, while the second phase involves domain independent feature selection and extraction. Dialogue act annotation were collected from three different corpora: AMI 1, HCRC MapTask 2 and SWBD DAMSL [1]. Based on ISO standards, these dialogue acts were mapped to corresponding groups. Number of various experiments were carried out to find features with the best predictive power. The results show that the combination of multiple features: bigrams of Part-Of-Speech, Chunks and words, lead to consistent improvement of the classifier's performance than features in isolation. Finally, we investigate the portability and generalibility of proposed approach on extracted features when using set of features that showed the best predictive results on unseen Metalogue corpus 3. The findings indicate that good classification accuracy can be achieved using our approach, and that there is a set of automatically extracted feature are shared between large corpora, that prove to be extremely reliable when used directly to classify Dialogue Acts.

Details

show

hide

Language(s): eng - English

Dates: Accepted: 2017-07-22Date issued: 2016

Publication Status: Issued

Pages: 60 p.

Publishing info: Saarbrücken : Universität des Saarlandes

Table of Contents: -

Rev. Type: -

Identifiers: BibTex Citekey: AmanovaMSc2017

Degree: Master

Event

show

Legal Case

show

Project information

show

Source

show