English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Cross Domain Dialogue Act Classification

Amanova, D. (2016). Cross Domain Dialogue Act Classification. Master Thesis, Universität des Saarlandes, Saarbrücken.

Item is

Files

show Files
hide Files
:
2016_Dilafruz Amanova_MSc thesis.pdf (Any fulltext), 740KB
 
File Permalink:
-
Name:
2016_Dilafruz Amanova_MSc thesis.pdf
Description:
-
OA-Status:
Visibility:
Restricted (Max Planck Institute for Informatics, MSIN; )
MIME-Type / Checksum:
application/pdf
Technical Metadata:
Copyright Date:
-
Copyright Info:
-
License:
-

Locators

show

Creators

show
hide
 Creators:
Amanova, Dilafruz1, Author           
Petukhova, Volha2, Advisor
Prof Klakow, Dietrich2, Referee
Miettinen, Pauli3, Referee           
Affiliations:
1International Max Planck Research School, MPI for Informatics, Max Planck Society, ou_1116551              
2Universität des Saarlandes, Sprach- und Signalverarbeitung, C7 1, ou_persistent22              
3Databases and Information Systems, MPI for Informatics, Max Planck Society, ou_24018              

Content

show
hide
Free keywords: -
 Abstract: Nowadays the dialogue act classification is one of the hot topics in computational linguistics. Different machine learning algorithms were used for dialogue act classi- fication. In this thesis, we investigate the cross domain dialogue act classification using Support Vector Machines. The goal of the research reported in this work is to explore features for effective cross domain classification. The work includes two phases of data-driven investigation. The first phase involves collecting, and analyzing corpora, while the second phase involves domain independent feature selection and extraction. Dialogue act annotation were collected from three different corpora: AMI 1, HCRC MapTask 2 and SWBD DAMSL [1]. Based on ISO standards, these dialogue acts were mapped to corresponding groups. Number of various experiments were carried out to find features with the best predictive power. The results show that the combination of multiple features: bigrams of Part-Of-Speech, Chunks and words, lead to consistent improvement of the classifier's performance than features in isolation. Finally, we investigate the portability and generalibility of proposed approach on extracted features when using set of features that showed the best predictive results on unseen Metalogue corpus 3. The findings indicate that good classification accuracy can be achieved using our approach, and that there is a set of automatically extracted feature are shared between large corpora, that prove to be extremely reliable when used directly to classify Dialogue Acts.

Details

show
hide
Language(s): eng - English
 Dates: 2017-07-222016
 Publication Status: Issued
 Pages: 60 p.
 Publishing info: Saarbrücken : Universität des Saarlandes
 Table of Contents: -
 Rev. Type: -
 Identifiers: BibTex Citekey: AmanovaMSc2017
 Degree: Master

Event

show

Legal Case

show

Project information

show

Source

show