日本語
 
Help Privacy Policy ポリシー/免責事項
  詳細検索ブラウズ

アイテム詳細

  Proactive Learning Algorithms: A Survey of the State of the Art and Implementation of Novel and Concrete Algorithm for (Unstructured) Data Classification

Anis, M. (2019). Proactive Learning Algorithms: A Survey of the State of the Art and Implementation of Novel and Concrete Algorithm for (Unstructured) Data Classification. Master Thesis, Universität des Saarlandes, Saarbrücken.

Item is

基本情報

表示: 非表示:
アイテムのパーマリンク: https://hdl.handle.net/21.11116/0000-0005-9C5B-6 版のパーマリンク: https://hdl.handle.net/21.11116/0000-0005-9C5C-5
資料種別: 学位論文

ファイル

表示: ファイル
非表示: ファイル
:
2019 MSc Thesis Myriam Anis.pdf (全文テキスト(全般)), 6MB
 
ファイルのパーマリンク:
-
ファイル名:
2019 MSc Thesis Myriam Anis.pdf
説明:
-
OA-Status:
閲覧制限:
制限付き (Max Planck Institute for Informatics, MSIN; )
MIMEタイプ / チェックサム:
application/pdf
技術的なメタデータ:
著作権日付:
-
著作権情報:
-
CCライセンス:
-

関連URL

表示:

作成者

表示:
非表示:
 作成者:
Anis, Myriam1, 著者
Klakow, Dietrich2, 学位論文主査
Petrenko, Pavlo2, 監修者
Hampp, Thomas2, 監修者
Klakow, Dietrich2, 監修者
Mirza, Paramita3, 監修者           
所属:
1International Max Planck Research School, MPI for Informatics, Max Planck Society, Campus E1 4, 66123 Saarbrücken, DE, ou_1116551              
2External Organizations, ou_persistent22              
3Databases and Information Systems, MPI for Informatics, Max Planck Society, ou_24018              

内容説明

表示:
非表示:
キーワード: -
 要旨: Artificial Intelligence (AI) has become one of the most researched fields nowadays. Ma-
chine Learning (ML) is one of the most popular AI domains, where systems are created
with the capability of automatic learning and improving from the learning experience.
The current revolution in the size and cost of electronic storage allows for the existence
of enormous amount of data that can be used for ML training. Unfortunately, not all
of this data is labelled. The process of manually labelling documents can be expen-
sive, time consuming and subject to human errors. Active Learning (AL) addresses
this challenge by finding a sample of the enormous data corpus that, if labelled, can
substitute the use of the whole dataset. AL routes this sample to a human labeller to
formulate the training dataset needed for the ML model. AL assumes that there exists a
single, infallible and indefatigable labeller. These assumptions cannot cope to real world
problems. The main focus of this work is to introduce Proactive Learning (PL) to an
existing AL system. PL aims at generalizing the problem, solved by AL, by relaxing
all of its assumptions about the user. The main addition of this project is enhancing
automatic text classification by combining knowledge from the domain of PL and from
Instance Relabelling paradigms to update the currently implemented AL system. The
implemented PL system is tested on the 20 Newsgroups, Reuters and AG News datasets.
The system is capable of reaching impressive results in detecting and predicting users
actions, which allows the system to efficiently route labelling tasks to the best users,
leading to minimize the risk of receiving wrong labels.

資料詳細

表示:
非表示:
言語: eng - English
 日付: 2019-08-062019-08-062019-08-062019-08-06
 出版の状態: 出版
 ページ: 86 p.
 出版情報: Saarbrücken : Universität des Saarlandes
 目次: -
 査読: -
 識別子(DOI, ISBNなど): -
 学位: 修士号 (Master)

関連イベント

表示:

訴訟

表示:

Project information

表示:

出版物

表示: