Using titles vs. full-text as source for automated semantic document annotation

Galke, Lukas; Mai, Florian; Schelten, Alan; Brunch, Dennis; Scherp, Ansgar

doi:10.1145/3148011.3148039

アイテム詳細

登録内容を編集ファイル形式で保存

一時保存へ追加

タグ情報を表示リリース履歴を表示詳細要約

公開

会議論文

Using titles vs. full-text as source for automated semantic document annotation

MPS-Authors

There are no MPG-Authors in the publication available

External Resource

There are no locators available

Fulltext (restricted access)

There are currently no full texts shared for your IP range.

フルテキスト (公開)

Galke_etal_2017_Using titles vs....pdf
(出版社版), 448KB

付随資料 (公開)

There is no public supplementary material available

引用

Galke, L., Mai, F., Schelten, A., Brunch, D., & Scherp, A. (2017). Using titles vs. full-text as source for automated semantic document annotation. In O., Corcho, K., Janowicz, G., Rizz, I., Tiddi, & D., Garijo (Eds.), Proceedings of the 9th International Conference on Knowledge Capture (K-CAP 2017). New York: ACM.

引用: https://hdl.handle.net/21.11116/0000-0009-F84A-D

要旨

We conduct the first systematic comparison of automated semantic
annotation based on either the full-text or only on the title metadata
of documents. Apart from the prominent text classification baselines
kNN and SVM, we also compare recent techniques of Learning
to Rank and neural networks and revisit the traditional methods
logistic regression, Rocchio, and Naive Bayes. Across three of our
four datasets, the performance of the classifications using only titles
reaches over 90% of the quality compared to the performance when
using the full-text.