Do Children Texts Hold The Key To Commonsense Knowledge?

Romero, Julien; Razniewski, Simon

doi:10.48550/arXiv.2210.04530

Do Children Texts Hold The Key To Commonsense Knowledge?

Romero, J., & Razniewski, S. (2022). Do Children Texts Hold The Key To Commonsense Knowledge? doi:10.48550/arXiv.2210.04530.

Item is 公開

表示: 全項目非表示: 全項目

基本情報

表示: 非表示:

アイテムのパーマリンク: https://hdl.handle.net/21.11116/0000-000B-58AA-3 版のパーマリンク: https://hdl.handle.net/21.11116/0000-000B-58AB-2

資料種別: 成果報告書

ファイル

表示: ファイル

非表示: ファイル

:

arXiv:2210.04530.pdf (プレプリント), 249KB

表示保存

ファイルのパーマリンク:
https://hdl.handle.net/21.11116/0000-000B-58AC-1

ファイル名:
arXiv:2210.04530.pdf

説明:
File downloaded from arXiv at 2022-10-31 09:16

OA-Status:
Green

閲覧制限:
公開

MIMEタイプ / チェックサム:
application/pdf / [MD5]

技術的なメタデータ:

表示

著作権日付:
-

著作権情報:
-

CCライセンス:
http://creativecommons.org/licenses/by/4.0/

作成者

表示:

非表示:

作成者:
Romero, Julien¹, 著者
Razniewski, Simon², 著者

所属:
1External Organizations, ou_persistent22
2Databases and Information Systems, MPI for Informatics, Max Planck Society, ou_24018

内容説明

表示:

非表示:

キーワード: Computer Science, Computation and Language, cs.CL,Computer Science, Artificial Intelligence, cs.AI

要旨: Compiling comprehensive repositories of commonsense knowledge is a
long-standing problem in AI. Many concerns revolve around the issue of
reporting bias, i.e., that frequency in text sources is not a good proxy for
relevance or truth. This paper explores whether children's texts hold the key
to commonsense knowledge compilation, based on the hypothesis that such content
makes fewer assumptions on the reader's knowledge, and therefore spells out
commonsense more explicitly. An analysis with several corpora shows that
children's texts indeed contain much more, and more typical commonsense
assertions. Moreover, experiments show that this advantage can be leveraged in
popular language-model-based commonsense knowledge extraction settings, where
task-unspecific fine-tuning on small amounts of children texts (childBERT)
already yields significant improvements. This provides a refreshing perspective
different from the common trend of deriving progress from ever larger models
and corpora.

資料詳細

表示:

非表示:

言語: eng - English

日付: 作成: 2022-10-10オンライン出版: 2022

出版の状態: オンラインで出版済み

ページ: 6 p.

出版情報: -

目次: -

査読: -

識別子（DOI, ISBNなど）: arXiv: 2210.04530
URI: https://arxiv.org/abs/2210.04530
DOI: 10.48550/arXiv.2210.04530
BibTex参照ID: Romero2210.04530

学位: -

アイテム詳細

基本情報

ファイル

関連URL

作成者

内容説明

資料詳細

関連イベント

訴訟

Project information

出版物