日本語
 
Help Privacy Policy ポリシー/免責事項
  詳細検索ブラウズ

アイテム詳細


公開

成果報告書

Recall Them All: Retrieval-Augmented Language Models for Long Object List Extraction from Long Documents

MPS-Authors
/persons/resource/persons270904

Singhania,  Sneha
Databases and Information Systems, MPI for Informatics, Max Planck Society;

/persons/resource/persons212613

Razniewski,  Simon
Databases and Information Systems, MPI for Informatics, Max Planck Society;

/persons/resource/persons45720

Weikum,  Gerhard
Databases and Information Systems, MPI for Informatics, Max Planck Society;

External Resource
There are no locators available
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
フルテキスト (公開)

arXiv:2405.02732.pdf
(プレプリント), 586KB

付随資料 (公開)
There is no public supplementary material available
引用

Singhania, S., Razniewski, S., & Weikum, G. (2024). Recall Them All: Retrieval-Augmented Language Models for Long Object List Extraction from Long Documents.


引用: https://hdl.handle.net/21.11116/0000-000F-75A0-8
要旨
Methods for relation extraction from text mostly focus on high precision, at
the cost of limited recall. High recall is crucial, though, to populate long
lists of object entities that stand in a specific relation with a given
subject. Cues for relevant objects can be spread across many passages in long
texts. This poses the challenge of extracting long lists from long texts. We
present the L3X method which tackles the problem in two stages: (1)
recall-oriented generation using a large language model (LLM) with judicious
techniques for retrieval augmentation, and (2) precision-oriented
scrutinization to validate or prune candidates. Our L3X method outperforms
LLM-only generations by a substantial margin.