CEDR: Contextualized Embeddings for Document Ranking

MacAvaney, Sean; Yates, Andrew; Cohan, Arman; Goharian, Nazli

Local TagsRelease HistoryDetailsSummary

CEDR: Contextualized Embeddings for Document Ranking

MacAvaney, S., Yates, A., Cohan, A., & Goharian, N. (2019). CEDR: Contextualized Embeddings for Document Ranking. Retrieved from http://arxiv.org/abs/1904.07094.

Item is Released

show all

Basic

hide

Item Permalink: https://hdl.handle.net/21.11116/0000-0004-02C7-9 Version Permalink: https://hdl.handle.net/21.11116/0000-0004-02C8-8

Genre: Paper

Files

hide Files

:

arXiv:1904.07094.pdf (Preprint), 878KB

View Save

File Permalink:
https://hdl.handle.net/21.11116/0000-0004-02C9-7

Name:
arXiv:1904.07094.pdf

Description:
File downloaded from arXiv at 2019-07-10 11:05 Accepted to SIGIR 2019, camera ready to follow

OA-Status:

Visibility:
Public

MIME-Type / Checksum:
application/pdf / [MD5]

Technical Metadata:

View

Copyright Date:
-

Copyright Info:
-

License:
http://arxiv.org/licenses/nonexclusive-distrib/1.0/

Locators

show

Creators

hide

Creators:
MacAvaney, Sean¹, Author
Yates, Andrew², Author
Cohan, Arman¹, Author
Goharian, Nazli¹, Author

Affiliations:
1External Organizations, ou_persistent22
2Databases and Information Systems, MPI for Informatics, Max Planck Society, ou_24018

Content

hide

Free keywords: Computer Science, Information Retrieval, cs.IR,Computer Science, Computation and Language, cs.CL

Abstract: Although considerable attention has been given to neural ranking
architectures recently, far less attention has been paid to the term
representations that are used as input to these models. In this work, we
investigate how two pretrained contextualized language modes (ELMo and BERT)
can be utilized for ad-hoc document ranking. Through experiments on TREC
benchmarks, we find that several existing neural ranking architectures can
benefit from the additional context provided by contextualized language models.
Furthermore, we propose a joint approach that incorporates BERT's
classification vector into existing neural models and show that it outperforms
state-of-the-art ad-hoc ranking baselines. We call this joint approach CEDR
(Contextualized Embeddings for Document Ranking). We also address practical
challenges in using these models for ranking, including the maximum input
length imposed by BERT and runtime performance impacts of contextualized
language models.

Details

hide

Language(s): eng - English

Dates: Created: 2019-04-15Modified: 2019-04-24Published Online: 2019

Publication Status: Published online

Pages: 5 p.

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: arXiv: 1904.07094
URI: http://arxiv.org/abs/1904.07094
BibTex Citekey: MacAvaney_arXiv1904.07094

Degree: -

Event

show

Legal Case

show

Project information

show

Source

show