CEDR: Contextualized Embeddings for Document Ranking

MacAvaney, Sean; Yates, Andrew; Cohan, Arman; Goharian, Nazli

Item

ITEM ACTIONSEXPORT

Add to Basket

Local TagsRelease HistoryDetailsSummary

Released

Paper

CEDR: Contextualized Embeddings for Document Ranking

MPS-Authors

/persons/resource/persons206666

Yates, Andrew
Databases and Information Systems, MPI for Informatics, Max Planck Society;

External Resource

No external resources are shared

Fulltext (restricted access)

There are currently no full texts shared for your IP range.

Fulltext (public)

arXiv:1904.07094.pdf
(Preprint), 878KB

Supplementary Material (public)

There is no public supplementary material available

Citation

MacAvaney, S., Yates, A., Cohan, A., & Goharian, N. (2019). CEDR: Contextualized Embeddings for Document Ranking. Retrieved from http://arxiv.org/abs/1904.07094.

Cite as: https://hdl.handle.net/21.11116/0000-0004-02C7-9

Abstract

Although considerable attention has been given to neural ranking
architectures recently, far less attention has been paid to the term
representations that are used as input to these models. In this work, we
investigate how two pretrained contextualized language modes (ELMo and BERT)
can be utilized for ad-hoc document ranking. Through experiments on TREC
benchmarks, we find that several existing neural ranking architectures can
benefit from the additional context provided by contextualized language models.
Furthermore, we propose a joint approach that incorporates BERT's
classification vector into existing neural models and show that it outperforms
state-of-the-art ad-hoc ranking baselines. We call this joint approach CEDR
(Contextualized Embeddings for Document Ranking). We also address practical
challenges in using these models for ranking, including the maximum input
length imposed by BERT and runtime performance impacts of contextualized
language models.