How Deep is your Learning: the DL-HARD Annotated Deep Learning Dataset

Mackie, Iain; Dalton, Jeffery; Yates, Andrew

Item

ITEM ACTIONSEXPORT

DownloadE-Mail

Please note that a newer version of this item is available:
https://pure.mpg.de/pubman/item/item_3347916_2

DetailsSummary

How Deep is your Learning: the DL-HARD Annotated Deep Learning Dataset

Mackie, I., Dalton, J., & Yates, A. (2021). How Deep is your Learning: the DL-HARD Annotated Deep Learning Dataset. Retrieved from https://arxiv.org/abs/2105.07975.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/21.11116/0000-0009-67AB-3 Version Permalink: https://hdl.handle.net/21.11116/0000-0009-67AC-2

Genre: Paper

Files

show Files

hide Files

arXiv:2105.07975.pdf (Preprint), 905KB

View Save

File Permalink:
https://hdl.handle.net/21.11116/0000-0009-67AD-1

Name:
arXiv:2105.07975.pdf

Description:
File downloaded from arXiv at 2021-10-26 12:54

OA-Status:
Not specified

Visibility:
Public

MIME-Type / Checksum:
application/pdf / [MD5]

Technical Metadata:

View

Copyright Date:
-

Copyright Info:
-

License:
http://arxiv.org/licenses/nonexclusive-distrib/1.0/

Locators

show

Creators

show

hide

Creators:
Mackie, Iain¹, Author
Dalton, Jeffery¹, Author
Yates, Andrew², Author

Affiliations:
1External Organizations, ou_persistent22
2Databases and Information Systems, MPI for Informatics, Max Planck Society, ou_24018

Content

show

hide

Free keywords: Computer Science, Information Retrieval, cs.IR

Abstract: Deep Learning Hard (DL-HARD) is a new annotated dataset designed to more
effectively evaluate neural ranking models on complex topics. It builds on TREC
Deep Learning (DL) topics by extensively annotating them with question intent
categories, answer types, wikified entities, topic categories, and result type
metadata from a commercial web search engine. Based on this data, we introduce
a framework for identifying challenging queries. DL-HARD contains fifty topics
from the official DL 2019/2020 evaluation benchmark, half of which are newly
and independently assessed. We perform experiments using the official submitted
runs to DL on DL-HARD and find substantial differences in metrics and the
ranking of participating systems. Overall, DL-HARD is a new resource that
promotes research on neural ranking methods by focusing on challenging and
complex topics.

Details

show

hide

Language(s): eng - English

Dates: Created: 2021-05-17Published Online: 2021

Publication Status: Published online

Pages: 7 p.

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: arXiv: 2105.07975
BibTex Citekey: Mackie_2105.07975
URI: https://arxiv.org/abs/2105.07975

Degree: -

Event

show

Legal Case

show

Project information

show

Source

show