English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Uncovering Hidden Semantics of Set Information in Knowledge Bases

Ghosh, S., Razniewski, S., & Weikum, G. (2020). Uncovering Hidden Semantics of Set Information in Knowledge Bases. Retrieved from http://arxiv.org/abs/2003.03155.

Item is

Files

show Files
hide Files
:
arXiv:2003.03155.pdf (Preprint), 2MB
Name:
arXiv:2003.03155.pdf
Description:
File downloaded from arXiv at 2020-09-17 09:11 This work is under review in the Journal of Web Semantics, Special Issue on Language Technology and Knowledge Graphs. This is a revision draft
OA-Status:
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
-

Locators

show

Creators

show
hide
 Creators:
Ghosh, Shrestha1, Author           
Razniewski, Simon1, Author           
Weikum, Gerhard1, Author           
Affiliations:
1Databases and Information Systems, MPI for Informatics, Max Planck Society, ou_24018              

Content

show
hide
Free keywords: Computer Science, Databases, cs.DB,Computer Science, Information Retrieval, cs.IR
 Abstract: Knowledge Bases (KBs) contain a wealth of structured information about
entities and predicates. This paper focuses on set-valued predicates, i.e., the
relationship between an entity and a set of entities. In KBs, this information
is often represented in two formats: (i) via counting predicates such as
numberOfChildren and staffSize, that store aggregated integers, and (ii) via
enumerating predicates such as parentOf and worksFor, that store individual set
memberships. Both formats are typically complementary: unlike enumerating
predicates, counting predicates do not give away individuals, but are more
likely informative towards the true set size, thus this coexistence could
enable interesting applications in question answering and KB curation.
In this paper we aim at uncovering this hidden knowledge. We proceed in two
steps. (i) We identify set-valued predicates from a given KB predicates via
statistical and embedding-based features. (ii) We link counting predicates and
enumerating predicates by a combination of co-occurrence, correlation and
textual relatedness metrics. We analyze the prevalence of count information in
four prominent knowledge bases, and show that our linking method achieves up to
0.55 F1 score in set predicate identification versus 0.40 F1 score of a random
selection, and normalized discounted gains of up to 0.84 at position 1 and 0.75
at position 3 in relevant predicate alignments. Our predicate alignments are
showcased in a demonstration system available at
https://counqer.mpi-inf.mpg.de/spo.

Details

show
hide
Language(s): eng - English
 Dates: 2020-03-062020-03-262020
 Publication Status: Published online
 Pages: 15 p.
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: arXiv: 2003.03155
URI: http://arxiv.org/abs/2003.03155
BibTex Citekey: Ghosh_arXiv2003.03155
 Degree: -

Event

show

Legal Case

show

Project information

show

Source

show