Uncovering Hidden Semantics of Set Information in Knowledge Bases

Ghosh, Shrestha; Razniewski, Simon; Weikum, Gerhard

Local TagsRelease HistoryDetailsSummary

Uncovering Hidden Semantics of Set Information in Knowledge Bases

Ghosh, S., Razniewski, S., & Weikum, G. (2020). Uncovering Hidden Semantics of Set Information in Knowledge Bases. Retrieved from http://arxiv.org/abs/2003.03155.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/21.11116/0000-0007-0662-4 Version Permalink: https://hdl.handle.net/21.11116/0000-0007-0663-3

Genre: Paper

Files

show Files

hide Files

:

arXiv:2003.03155.pdf (Preprint), 2MB

View Save

File Permalink:
https://hdl.handle.net/21.11116/0000-0007-0664-2

Name:
arXiv:2003.03155.pdf

Description:
File downloaded from arXiv at 2020-09-17 09:11 This work is under review in the Journal of Web Semantics, Special Issue on Language Technology and Knowledge Graphs. This is a revision draft

OA-Status:

Visibility:
Public

MIME-Type / Checksum:
application/pdf / [MD5]

Technical Metadata:

View

Copyright Date:
-

Copyright Info:
-

License:
http://arxiv.org/licenses/nonexclusive-distrib/1.0/

Locators

show

Creators

show

hide

Creators:
Ghosh, Shrestha¹, Author
Razniewski, Simon¹, Author
Weikum, Gerhard¹, Author

Affiliations:
1Databases and Information Systems, MPI for Informatics, Max Planck Society, ou_24018

Content

show

hide

Free keywords: Computer Science, Databases, cs.DB,Computer Science, Information Retrieval, cs.IR

Abstract: Knowledge Bases (KBs) contain a wealth of structured information about
entities and predicates. This paper focuses on set-valued predicates, i.e., the
relationship between an entity and a set of entities. In KBs, this information
is often represented in two formats: (i) via counting predicates such as
numberOfChildren and staffSize, that store aggregated integers, and (ii) via
enumerating predicates such as parentOf and worksFor, that store individual set
memberships. Both formats are typically complementary: unlike enumerating
predicates, counting predicates do not give away individuals, but are more
likely informative towards the true set size, thus this coexistence could
enable interesting applications in question answering and KB curation.
In this paper we aim at uncovering this hidden knowledge. We proceed in two
steps. (i) We identify set-valued predicates from a given KB predicates via
statistical and embedding-based features. (ii) We link counting predicates and
enumerating predicates by a combination of co-occurrence, correlation and
textual relatedness metrics. We analyze the prevalence of count information in
four prominent knowledge bases, and show that our linking method achieves up to
0.55 F1 score in set predicate identification versus 0.40 F1 score of a random
selection, and normalized discounted gains of up to 0.84 at position 1 and 0.75
at position 3 in relevant predicate alignments. Our predicate alignments are
showcased in a demonstration system available at
https://counqer.mpi-inf.mpg.de/spo.

Details

show

hide

Language(s): eng - English

Dates: Created: 2020-03-06Modified: 2020-03-26Published Online: 2020

Publication Status: Published online

Pages: 15 p.

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: arXiv: 2003.03155
URI: http://arxiv.org/abs/2003.03155
BibTex Citekey: Ghosh_arXiv2003.03155

Degree: -

Event

show

Legal Case

show

Project information

show

Source

show