Factoring Out Prior Knowledge from Low-Dimensional Embeddings

Heiter, Edith; Fischer, Jonas; Vreeken, Jilles

Datensatz

DATENSATZ AKTIONENEXPORT

Zur Ablage hinzufügen

Bitte beachten Sie, dass eine neuere Version dieses Datensatzes verfügbar ist:
https://pure.mpg.de/pubman/item/item_3289867_2

DetailsÜbersicht

Freigegeben

Forschungspapier

Factoring Out Prior Knowledge from Low-Dimensional Embeddings

MPG-Autoren

/persons/resource/persons229482

Fischer, Jonas
Databases and Information Systems, MPI for Informatics, Max Planck Society;

Externe Ressourcen

Es sind keine externen Ressourcen hinterlegt

Volltexte (beschränkter Zugriff)

Für Ihren IP-Bereich sind aktuell keine Volltexte freigegeben.

Volltexte (frei zugänglich)

arXiv:2103.01828.pdf
(Preprint), 13MB

Ergänzendes Material (frei zugänglich)

Es sind keine frei zugänglichen Ergänzenden Materialien verfügbar

Zitation

Heiter, E., Fischer, J., & Vreeken, J. (2021). Factoring Out Prior Knowledge from Low-Dimensional Embeddings. Retrieved from https://arxiv.org/abs/2103.01828.

Zitierlink: https://hdl.handle.net/21.11116/0000-0008-16ED-5

Zusammenfassung

Low-dimensional embedding techniques such as tSNE and UMAP allow visualizing
high-dimensional data and therewith facilitate the discovery of interesting
structure. Although they are widely used, they visualize data as is, rather
than in light of the background knowledge we have about the data. What we
already know, however, strongly determines what is novel and hence interesting.
In this paper we propose two methods for factoring out prior knowledge in the
form of distance matrices from low-dimensional embeddings. To factor out prior
knowledge from tSNE embeddings, we propose JEDI that adapts the tSNE objective
in a principled way using Jensen-Shannon divergence. To factor out prior
knowledge from any downstream embedding approach, we propose CONFETTI, in which
we directly operate on the input distance matrices. Extensive experiments on
both synthetic and real world data show that both methods work well, providing
embeddings that exhibit meaningful structure that would otherwise remain
hidden.