Testing Conditional Independence on Discrete Data using Stochastic Complexity

Marx, Alexander; Vreeken, Jilles

Lokale TagsFreigabegeschichteDetailsÜbersicht

Testing Conditional Independence on Discrete Data using Stochastic Complexity

Marx, A., & Vreeken, J. (2019). Testing Conditional Independence on Discrete Data using Stochastic Complexity. Retrieved from http://arxiv.org/abs/1903.04829.

Item is Freigegeben

einblenden: alle ausblenden: alle

Basisdaten

einblenden: ausblenden:

Datensatz-Permalink: https://hdl.handle.net/21.11116/0000-0004-027A-1 Versions-Permalink: https://hdl.handle.net/21.11116/0000-0004-027B-0

Genre: Forschungspapier

Dateien

einblenden: Dateien

ausblenden: Dateien

:

arXiv:1903.04829.pdf (Preprint), 923KB

Öffnen Speichern

Datei-Permalink:
https://hdl.handle.net/21.11116/0000-0004-027C-F

Name:
arXiv:1903.04829.pdf

Beschreibung:
File downloaded from arXiv at 2019-07-10 09:24 accepted at AISTATS'19, the proposed test was released in the R package SCCI

OA-Status:

Sichtbarkeit:
Öffentlich

MIME-Typ / Prüfsumme:
application/pdf / [MD5]

Technische Metadaten:

Öffnen

Copyright Datum:
-

Copyright Info:
-

Lizenz:
http://arxiv.org/licenses/nonexclusive-distrib/1.0/

Externe Referenzen

einblenden:

Urheber

einblenden:

ausblenden:

Urheber:
Marx, Alexander¹, Autor
Vreeken, Jilles¹, Autor

Affiliations:
1Databases and Information Systems, MPI for Informatics, Max Planck Society, ou_24018

Inhalt

einblenden:

ausblenden:

Schlagwörter: Statistics, Machine Learning, stat.ML,Computer Science, Learning, cs.LG

Zusammenfassung: Testing for conditional independence is a core aspect of constraint-based
causal discovery. Although commonly used tests are perfect in theory, they
often fail to reject independence in practice, especially when conditioning on
multiple variables.
We focus on discrete data and propose a new test based on the notion of
algorithmic independence that we instantiate using stochastic complexity.
Amongst others, we show that our proposed test, SCI, is an asymptotically
unbiased as well as $L_2$ consistent estimator for conditional mutual
information (CMI). Further, we show that SCI can be reformulated to find a
sensible threshold for CMI that works well on limited samples. Empirical
evaluation shows that SCI has a lower type II error than commonly used tests.
As a result, we obtain a higher recall when we use SCI in causal discovery
algorithms, without compromising the precision.

Details

einblenden:

ausblenden:

Sprache(n): eng - English

Datum: Erstellt: 2019-03-12Online veröffentlicht: 2019

Publikationsstatus: Online veröffentlicht

Seiten: 18 p.

Ort, Verlag, Ausgabe: -

Inhaltsverzeichnis: -

Art der Begutachtung: -

Identifikatoren: arXiv: 1903.04829
URI: http://arxiv.org/abs/1903.04829
BibTex Citekey: Marx_arXiv1903.04829

Art des Abschluß: -

Datensatz

Basisdaten

Dateien

Externe Referenzen

Urheber

Inhalt

Details

Veranstaltung

Entscheidung

Projektinformation

Quelle