Combining near-optimal feature selection with gSpan

Borgwardt, KM; Yan, X; Thoma, M; Cheng, H; Gretton, A; Song, L; Smola, A; Han, J; Hu, P; Kriegel, H-P

Lokale TagsFreigabegeschichteDetailsÜbersicht

Combining near-optimal feature selection with gSpan

Borgwardt, K., Yan, X., Thoma, M., Cheng, H., Gretton, A., Song, L., et al. (2008). Combining near-optimal feature selection with gSpan. In 6th International Workshop on Mining and Learning with Graphs (MLG 2008) (pp. 1-3).

Item is Freigegeben

einblenden: alle

Basisdaten

ausblenden:

Datensatz-Permalink: https://hdl.handle.net/21.11116/0000-0003-3D79-2 Versions-Permalink: https://hdl.handle.net/21.11116/0000-0003-3D7A-1

Genre: Konferenzbeitrag

Dateien

ausblenden: Dateien

:

MLG-2008-Borgwardt.pdf (beliebiger Volltext), 119KB

Öffnen Speichern

Datei-Permalink:
https://hdl.handle.net/21.11116/0000-0003-3D7B-0

Name:
MLG-2008-Borgwardt.pdf

Beschreibung:
-

OA-Status:

Sichtbarkeit:
Öffentlich

MIME-Typ / Prüfsumme:
application/pdf / [MD5]

Technische Metadaten:

Öffnen

Copyright Datum:
-

Copyright Info:
-

Lizenz:
-

Externe Referenzen

ausblenden:

externe Referenz:
http://research.ics.aalto.fi/events/MLG08/programme.html (Verlagsversion) Open Access Status unbekannt

Beschreibung:
-

OA-Status:

Urheber

ausblenden:

Urheber:
Borgwardt, KM, Autor
Yan, X, Autor
Thoma, M, Autor
Cheng, H, Autor
Gretton, A^{1, 2}, Autor
Song, L, Autor
Smola, A, Autor
Han, J, Autor
Hu, P, Autor
Kriegel, H-P, Autor

Affiliations:
1Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_1497794
2Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_1497795

Inhalt

ausblenden:

Schlagwörter: -

Zusammenfassung: Graph classification is an increasingly important step in numerous application domains, such as function prediction of molecules and proteins, computerised scene analysis, and anomaly detection in program flows. Among the various approaches proposed in the literature, graph classification based on frequent subgraphs is a popular branch: Graphs are represented as (usually binary) vectors, with components indicating whether a graph contains a particular subgraph that is frequent across the dataset. On large graphs, however, one faces the enormous problem that the number of these frequent subgraphs may grow exponentially with the size of the graphs, but only few of them possess enough discriminative power to make them useful for graph classification. Efficient and discriminative feature selection among frequent subgraphs is hence a key
challenge for graph mining. In this article, we propose an approach to feature selection on frequent subgraphs, called CORK, that combines two central advantages. First, it optimises a submodular quality criterion, which means that we can yield a near-optimal solution using greedy feature
selection. Second, our submodular qual-
ity function criterion can be integrated into
gSpan, the state-of-the-art tool for frequent
subgraph mining, and help to prune the
search space for discriminative frequent subgraphs even during frequent subgraph mining.

Details

ausblenden:

Sprache(n):

Datum: Erschienen: 2008-07

Publikationsstatus: Erschienen

Seiten: -

Ort, Verlag, Ausgabe: -

Inhaltsverzeichnis: -

Art der Begutachtung: -

Identifikatoren: -

Art des Abschluß: -

Veranstaltung

ausblenden:

Titel: 6th International Workshop on Mining and Learning with Graphs (MLG 2008)

Veranstaltungsort: Helsinki, Finland

Start-/Enddatum: 2008-07-04 - 2008-07-05

Entscheidung

einblenden:

Projektinformation

einblenden:

Quelle 1

ausblenden:

Titel: 6th International Workshop on Mining and Learning with Graphs (MLG 2008)

Genre der Quelle: Konferenzband

Urheber:

Affiliations:

Ort, Verlag, Ausgabe: -

Seiten: - Band / Heft: - Artikelnummer: - Start- / Endseite: 1 - 3 Identifikator: -