Deutsch
 
Hilfe Datenschutzhinweis Impressum
  DetailsucheBrowse

Datensatz

 
 
DownloadE-Mail
  A Time Machine for Text Search

Berberich, K., Bedathur, S., Neumann, T., & Weikum, G. (2007). A Time Machine for Text Search. In C. Clarke, N. Fuhr, N. Kando, W. Kraaij, & A. P. de Vries (Eds.), SIGIR'07: 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 519-526). New York, NY, USA: ACM.

Item is

Dateien

einblenden: Dateien
ausblenden: Dateien
:
sigir2007.pdf (beliebiger Volltext), 5KB
 
Datei-Permalink:
-
Name:
sigir2007.pdf
Beschreibung:
-
OA-Status:
Sichtbarkeit:
Privat
MIME-Typ / Prüfsumme:
application/pdf
Technische Metadaten:
Copyright Datum:
-
Copyright Info:
-
Lizenz:
-

Externe Referenzen

einblenden:

Urheber

einblenden:
ausblenden:
 Urheber:
Berberich, Klaus1, Autor           
Bedathur, Srikanta1, Autor           
Neumann, Thomas1, Autor           
Weikum, Gerhard1, Autor           
Affiliations:
1Databases and Information Systems, MPI for Informatics, Max Planck Society, ou_24018              

Inhalt

einblenden:
ausblenden:
Schlagwörter: -
 Zusammenfassung: Text search over temporally versioned document collections such as web archives has received little attention as a research problem. As a consequence, there is no scalable and principled solution to search such a collection as of a specified time. In this work, we address this shortcoming and propose an efficient solution for time-travel text search by extending the inverted file index to make it ready for temporal search. We introduce approximate temporal coalescing as a tunable method to reduce the index size without significantly affecting the quality of results. In order to further improve the performance of time-travel queries, we introduce two principled techniques to trade off index size for its performance. These techniques can be formulated as optimization problems that can be solved to near-optimality. Finally, our approach is evaluated in a comprehensive series of experiments on two large-scale real-world datasets. Results unequivocally show that our methods make it possible to build an efficient "time machine" scalable to large versioned text collections.

Details

einblenden:
ausblenden:
Sprache(n): eng - English
 Datum: 2008-03-192007
 Publikationsstatus: Erschienen
 Seiten: -
 Ort, Verlag, Ausgabe: -
 Inhaltsverzeichnis: -
 Art der Begutachtung: -
 Identifikatoren: eDoc: 356467
DOI: 10.1145/1277741.1277831
Anderer: Local-ID: C12573CC004A8E26-A6B6F424AE7674F9C12572B90021A0B8-BerberichBNW2007az
 Art des Abschluß: -

Veranstaltung

einblenden:
ausblenden:
Titel: SIGIR 2007
Veranstaltungsort: Amsterdam, Netherlands
Start-/Enddatum: 2007-07-23 - 2007-07-27

Entscheidung

einblenden:

Projektinformation

einblenden:

Quelle 1

einblenden:
ausblenden:
Titel: SIGIR'07 : 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
Genre der Quelle: Konferenzband
 Urheber:
Clarke, Charlie, Herausgeber
Fuhr, Norbert, Herausgeber
Kando, Noriko, Herausgeber
Kraaij, Wessel, Herausgeber
de Vries, Arjen P., Herausgeber
Affiliations:
-
Ort, Verlag, Ausgabe: New York, NY, USA : ACM
Seiten: - Band / Heft: - Artikelnummer: - Start- / Endseite: 519 - 526 Identifikator: ISBN: 978-1-59593-597-7