Efficient Query Processing and Index Tuning Using Proximity Scores

Broschart, Andreas

doi:10.22028/D291-26400

Local TagsRelease HistoryDetailsSummary

Efficient Query Processing and Index Tuning Using Proximity Scores

Broschart, A. (2012). Efficient Query Processing and Index Tuning Using Proximity Scores. PhD Thesis, Universität des Saarlandes, Saarbrücken. doi:10.22028/D291-26400.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/11858/00-001M-0000-0014-6275-D Version Permalink: https://hdl.handle.net/21.11116/0000-000C-480F-4

Genre: Thesis

Files

show Files

Locators

show

hide

Locator:
http://scidok.sulb.uni-saarland.de/volltexte/2012/4981/ (Any fulltext) Open Access Green

Description:
-

OA-Status:
Green

Locator:
http://scidok.sulb.uni-saarland.de/doku/lic_ohne_pod.php?la=de (Copyright transfer agreement) Open Access status unknown

Description:
-

OA-Status:
Not specified

Creators

show

hide

Creators:
Broschart, Andreas^{1, 2}, Author
Schenkel, Ralf¹, Advisor
Suel, Torsten³, Advisor

Affiliations:
1Databases and Information Systems, MPI for Informatics, Max Planck Society, ou_24018
2International Max Planck Research School, MPI for Informatics, Max Planck Society, ou_1116551
3External Organizations, ou_persistent22

Content

show

hide

Free keywords: -

Abstract: n the presence of growing data, the need for efficient query processing under
result quality and index size control becomes more and more a challenge to
search engines. We show how to use proximity scores to make query processing
effective and efficient with focus on either of the optimization goals.
More precisely, we make the following contributions:
• We present a comprehensive comparative analysis of proximity score models and
a rigorous analysis of the potential of phrases and adapt a leading proximity
score model for XML data.
• We discuss the feasibility of all presented proximity score models for top-k
query processing and present a novel index combining a content and proximity
score that helps to accelerate top-k query processing and improves result
quality.
• We present a novel, distributed index tuning framework for term and term pair
index lists that optimizes pruning parameters by means of well-defined
optimization criteria under disk space constraints. Indexes can be tuned with
emphasis on efficiency or effectiveness: the resulting indexes yield fast
processing at high result quality.
• We show that pruned index lists processed with a merge join outperform top-k
query processing with unpruned lists at a high result quality.
• Moreover, we present a hybrid index structure for improved cold cache run
times.

Details

show

hide

Language(s): eng - English

Dates: Accepted: 2012-10-09Published Online: 2012Date issued: 2012

Publication Status: Issued

Pages: -

Publishing info: Saarbrücken : Universität des Saarlandes

Table of Contents: -

Rev. Type: -

Identifiers: eDoc: 647546
Other: Local-ID: C1256DBF005F876D-DE4B2520B99264A3C1257B1900434A8C-Broschart_PhD2012
BibTex Citekey: Broschart_PhD2012
DOI: 10.22028/D291-26400
URN: urn:nbn:de:bsz:291-scidok-49816
Other: hdl:20.500.11880/26456

Degree: PhD

Event

show

Legal Case

show

Project information

show

Source

show