English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Information Retrieval by Dimension Reduction A compartive Study

Parreira, J. X. (2003). Information Retrieval by Dimension Reduction A compartive Study. Master Thesis, Universität des Saarlandes, Saarbrücken.

Item is

Files

show Files
hide Files
:
Master-Parreira-Josaine-2003.pdf (Any fulltext), 3MB
 
File Permalink:
-
Name:
Master-Parreira-Josaine-2003.pdf
Description:
-
OA-Status:
Visibility:
Restricted (Max Planck Institute for Informatics, MSIN; )
MIME-Type / Checksum:
application/pdf
Technical Metadata:
Copyright Date:
-
Copyright Info:
-
License:
-

Locators

show

Creators

show
hide
 Creators:
Parreira, Josiane Xavier1, Author           
Bast, Holger2, Advisor           
Weikum, Gerhard1, Referee           
Affiliations:
1Databases and Information Systems, MPI for Informatics, Max Planck Society, ou_24018              
2Algorithms and Complexity, MPI for Informatics, Max Planck Society, ou_24019              

Content

show
hide
Free keywords: -
 Abstract: In this work we present a study of different techniques for semantic indexing by dimension reduction, with special emphasis on the LSI technique. Dimension reduction is important in the Information Retrieval (IR) context to enable fast retrieval and elimination of noisy data. LSI attempts to improve IR quality by deriving a latent semantic space with lower dimensionality, based on the co-occurrence of the terms in the documents from the document collection. It is a heuristic method and although experiments show that the LSI technique often improves the retrieval performance, there are deficiencies regarding mathematical models and rigorous theorems. Several variants of the LSI technique have been proposed, which differ in the function used for the mapping to the lower-dimensional space. Our comparative study is carried out using mathematical tools, like Linear Algebra, and systematic experiments. We present a theoreticla analysis of the two main LSI variants found in the literature - we call them Angle-stretching LSI and Angle-preserving LSI - and we prove that the results of the two can, in principle, arbitrarily, differ. The experiments reveal interesting features of the LSI variants and the differences in their behavior. In our experiments, the Angle-stretching LSI performs consistently worse than the Angle-preserving LSI.

Details

show
hide
Language(s): eng - English
 Dates: 20032003
 Publication Status: Issued
 Pages: -
 Publishing info: Saarbrücken : Universität des Saarlandes
 Table of Contents: -
 Rev. Type: -
 Identifiers: BibTex Citekey: XavierParreira2003
 Degree: Master

Event

show

Legal Case

show

Project information

show

Source

show