Classification and Intelligent Search on Information in XML

Fuhr, Norbert; Weikum, Gerhard

Local TagsRelease HistoryDetailsSummary

Classification and Intelligent Search on Information in XML

Fuhr, N., & Weikum, G. (2002). Classification and Intelligent Search on Information in XML. IEEE Data Engineering Bulletin, 25(1), 51-58.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/11858/00-001M-0000-000F-30C7-5 Version Permalink: https://hdl.handle.net/11858/00-001M-0000-000F-30C8-3

Genre: Journal Article

Files

show Files

hide Files

:

fuhrW02.pdf (Any fulltext), 48KB

File Permalink:
-

Name:
fuhrW02.pdf

Description:
-

OA-Status:

Visibility:
Private

MIME-Type / Checksum:
application/pdf

Technical Metadata:

Copyright Date:
-

Copyright Info:
-

License:
-

Locators

show

Creators

show

hide

Creators:
Fuhr, Norbert, Author
Weikum, Gerhard¹, Author

Affiliations:
1Databases and Information Systems, MPI for Informatics, Max Planck Society, ou_24018

Content

show

hide

Free keywords: -

Abstract: {XML} will be the method of choice for representing all kinds of documents in product catalogs, digital libraries, scientific data repositories, and across the Web. This observation creates high expectations that {XML} will be a major catalyst in constructing the “Semantic Web”. However, merely casting all documents into {XML} format does not necessarily make a document’s semantics explicit and more amenable for effective information searching. Rather, to fully leverage {XML} on a global scale, significant progress is needed on the following issues: 1. providing an easy-to-use yet powerful and efficient search language that combines concepts from current {XML} pattern-matching languages (e.g., {XP}ath, {XQ}uery, etc.) with ontology-backed information-retrievalstyle search result ranking, 2. extracting more semantics from existing document collections by constructing structural and ontological skeletons (e.g., in the form of {DTD}s or {XML} schemas) that describe the data at a higher semantic level and can also facilitate new forms of indexing for efficiency, and 3. classifying existing documents according to a given thematic or personalized, hierarchical ontology to make searching more effective (e.g., exploit relevance feedback) and efficient (e.g., limit the search focus). {CLASSIX}, a joint project of the Universities of Dortmund and the Saarland in Germany, addresses these three issues. We describe our approaches for each of these topics in the remainder of this paper.

Details

show

hide

Language(s): eng - English

Dates: Modified: 2007-04-13Date issued: 2002

Publication Status: Issued

Pages: -

Publishing info: -

Table of Contents: -

Rev. Type: Peer

Identifiers: eDoc: 520372
Other: Local-ID: C1256DBF005F876D-49001CE30715F797C12572BC00404DAC-FuhrW02

Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show

hide

Title: IEEE Data Engineering Bulletin

Source Genre: Journal

Creator(s):

Affiliations:

Publ. Info: -

Pages: - Volume / Issue: 25 (1) Sequence Number: - Start / End Page: 51 - 58 Identifier: -