Object-Relational Database Representations for Text Indexing
Authors:
Panagiotis Papadakos,
Yannis Theoharis,
Yannis Marketakis,
Nikos Armenatzoglou,
Yannis Tzitzikas
Abstract:
One of the distinctive features of Information Retrieval systems comparing to Database Management systems, is that they offer better compression for posting lists, resulting in better I/O performance and thus faster query evaluation. In this paper, we introduce database representations of the index that reduce the size (and thus the disk I/Os) of the posting lists. This is not achieved by redesi…
▽ More
One of the distinctive features of Information Retrieval systems comparing to Database Management systems, is that they offer better compression for posting lists, resulting in better I/O performance and thus faster query evaluation. In this paper, we introduce database representations of the index that reduce the size (and thus the disk I/Os) of the posting lists. This is not achieved by redesigning the DBMS, but by exploiting the non 1NF features that existing Object-Relational DBM systems (ORDBMS) already offer. Specifically, four different database representations are described and detailed experimental results for one million pages are reported. Three of these representations are one order of magnitude more space efficient and faster (in query evaluation) than the plain relational representation.
△ Less
Submitted 17 June, 2009;
originally announced June 2009.
The Anatomy of Mitos Web Search Engine
Authors:
Panagiotis Papadakos,
Giorgos Vasiliadis,
Yannis Theoharis,
Nikos Armenatzoglou,
Stella Kopidaki,
Yannis Marketakis,
Manos Daskalakis,
Kostas Karamaroudis,
Giorgos Linardakis,
Giannis Makrydakis,
Vangelis Papathanasiou,
Lefteris Sardis,
Petros Tsialiamanis,
Georgia Troullinou,
Kostas Vandikas,
Dimitris Velegrakis,
Yannis Tzitzikas
Abstract:
Engineering a Web search engine offering effective and efficient information retrieval is a challenging task. This document presents our experiences from designing and develo** a Web search engine offering a wide spectrum of functionalities and we report some interesting experimental results. A rather peculiar design choice of the engine is that its index is based on a DBMS, while some of the…
▽ More
Engineering a Web search engine offering effective and efficient information retrieval is a challenging task. This document presents our experiences from designing and develo** a Web search engine offering a wide spectrum of functionalities and we report some interesting experimental results. A rather peculiar design choice of the engine is that its index is based on a DBMS, while some of the distinctive functionalities that are offered include advanced Greek language stemming, real time result clustering, and advanced link analysis techniques (also for spam page detection).
△ Less
Submitted 16 March, 2008; v1 submitted 14 March, 2008;
originally announced March 2008.