Efficient Distributed Skylining for Web Information Systems
- Author(s): Balke, Wolf-Tilo
- Guentzer, Ulrich
- Zheng, Jason Xin
- et al.
Published Web Locationhttp://springerlink.metapress.com/app/home/contribution.asp?wasp=7pelnglyvj7qhul3hjf3&referrer=parent&backto=issue,16,62;journal,397,1931;linkingpublicationresults,1:105633,1
Though skyline queries already have claimed their place in retrieval over central databases, their application in Web information systems up to now was impossible due to the distributed aspect of retrieval over Web sources. But due to the amount, variety and volatile nature of information accessible over the Internet extended query capabilities are crucial. We show how to efficiently perform distributed skyline queries and thus essentially extend the expressiveness of querying today's Web information systems. Together with our innovative retrieval algorithm we also present useful heuristics to further speed up the retrieval in most practical cases paving the road towards meeting even the real-time challenges of on-line information services. We discuss performance evaluations and point to open problems in the concept and application of skylining in modem information systems. For the curse of dimensionality, an intrinsic problem in skyline queries, we propose a novel sampling scheme that allows to get an early impression of the skyline for subsequent query refinement.