XyView: Universal Relations Revisited
15 pages
English

Découvre YouScribe en t'inscrivant gratuitement

Je m'inscris

XyView: Universal Relations Revisited

-

Découvre YouScribe en t'inscrivant gratuitement

Je m'inscris
Obtenez un accès à la bibliothèque pour le consulter en ligne
En savoir plus
15 pages
English
Obtenez un accès à la bibliothèque pour le consulter en ligne
En savoir plus

Description

XyView: Universal Relations Revisited Dan Vodislav Sophie Cluet Gregory Corona Imen Sebei CNAM/CEDRIC Paris, France INRIA Rocquencourt, France Xyleme Paris, France CNAM/CEDRIC Paris, France Abstract We present XyView, a practical solu- tion for fast development of user- (web forms) and machine-oriented applications (web services) over a repository of het- erogeneous schema-free XML documents. XyView provides the means to view such a repository as an array that can be queried using a QBE-like interface or through sim- ple selection/projection queries. It ex- tends the concept of universal relations in mainly two ways: (i) the input is not a re- lational schema but a potentially large set of XML data guides; (ii) the view is not defined explicitely by a unique query but implicitly by various mappings so as to avoid data loss and duplicates generated by joins. Developed on top of the Xyleme content management system, XyView can easily be adapted to any system support- ing XQuery. Keywords: XML views, heterogeneous data integration, application development tools, universal relation 1 Introduction For decades, companies have produced digital data such as notes, contracts, emails, progress re- ports, minutes, etc. This data constitutes a mine of useful information that is largely unexploited.

  • projection

  • xyview

  • ple selection-projection

  • xml document

  • single struc- ture

  • projection queries

  • view schemas


Sujets

Informations

Publié par
Nombre de lectures 10
Langue English

Extrait

XyView: Universal Relations Revisited
Dan Vodislav Sophie Cluet CNAM/CEDRIC INRIA Paris, France Rocquencourt, France vodislav@cnam.fr Sophie.Cluet@inria.fr
Abstract We present XyView , a practical solu-tion for fast development of user- (web forms) and machine-oriented applications (web services) over a repository of het-erogeneous schema-free XML documents. XyView provides the means to view such a repository as an array that can be queried using a QBE-like interface or through sim-ple selection/projection queries. It ex-tends the concept of universal relations in mainly two ways: (i) the input is not a re-lational schema but a potentially large set of XML data guides; (ii) the view is not de ned explicitely by a unique query but implicitly by various mappings so as to avoid data loss and duplicates generated by joins. Developed on top of the Xyleme content management system, XyView can easily be adapted to any system support-ing XQuery. Keywords : XML views, heterogeneous data integration, application development tools, universal relation 1 Introduction For decades, companies have produced digital data such as notes, contracts, emails, progress re-ports, minutes, etc. This data constitutes a mine of useful information that is largely unexploited. The advent of XLM provides the opportunity to change that. Many enterprises are now consider-ing storing their home data in XML repositories so as to be able to query them in a signi can t way, i.e., with tools more sophisticated than full text search engines. In this paper, we are addressing the problem of querying such repositories. More
GregoryCoronaImenSebei Xyleme CNAM/CEDRIC Paris, France Paris, France Gregory.Corona@xyleme.com imen.sebei@cnam.fr
precisely, we are interested in developing, easily and quickly, simple query API (web services) or user interfaces (web forms) over these repositories. An important characteristic of the applications we are considering is that they deal with legacy data that have been mostly produced by human beings using standard text editors. As a result, the data is (i) poorly typed (well formed rather than valid XML) and (ii) highly heterogeneous (although documents have strong semantic con-nections). These features are particularly chal-lenging since they call for sophisticated tools to ease the application programmer task while at the same time disabling most existing approaches. The solution we propose borrows from the uni-versal relation paradigm of the seventies [18]: XyView provides the means to easily view a set of heterogeneous XML documents as a single array that can be queried through simple selections and projections. Obviously, the context being XML, the array contains XML subtrees and is built us-ingXQuery.Butthefundamentaldi erencesbe-tween universal relations and our approach are the following: Thearrayisnotde nedbyonequerybut by a speci cation of how a simple selection-projection user query is to be translated into an XQuery. This di erence is important. The problem with universal relations is that, unless the database schema has particularly nice prop-erties which is rarely the case, projection op-erations generate many duplicates that are not always easy to remove. This is due to the join operations entering the de nition of the universal relation. Alternatively, the join operations can also be the cause of missing information. This is usually solved by intro-

  • Univers Univers
  • Ebooks Ebooks
  • Livres audio Livres audio
  • Presse Presse
  • Podcasts Podcasts
  • BD BD
  • Documents Documents