| Publisher | Association for Computing Machinery | ||
|---|---|---|---|
| Format | 112.6KB PDF | Date added | 25 Apr 2008 |
| Topics | XML, Programming Languages, Data Acquisition - ETL | ||
| Downloads | 3 | ||
This paper proposes a method of classifying XML documents and extracting XML schema from XML by inductive inference based on constraint logic programming. The goal of this work is to type a large collection of XML approximately but efficiently. This can also process XML code written in a different schema or even code which is schema-less. The approach is intended to achieve identification based on the syntax and semantics of the XML documents by information extraction using ontology, and to support retrieval and data management.
Related white papers
Think Globally, Act Locally: Targeting New Customers with Geolocation
When it comes to content, "one size fits all" used to be the norm and web traffic was measured in "hits". No longer. Companies that are leading the...
Introducing Xomega for XML Object Modeling and Code Generation
XML-based Model Driven Development can be a simple, but very powerful alternative to the UML-based MDA and can result in significantly increased productivity, clean and robust designs and improved system...
Oracle Primavera P6 EPPM Integrations With Web Services and Events
Primavera Web Services is an integration technology that extends P6 functionality and business objects. Based on open standards such as SOAP, XML and WSDL, Primavera Web Services enable developers to...
Radio-Research Firm Reduces Production Time and Costs With Presentation Solution
Research Director, Inc. (RDI), one of the largest radio-research consulting firms in the United States, produces large, complex presentations that provide radio-audience data for its customers once every quarter. Faced...
Testing SIP Call Flows Using XML Protocol Templates
A Session Initiation Protocol (SIP) Call Flow is a casual sequence of messages that is exchanged between interacting SIP entities. This paper presents a novel test system for SIP based...
Consortium Develops New Accessible Multimedia Tool for the Print Disabled
The DAISY Consortium develops and promotes DAISY (the Digital Accessible Information System), the world's most widely used assistive reading technology for the print disabled. The consortium wanted a solution that...
Web Server Improvements with Microsoft Server 2008
This is another in our series about Microsoft Longhorn, also known as Server 2008. In this series we break down the most important components of Longhorn and give listeners the...



