One of the most critical steps to integrating heterogeneous e-Business applications using different XML schemas is schema mapping, which is known to be costly and error-prone. Past schema mapping research has not fully utilized semantic information in the XML schema. In this paper, we propose a semantic similarity analysis approach to facilitate XML schema mapping, merging, and reuse. Several key innovations are introduced, including 1) a layered semantic structure of XML schema; 2) layered specific similarity measures using information content-based approach; and 3) an approach for integrating similarities at all layers. Experimental results using two different schemas from the Automotive Industry Action Group demonstrate that the proposed approach is valuable for addressing difficulties in XML schema mapping.
Proceedings of the 2008 IEEE International Conference on Information Reuse and Integration
July 13-15, 2008
Las Vegas, NV
XML Schema, e-Business Integration, Schema Matching, Mapping, Merging, Reuse, Similarity Measure, Information Content