Using query interfaces of different Web databases, we propose a new complex schema matching approach, Parallel Schema Matching (PSM). A parallel schema is formed by comparing two individual schemas and deleting common attributes. The attribute matching can be discovered from the attribute-occurrence patterns if many parallel schemas are available. A count-based greedy algorithm identifies which attributes are more likely to be matched. Experiments show that PSM can identify both simple matching and complex matching accurately and efficiently.
Citation:
Weifeng Su, Jiying Wang, Frederick Lochovsky, "Holistic Query Interface Matching using Parallel Schema Matching," icde, pp.122, 22nd International Conference on Data Engineering (ICDE'06), 2006