Change search
ReferencesLink to record
Permanent link

Direct link
An Instance Based Schema Matching Between Opaque Database Schemas
Univ Faisalabad, Sch Elect Engn, Faisalabad, Pakistan..
Univ Faisalabad, Sch Elect Engn, Faisalabad, Pakistan..
Blekinge Institute of Technology, Faculty of Computing, Department of Computer Science and Engineering.
2014 (English)In: 2014 4TH INTERNATIONAL CONFERENCE ON ENGINEERING TECHNOLOGY AND TECHNOPRENEURSHIP (ICE2T), IEEE Computer Society, 2014, 177-182 p.Conference paper (Refereed)
Abstract [en]

Schema matching is always needed amoung schemas of relational datasets in database integration applications. Heterogeneous database integration involves a significant role of schema matching Most of the previous solution to schema matching problem based on the identification of similarity between the columns names or by recognizing common domains in the data stored in the schemas. These approaches are not applicable on those datasets with unaligned schemas where the name of the columns in the schemas and the data in the columns are opaque. In this paper we proposed an instance based approach to find the matching between the schemas of heterogeneous datasets that share a common primary keys but it is unknown which columns are primary keys. The proposed approach consists of two main phases Row Similarity and Attribute Similarity. In the row similarity phase proposed approach determines all the pairs of rows among datasets that are representing same real world entity based on the same primary keys values. In attribute similarity phase, by comparing the data values within those similar pairs of rows our approach able to find the corresponding attributes. Different experiments are performed to validate proposed approach by using real world datasets. The results demonstrated the viability of the proposed approach.

Place, publisher, year, edition, pages
IEEE Computer Society, 2014. 177-182 p.
Keyword [en]
Attribute identification, Data integration, Schema matching, Heterogeneous databases, Relational datasets
National Category
Computer Science Information Systems
URN: urn:nbn:se:bth-10857ISI: 000360302900036ISBN: 978-1-4799-4621-1OAI: diva2:862002
4th International Conference on Engineering Technology and Technopreneuship (ICE2T), Kuala Lumpur, MALAYSIA
Available from: 2015-10-20 Created: 2015-10-20 Last updated: 2016-02-02Bibliographically approved

Open Access in DiVA

No full text

By organisation
Department of Computer Science and Engineering
Computer ScienceInformation Systems

Search outside of DiVA

GoogleGoogle Scholar

Total: 38 hits
ReferencesLink to record
Permanent link

Direct link