Data integrated from multiple sources may contain inconsistencies that violate integrity constraints. The constraint repair problem attempts to find "low cost" changes that, when applied, will cause the constraints to be satisfied. While in most previous work repair cost is stated in terms of tuple insertions and deletions, the authors follow recent work to define a database repair as a set of value modifications. In this context, this paper introduces a novel cost framework that allows for the application of techniques from record-linkage to the search for good repairs. The authors prove that finding minimal-cost repairs in this model is NP-complete in the size of the database, and introduce an approach to heuristic repair-construction based on equivalence classes of attribute values.
Related white papers
Live Webcast: Telecoms 2.0 - Where is telecoms heading?
Telecoms 2.0 - Where is telecoms heading? UK telecoms is at a crossroads. IT managers face new demands to enable flexible working, deliver converged networks and provide support for multiple applications...
Is Now the Time to Migrate to IP Telephony? Re-evaluating the Risks and Rewards
If your business finds itself with one or more aging digital PBXs, you are facing the challenge of deciding when a migration to a new telephony environment makes sense for...
Analysis of Wireline, Wireless, Voice and Data Services for Major Italian Telecom Operator WIND
WIND is a joint venture of France Telecom and Ente Nazionale per l'Energia (ENEL), the electrical provider for Italy. WIND needed a panoramic view of all aspects involved in telephone...
VoiceGenie Speech-Enables Companies With IBM and Linux Solution
To improve customer service, companies are turning to innovative firms such as VoiceGenie Technologies Inc. Based in Toronto, Canada, VoiceGenie is a next-generation interactive voice response (IVR) system provider focused...
Securing Voice Networks
Converged voice and data communication can be more secure than traditional PSTN communication with proper planning and design. Voice-related threats include toll fraud, denial of service attacks, impersonation exploits, and...
Leveraging the Power of Unified Application and Data Integration
This webcast focuses on how customers, such as Merial, are using TIBCO DataExchange with TIBCO BusinessWorks to rapidly deploy integration solutions to address challenges around data synchronization, business intelligence, and...
Envision Financial Banks on IP Telephony to Improve Customer Relationships
Envision Financial is Canada's third largest credit union which provides financial services. In order for the merger to be successful, Envision Financial needed to integrate the existing independent networks, with...


