Using Knowledge Graphs for Record Linkage: Challenges and Opportunities

AS Andreou, D Firmani, JG Mathew, M Mecella… - International Conference …, 2023 - Springer
International Conference on Advanced Information Systems Engineering, 2023Springer
In this paper, we explore how Knowledge Graphs (KGs) can potentially benefit Record
Linkage (RL). RL is the process of identifying and resolving duplicate records across
different data sources, including structured, semi-structured, and unstructured data (eg, in
data lakes). RL is a critical task for information systems that rely on data to make decisions
and is used in a wide variety of fields such as healthcare, finance, government and
marketing. Due to recent advances in machine learning, there has been a significant …
Abstract
In this paper, we explore how Knowledge Graphs (KGs) can potentially benefit Record Linkage (RL). RL is the process of identifying and resolving duplicate records across different data sources, including structured, semi-structured, and unstructured data (e.g., in data lakes). RL is a critical task for information systems that rely on data to make decisions and is used in a wide variety of fields such as healthcare, finance, government and marketing. Due to recent advances in machine learning, there has been a significant progress in building automated RL methods. However, when dealing with vertical applications, featuring specialized domains such as a particular hospital or industry, human experts are still required to enter domain-specific knowledge, making RL prohibitively expensive. Despite KGs can be powerful tools to represent and derive domain-specific knowledge, their application to RL has been overlooked. Inspired by a healthcare case study in the Republic of Cyprus, we aim at filling this gap by identifying challenges and opportunities of using KGs to reduce the effort of solving RL in vertical applications.
Springer
Showing the best result for this search. See all results