Projects per year
Abstract
We report on a database project which is in the process of linking 29 million vital event records encompassing the entire population of Scotland from 1856 until 1973. Since these records contain no common identifiers, the challenge is to form a pedigree by performing probabilistic linkage over the records. We describe the linkage methodology used to create links between records, for example identifying the birth and marriage records of a single person, and discuss the database technologies employed in the project. A graph database (Neo4j) is used to store both the original vital event records and the links made between them. A metric index is used to find potential links efficiently. Finally, we demonstrate how linkage can be improved by augmenting links based on record distance thresholds with local graph analysis.
Original language | English |
---|---|
Pages | 291-302 |
Number of pages | 12 |
Publication status | Published - 5 Jul 2023 |
Event | 31st Italian Symposium on Advanced Database Systems - Galzignano Terme, Italy Duration: 2 Jul 2023 → 5 Jul 2023 https://sebd2023.dei.unipd.it |
Conference
Conference | 31st Italian Symposium on Advanced Database Systems |
---|---|
Abbreviated title | SEBD 2023 |
Country/Territory | Italy |
Period | 2/07/23 → 5/07/23 |
Internet address |
Keywords
- Metric indexing
- Metric search
- Data linkage
- Graph databases
- Similarity search
Fingerprint
Dive into the research topics of 'An approach to population linkage using graph databases'. Together they form a unique fingerprint.-
ADR UK Programme: University of Edinburgh 2022-2026 ADR UK Programme
Dearle, A. (PI), Akgun, O. (CoI) & Kirby, G. N. C. (CoI)
Economic & Social Research Council
1/04/22 → 31/03/26
Project: Standard
-
Digitising Scotland: Digitising Scotland
Kirby, G. N. C. (PI)
Economic & Social Research Council
31/10/14 → 31/10/20
Project: Standard
-
Administrative Data Research Centres: ESRC - Admin Data Service - Scottish Consortium
Kirby, G. N. C. (PI)
1/11/13 → 31/10/18
Project: Standard