Title
Data Linkage techniques: Past, present and future.
Speaker
Dr. Peter Christen, Department of Computer Science, FEIT The Australian National University, Canberra.
Abstract
Techniques for matching, linking or integrating data are becoming increasingly popular in many organisations. While traditionally used mainly in health and statistics, today data linkage is increasingly being applied in and between government organisations to improve outcomes in taxation, census, immigration, social welfare, in crime and fraud detection, and in the assembly of terrorism intelligence.
Many businesses routinely deduplicate and link their data when compiling mailing lists, and databases containing customer data are commonly sold for marketing purposes.
Today, data linkage not only faces computational and operational challenges due to the increasing size of data collections and their complexity, but also privacy and confidentiality challenges due to growing concerns by the general public about their personal information being linked and shared between organisations.
In this talk Peter will (1) present a short history of data linkage, (2) provide an overview of various innovative linkage techniques that have been developed in the last few years, and (3) discuss the core technical research areas that need to be addressed in order to make large scale data linkage both feasible as well as acceptable by the general public.
Biography
Dr Peter Christen is a lecturer at the Department of Computer Science at the Australian National University. He received his Diploma in computer science engineering from the ETH Zurich (Switzerland) in 1995 and his PhD in computer science from the University of Basel (Switzerland) in 1999. His research interests are data mining (especially data preprocessing and data linkage), high-performance computing, and most recently security and privacy preservation (in the context of data linkage and health informatics).
In the last four years his research has concentrated on the project "Investigation and Development of Parallel Large Scale Record Linkage Techniques", an ARC Linkage project conducted in collaboration with and partially funded by the NSW Department of Health.
Location
Room G35 Ground Floor John Dedman Building 27 ANU. It is located between the Union Building and the Drill Hall. G35 is on the western side near Sullivans Creek. There is a paid parking area corner of Childers St and Hutton St. This is located near the John Dedman Building on the eastern side.