M5: Data Analysis Algorithms & Implementations - Record Linkage¶
This fifth module provides an introduction to identifying and linking entities in different sets together using a algorithm known as record linkage. We will explore the various algorithms for performing record linkage and when to use them.
Pre-recorded Lectures¶
The lectures on record linkage will be done in class this week. However, there might be some follow up videos provided later this week to cover anything missed or bonus videos related to the group project.
Supplementary Resources¶
Useful book: Big Data and Social Science: A Practical Guide to Methods and Tools (Chapman & Hall/CRC Statistics in the Social and Behavioral Sciences) by Foster, Ghani et al. On reserve in Regenstein for the quarter. See Chapter #3 on Record Linkage.
Programming Assignment¶
Programming Assignment #4, due Saturday, February 19th at 4:30pm CDT