Probabilistic-Record-Linkage-MatLab

A probabilistic approach to record linkage using Fellegi-Sunter framework with difference matrix .
8 Downloads
Updated 4 Jul 2019

A probabilistic approach to record linkage using Fellegi-Sunter framework using MatLab . It can be used with any versions of MatLab . Functions are :

readfile % Read any two data file to be linked.

caldiffmatrix(a3,b3); %Calculate difference matrix, the difference of each row of a %datafile from another.
calcofu(a3,b3); %Calculation of U-Probability using frequency of data.
calcweight(A,us); % Calculating weight of each filed using M and U- Probability , %assuming M = 0.99
calcsumofweight(weight); %Calculate weight summation for every linked pair .

Example :
b3 = readfile('b3.csv');
a3 = readfile('a3.csv');
A = caldiffmatrix(a3,b3);
us = calcofu(a3,b3);
weight = calcweight(A,us);
sum = calcsumofweight(weight);

Cite As

Arindam Kumar Paul (2024). Probabilistic-Record-Linkage-MatLab (https://github.com/arindampaulripon/Probabilistic-Record-Linkage-MatLab), GitHub. Retrieved .

MATLAB Release Compatibility
Created with R2018a
Compatible with any release
Platform Compatibility
Windows macOS Linux
Categories
Find more on Characters and Strings in Help Center and MATLAB Answers

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Versions that use the GitHub default branch cannot be downloaded

Version Published Release Notes
1.0.0

To view or report issues in this GitHub add-on, visit the GitHub Repository.
To view or report issues in this GitHub add-on, visit the GitHub Repository.