Crandore Hub

fuzzylink

Probabilistic Record Linkage Using Pretrained Text Embeddings

Links datasets through fuzzy string matching using pretrained text embeddings. Produces more accurate record linkage when lexical string distance metrics are a poor guide to match quality (e.g., "Patricia" is more lexically similar to "Patrick" than it is to "Trish"). Capable of performing multilingual record linkage. Methods are described in Ornstein (2025) <doi:10.1017/pan.2025.10016>.

Versions across snapshots

VersionRepositoryFileSize
0.4.1 rolling source/ R- fuzzylink_0.4.1.tar.gz 18.9 KiB
0.4.1 rolling linux/jammy R-4.5 fuzzylink_0.4.1.tar.gz 61.1 KiB
0.4.1 latest source/ R- fuzzylink_0.4.1.tar.gz 18.9 KiB
0.4.1 latest linux/jammy R-4.5 fuzzylink_0.4.1.tar.gz 61.1 KiB
0.4.1 2026-04-23 source/ R- fuzzylink_0.4.1.tar.gz 18.9 KiB
0.4.1 2026-04-09 windows/windows R-4.5 fuzzylink_0.4.1.zip 63.8 KiB

Dependencies (latest)

Imports