fuzzylink
Probabilistic Record Linkage Using Pretrained Text Embeddings
Links datasets through fuzzy string matching using pretrained text embeddings. Produces more accurate record linkage when lexical string distance metrics are a poor guide to match quality (e.g., "Patricia" is more lexically similar to "Patrick" than it is to "Trish"). Capable of performing multilingual record linkage. Methods are described in Ornstein (2025) <doi:10.1017/pan.2025.10016>.
Versions across snapshots
| Version | Repository | File | Size |
|---|---|---|---|
0.4.1 |
rolling source/ R- | fuzzylink_0.4.1.tar.gz |
18.9 KiB |
0.4.1 |
rolling linux/jammy R-4.5 | fuzzylink_0.4.1.tar.gz |
61.1 KiB |
0.4.1 |
latest source/ R- | fuzzylink_0.4.1.tar.gz |
18.9 KiB |
0.4.1 |
latest linux/jammy R-4.5 | fuzzylink_0.4.1.tar.gz |
61.1 KiB |
0.4.1 |
2026-04-23 source/ R- | fuzzylink_0.4.1.tar.gz |
18.9 KiB |
0.4.1 |
2026-04-09 windows/windows R-4.5 | fuzzylink_0.4.1.zip |
63.8 KiB |