fuzzystring
Fast Fuzzy String Joins for Data Frames
Perform fuzzy joins on data frames using approximate string matching. Implements inner, left, right, full, semi, and anti joins with string distance metrics from the 'stringdist' package, including Optimal String Alignment, Levenshtein, Damerau-Levenshtein, Jaro-Winkler, q-gram, cosine, Jaccard, and Soundex. Uses a 'data.table' backend plus compiled 'C++' result assembly to reduce overhead in large joins, while adaptive candidate planning avoids unnecessary distance evaluations in single-column string joins. Suitable for reconciling misspellings, inconsistent labels, and other near-match identifiers while optionally returning the computed distance for each match.
Versions across snapshots
| Version | Repository | File | Size |
|---|---|---|---|
0.0.5 |
rolling source/ R- | fuzzystring_0.0.5.tar.gz |
276.6 KiB |
0.0.5 |
rolling linux/jammy R-4.5 | fuzzystring_0.0.5.tar.gz |
358.3 KiB |
0.0.5 |
latest source/ R- | fuzzystring_0.0.5.tar.gz |
276.6 KiB |
0.0.5 |
latest linux/jammy R-4.5 | fuzzystring_0.0.5.tar.gz |
358.3 KiB |
0.0.5 |
2026-04-23 source/ R- | fuzzystring_0.0.5.tar.gz |
276.6 KiB |
0.0.5 |
2026-04-09 windows/windows R-4.5 | fuzzystring_0.0.5.zip |
681.7 KiB |