Crandore Hub

fuzzystring

Fast Fuzzy String Joins for Data Frames

Perform fuzzy joins on data frames using approximate string matching. Implements inner, left, right, full, semi, and anti joins with string distance metrics from the 'stringdist' package, including Optimal String Alignment, Levenshtein, Damerau-Levenshtein, Jaro-Winkler, q-gram, cosine, Jaccard, and Soundex. Uses a 'data.table' backend plus compiled 'C++' result assembly to reduce overhead in large joins, while adaptive candidate planning avoids unnecessary distance evaluations in single-column string joins. Suitable for reconciling misspellings, inconsistent labels, and other near-match identifiers while optionally returning the computed distance for each match.

Versions across snapshots

VersionRepositoryFileSize
0.0.5 rolling linux/jammy R-4.5 fuzzystring_0.0.5.tar.gz 358.3 KiB
0.0.5 rolling linux/noble R-4.5 fuzzystring_0.0.5.tar.gz 360.8 KiB
0.0.5 rolling source/ R- fuzzystring_0.0.5.tar.gz 276.6 KiB
0.0.5 latest linux/jammy R-4.5 fuzzystring_0.0.5.tar.gz 358.3 KiB
0.0.5 latest linux/noble R-4.5 fuzzystring_0.0.5.tar.gz 360.8 KiB
0.0.5 latest source/ R- fuzzystring_0.0.5.tar.gz 276.6 KiB
0.0.5 2026-04-26 source/ R- fuzzystring_0.0.5.tar.gz 276.6 KiB
0.0.5 2026-04-23 source/ R- fuzzystring_0.0.5.tar.gz 276.6 KiB
0.0.5 2026-04-09 windows/windows R-4.5 fuzzystring_0.0.5.zip 681.7 KiB

Dependencies (latest)

Imports

LinkingTo

Suggests