Crandore Hub

fuzzystring

Fast Fuzzy String Joins for Data Frames

Perform fuzzy joins on data frames using approximate string matching. Implements inner, left, right, full, semi, and anti joins with string distance metrics from the 'stringdist' package, including Optimal String Alignment, Levenshtein, Damerau-Levenshtein, Jaro-Winkler, q-gram, cosine, Jaccard, and Soundex. Uses a 'data.table' backend plus compiled 'C++' result assembly to reduce overhead in large joins, while adaptive candidate planning avoids unnecessary distance evaluations in single-column string joins. Suitable for reconciling misspellings, inconsistent labels, and other near-match identifiers while optionally returning the computed distance for each match.

Versions across snapshots

VersionRepositoryFileSize
0.0.5 rolling source/ R- fuzzystring_0.0.5.tar.gz 276.6 KiB
0.0.5 rolling linux/jammy R-4.5 fuzzystring_0.0.5.tar.gz 358.3 KiB
0.0.5 latest source/ R- fuzzystring_0.0.5.tar.gz 276.6 KiB
0.0.5 latest linux/jammy R-4.5 fuzzystring_0.0.5.tar.gz 358.3 KiB
0.0.5 2026-04-23 source/ R- fuzzystring_0.0.5.tar.gz 276.6 KiB
0.0.5 2026-04-09 windows/windows R-4.5 fuzzystring_0.0.5.zip 681.7 KiB

Dependencies (latest)

Imports

LinkingTo

Suggests