Crandore Hub

fozziejoin

Utilities for Joining Dataframes with Inexact Matching

Provides functions for joining data frames based on inexact criteria, including string distance, Manhattan distance, Euclidean distance, and interval overlap. This API is designed as a modern, performance-oriented alternative to the 'fuzzyjoin' package (Robinson 2026) <doi:10.32614/CRAN.package.fuzzyjoin>. String distance functions utilizing 'q-grams' are adapted with permission from the 'textdistance' 'Rust' crate (Orsinium 2024) <https://docs.rs/textdistance/latest/textdistance/>. Other string distance calculations rely on the 'rapidfuzz' 'Rust' crate (Bachmann 2023) <https://docs.rs/rapidfuzz/0.5.0/rapidfuzz/>. Interval joins are backed by a Adelson-Velsky and Landis tree as implemented by the 'interavl' 'Rust' crate <https://docs.rs/interavl/0.5.0/interavl/>.

Versions across snapshots

VersionRepositoryFileSize
0.0.13 rolling source/ R- fozziejoin_0.0.13.tar.gz 2.4 MiB
0.0.13 latest source/ R- fozziejoin_0.0.13.tar.gz 2.4 MiB
0.0.13 2026-04-23 source/ R- fozziejoin_0.0.13.tar.gz 2.4 MiB
0.0.13 2026-04-09 windows/windows R-4.5 fozziejoin_0.0.13.zip 1.6 MiB

Dependencies (latest)

Imports

Suggests