clustringr
Cluster Strings by Edit-Distance
Returns an edit-distance based clusterization of an input vector of strings. Each cluster will contain a set of strings w/ small mutual edit-distance (e.g., Levenshtein, optimum-sequence-alignment, Damerau-Levenshtein), as computed by stringdist::stringdist(). The set of all mutual edit-distances is then used by graph algorithms (from package 'igraph') to single out subsets of high connectivity.
Versions across snapshots
| Version | Repository | File | Size |
|---|---|---|---|
1.0 |
2026-04-09 windows/windows R-4.5 | clustringr_1.0.zip |
434.3 KiB |