Crandore Hub

ngram

Fast n-Gram 'Tokenization'

An n-gram is a sequence of n "words" taken, in order, from a body of text. This is a collection of utilities for creating, displaying, summarizing, and "babbling" n-grams. The 'tokenization' and "babbling" are handled by very efficient C code, which can even be built as its own standalone library. The babbler is a simple Markov chain. The package also offers a vignette with complete example 'workflows' and information about the utilities offered in the package.

Versions across snapshots

VersionRepositoryFileSize
3.2.3 rolling linux/jammy R-4.5 ngram_3.2.3.tar.gz 343.2 KiB
3.2.3 rolling linux/noble R-4.5 ngram_3.2.3.tar.gz 343.1 KiB
3.2.3 rolling source/ R- ngram_3.2.3.tar.gz 243.0 KiB
3.2.3 latest linux/jammy R-4.5 ngram_3.2.3.tar.gz 343.2 KiB
3.2.3 latest linux/noble R-4.5 ngram_3.2.3.tar.gz 343.1 KiB
3.2.3 latest source/ R- ngram_3.2.3.tar.gz 243.0 KiB
3.2.3 2026-04-26 source/ R- ngram_3.2.3.tar.gz 243.0 KiB
3.2.3 2026-04-23 source/ R- ngram_3.2.3.tar.gz 243.0 KiB
3.2.3 2026-04-09 windows/windows R-4.5 ngram_3.2.3.zip 349.2 KiB
3.2.3 2025-04-20 source/ R- ngram_3.2.3.tar.gz 243.0 KiB

Dependencies (latest)

Imports