Crandore Hub

cleanNLP

A Tidy Data Model for Natural Language Processing

Provides a set of fast tools for converting a textual corpus into a set of normalized tables. Users may make use of the 'udpipe' back end with no external dependencies, or a Python back ends with 'spaCy' <https://spacy.io>. Exposed annotation tasks include tokenization, part of speech tagging, named entity recognition, and dependency parsing.

README

# Repository for language models

Versions across snapshots

VersionRepositoryFileSize
3.1.0 2026-04-09 windows/windows R-4.5 cleanNLP_3.1.0.zip 3.2 MiB

Dependencies (latest)

Imports

Suggests