Crandore Hub

EnvNJ

Whole Genome Phylogenies Using Sequence Environments

Contains utilities for the analysis of protein sequences in a phylogenetic context. Allows the generation of phylogenetic trees base on protein sequences in an alignment-independent way. Two different methods have been implemented. One approach is based on the frequency analysis of n-grams, previously described in Stuart et al. (2002) <doi:10.1093/bioinformatics/18.1.100>. The other approach is based on the species-specific neighborhood preference around amino acids. Features include the conversion of a protein set into a vector reflecting these neighborhood preferences, pairwise distances (dissimilarity) between these vectors, and the generation of trees based on these distance matrices.

Versions across snapshots

VersionRepositoryFileSize
0.1.3 rolling source/ R- EnvNJ_0.1.3.tar.gz 117.8 KiB
0.1.3 latest source/ R- EnvNJ_0.1.3.tar.gz 117.8 KiB
0.1.3 2026-04-23 source/ R- EnvNJ_0.1.3.tar.gz 117.8 KiB
0.1.3 2026-04-09 windows/windows R-4.5 EnvNJ_0.1.3.zip 145.7 KiB

Dependencies (latest)

Imports

Suggests