GencoDymo2
Comprehensive Analysis of 'GENCODE' Annotations and Splice Site Motifs
A comprehensive suite of helper functions designed to facilitate the analysis of genomic annotations from the 'GENCODE' database <https://www.gencodegenes.org/>, supporting both human and mouse genomes. This toolkit enables users to extract, filter, and analyze a wide range of annotation features including genes, transcripts, exons, and introns across different 'GENCODE' releases. It provides functionality for cross-version comparisons, allowing researchers to systematically track annotation updates, structural changes, and feature-level differences between releases. In addition, the package can generate high-quality FASTA files containing donor and acceptor splice site motifs, which are formatted for direct input into the 'MaxEntScan' tool (Yeo and Burge, 2004 <doi:10.1089/1066527041410418>), enabling accurate calculation of splice site strength scores.
README
<!-- README.md is generated from README.Rmd. Please edit that file -->
# GencoDymo2: Comprehensive Analysis of GENCODE Annotations and Splice Site Motifs <img src="man/figures/GencoDymo2_logo.png" align="right" height="140"/>
[](https://www.repostatus.org/#active)
[](https://www.gnu.org/licenses/gpl-3.0)
[](https://github.com/monahton)
[](https://github.com/monahton/GencoDymo2/issues)
[](https://cran.r-project.org/)
[](https://monahton.github.io/GencoDymo2/)
[](https://doi.org/10.5281/zenodo.15302316)
[](https://cran.r-project.org/package=GencoDymo2)
[](https://cran.r-project.org/package=GencoDymo2)
[](https://monahton.r-universe.dev/)
[](https://monahton.r-universe.dev/)
[](https://github.com/monahton/GencoDymo2/commits/main)
------------------------------------------------------------------------
## π¦ Overview
**GencoDymo2** is an R package tailored for dynamic extraction,
exploration, and comparison of gene annotations from the
[GENCODE](https://www.gencodegenes.org) database for human and mouse
genomes. This tool facilitates:
- Automated retrieval of the latest or specific GENCODE releases
- Visualization and quantification of annotation changes across versions
- Extraction of introns, exons, splice motifs
- Generation of splice site FASTA files for tools like MaxEntScan
It bridges bioinformatics workflows and annotation dynamics, enhancing
reproducibility and comparative studies in transcriptome and splicing
research.
------------------------------------------------------------------------
## π» Installation
You can install the **stable version** of `GencoDymo2` from
[CRAN](https://cran.r-project.org/package=GencoDymo2):
``` r
# Install the stable version from CRAN
install.packages("GencoDymo2")
```
Or you can install the **development version** from GitHub for the
latest features:
``` r
#Install the development version from GitHub
install.packages("pak")
pak::pkg_install("monahton/GencoDymo2")
```
``` r
# Load the package
library(GencoDymo2)
```
------------------------------------------------------------------------
## π Getting Started
To get started, view the vignette:
``` r
vignette("GencoDymo2")
```
Or visit the documentation website:
π <https://monahton.github.io/GencoDymo2/>
π
<https://monahton.github.io/GencoDymo2/articles/GencoDymo2_vignette.html>
------------------------------------------------------------------------
## π Functions Highlights
| Function | Description |
|----|----|
| `get_latest_release()` | Retrieves the latest available GENCODE release per species |
| `compare_release()` | compare annotation statistics between releases |
| `extract_introns()` | Extracts and processes introns from annotation |
| `assign_splice_sites()` | Assign the donor and acceptor splice sites |
| `extract_ss_motif()` | Extract splicing motifs for MaxEntScan tool |
------------------------------------------------------------------------
## π οΈ Development & Contributing
**GencoDymo2** is actively developed. Contributions and suggestions are
welcome!
- π§ Open issues: <https://github.com/monahton/GencoDymo2/issues>
- π¬ Email: <aboualezz.monah@hsr.it>
- π€ Pull requests encouraged!
------------------------------------------------------------------------
## :writing_hand: Author
**Monah Abou Alezz, PhD** β <aboualezz.monah@hsr.it>.
San Raffaele Telethon Institute for Gene Therapy (SR-TIGET)
IRCCS San Raffaele Scientific Institute, Milan, Italy
π [Personal website](https://monahton.github.io)
[](https://saythanks.io/to/monahton)
[](https://linkedin.com/in/monah-abou-alezz-phd-06a948ba)
------------------------------------------------------------------------
## π§ͺ Related Projects
- [GENCODE](https://www.gencodegenes.org)
- MaxEntScan
------------------------------------------------------------------------
## π£ Acknowledgments
Developed as part of ongoing research on lncRNA splicing and gene
annotation evolution.
Special thanks to colleagues at IGM-CNR and collaborators across
splicing research projects.
Versions across snapshots
| Version | Repository | File | Size |
|---|---|---|---|
1.0.4 |
rolling linux/jammy R-4.5 | GencoDymo2_1.0.4.tar.gz |
461.5 KiB |
1.0.4 |
rolling linux/noble R-4.5 | GencoDymo2_1.0.4.tar.gz |
461.5 KiB |
1.0.4 |
rolling source/ R- | GencoDymo2_1.0.4.tar.gz |
461.5 KiB |
1.0.4 |
latest linux/jammy R-4.5 | GencoDymo2_1.0.4.tar.gz |
461.5 KiB |
1.0.4 |
latest linux/noble R-4.5 | GencoDymo2_1.0.4.tar.gz |
461.5 KiB |
1.0.4 |
latest source/ R- | GencoDymo2_1.0.4.tar.gz |
461.5 KiB |
1.0.4 |
2026-04-26 source/ R- | GencoDymo2_1.0.4.tar.gz |
461.5 KiB |
1.0.4 |
2026-04-23 source/ R- | GencoDymo2_1.0.4.tar.gz |
461.5 KiB |
Dependencies (latest)
Imports
- Biostrings (>= 2.72.1)
- BSgenome (>= 1.66.3)
- data.table (>= 1.17.0)
- dplyr (>= 1.1.4)
- GenomicRanges (>= 1.56.2)
- methods (>= 4.4.0)
- plotrix (>= 3.8.4)
- progress (>= 1.2.3)
- RCurl (>= 1.98.1.17)
- rtracklayer (>= 1.64.0)
- tidyr (>= 1.3.1)
- IRanges (>= 2.38.1)
- tools (>= 4.4.0)
- utils (>= 4.4.0)
Suggests
- devtools (>= 2.4.5)
- BSgenome.Hsapiens.UCSC.hg38 (>= 1.4.5)
- knitr
- rmarkdown
- testthat (>= 3.0.0)