eHDPrep
Quality Control and Semantic Enrichment of Datasets
A tool for the preparation and enrichment of health datasets for analysis (Toner et al. (2023) <doi:10.1093/gigascience/giad030>). Provides functionality for assessing data quality and for improving the reliability and machine interpretability of a dataset. 'eHDPrep' also enables semantic enrichment of a dataset where metavariables are discovered from the relationships between input variables determined from user-provided ontologies.
Versions across snapshots
| Version | Repository | File | Size |
|---|---|---|---|
1.4.0 |
rolling source/ R- | eHDPrep_1.4.0.tar.gz |
1.7 MiB |
1.4.0 |
latest source/ R- | eHDPrep_1.4.0.tar.gz |
1.7 MiB |
1.4.0 |
2026-04-09 windows/windows R-4.5 | eHDPrep_1.4.0.zip |
1.3 MiB |
Dependencies (latest)
Imports
- ggplot2 (>= 3.3.3)
- dplyr (>= 1.1.0)
- forcats (>= 0.5.0)
- stringr (>= 1.4.0)
- purrr (>= 0.3.4)
- tidyr (>= 1.1.2)
- kableExtra (>= 1.3.1)
- magrittr (>= 2.0.1)
- tibble (>= 3.0.5)
- scales (>= 1.1.1)
- rlang (>= 0.4.10)
- quanteda (>= 2.1.2)
- tm (>= 0.7-8)
- pheatmap (>= 1.0.12)
- igraph (>= 1.2.6)
- tidygraph (>= 1.2.0)
- readr (>= 1.4.0)
- readxl (>= 1.3.1)
- knitr (>= 1.31)