Crandore Hub

syntheticdata

Synthetic Clinical Data Generation and Privacy-Preserving Validation

Generates synthetic clinical datasets that preserve statistical properties while reducing re-identification risk. Implements Gaussian copula simulation, bootstrap with noise injection, and Laplace noise perturbation, with built-in utility and privacy validation metrics. Useful for privacy-aware data sharing in multi-site clinical research. Validates synthetic data quality via distributional similarity (Kolmogorov-Smirnov), discriminative accuracy (real-vs-synthetic classifier), and nearest-neighbor privacy ratio. Methods described in Jordon et al. (2022) <doi:10.48550/arXiv.2205.03257> and Snoke et al. (2018) <doi:10.1111/rssa.12358>.

Versions across snapshots

VersionRepositoryFileSize
0.1.0 rolling linux/jammy R-4.5 syntheticdata_0.1.0.tar.gz 186.8 KiB
0.1.0 rolling linux/noble R-4.5 syntheticdata_0.1.0.tar.gz 186.7 KiB
0.1.0 rolling source/ R- syntheticdata_0.1.0.tar.gz 152.0 KiB
0.1.0 latest linux/jammy R-4.5 syntheticdata_0.1.0.tar.gz 186.8 KiB
0.1.0 latest linux/noble R-4.5 syntheticdata_0.1.0.tar.gz 186.7 KiB
0.1.0 latest source/ R- syntheticdata_0.1.0.tar.gz 152.0 KiB
0.1.0 2026-04-26 source/ R- syntheticdata_0.1.0.tar.gz 152.0 KiB
0.1.0 2026-04-23 source/ R- syntheticdata_0.1.0.tar.gz 152.0 KiB
0.1.0 2026-04-09 windows/windows R-4.5 syntheticdata_0.1.0.zip 191.2 KiB

Dependencies (latest)

Imports

Suggests