syntheticdata
Synthetic Clinical Data Generation and Privacy-Preserving Validation
Generates synthetic clinical datasets that preserve statistical properties while reducing re-identification risk. Implements Gaussian copula simulation, bootstrap with noise injection, and Laplace noise perturbation, with built-in utility and privacy validation metrics. Useful for privacy-aware data sharing in multi-site clinical research. Validates synthetic data quality via distributional similarity (Kolmogorov-Smirnov), discriminative accuracy (real-vs-synthetic classifier), and nearest-neighbor privacy ratio. Methods described in Jordon et al. (2022) <doi:10.48550/arXiv.2205.03257> and Snoke et al. (2018) <doi:10.1111/rssa.12358>.
Versions across snapshots
| Version | Repository | File | Size |
|---|---|---|---|
0.1.0 |
rolling linux/jammy R-4.5 | syntheticdata_0.1.0.tar.gz |
186.8 KiB |
0.1.0 |
rolling linux/noble R-4.5 | syntheticdata_0.1.0.tar.gz |
186.7 KiB |
0.1.0 |
rolling source/ R- | syntheticdata_0.1.0.tar.gz |
152.0 KiB |
0.1.0 |
latest linux/jammy R-4.5 | syntheticdata_0.1.0.tar.gz |
186.8 KiB |
0.1.0 |
latest linux/noble R-4.5 | syntheticdata_0.1.0.tar.gz |
186.7 KiB |
0.1.0 |
latest source/ R- | syntheticdata_0.1.0.tar.gz |
152.0 KiB |
0.1.0 |
2026-04-26 source/ R- | syntheticdata_0.1.0.tar.gz |
152.0 KiB |
0.1.0 |
2026-04-23 source/ R- | syntheticdata_0.1.0.tar.gz |
152.0 KiB |
0.1.0 |
2026-04-09 windows/windows R-4.5 | syntheticdata_0.1.0.zip |
191.2 KiB |