Crandore Hub

stddiff.spark

Calculate the Standardized Difference for Numeric, Binary and Category Variables in Apache Spark

Provides functions to compute standardized differences for numeric, binary, and categorical variables on Apache Spark DataFrames using 'sparklyr'. The implementation mirrors the methods used in the 'stddiff' package but operates on distributed data. See Zhicheng Du, Yuantao Hao (2022) <doi:10.32614/CRAN.package.stddiff> for reference.

Versions across snapshots

VersionRepositoryFileSize
1.0 rolling linux/jammy R-4.5 stddiff.spark_1.0.tar.gz 44.4 KiB
1.0 rolling linux/noble R-4.5 stddiff.spark_1.0.tar.gz 44.4 KiB
1.0 rolling source/ R- stddiff.spark_1.0.tar.gz 10.3 KiB
1.0 latest linux/jammy R-4.5 stddiff.spark_1.0.tar.gz 44.4 KiB
1.0 latest linux/noble R-4.5 stddiff.spark_1.0.tar.gz 44.4 KiB
1.0 latest source/ R- stddiff.spark_1.0.tar.gz 10.3 KiB
1.0 2026-04-26 source/ R- stddiff.spark_1.0.tar.gz 10.3 KiB
1.0 2026-04-23 source/ R- stddiff.spark_1.0.tar.gz 10.3 KiB
1.0 2026-04-09 windows/windows R-4.5 stddiff.spark_1.0.zip 47.5 KiB

Dependencies (latest)

Imports

Suggests