masque: Structurally Faithful Development Surrogates for Tabular Data
Source:R/masque-package.R
masque-package.Rdmasque turns a single tabular dataset into a structurally faithful
synthetic clone suitable for pipeline development. Experimental-design
columns and the NA pattern are preserved exactly; treatment and
categorical-covariate level vocabularies are optionally aliased; outcome
and numeric-covariate values are re-simulated via a Gaussian copula that
preserves the global covariance structure. A private recipe object
round-trips a pipeline written against the synthetic clone onto the
original data.
Honest claim
masque is not a differential-privacy or anonymisation tool. Its outputs
are development surrogates: structurally faithful enough that pipeline code
runs unchanged, controlled enough that raw values are not exposed in
collaborate mode. See vignette("confidentiality", package = "masque") for
the full threat model and limitations.
Two modes
mode = "local": owner's realistic dev surrogate; preserves names and level vocabularies; may emit observed values; not for external sharing.mode = "collaborate": opaque aliasing for treatment and categorical-covariate levels; within-resolution jitter on numerics; integers stochastically rounded;audit_mask()auto-runs at construction.
Author
Maintainer: Max Moldovan max.moldovan@gmail.com (ORCID) (Adelaide University)