Quickstart
Let’s generate your first verifiable synthetic dataset — right from your terminal. VeriSynth runs 100% locally, needs no GPU, and produces both a synthetic dataset and a cryptographic proof receipt for auditability.1. Install VeriSynth Core
2. Prepare your dataset
Start with any CSV file — for example, a small tabular dataset of patients, transactions, or IoT readings.| patient_id | age | bmi | smoker | hba1c | systolic_bp | diastolic_bp |
|---|---|---|---|---|---|---|
| 001 | 62 | 31.4 | 1 | 7.8 | 148 | 90 |
| 002 | 45 | 28.6 | 0 | 6.1 | 132 | 82 |
| 003 | 33 | 24.1 | 0 | 5.3 | 116 | 74 |
3. Generate synthetic data
Run VeriSynth directly from the CLI:4. Inspect the proof receipt
Each run creates a verifiable JSON proof describing:- Dataset hashes (SHA-256)
- Model configuration (Gaussian Copula)
- Random seed (for reproducibility)
- Privacy and correlation metrics
- Merkle root for integrity checking
5. Explore your synthetic dataset
The generated data preserves the statistical structure of your original dataset — correlations, distributions, and realistic relationships — without containing any real records. Try viewing it in pandas:6. Reproduce your results
Because VeriSynth uses deterministic seeding (--seed), you can always reproduce the exact same synthetic dataset and proof receipt from the same input file.
seed value will yield identical Merkle roots and metrics, proving the process is verifiable and consistent.