CLI Commands
The VeriSynth CLI is a lightweight, Python-based command-line interface for generating synthetic datasets and producing verifiable proof receipts. All commands run offline and are designed to integrate easily with Python, CI/CD pipelines, and local analytics workflows.Base Command: verisynth
The base command generates synthetic data and outputs both:
- A synthetic dataset (
synthetic.csv) - A proof receipt (
proof.json)
Global Options
| Flag | Description |
|---|---|
--input | Path to input CSV file |
--output | Output directory for synthetic data and proof |
--rows | Number of synthetic rows to generate (default: 1000) |
--seed | Random seed for deterministic reproducibility |
--schema | Path to YAML schema configuration file (optional) |
Schema Configuration
VeriSynth supports explicit field mapping and exclusion through YAML schema configuration files. This gives you fine-grained control over which fields to synthesize and how to handle data types.Schema Configuration Format
Configuration Options
exclude: List of field names to exclude from synthesis (e.g., IDs, addresses)types: Explicit type mappings for fields (supports:int,float,bool,str)model: Model configuration including engine and seed
Benefits
- Privacy: Exclude sensitive identifiers and PII
- Control: Explicit type handling instead of automatic detection
- Reproducibility: Schema configuration is included in proof receipts
- Validation: Built-in validation ensures configuration correctness