💡 Announcing the MOSTLY AI and Databricks Integration
Read all about it here


Sampling is the random selection of values or complete records based on a defined probability distribution. The generation of synthetic data is based on sampling, where the underlying distribution is learned during the training of the synthetic model and from which records are sampled to create a final synthetic dataset that contains all the properties of the original dataset or meets other specific requirements, i.e. balanced minorities and minority classes.

Ready to get started?

Get started for free or get in touch with our sales team for a demo.