Manage synthetic datasets

Manage synthetic datasets

To manage a synthetic dataset, you need to be the owner or have Editor access to one.

Status of synthetic datasets

In the table below, you can find a list of all possible statuses a synthetic dataset can have.

StatusDescriptionNext actions
NewA synthetic dataset object exists with a default or modified configuration. Generation has not yet started.• Start generation
• Delete
In progressSynthetic data generation is in progress.• Cancel generation
• Delete
ReadyThe synthetic dataset has been generated successfully.• Generate
• Share
• Delete
FailedThe synthetic dataset generation started and then failed.• Delete
CanceledThe synthetic dataset generation was canceled while still in progress.• Delete

Clone a synthetic dataset

You can clone a synthetic dataset in one of two ways.

Steps

  • Clone a synthetic dataset directly from the synthetic datasets page.
    1. From the synthetic datasets page, click the kebab menu of a synthetic dataset, and select Clone. MOSTLY AI - Clone a synthetic dataset - 01 - From synthetic datasets page, kebab menu, Clone
  • Clone a synthetic dataset after you open it.
    1. From the synthetic datasets page, click a synthetic dataset to open it.
    2. Click the kebab menu in the upper right, and select Clone. MOSTLY AI - Clone a synthetic dataset - 02 - From synthetic datasets details page, kebab menu, Clone

Result

A new synthetic dataset is created. The synthetic dataset name starts with Clone - and is then followed by the name of the original synthetic dataset.

MOSTLY AI - Clone a synthetic dataset - 03 - Name and result

What's new

You can now reuse the generation configuration from the previous synthetic dataset and make any necessary changes before generating again.

Delete a synthetic dataset

A synthetic dataset contains Generative AI models, one tabular model for each table of data and one language model for each column with unstructured text data. Depending on the size of your original data, it can take a long time to train a new one.

If you need to delete a synthetic dataset, you can do so after you open the synthetic dataset.

Steps

  1. From the Synthetic datasets page, select a synthetic dataset.
  2. Click the kebab menu in the upper right.
  3. Select Delete.
  4. Click Yes in the confirmation dialog.

Result

The synthetic dataset is now deleted.