Add data

Add data to a generator

While a generator has the New status (created but training has not started), you can add tabular data to it. And you can add data from multiple sources:

  • uploaded files (CSV, Parquet)
  • databases
  • cloud storage buckets
📑

Note You cannot add data to already trained generators.

Add uploaded files

From the web application, open an untrained generator to add tables to it.

  1. On the Generators page, click + New generator.

    MOSTLY AI - Generators page - Click New generator

    Step result: You now have a generator object created in the MOSTLY AI database and the generator is listed on the Generators page.

    The Add data window appears prompting you to add tabular data for your generator to train on.

  2. Click Upload file.

    MOSTLY AI - Generators page - Click Upload file
  3. Under Upload file, drag a local file onto the box or click the box to browse your local file system.

    💡

    If you need a dataset, download one from the Datasets page.

    MOSTLY AI - Generators page - Drag to upload or click to browse
  4. (Optional) Enter a name for the table.
    The table name appears in the list of tables added to the generator. Also, the table name that you provide is what appears in each generated synthetic dataset.

    MOSTLY AI - Generators page - Name the new table
  5. Click Proceed.

Add data from a database

Use a source database connector to add tables to generators.

From the web application, open an untrained generator to add tables to it.

Steps

  1. In the generator, click Add data on the Data configuration page.
  2. Select Connect to source. MOSTLY AI - Generators Add data - 02 - Click Connect to source
  3. Select a database connector. MOSTLY AI - Generators Add data - 03 - Select a source
  4. Select a schema (optional) and table. MOSTLY AI - Generators Add data - 03 - Select a source
  5. Click Proceed.

Result

The database table is now added to your generator.

Add data from a cloud bucket

Use a source cloud storage connector to add tables to generators.

Steps

  1. In an untrained generator, click Add data.
  2. Click Connect to source. MOSTLY AI - Generators Add data from cloud storage - 02 - Select a connector
  3. Select a cloud storage connector. MOSTLY AI - Generators Add data from cloud storage - 02 - Select a connector
  4. Define the bucket path to your table, where Table path is the folder path, and Table name is the filename. MOSTLY AI - Generators Add data from cloud storage - 03 - Define table path
  5. Click Proceed.