Configure generators
Add data

Add data to a generator

While a generator has the New status (created but training has not started), you can add tabular data to it. And you can add data from multiple sources:

  • file upload (CSV, Parquet)
  • databases
  • cloud storage buckets
📑

Note You cannot add data to already trained generators.

Add via file upload

From the web application, open an untrained generator to add tables to it.

Steps

  1. On the Generators page, click + New generator.

    MOSTLY AI - Generators page - Click New generator

    Step result: You now have a generator object created in the MOSTLY AI database and the generator is listed on the Generators page.

    The Add data window appears prompting you to add tabular data for your generator to train on.

  2. Click Upload file.

    MOSTLY AI - Generators page - Click Upload file
  3. Under Upload file, drag a local file onto the box or click the box to browse your local file system.

    💡

    If you need a dataset, download one from the Datasets page.

    MOSTLY AI - Generators page - Drag to upload or click to browse
  4. (Optional) For Table name, change the default name for the table.
    MOSTLY AI captures the table name from the file provided.
    Info: The name specified here will appear in the list of tables added to the generator. Also, this table name appears in each synthetic dataset created with this generator.

    MOSTLY AI - Generators page - Name the new table
  5. Click Proceed.

Add data from a database

Use a source database connector to add tables to generators.

From the web application, open an untrained generator to add tables to it.

Steps

  1. In the generator, click Add data on the Data configuration page.
  2. Select Connect to source. MOSTLY AI - Generators Add data - 02 - Click Connect to source
  3. Select a database connector.
    📑

    If you do not have one, click + New connector to create a database connector. When done, the app brings you back to select a connector.

    MOSTLY AI - Generators Add data - 03 - Select a source
  4. Select a schema and a table. MOSTLY AI - Generators Add data - 04 - Select a database schema and table
    💡

    You can search for your schemas and tables when you type in the Select schema and Select table boxes.

    MOSTLY AI - Generators Add data - 05 - Search database schemas and tables
  5. Click Proceed.

Result

The database table is now added to your generator.

Add data from a cloud bucket

Use a source cloud storage connector to add tables to generators.

Steps

  1. In an untrained generator, click Add data.
  2. Click Connect to source. MOSTLY AI - Generators Add data from cloud storage - 02 - Select a connector
  3. Select a cloud storage connector.
    📑

    If you do not have one, click + New connector to create a cloud storage connector. When done, the app brings you back to select a connector.

    MOSTLY AI - Generators Add data from cloud storage - 02 - Select a connector
  4. Define the bucket path to your table, where Table path is the folder path, and Table name is the filename. MOSTLY AI - Generators Add data from cloud storage - 03 - Define table path
  5. Click Proceed.