Guides
Google Cloud object storage

Connect to Google Cloud object storage

To use datasets that you keep in Google Cloud storage buckets as a data source for synthetic data, you need to create a Google Cloud connector in MOSTLY AI.

If you want to keep the generated synthetic data in a separate bucket, you need another Google Cloud connector that points to the bucket.

Prerequisites

To create a Google Cloud bucket connector, you need the contents of your Google Cloud key file and the name of the bucket where you keep your original data.

Download your Google Cloud storage key file

  1. In Google Cloud BigQuery, open the main sidebar menu and select APIs & Services > Enabled APIs & services. Google Cloud BigQuery - Select APIs and Services > Enabled APIs and services
  2. From the sidebar, select Credentials. Google Cloud BigQuery - Select Credentials
  3. Click your service account. Google Cloud BigQuery - Click Service account
  4. Select the KEYS tab.
  5. Click ADD KEY and select Create new key. Google Cloud BigQuery - Click Add key and select New key
  6. In the prompt, select JSON and click Create. Google Cloud BigQuery - Select JSON and click Create

Create a Google Cloud object storage connector

  1. From the Connectors tab, click Create connector. Click Create connector button The Create connector drawer appears on the right.
  2. On the Connect to cloud storage tab, select Google cloud storage. Select Google Cloud storage connector
  3. On the Create Google cloud connector page, configure the connector.
    1. For Connector name, enter a name that you can distinguish from other connectors.

      💡

      A combination of Google Cloud + _BUCKET_ might help you identify this connector among other Google Cloud storage connectors.

    2. For Connection type, select whether you want to use the connector as a source or destination.

      You can select only data source connectors when you create a new catalog.

      Similarly, you can select only data destination connectors when you configure a destination for the new synthetic dataset.

    3. For Key file, paste the contents of your Google Cloud Storage key file.

    4. For Bucket, enter the name of the Google Cloud Storage bucket.

      Configure Google Cloud Storage bucket connector
  4. Click Test connection to make sure the configuration is correct. If you see a successful connection message, then MOSTLY AI connected to Google Cloud object storage successfully and found the provided bucket.
  5. Click Save to save your new Google Cloud bucket connector.

What's next

You can now use the Google Cloud storage connector as a data source when you create a new catalog.

You can also use the Google Cloud storage connector as a destination.

You can use different types of data sources and destinations for a synthetic dataset. For example, if your data source is a Google Cloud storage database, you can deliver the generated synthetic to any of the supported databases or cloud storage providers.