💡 Download the complete guide to AI-generated synthetic data!
Go to the ebook
August 16, 2023
1m 38s

Getting Started with MOSTLY AI - Configuring Primary Keys in Synthetic Data Generation


In this video, we'll be discussing how to configure primary keys in synthetic data generation using MOSTLY AI's synthetic data platform.

If you're new to MOSTLY AI and you're looking to start synthesizing primary keys, then this video is for you! We'll help you pick the right synthetic data generation method and get started with synthesizing primary keys. After watching this video, you'll be well on your way to generating accurate and privacy-safe synthetic data using MOSTLY AI!

Get started for free here ➡️ https://bit.ly/43IGYSv


[00:00:00] Hi, everyone. In this video, we'll talk about a specific generation method; namely, how you can synthesize primary keys.

[00:00:08] I've provided here a data set, a table that contains an ID column, and it just contains an integer variable that is an ID for this User Table here, and that's a very common thing that you will encounter in datasets out there.

[00:00:24] The platform automatically detects this as numeric, which could be fine, but I want to specifically make sure this is treated as a primary key, so I go here into Generation Method and I pick Primary Key, and then there are four different generation formats that I can pick; Sequential, UUID, UUID dashless, and UUID short.

[00:00:44] To show you what that actually looks like when it's synthesized, I have synthesized this table here with those four different settings. Let's look into that, what that looks like. Sequential is just an integer, you see it here in the second column, starting at 1 and then growing from there, probably like the original data.

[00:01:04] UUID is the universally unique identifier. It's a standard format, very commonly used in databases. It contains 36 characters including four dashes. That's what it looks like.

[00:01:16] UUID dashless is, as the name implies, the UUID without the dashes. Then, UUID short is actually a proprietary hash that the platform creates. That's a little shorter. Depending on the use case, you can pick either Sequential, UUID, UUID dashless, or the UUID short. Thanks for watching.

Ready to try synthetic data?

The best way to learn about synthetic data is to experiment with synthetic data generation. Try it for free or get in touch with our sales team for a demo.