Privacy and security

Privacy & security in MOSTLY AI's Synthetic Data Platform

Privacy-by-design and maximum security for each and every one of your synthetic datasets.

Get started free

No 1:1 relationship to
the original data

In contrast to traditional anonymization techniques, MOSTLY AI uses your original data only as learning material to train Generative AI models. During training, the models learn the patterns, distributions, correlations, and other statistical characteristics of your original data. MOSTLY AI then uses the AI models to generate synthetic data from scratch. Thus a synthetic record cannot be linked back to one specific original data record. Instead it is based upon the input of what was generalized by the model from all original data records.

Model overfitting
prevention

We've implemented a robust mechanism to prevent our Generative AI models from memorizing individual properties and patterns. Our approach involves carefully designed loss functions and validation criteria, all aimed at ensuring generalization and guarding against overfitting. The models only learn general patterns of the original data, but no specific individual data points.

Random draw
synthesis

To generate synthetic data, the MOSTLY AI Synthetic Data Platform generates new samples with random draws against the trained AI models. Let's consider a simplified example with a column like 'Gender,' which has categories like Male, Female, Other, and N/A. The model learns the distribution, like 47% females, 45% males, 7% other, and 1% N/As. During a draw, the chance of 'Male' is about 4-5 times out of 10.

As mentioned, this example is overly simplified as during each random draw, the MOSTLY AI Synthetic Data Platform considers not only the distribution of a single column but also all statistical characteristics and the relationships between each column of the original data.

Get started free

Privacy protection mechanisms

Our commitment to privacy extends to safeguarding against re-identification risks, especially in scenarios involving rare categories, extreme values, and extended sequence lengths.

Learn more in the MOSTLY AI Documentation

Rare category protection

The Platform uses rare category protection for categorical columns, preventing the AI model from being trained with rare values. To maintain the original data's correlation and distribution, we substitute these values with the category "_RARE_".

Extreme value protection

The Platform applies extreme value protection to numerical and date-time columns. Before training, it removes minimum and maximum outliers from these columns to prevent exceptional cases from appearing in the synthetic data.

Extreme sequence length protection

The Platform removes excessive linked records that could lead back to a subject in a subject table. Long sequences can pose a privacy risk, so they are removed before training of the Generative AI model.

Privacy settings by default

In the MOSTLY AI Synthetic Data Platform, all configuration settings to protect data privacy are on by default, so you can rest assured.

Why should you trust MOSTLY AI's synthetic data?

SOC 2 and ISO 27001 certified solution with maximum security

Continuous external audits and legal assessments for compliance

The highest data anonymization standards

Complies with the requirements of GDPR, CCPA, CPRA, HIPAA, PDPA, & APPI

Available for on-premises installations, including in air-gapped environments, or deployed in private cloud infrastructures

Request a demo

Trusted by leading brands and data privacy experts

“We see synthetic data as the foundation for all future data-driven development, as it provides the only GDPR-compliant method for unlocking advanced analytics and insights based on customer data.

This partnership with MOSTLY AI is a logical step on our journey towards increasing customer success and satisfaction. We are preparing the bank for a data- and purpose-driven future in which synthetic data continuously protects and serves customers.”
Dietmar Böckmann
Managing Director, Erste Digital
“Therefore MOSTLY AI Synthetic Data does not contain any personal data within the meaning of Art. 4 (1) GDPR. In accordance with Recital 26 GDPR, customers are therefore permitted to freely process and share these anonymous synthetic datasets, as they are not subject to the rules of GDPR.”
Taylor Wessing
Legal Assessment
“The analysis above allows us to conclude that the data synthetization performed by the MOSTLY AI Synthetic Data Software adequately addresses the risks of identity disclosure, attribute disclosure, and membership disclosure.”
SBA Research
Technical Assessment

GDPR-compliant data anonymization by default

MOSTLY AI's Synthetic Data Platform provides complete anonymization by default. Preconfigured settings for non-expert users eliminate human error. Automatic, state-of-the-art privacy mechanisms ensure that your data is safe and your customers will be protected against threats.

No risk of data breaches & data fines

With MOSTLY AI's synthetic data it is not possible to single out an individual, link records relating to an individual, or infer information concerning an individual. Your customer data is kept safe and secure. Synthetic data helps eliminate the risk of data breaches and privacy fines.

Ready to try synthetic data?

The best way to learn about synthetic data is to experiment with synthetic data generation. Try it for free or get in touch with our sales team for a demo.

Get started free Request a demo

Name	Borlabs Cookie
Provider	Owner of this website, Imprint
Purpose	Saves the visitors preferences selected in the Cookie Box of Borlabs Cookie.
Cookie Name	borlabs-cookie
Cookie Expiry	1 Year

Name	Google Tag Manager
Provider	Google Ireland Limited, Gordon House, Barrow Street, Dublin 4, Ireland
Purpose	Used to control advanced script and event handling.
Privacy Policy	https://policies.google.com/privacy?hl=en
Cookie Name	-

Accept	Google Analytics
Name	Google Analytics
Provider	Google Ireland Limited, Gordon House, Barrow Street, Dublin 4, Ireland
Purpose	Cookie by Google used for website analytics. Generates statistical data on how the visitor uses the website.
Privacy Policy	https://policies.google.com/privacy?hl=en
Cookie Name	_ga,_gat,_gid
Cookie Expiry	2 Years

Accept	LinkedIn Insight Tag (LinkedIn Pixel)
Name	LinkedIn Insight Tag (LinkedIn Pixel)
Provider	LinkedIn Corporation
Purpose	The LinkedIn Insight Tag is a lightweight JavaScript tag that powers conversion tracking, website audiences, and website demographics for LinkedIn ad campaigns.
Privacy Policy	https://www.linkedin.com/legal/privacy-policy
Cookie Expiry	2 Years

Accept	Hotjar
Name	Hotjar
Provider	Hotjar Ltd., Dragonara Business Centre, 5th Floor, Dragonara Road, Paceville St Julian's STJ 3141 Malta
Purpose	Hotjar is an user behavior analytic tool by Hotjar Ltd.. We use Hotjar to understand how users interact with our website.
Privacy Policy	https://www.hotjar.com/legal/policies/privacy/
Host(s)	*.hotjar.com
Cookie Name	_hjClosedSurveyInvites, _hjDonePolls, _hjMinimizedPolls, _hjDoneTestersWidgets, _hjIncludedInSample, _hjShownFeedbackMessage, _hjid, _hjRecordingLastActivity, hjTLDTest, _hjUserAttributesHash, _hjCachedUserAttributes, _hjLocalStorageTest, _hjptid
Cookie Expiry	Session / 1 Year

Accept	Mixpanel
Name	Mixpanel
Provider	Mixpanel S.L
Purpose	We utilize Mixpanel cookies to gather data regarding your usage of our website. This information enables us to comprehend your interests, enhance our products and services, and deliver an improved user experience on our website.
Privacy Policy	https://mixpanel.com/legal/privacy-policy
Host(s)	*.mostly.ai, mostly.ai
Cookie Name	_mp

Accept	HubSpot
Name	HubSpot
Provider	HubSpot Inc., 25 First Street, 2nd Floor, Cambridge, MA 02141, USA
Purpose	HubSpot is a user database management service provided by HubSpot, Inc. We use HubSpot on this website for our online marketing activities.
Privacy Policy	https://legal.hubspot.com/privacy-policy
Host(s)	*.hubspot.com, hubspot-avatars.s3.amazonaws.com, hubspot-realtime.ably.io, hubspot-rest.ably.io, js.hs-scripts.com
Cookie Name	__hs_opt_out, __hs_d_not_track, hs_ab_test, hs-messages-is-open, hs-messages-hide-welcome-message, __hstc, hubspotutk, __hssc, __hssrc, messagesUtk
Cookie Expiry	Session / 30 Minutes / 1 Day / 1 Year / 13 Months

Accept	VWO
Name	VWO
Provider	VWO, Wingify Software Pvt., Heidenkampsweg 58, Hamburg, 20097, Germany
Purpose	VWO allows website owners to conduct A/B testing, create heatmaps, and track user behavior to optimize their website's performance and user experience. We use VWO on this website for our online marketing activities.
Privacy Policy	https://vwo.com/privacy-policy/
Host(s)	mostly.ai
Cookie Name	_vis_opt_exp_#_goal_#, _vis_opt_test_cookie, _vis_opt_exp_#_combi, _vis_opt_exp_#_exclude, _vis_opt_exp_#_split, _vis_opt_s, _vis_opt_out, _vwo_uuid, _vwo_uuid_#, _vwo_ds, _vwo_sn, _vwo_uuid_v2, _vis_opt_exp_#_combi_choose, _vwo_referrer, _vwo, wingify_push_db_status, wingify_push_subscription_id, wingify_push_subscription_endpoint, pushcrew_opt_out, wingify_push_do_not_show_notification_popup, pshcrw_update_subId, wingify_push_subscription_status, wingify_push_subscriber_lang, wingify_donot_track_actions, wingify_do_not_show_chicklet, _wingify_pc_uuid, wingifyEcomData-, wingify_push_gcm_id, wingifyRetrySegment-, wingifySegment-*, pshcrw_v_k, wingify_push_subscriber_id, _vwo_global_opt_out, _vwo_ssm