Fairness and explainability for AI and Machine Learning

Fair AI and explainability challenges in AI and ML development

According to Gartner, by now 85% of algorithms are erroneous due to bias.
Biased data is bad for business. From discriminatory hiring algorithms to sexist credit scoring models, numerous fairness scandals prove that the bias damage is both social and financial in nature.
AI regulations are coming across the world. The European Union has already made its proposal to regulate AI and the datasets used for training to enforce the creation of fair AI and safety standards, especially for high-risk use cases.
Regulatory oversight is needed. However, companies using AI are not prepared to demonstrate compliance and offer explainability to regulators.

The status quo in fair AI and AI explainability

There are millions of AI algorithms already in production. Only a small portion of them were audited for fairness. Fair AI is still only talked about in the future tense by most AI engineers. Companies putting untested, biased algorithms into production run the risk of getting into serious trouble not only from a PR perspective but by way of making bad business decisions. After all, biased data will lead to biased business decisions, underserved minority groups, and inexplicable results. From faulty pricing models in insurance to suboptimal prediction outcomes in healthcare, algorithmic fairness is a long stretch away from reality.

The current landscape of fair AI and AI explainability is marked by a stark discrepancy between the growing recognition of their importance and the actual efforts undertaken to address them. While academic conferences, think tanks, and even some regulatory bodies are putting an increasing focus on the need for AI to be both fair and explainable, these discussions often don't translate into actionable steps within organizations.

Many companies are still in the early stages of understanding what it means to implement fair and explainable AI systems. The common practice of simply deleting sensitive attributes like race, ethnicity, or religion from datasets is a glaring example of the superficial approaches that fail to address the root cause of the problem. This not only perpetuates biases through proxy variables but also obfuscates the decision-making process, making it even harder to audit and explain the AI model's behavior.

The result is a landscape where algorithmic decisions, although increasingly critical in everything from loan approvals to medical diagnoses, lack both fairness and transparency. This undermines public trust in AI systems and exposes organizations to both ethical scrutiny and legal repercussions. And while there are tools and methods available for auditing algorithms, their adoption remains woefully limited, often considered as an afterthought rather than a fundamental part of AI development. Consequently, the industry is caught in a cycle of deploying algorithms that neither the creators nor the end-users fully understand or trust, perpetuating a status quo that is increasingly at odds with societal demands for fairness, accountability, and transparency.

Synthetic data for fair AI and AI explainability

Good quality AI-generated synthetic data can reduce bias in training datasets and can thus help to create fair AI systems. Synthetic data also provides the foundations for explainable AI or XAI. Algorithmic audits can greatly benefit from synthetic data, that is free to share with regulators and provides a window into the workings of AI algorithms. Where sensitive training data cannot be shared further, highly representative synthetic data can serve as a drop-in placement to help with model documentation, model validation, and model certification.

For example, synthetic data generated by MOSTLY AI's synthetic data platform corrected a racial bias in crime prediction from 24% to just 1 % and narrowed the gap between high-earning men and women from 20% to 2% in the US census dataset. Read the Fairness Series to learn more about how fair synthetic data can help with reducing biases!

As for explainable AI, synthetic data can play a critical role in the auditing process. Auditors and regulators often require access to the data that trained a given model to validate its performance and ethical considerations. Sharing original, sensitive data might not be feasible due to privacy and regulatory constraints. However, synthetic data can be freely shared, as it encapsulates the statistical properties of the original data without the sensitive details. Thus auditors and teams evaluating trained models work with synthetic data, enabling more transparent and fair AI systems.

Using synthetic data for such audits provides an effective and privacy-compliant way to document, validate, and certify AI models, which is vital in gaining public trust and meeting regulatory standards.

Name	Borlabs Cookie
Provider	Owner of this website, Imprint
Purpose	Saves the visitors preferences selected in the Cookie Box of Borlabs Cookie.
Cookie Name	borlabs-cookie
Cookie Expiry	1 Year

Name	Google Tag Manager
Provider	Google Ireland Limited, Gordon House, Barrow Street, Dublin 4, Ireland
Purpose	Used to control advanced script and event handling.
Privacy Policy	https://policies.google.com/privacy?hl=en
Cookie Name	-

Accept	Google Analytics
Name	Google Analytics
Provider	Google Ireland Limited, Gordon House, Barrow Street, Dublin 4, Ireland
Purpose	Cookie by Google used for website analytics. Generates statistical data on how the visitor uses the website.
Privacy Policy	https://policies.google.com/privacy?hl=en
Cookie Name	_ga,_gat,_gid
Cookie Expiry	2 Years

Accept	LinkedIn Insight Tag (LinkedIn Pixel)
Name	LinkedIn Insight Tag (LinkedIn Pixel)
Provider	LinkedIn Corporation
Purpose	The LinkedIn Insight Tag is a lightweight JavaScript tag that powers conversion tracking, website audiences, and website demographics for LinkedIn ad campaigns.
Privacy Policy	https://www.linkedin.com/legal/privacy-policy
Cookie Expiry	2 Years

Accept	Hotjar
Name	Hotjar
Provider	Hotjar Ltd., Dragonara Business Centre, 5th Floor, Dragonara Road, Paceville St Julian's STJ 3141 Malta
Purpose	Hotjar is an user behavior analytic tool by Hotjar Ltd.. We use Hotjar to understand how users interact with our website.
Privacy Policy	https://www.hotjar.com/legal/policies/privacy/
Host(s)	*.hotjar.com
Cookie Name	_hjClosedSurveyInvites, _hjDonePolls, _hjMinimizedPolls, _hjDoneTestersWidgets, _hjIncludedInSample, _hjShownFeedbackMessage, _hjid, _hjRecordingLastActivity, hjTLDTest, _hjUserAttributesHash, _hjCachedUserAttributes, _hjLocalStorageTest, _hjptid
Cookie Expiry	Session / 1 Year

Accept	Mixpanel
Name	Mixpanel
Provider	Mixpanel S.L
Purpose	We utilize Mixpanel cookies to gather data regarding your usage of our website. This information enables us to comprehend your interests, enhance our products and services, and deliver an improved user experience on our website.
Privacy Policy	https://mixpanel.com/legal/privacy-policy
Host(s)	*.mostly.ai, mostly.ai
Cookie Name	_mp

Accept	HubSpot
Name	HubSpot
Provider	HubSpot Inc., 25 First Street, 2nd Floor, Cambridge, MA 02141, USA
Purpose	HubSpot is a user database management service provided by HubSpot, Inc. We use HubSpot on this website for our online marketing activities.
Privacy Policy	https://legal.hubspot.com/privacy-policy
Host(s)	*.hubspot.com, hubspot-avatars.s3.amazonaws.com, hubspot-realtime.ably.io, hubspot-rest.ably.io, js.hs-scripts.com
Cookie Name	__hs_opt_out, __hs_d_not_track, hs_ab_test, hs-messages-is-open, hs-messages-hide-welcome-message, __hstc, hubspotutk, __hssc, __hssrc, messagesUtk
Cookie Expiry	Session / 30 Minutes / 1 Day / 1 Year / 13 Months

Accept	VWO
Name	VWO
Provider	VWO, Wingify Software Pvt., Heidenkampsweg 58, Hamburg, 20097, Germany
Purpose	VWO allows website owners to conduct A/B testing, create heatmaps, and track user behavior to optimize their website's performance and user experience. We use VWO on this website for our online marketing activities.
Privacy Policy	https://vwo.com/privacy-policy/
Host(s)	mostly.ai
Cookie Name	_vis_opt_exp_#_goal_#, _vis_opt_test_cookie, _vis_opt_exp_#_combi, _vis_opt_exp_#_exclude, _vis_opt_exp_#_split, _vis_opt_s, _vis_opt_out, _vwo_uuid, _vwo_uuid_#, _vwo_ds, _vwo_sn, _vwo_uuid_v2, _vis_opt_exp_#_combi_choose, _vwo_referrer, _vwo, wingify_push_db_status, wingify_push_subscription_id, wingify_push_subscription_endpoint, pushcrew_opt_out, wingify_push_do_not_show_notification_popup, pshcrw_update_subId, wingify_push_subscription_status, wingify_push_subscriber_lang, wingify_donot_track_actions, wingify_do_not_show_chicklet, _wingify_pc_uuid, wingifyEcomData-, wingify_push_gcm_id, wingifyRetrySegment-, wingifySegment-*, pshcrw_v_k, wingify_push_subscriber_id, _vwo_global_opt_out, _vwo_ssm

Fairness and explainability for AI and Machine Learning

Fair AI and explainability challenges in AI and ML development

The status quo in fair AI and AI explainability

Synthetic data for fair AI and AI explainability

Ready to try synthetic data?