Subscribe now

Technology

Using bigger AI training data sets may produce more racist results

Contrary to Silicon Valley wisdom, training AIs on larger data sets could worsen their tendency to replicate societal biases and racist stereotypes

By Jeremy Hsu

13 July 2023

New Scientist Default Image

Larger training sets don’t reduce bias in artificial intelligence

Shutterstock/wutzkohphoto

Many tech companies have operated under the assumption that training artificial intelligence on more data can help fix the ongoing problem of AIs replicating human prejudices. But a study has found that AIs trained on increasingly larger data sets can produce even more racist results.

Abeba Birhane at the Mozilla Foundation and her colleagues compared two data sets provided by the Large-scale Artificial Intelligence Open Network (LAION), a non-profit that offers open-source data sets for AI training. One contained 400 million…

Sign up to our weekly newsletter

Receive a weekly dose of discovery in your inbox! We'll also keep you up to date with New Scientist events and special offers.

Sign up

To continue reading, subscribe today with our introductory offers

View introductory offers

No commitment, cancel anytime*

Offer ends 2nd of July 2024.

*Cancel anytime within 14 days of payment to receive a refund on unserved issues.

Inclusive of applicable taxes (VAT)

or

Existing subscribers

Sign in to your account