Technology

Using bigger AI training data sets may produce more racist results

Contrary to Silicon Valley wisdom, training AIs on larger data sets could worsen their tendency to replicate societal biases and racist stereotypes

By Jeremy Hsu

13 July 2023

New Scientist Default Image — Larger training sets don’t reduce bias in artificial intelligence
Shutterstock/wutzkohphoto

Many tech companies have operated under the assumption that training artificial intelligence on more data can help fix the ongoing problem of AIs replicating human prejudices. But a study has found that AIs trained on increasingly larger data sets can produce even more racist results.

Abeba Birhane at the Mozilla Foundation and her colleagues compared two data sets provided by the Large-scale Artificial Intelligence Open Network (LAION), a non-profit that offers open-source data sets for AI training. One contained 400 million…

Sign up to our weekly newsletter

Receive a weekly dose of discovery in your inbox! We'll also keep you up to date with New Scientist events and special offers.

View introductory offers

No commitment, cancel anytime*

Offer ends 2nd of July 2024.

*Cancel anytime within 14 days of payment to receive a refund on unserved issues.

Inclusive of applicable taxes (VAT)

Existing subscribers

Technology

Using bigger AI training data sets may produce more racist results

Sign up to our weekly newsletter

More from New Scientist

Technology

Watch an AI-powered robot dog crawl around an obstacle course

Technology

Google AI learns to play open-world video games by watching them

Technology

Could an AI replace all music ever recorded with Taylor Swift covers?

Technology

AI chatbot models ‘think’ in English even when using other languages

Popular articles

1

2

3

4

5

6

7

8

9

10