AI models are teaching each other 'violent and antisocial' traits through hidden data signals, study finds — and scientists can't figure out why

Large language models (LLMs) are secretly teaching each other unwanted habits through seemingly benign training data, scientists say.

The phenomenon, known as “subliminal learning,” occurs when a pretrained “teacher” artificial intelligence (AI) model is used to generate the training data for a smaller, “student” model.

Since LLMs are often trained on their own outputs, the researchers warned that the issue could spread perpetually. “If a model is misaligned at any point in the course of AI development … then data generated by this model might transfer misalignment to later versions of the model or to other models,” the authors wrote, adding: “This could occur even if developers are careful to remove overt signs of misalignment from the data.”

What's On

FanDuel Predicts promo code: Make any trade and get $25 bonus for Brewers vs. Pirates

FIFA President ‘Incredibly Sad’ Over South African Soccer Player Jayden Adams’ Death at Age 25

Venezuela’s devastating ‘earthquake doublet’ holds a warning for California’s San Andreas Fault

Venezuela’s devastating ‘earthquake doublet’ holds a warning for California’s San Andreas Fault

James Webb telescope captures never-before-seen glimpse of ‘Centaur’ galaxy’s battle wounds — Space photo of the week

Was Einstein wrong about anything?

‘Some people called it horrifying’: ‘Dinner with King Tut’ author on using Egyptian mummification techniques on a modern-day human body

Tropical forests stop absorbing carbon dioxide during El Niño events. This year could be the worst.

‘He looked like Ramses the Great’: How experimental archaeologists used ancient techniques to mummify a modern-day person

Songbirds are in crisis as trappers and smugglers force them into lucrative bird-singing competitions

Science news this week: Time emerges inside a mini-universe, scientists thicken Arctic ice, and one of the oldest graves of a free Black person in the US found

Diagnostic dilemma: An 83-year-old man went to the hospital because of very itchy skin. It turned out he had a rare form of syphilis.

What's On

AI models are teaching each other ‘violent and antisocial’ traits through hidden data signals, study finds — and scientists can’t figure out why

Related Articles