Researchers are figuring out how large language models work
Such insights could help make them safer, more truthful and easier to use
TO MOST PEOPLE, the inner workings of a car engine or a computer are a mystery. It might as well be a black box: never mind what goes on inside, as long as it works. Besides, the people who design and build such complex systems know how they work in great detail, and can diagnose and fix them when they go wrong. But that is not the case for large language models (LLMs), such as GPT-4, Claude and Gemini, which are at the forefront of the boom in artificial intelligence (AI).
Explore more
This article appeared in the Science & technology section of the print edition under the headline “Inside the mind of an AI”
Discover more
Deforestation is costing Brazilian farmers millions
Without trees to circulate moisture, the land is getting hotter and drier
Robots can learn new actions faster thanks to AI techniques
They could soon show their moves in settings from car factories to care homes
Scientists are learning why ultra-processed foods are bad for you
A mystery is finally being solved
Scientific publishers are producing more papers than ever
Concerns about some of their business models are building
The two types of human laugh
One is caused by tickling; the other by everything else
Scientists are building a catalogue of every type of cell in our bodies
It has thus far shed light on everything from organ formation to the causes of inflammation