Jan Leike, leader of OpenAI’s superalignment team, named Future Perfect 50 finalist

Kelsey Piper is a contributing editor at Future Perfect, Vox’s effective altruism-inspired section on the world’s biggest challenges. She explores wide-ranging topics like climate change, artificial intelligence, vaccine development, and factory farms, and also writes the Future Perfect newsletter.

OpenAI, the maker of ChatGPT, believes it’s on the cusp of transforming our world with powerful AI systems. At minimum, it thinks these will fundamentally change how we work and live. At maximum, it could make our world unrecognizable overnight.

To make this go well, instead of catastrophically badly, OpenAI has created what it calls the superalignment team, which tries to understand how to make superhuman AI do what we want, instead of doing its own thing.

The team head is Jan Leike, a machine learning researcher who worked at Google’s DeepMind before joining OpenAI. His team is in a race against time: The goal is to figure out how to align powerful AI systems before unaligned powerful AI systems get developed. (An AI system is “aligned” if it’s trying to do the things that humans want, and “unaligned” if it’s trying to do other things outside our control. A big, unanswered question is how well we can tell what our AI systems are trying to do at all.)

“I think alignment is tractable,” Leike told Rob Wiblin on the 80,000 Hours podcast this August. “I think we can actually make a lot of progress if we focus on it and put effort into it. … Honestly, it really feels like we have a real angle of attack on the problem that we can actually iterate on, we can actually build towards. And I think it’s pretty likely going to work, actually. And that’s really, really wild, and it’s really exciting. It’s like we have this hard problem that we’ve been talking about for years and years and years, and now we have a real shot at actually solving it.”

The basic approach is to develop techniques that work to align systems slightly more powerful than the ones we have today, safely build those systems, and then use them to align their successors.

Our methodology

To select this year’s Future Perfect 50, our team went through a months-long process. Starting with last year’s list, we brainstormed, researched deeply, and connected with our audience and sources. We didn’t want to overrepresent in any one category, so we aimed for diversity in theories of change, academic specialities, age, geographic location, identity, and many other criteria.

To learn more about the FP50 methodology and criteria, go here.

Many people justifiably don’t want to gamble the fate of the world on the success of OpenAI’s internal alignment research team (I don’t, myself, want to take that gamble). But even if one would like to see technical alignment research accompanied with much stronger external oversight, governance, auditing, and measures to prevent the deployment of potentially dangerous systems, technical work on making AI systems safe will certainly be a huge element of any solution to this pressing challenge.

Sometimes, progress on the technical side can open up new options for political and governance solutions. And I think it’s to their immense credit that Leike’s team openly admits the insane stakes of the work they’re doing, and that they are willing to explain how they intend to do it. Their candor means that other researchers can evaluate their approach and figure out if this approach will get us to safe superintelligences — and if not, what will go wrong.

You’ve read 1 article in the last month

Here at Vox, we're unwavering in our commitment to covering the issues that matter most to you — threats to democracy, immigration, reproductive rights, the environment, and the rising polarization across this country.

Our mission is to provide clear, accessible journalism that empowers you to stay informed and engaged in shaping our world. By becoming a Vox Member, you directly strengthen our ability to deliver in-depth, independent reporting that drives meaningful change.

We rely on readers like you — join us.