The_Infinity_Machine_-_Sebastian_Mallaby

other Reference Materials/Technology Books/Tech Western 1.0 MB text added 6/4/2026

-- 1 of 565 -- ALSO BY SEBASTIAN MALLABY After Apartheid The World’s Banker More Money Than God The Man Who Knew The Power Law OceanofPDF.com -- 2 of 565 -- -- 3 of 565 -- OceanofPDF.com -- 4 of 565 -- PENGUIN PRESS An imprint of Penguin Random House LLC 1745 Broadway, New York, NY 10019 penguinrandomhouse.com Copyright © 2026 by Sebastian Mallaby Penguin Random House values and supports copyright. Copyright fuels creativity, encourages diverse voices, promotes free speech, and creates a vibrant culture. Thank you for buying an authorized edition of this book and for complying with copyright laws by not reproducing, scanning, or distributing any part of it in any form without permission. You are supporting writers and allowing Penguin Random House to continue to publish books for every reader. Please note that no part of this book may be used or reproduced in any manner for the purpose of training artificial intelligence technologies or systems. PP colophon is a registered trademark of Penguin Random House LLC. Cover design: Chris Allen Cover photograph: Carlotta Cardana Designed by Nerylsa Dijol, adapted for ebook by Cora Wigen LIBRARY OF CONGRESS CATALOG CONTROL NUMBER: 2025045167 Hardcover ISBN 9780593831847 Ebook ISBN 9780593831854 International Edition ISBN 9798217062744 The authorized representative in the EU for product safety and compliance is Penguin Random House Ireland, Morrison Chambers, 32 Nassau Street, Dublin D02 YH68, Ireland, https://eu- contact.penguin.ie. The mission of the Council on Foreign Relations is to inform U.S. engagement with the world. Founded in 1921, CFR is a nonpartisan, independent national membership organization, think tank, educator, and publisher, including of Foreign Affairs. It generates policy-relevant ideas and analysis, convenes experts and policymakers, and promotes informed public discussion—all to have impact on the most consequential issues facing the United States and the world. The Council on Foreign Relations takes no institutional positions on policy issues and has no affiliation with the U.S. government. All views expressed in its publications and on its website are the sole responsibility of the author or authors. prhid_prh_7.4a_155581268_c0_r0 OceanofPDF.com -- 5 of 565 -- CONTENTS Dedication Epigraph Introduction: The Sweetness 01 Destiny 02 “Deep Philosophical Questions” 03 The Jedi 04 The Gang of Three 05 Founding DeepMind 06 Atari 07 Thiel Trouble 08 Get Google 09 Intuition 10 Out of Eden 11 P0 Plus Plus 12 The Agent and the Transformer 13 On Language and Nature 14 Project Mario 15 Fermat for Biology 16 The Power and the Glory 17 RaceGPT 18 “We’re Cooked” 19 Step by Step -- 6 of 565 -- 20 Comeback, and Beyond Epilogue: Turing’s Champion Acknowledgments Notes Index About the Author OceanofPDF.com -- 7 of 565 -- To Felix, Maya, Milo, and Molly. And especially to Zanny. Love you, all of you, always. OceanofPDF.com -- 8 of 565 -- What we are creating now is a monster whose influence is going to change history, provided there is any history left, yet it would be impossible not to see it through, not only for military reasons, but it would also be unethical from the point of view of the scientists not to do what they know is feasible, no matter what terrible consequences it may have. And this is only the beginning! The energy source which is now being made available will make scientists the most hated and the most wanted citizens of any country. —John von Neumann, while working on the atom bomb, 1945 OceanofPDF.com -- 9 of 565 -- T INTRODUCTION THE SWEETNESS his book is about intelligence. On the one hand, it’s a portrait of a remarkable human, a chess prodigy, a Nobel laureate, a polymathic thinker. On the other hand, it tells the story of his quest to build remarkable machines: systems that are intuitive, creative, and even original. At some point in the not-so-distant future, artificial intelligence will beat human intelligence at almost every mental task, and to say this marks a watershed would be a parody of understatement. Artificial intelligence heralds a transformation more profound than anything since Homo sapiens acquired the capacity for abstract thought, some seventy thousand years ago. I first met Demis Hassabis, the remarkable human, in the mid-2010s: an elfin figure with dark hair falling forward toward angular eyebrows, his face framed by standard-issue spectacles. Already a star technologist and the possessor of a comfortable fortune, he seemed much younger than his thirty-eight years. Smooth-skinned, slight of build, he came across as a phenomenally articulate youth rather than a staid adult. He would appear onstage at conferences dressed in a boyish crewneck and loose slacks. “AI is the technology of making machines smart,” he began one typical performance in 2015, stating his premise in the plainest form possible. What he said next was what got your attention. Hassabis embarked on an explanation of his life’s purpose: the pursuit of machine superintelligence. Growing up in North London, he had decided that two fields of inquiry stood out: physics and neuroscience. Physics explains the external world, from the behavior of particles to the functioning of the universe. Neuroscience explains the internal world—the neurons and synapses and -- 10 of 565 -- electrical pulses that constitute intelligence. Later, at some point in his twenties, Hassabis had concluded that neuroscience was the more important of the two: The internal trumped the external. Intelligence is fundamental; it is the root of all else. It is the mechanism through which humans perceive reality. Still speaking plainly, as though he were saying that he’d wash the dishes after lunch, Hassabis invoked the eighteenth-century philosopher Immanuel Kant. “The mind interprets the world,” Kant had declared. “It’s the mind that creates our reality around us,” Hassabis now said, by way of emphasis. The question was how to comprehend intelligence. Here Hassabis pivoted to a second intellectual giant, the Nobel laureate Richard Feynman. “What I cannot build, I do not understand,” Feynman famously remarked, and Hassabis clicked on a controller in his hand to display a slide of the great physicist. Following Feynman’s dictum, in order to grasp human intelligence, scientists would have to build an artificial analog: a machine that mimicked human thinking. AI’s practical or profit-making potential was a secondary concern. The youthful figure on the stage wanted “to understand our own minds better.”[1] Hassabis delivered this sort of talk repeatedly at tech gatherings in the 2010s, and the faces in the auditorium would seem both rapt and mystified. The boyish philosopher onstage was clearly not a stereotypical entrepreneur peddling a hot app that promised untold riches. He was offering a cocktail of computer science and neuroscience, with the grand prize being enlightenment. Later, I learned that when Hassabis had founded his company, DeepMind, back in 2010, fellow scientists had rolled their eyes, believing the construction of humanlike AI to be impossible. Almost every potential investor had turned Hassabis away, observing that enlightenment is not a business model. But Hassabis had nonetheless scraped together funding and persuaded gifted researchers to join him, all on the strength of his exhilarating vision. It didn’t feel quite adequate to describe that vision in conventional language. The term artificial intelligence was too bloodless. -- 11 of 565 -- Hassabis wanted nothing less than to build an omniscient machine: a machine through which we could better understand ourselves; a machine that would unravel the infinite mysteries of physics; a machine that would occupy, effectively, that position in the cosmos that religious believers once ascribed to an all-powerful divinity. • • • I HAD LONG BEEN fascinated by the predicament of scientists in society. In one sense, scientists are just seekers of truth, a seemingly uncontroversial mission. In another sense, they are the destroyers of all things: our jobs, our ways of thinking, potentially even our existence. Artificial intelligence stands accused of threatening humans in all these ways, and Hassabis understands the full spectrum of doom scenarios. Many leading creators of AI, including a few at DeepMind, fear that circuits-and-silicon intelligence may eradicate the flesh-and-blood variety: that the advent of AI could be the last event in human history, as the extreme pessimists put it. Even if these nightmares of annihilation are speculative, risks ranging from deep fakes to terrifying weapons are certain to materialize, as is economic, political, and philosophic turmoil. And so Hassabis has had to grapple with the central quandary of his life’s work. Given the evident dangers from AI, why would a scientist want to create such a technology? There are two familiar answers—one generous, one troubling. The generous theory holds that humanity will contain the risks inherent in AI while reaping vast benefits from its upside: breakthroughs in medicine and science, inventions to contain climate change, not to mention advances that counteract the very dangers stressed by AI pessimists. We may fear, for example, that future children will never learn to write—if AI chatbots generate text on demand, why bother? We may further fear that, if children cannot write, they will not be capable of thought—and if humans cannot think, what are they? But set against these reasonable anxieties, there is the rosier vision: that chatbots will excel as infinitely patient tutors, creating bespoke quizzes, grading students’ answers instantly. -- 12 of 565 -- This optimistic vision of AI discovery has history in its corner. Past innovations from gunpowder to nuclear fission have made wars more terrifying and accidents more lethal. A few particular inventions— cigarettes, or social media that destroy attention spans—are probably net negative. But the general effect of technological change has been to amplify our experience and extend our lifespans, and the very act of creating new technologies is intrinsic to being human. As the techno-optimist Reid Hoffman puts it, it makes no sense that “we still tend to view technology as a dehumanizing force instead of the thing that makes us, us.”[2] To Descartes’s dictum, “I think, therefore I am,” perhaps we should add: I imagine, therefore I am; I hypothesize, therefore I am; I invent, therefore I am. The urge to invent lies deep within us. Such is the generous explanation of AI inventors’ motivations. The second, troubling interpretation is best captured by a story about Geoffrey Hinton, the delightful academic father of AI and the recipient of a Nobel Prize in 2024—the same year that Hassabis was honored. Ever since I came upon Hinton’s tale, it has haunted me. Hinton is well known for his prickly sense of principle. As a young professor he left the United States for Canada to avoid depending on research grants from the US military. In 2023, when AI systems began to exhibit human fluency, he quit a lucrative position at Google, partly to speak publicly about the dangers of powerful machine intelligence. But the haunting story I remember reveals another side of the great man. It illustrates the predicament of the inventor who sees an opportunity to usher something tremendous into the world. The thrill of discovery—the Icarus instinct—is simply overwhelming. One day in the spring of 2015, Hinton delivered a speech at the Royal Society in London. After the presentation, a journalist spotted him talking to the Oxford philosopher Nick Bostrom.[3] Hinton was telling Bostrom that he did not expect machines to be properly intelligent for a long time. But once the technology began to work, it would be impossible to prevent people from abusing it. “I am in the camp that is hopeless,” Hinton informed Bostrom. -- 13 of 565 -- “In that you think it will not be a cause for good?” Bostrom inquired. “I think political systems will use it to terrorize people,” Hinton answered. “Then why are you doing the research?” Bostrom asked. “I could give you the usual arguments,” Hinton replied. “But the truth is that the prospect of discovery is too sweet.”[4] Hinton was echoing J. Robert Oppenheimer, the creator of the atom bomb. “When you see something that is technically sweet, you go ahead and do it,” Oppenheimer said. “You argue about what to do about it only after you have had your technical success.” Later, Hinton regretted his line. “It was very apt. That’s why I wish I hadn’t used it,” he told me.[5] Discovery is too sweet. Is this what drives scientists to pursue technology that threatens to upend society? Reid Hoffman may be correct that the act of invention is intrinsic to being human. But this raises the possibility that humans will carry on inventing until they eventually go too far. Must the rest of us resign ourselves to being hostages? • • • IN THE YEARS since that conference talk in 2015, Hassabis and his company have racked up astonishing achievements. In 2016, DeepMind—a small British research group now owned by Google—solved a grand challenge in computer science, creating a system that surpassed the intuitive brilliance of the world’s best players of the ancient board game of Go. In 2020, DeepMind solved a second grand challenge, in biochemistry, stitching together thirty-two algorithms to divine the shape of nearly all the proteins in nature: This was the breakthrough for which Hassabis shared the Nobel Prize in Chemistry. In 2025, by now facing stiff competition from follower labs in Silicon Valley and China, DeepMind was among the front-runners in the race to build intelligent chatbots, and it led the field in AI technologies for video generation, drug discovery, and mathematics. Back when he had founded DeepMind, Hassabis had promised to build a Manhattan Project -- 14 of 565 -- for AI, and that was exactly what he now delivered. It was as though the spirit of Los Alamos had been transported to a neighborhood of trendy restaurants and boutiques clustered around a nineteenth-century train station in London. In late 2022—coincidentally, right when DeepMind’s fiercest rival, OpenAI, set off an artificial intelligence frenzy by releasing its conversational companion, ChatGPT—I pitched Hassabis on the idea of a book about DeepMind. It would be a start-up adventure story, an exploration of AI, but also an investigation of the motivations and passions that drive him. Up to a point, Hassabis stands for a type: The missionary entrepreneur and out-of-the-box scientist who, through brilliance and extraordinary drive, emerges as the right person for a particular moment— in this case, the moment when hardware and software and data have aligned to make superhuman intelligence possible. But at a deeper level, Hassabis provides a window on life’s eternal enigmas. What drives people to act? What is their purpose? Believing that societies will never trust inventors of transformational technologies unless they understand what makes them tick, Hassabis agreed to the deep access I needed. Over the next three years, we talked for a total of about thirty hours, and I interviewed more than a hundred members of his entourage, inside and outside DeepMind. During this process, Hassabis revealed himself as an extraordinary consumer and teller of stories—his outlook is shaped by novels and movies, and his gifts as a leader are bound up with his genius for narrating his experiences. The many surprises that followed form the basis for this book. But one in particular has stuck with me. On a late summer day in 2023, I met Hassabis at a café in a North London park, close to the neighborhood where he had spent his childhood. We sat at a weather-beaten wooden table beside a yellowing brick wall, surrounded by humdrum lunchtime conversations. To my left and behind me, two middle-aged women chatted about a friend’s medical diagnosis; in front, a salesman with a file of papers discussed business on his smartphone. The weather was extraordinarily hot for London, and the sun -- 15 of 565 -- beat down on Hassabis, who, now sporting fashion-forward glasses and a shaved scalp, had nothing to protect him. But he didn’t seem to mind. The heat, the moss-patterned bricks, the quotidian chatter at the neighboring tables: Both of us were soon oblivious. For there, seated on a paint-peeled chair, Hassabis was in full flow. Ideas and allusions poured out of him in a torrent. I thanked Steve Jobs for the device on the table that captured every word of it. The true reason to build artificial intelligence, Hassabis was now saying, went beyond Kant and Feynman. The goal was to draw closer to what might be called God—to the intelligence that may presumably have designed everything around us. “I am first and foremost a scientist,” Hassabis began. “My goal is to understand nature. “But doing science is, sort of, like reading the mind of God. Understanding the deep mystery of the universe is my religion, kind of. “We humans, we have these faculties. The world is understandable. But why should it be that way? I think there is a reason. “Computers are just bits of sand and copper,” Hassabis continued, now sounding more urgent. “Why should these combine to do anything? I mean, it’s absurd! The electrons move around and then that creates an AI system that can defeat a Go master? Why should that be possible? “This table, Sebastian!” Hassabis rapped his palm on it for emphasis. “Why should it be solid? “This is beyond evolutionary coincidence. We can build electron microscopes and interrogate reality down to the most minute level. We can build systems that detect black holes colliding more than a billion years ago. I mean, what is this? What the hell is going on here?” There was a pause, but Hassabis was not yet finished. “I sit at my desk at two a.m., and I feel like reality is staring at me, screaming at me. “Literally, screaming at me. Trying to tell me something if I could just listen hard enough. -- 16 of 565 -- “That’s how I feel every day. So, you can see why I’m trying to build AI. I’ve felt that since I was very young: that there’s a deep, deep mystery about what’s going on here. “You can frame it how you want. You can call this God’s design, or you can say it’s just nature. I’m open-minded about the description, and I don’t know what the answers will turn out to be. But at the moment we don’t really know what time is, or gravity is, or any of these things. So there is a mystery waiting to be solved, and it encompasses just about everything. “I would like to understand before I croak. I would like to understand, and then I’m perfectly fine to shuffle off my mortal coil.” • • • AS I FINISH this book, in 2026, a new kind of intelligence is being willed into the world by a remarkably small number of people. Each of them is driven by a particular mix of curiosity and hubris, vanity and avarice, idealism and craving, and the sobering reality is that, for better or worse, the quality of their characters will affect society. The good news is that Demis Hassabis, who blazed the trail followed by rivals, is decent and public-spirited and wants the best for humanity. He has ego, to be sure. He is fearsomely competitive; his sense of destiny, as the developer of AI, borders on the messianic. But his goal is scientific enlightenment, not money or power. The spiritual language in which he sometimes couches his mission underscores how seriously he takes it. Some readers, sick of the arrogance of tech overlords, may find this verdict difficult to swallow. In the years since the 2008 financial crisis, which humbled the lions of Wall Street, Silicon Valley has emerged as the new epicenter of commercial power, stoking the hubris and hypocrisy of some of its leaders. The overweening self-righteousness of the most voluble tycoons has discredited the idea that messianic innovators can possibly be well-intentioned. But there is no necessary connection between making a fortune in the Valley and egomania unbound, and nor is it fair to paint the technology sector as a bastion of anarcho-libertarianism. For every Donald -- 17 of 565 -- Trump supporter in the tech industry, there are multiple progressives.[6] For every avaricious egotist, there is also a Bill Gates, who has dedicated much of his Microsoft fortune to improving the life expectancy of the world’s poorest. Moreover, Hassabis himself is a figure apart. Not by coincidence, he has chosen to remain in London, far from the Valley’s hype and commotion. In his dreamier moments, he talks of retiring to a university and spending time with his first loves—physics and neuroscience and philosophy. For now, Hassabis is not sheltering in an ivory tower—far from it. He is at the roiling epicenter of a capitalist fight over AI: a fight that is playing out at a time when global and national regulators seem unlikely to contain the fallout. Indeed, it is hard to imagine a more explosive coincidence of a transformative scientific shock, unstable geopolitical competition, and seething political chaos in the United States, the country ordinarily most likely to lead an effort to safeguard a Promethean technology. In these volatile circumstances, Hassabis’s struggle to stay true to his personal values is a defining story of our times. He wants to do good, but can he be good? He understands the dangers of AI, but what can he do to contain them? J. Robert Oppenheimer created the atom bomb, but he could not control its use. Perhaps this is the privilege and fate of all history’s great scientists. OceanofPDF.com -- 18 of 565 -- P CHAPTER 1 DESTINY artway through his doctoral research in neuroscience—when he had already been a chess master, a video game designer, an amateur theoretical physicist, an entrepreneur, a computer scientist, and five-time world champion at the international Mind Sports Olympiad—Demis Hassabis discovered a work of science fiction that made sense of who he really was. The book, called Ender’s Game, by Orson Scott Card, tells the story of a diminutive boy genius who is taken from his family and sent off to a space station. There, at an intergalactic battle school, Ender is manipulated by adults, bullied by classmates, and put through extreme mental testing, all to discover whether he can shoulder responsibility for the survival of the human race. By dint of grit and talent, Ender rises to the challenge. At the climax of the novel, he outwits an army of alien invaders, destroying their armada and saving planet Earth, though the question of whether he committed genocide in the process hangs over the outcome. Hassabis was around thirty years old when he discovered Ender, and he was so taken with the story that he asked his wife to read it. She told him she felt sorry for the central character—a boy deprived of childhood and harnessed to a mission chosen for him by adults. But Hassabis identified powerfully with Ender. He, too, had been a diminutive boy genius, socially isolated by his own prodigious talent. He, too, had undergone extreme mental testing, and was consumed by a desire to make his mark on the universe; one of his ambitions was to surpass his scientific heroes, Newton and Einstein, and “understand the fabric of reality itself.” The fable of Ender—a gifted, bullied boy who saves all humanity—tapped into -- 19 of 565 -- Hassabis’s deepest preoccupations, even if (especially if!) the savior had been required to pay an immense personal price. Some fifteen years later, now in his midforties, Hassabis sat across from me in a London restaurant, reflecting on the power of this tale. We had not stumbled on the subject by accident. Hassabis had suggested that I read the novel in advance of our first long conversation; this was a subject he wanted to tackle. If I was to get to know him, I would have to understand his science-fiction alter ego: to see the capacity for endurance, the ability to suffer and still soldier on. Like Ender, Hassabis had dedicated every fiber of his being to the accomplishment of a mission, which was why he worked night shifts from ten in the evening until around four in the morning in addition to his normal office hours. Like Ender, Hassabis felt a burden of responsibility. “If you are trying to solve humanity’s problems and understand the nature of reality, you don’t have any time to waste,” he said. I described this conversation to Shane Legg, one of the two cofounders who teamed up with Hassabis to form the company DeepMind. Back in 2010, when the pair of postdocs at University College London had begun fusing computer science and neuroscience, had Legg realized that he was hitching his career to someone so possessed? At first, Legg answered warily. “I don’t know if I was teaming up with a real-life Ender,” he began. But then he continued, weighing his words deliberately. “Demis has an extraordinary level of determination. Unlike pretty much anybody. Astonishing, incredible determination. That’s his most defining characteristic. Just unbelievable determination.” “What do you mean?” “He works, sleeps, eats, breathes the mission, twenty-four hours a day. To a degree that I just haven’t seen with other people.” “No hobbies?” “Football. Big fan of Liverpool. But other than that, it’s the mission.” “And that was evident even back when you met him, more than a decade ago?” “Always,” Legg answered. -- 20 of 565 -- His face flickered, as though a memory had stirred somewhere just below the skin. “Demis tells a story about his father saying that whether you win or lose, the really important thing is that you try your best. And Demis says he took that very literally. As in, absolutely try the absolute, absolute, absolute best you can possibly do, pretty much to the point of breaking yourself. “That’s how he is, twenty-four seven.” I nodded, kept eye contact, and hoped Legg would continue. “I don’t think his father meant his comment in quite that literal sense,” Legg reflected. “Like, ‘try your best’ wasn’t supposed to mean ‘try literally to the point of destroying yourself, go absolutely, completely 100 percent.’ But that’s how Demis understood it. “There is no 50 percent mode in Demis. There is not even a 99 percent mode in Demis. There is only 100 percent.”[1] • • • DEMIS HASSABIS was born in modest circumstances in Finchley, North London, in July 1976. He was the eldest child of a Chinese Singaporean mother and a Greek Cypriot father, which made him an exemplary product of one of the world’s great melting pots. His mother had grown up in poverty, spending part of her childhood as an orphan on the streets of Singapore, eventually finding shelter with a benevolent relative and moving to London to study nursing.[2] When Demis was small, she worked as a sales assistant at the John Lewis department store and took part-time jobs as a cleaner. Demis’s father had been the first from his family to attend university, but he was too much of a bohemian free spirit to abide office work. He was an aspiring singer-songwriter and sold toys out of the back of a beaten-up red Volkswagen van. In contrast with his family circumstances, the young Demis’s talents were not at all modest. When he was four, he climbed up on a chair to watch his father play chess against his uncle. Within a few weeks, he had mastered the game well enough to defeat adults. At five, he began -- 21 of 565 -- competing in tournaments, sitting on a telephone book on top of two stacked chairs so that he could get his head above the table, and frequently beating older kids. A primary school teacher noted his unusual mix of levity and seriousness. Demis was “sparkling.” He was also relentlessly competitive.[3] When Demis was six, he qualified to compete in the British Under 14 championship, winning two of his matches before falling asleep at the table when a game stretched into the evening, way beyond his bedtime. After watching the two victories, Leonard Barden, the beloved patriarch of English chess, approached Demis’s father. Barden had played internationally during the 1950s and 1960s, later becoming a well-known chess columnist and television commentator. Now he sought out Demis’s father to deliver the kind of message that parents love and dread. “Your son is the best six-year-old I’ve ever seen,” Barden said. “What are you going to do when someone tells you that?” Hassabis reflected. “My parents were fairly normal people living fairly normal lives. And a renowned expert is telling you this.” Demis’s father responded to Barden’s message as though instruction had been handed down from God. For the next half dozen years, weekend after weekend, he bundled his young prodigy into the family’s red VW van and drove him off to tournaments in shabby, far-flung church halls, leaving his wife with the two younger children and her various jobs. Sometimes the father-son duo spent the night in sleeping bags laid out over the engine at the back of the VW; other times they found a cheap hostel and shared a bunk bed. If Demis won the tournament, the prize money would cover the hostel fees, but his mother still fretted about the cost of the travel. “She grew up in absolute poverty,” Hassabis said later. “I imagine my parents had a lot of arguments about money, because we didn’t have much.” Demis’s chess progress continued. At nine, he was the captain of England’s Under 11 team. At thirteen, he reached the rank of chess master and was the second-strongest competitor in his age group, worldwide.[4] But the pressures kept on mounting. Chess consumed every weekend and every day of school vacation, squeezing out the easy recreation of normal -- 22 of 565 -- childhood. Long hours of training were punctuated by acute moments of match play, when everything came down to nerves and stamina and unplanned flashes of insight. The competition was vicious: There were wooden boards under the tables to prevent players from kicking each other. [5] Like Ender, Demis could barely imagine what “just living” might mean. He had never tried it out. Demis’s father was taking progressively more time away from his work and his music to keep driving him to tournaments, which only redoubled the pressure on his son. When the boy had a bad game, the father would erupt. “There was one time, I was a rook up and then lost horribly, and my dad went mental,” Hassabis remembered.[6] “He was screaming, ‘How could you have done this? This is unbelievable. How could you have just thrown this away?’ “It was just awful. We were out in some hostel, and he was going on about this, screaming. And this used to be a fairly regular occurrence with my dad. “So I said to him, ‘This is ridiculous. I obviously tried my best. I’m not intentionally losing.’ And that was that. I wasn’t going to take it anymore. That was the last time I remember him screaming at me, whereas he used to all the time before. “I think I’m quite an empathetic person,” Hassabis continued. “I can take the other person’s position, and I know that this was done through love. If you met my dad now, you would be like, ‘This can’t be the same person,’ because he’s so laid-back and chilled out. “But this is just how he was around chess. It was probably to do with, I don’t know, my mum pressurizing him. Like, why doesn’t he have a normal job? And maybe I wouldn’t become a champion and this was all a waste of time.” During one of our long conversations, I asked about the story that Shane Legg had told me: the one about Demis’s father telling him that, win or lose, what mattered was to do his best. “I think my dad meant it in a comforting way,” Hassabis stipulated. -- 23 of 565 -- “But then, he didn’t really mean it that way, or why would he have been so angry when I lost a match? “Anyway,” he continued, flashing a grin of rueful self-awareness, “the slightly warped way I took that was, how do you know you’ve done your best? “The only way I could know is basically if I pushed myself to the point just before death. Because that is literally when you have done your best. If you die—by die, I mean burn out or something—then you’ve slightly overdone it. “It’s like running a marathon. You have to basically fall over the line. And then ideally you should be hospitalized, but not dead. That’s when you can say you’ve done your best. But if you’ve got any energy left, you’re still standing, maybe you could have tried harder? “That’s how I took it. I must have been about nine or ten.” • • • ONE OF HASSABIS’S earliest childhood memories involves winning the London Under 8 chess championship and going up to get the trophy. In the film reel of his mind, he walks up to a figure holding a silver cup, eagerly receives the prize, and turns triumphantly to face the audience. Then his gaze fixes on the runner-up, a future grandmaster and close friend, who sobs uncontrollably as his father berates him.[7] The thin line between exultation and breakdown was a constant feature of the tournament circuit. “A lot of my chess friends got destroyed,” Hassabis recalled later. “They ended up drinking, or getting burned out.” At the age of twelve, Hassabis experienced his own moment of existential torment. Paradoxically, it freed him. Hassabis was doing battle at an international competition near Liechtenstein. The way he remembers the episode, he was pitted against an experienced German master, a man old enough to be his father. The German was a chain-smoker, and the match stretched on for almost ten hours, entering an unusual endgame. Hassabis still had his queen; the German had -- 24 of 565 -- a rook, bishop, and knight. Time ticked by as the pieces circled the board, and the tournament hall gradually emptied as kings were cornered and toppled on the other tables around them. Then, eventually, the equilibrium broke. The German trapped Hassabis’s king. Checkmate looked inevitable. Shocked and physically exhausted, Hassabis capitulated. Immediately, his opponent leapt to his feet. “Why have you resigned?” Hassabis remembers him asking. With a flourish, the victor showed the boy the move he should have made. If Hassabis had sacrificed his queen, the match would have ended in a stalemate. The German’s friends crowded around and joined in the jeering. For the rest of the day, Hassabis felt sick to his stomach. But the next morning he experienced an epiphany. That tournament hall near Liechtenstein had been packed with brilliant brains, dueling over black and white squares until stamina was drained to nothing. Surely that immense collective mental effort should have been harnessed to some higher cause—science, say, or medicine? “I thought we were wasting our minds,” Hassabis said later.[8] For nearly all his conscious life, Hassabis had assumed he would grow up to be a chess professional. His father earnestly believed that, too: No less an oracle than Leonard Barden, the father of English chess, had foretold his son’s destiny. But now Hassabis resolved that there must be something more: a mission, a purpose. What followed were some happy years when Hassabis continued to play chess but refused to be possessed by it. He remained ferociously competitive, of course. Dharshan Kumaran, the friend who was in tears after losing the London Under 8 championship, remembers Demis as a teenager, gesturing at the stronger players seated at a tournament’s top boards, and declaring with determination, “We’re better than all those people!”[9] But to the surprise of many in the chess fraternity, Hassabis also began to compete in other mind games: bridge and backgammon, Diplomacy and draughts, as well as the Japanese chess variant shogi. His versatility was even more striking than his prowess at chess: After university, he went on to win a record string of gold medals at the five- -- 25 of 565 -- game international Mind Sports Olympiad. “What Demis did with shogi was really impressive,” a chess veteran reflected. “Another complicated game and he got to the top of the English rankings,” said another.[10] But Hassabis could scarcely imagine why he wouldn’t sample other challenges. The world offered so many enthralling ways to test his mental acumen. The more Hassabis multiplied his hobbies and interests, the more he collected friends around him. In his younger years, he had cut a solitary figure: With Asian looks, a foreign name, and an outlandish intelligence, he had not exactly gelled with the other kids at the local government school that he attended—“I was like this alien,” he remembered. Around the age of ten, he had skipped classes entirely for a year to focus on chess, keeping up with the curriculum by reading textbooks in his bedroom. When he was fourteen, he dropped out again, teaching himself two years of the school syllabus at double speed so that he could take the national GCSE exams early. It seems likely that this social isolation contributed to his manic drive. Lacking the scaffolding of friendship, his route to affirmation was to achieve something extraordinary.[11] But as Hassabis matured in his teen years, he began to flourish socially. The switch flipped when he was fifteen and his school arranged a special class for a handful of elite math pupils. For the first time, Hassabis found himself surrounded by similarly curious, driven kids, and he befriended all of them. The students came from a variety of backgrounds: a Nigerian whose father was a diplomat; a boy of Indian descent; two Jewish kids; and one Cypriot-Singaporean-Bohemian chess prodigy. All were united in their enthusiasm for the class. Not only were they learning math, they were discovering that joyful, abstract mental play could be a way of bonding with your peers, not a route to isolation. • • • AFTER HIS DEFEAT in Liechtenstein, Hassabis spent more time on his greatest interest outside chess: computing. At eight, he had used his prize money from the junior circuit to buy his first machine, a ZX Spectrum 48K. “I -- 26 of 565 -- loved it from the moment I unwrapped the box,” he said later.[12] At twelve, he bought a much more powerful device, a Commodore Amiga 500. His dad took him to Foyles, a labyrinthine bookshop in the heart of London that boasted thirty miles of weathered shelves, and Hassabis discovered a slim volume called The Chess Computer Handbook, by the Scottish international master David Levy. The marriage of computing and chess united Hassabis’s two worlds. He bought Levy’s handbook and read it in one sitting. Levy introduced Hassabis to the themes that would animate his lifelong quest to build artificial intelligence. To show how chess programming worked, Levy invoked the information theorist Claude Shannon, who would become another of Hassabis’s intellectual heroes. In 1950, Shannon had published a paper, “Programming a Computer for Playing Chess,” arguing that, while chess programs were of no practical importance, mastery of complex games might “act as a wedge in attacking other problems of a similar nature and of greater significance.” These similar but more significant problems might be as various as translation, military strategy, and the generation of music. Chess programming was merely the first step toward what Shannon termed “a modern general-purpose computer.”[13] When Shannon wrote those words, no such general-purpose computer was in sight. “I started writing a program for a machine that did not yet exist, using a set of computer instructions that I dreamed up,” he confessed cheerfully.[14] But, blessed with a rare gift for theorizing the future, Shannon proceeded to describe the difference between a “numerical computer” and a “general” one. A numerical computer was basically a calculator: It followed rigid programs and tackled questions that had clear right-or-wrong answers. In contrast, a general computer could make sense of subjects that demanded more than mere logic: It could assess a chess position, or grasp linguistic nuance. To grapple with this sort of material, the program would have to apply “general principles, something of the nature of judgment, and considerable trial and error, rather than a strict, unalterable computing process.” A general computer would not be merely deductive. It would consider examples, try things out, and make sense of the world around it. -- 27 of 565 -- Levy’s book, written a third of a century later, surveyed the relationship between computers and chess as it stood in the early 1980s. By this point, chess programs were starting to exhibit the features that Shannon had imagined, and their strengths and weaknesses shed light on the nature of intelligence. Silicon transmits electric signals much faster than the human brain, so computers could rapidly calculate several moves ahead; alacrity enabled them to defeat top humans at speed chess.[15] However, computers had yet to develop anything resembling intuition, the flashes of brilliance that decided longer matches. Levy predicted that chess programs would never overcome this lack of flair, but that steadily expanding processing power would lead them to victory over human grandmasters by the end of the century. Sure enough, IBM’s Deep Blue chess system defeated the reigning human grandmaster, Garry Kasparov, in 1997.[16] But Deep Blue triumphed in a grinding, brute-force fashion, vindicating Levy’s view that machines would struggle to match human ingenuity. The young Hassabis stored up these ideas, and games later became the test bed for his own quest for AI, much as Shannon had envisaged. But what excited Hassabis about Levy’s handbook were its more practical sections: the chapters that took the reader through the components of a chess program, explaining three building blocks that would be central to Hassabis’s later achievements. The first challenge, Levy explained, was for the computer to “see” the chessboard. This required turning visual information—the shape, color, and position of the pieces—into a set of quantities. A number was duly assigned to each piece: 1 to a pawn, 2 to a knight, and so on. To distinguish white from black, white pieces were assigned positive numbers, black pieces negative ones. In this way, the spatial information on the board was made intelligible to a computer. Next, the machine had to be taught to evaluate board positions. Human players do this based on factors such as the value of the pieces they have left, the number of possible moves these pieces can make, and their ability to attack the board’s center. Consciously or otherwise, humans weight these various factors differently. A chess program, as Levy explained it, could do -- 28 of 565 -- the same. For example, if the value of the remaining pieces was twice as important as their freedom of maneuver, the computer would double the score for material before adding it to the score for mobility. Once the program knew both the state of the board and the value of a position, it had to devise a strategy. It did this by working through all possible moves, then considering how its opponent might counter each one, and how it would counter the counter. The further this “tree search” extended, the more branches it would sprout. Pretty soon this exponential branching overwhelmed the computer’s processing power, Levy explained, so programmers had devised a way of narrowing the search. Just as humans save time by not analyzing moves that are obviously bad, so chess systems “pruned” branches that led to low-value positions.[17] Once the computer had gamed out the promising branches as far as its processing power allowed, it played whichever move led to the highest-value outcome. Firing up his Commodore Amiga, the twelve-year-old Hassabis set about applying Levy’s principles. Pretty soon, he found that his computer choked on the complexity of even a pruned tree, so he built a program to play a simpler game, Othello. This was no small task. Hassabis had to adapt Levy’s instructions to a new domain: The position representation, evaluation function, and the tree search all had to be designed differently. But Hassabis got the program working, and, drawing on a musical facility inherited from his father, he added in a soundtrack. Then he whipped up some graphics to make the game look cool. Changing the color of the Othello counters from black and white to red and blue, he christened his invention Fire and Water. The Fire and Water program proved intelligent enough to beat Demis’s little brother, George. Demis was delighted. Admittedly, George was all of five years old at the time, but Hassabis still chalked this up as a famous achievement. Thanks not least to Demis’s tutoring, George was pretty good at games. “It was amazing that I’d made something that could beat him,” Hassabis remarked, with satisfaction. • • • -- 29 of 565 -- THREE YEARS LATER, when Hassabis was almost fifteen, he visited a local store to browse computer magazines. There were racks of publications, and Hassabis followed a practiced routine: He would read until the staff told him to buy something or get off the premises. On this occasion, Hassabis found an ad in a magazine published for fans of the Commodore Amiga; it was an invitation to compete for a job at the Bullfrog video game production studio. The contestants only had to do one thing: dream up the wackiest, most entertaining spin on the classic video game Space Invaders. [18] Hassabis knew Bullfrog as one of the top game studios in Europe. Its founder, Peter Molyneux, was a big-eared, big-talking, lanky creative who did not just design games; he invented entire new genres of games, notably the “god games” in which players controlled the fates of hordes of digital characters. Unbeknownst to Hassabis, Molyneux also had a wild side.[19] When kids came to his office to work as game testers, he would sometimes amuse himself by shooting at them with a BB gun.[20] When a Bullfrog designer complained about the disappointing size of his bonus, Molyneux responded by hurling a heavy object at him. The projectile missed, shattering the company fish tank, which was stocked with piranhas. The following Monday, a new Bullfrog recruit climbed the stairs to the company’s attic studio and found dead fish all over the floor, along with smashed glass and wet patches. Other than that, the place was empty. Molyneux and his team had jetted off to America, leaving the mess behind them.[21] Hassabis loved Molyneux’s god games, and he resolved to compete in Bullfrog’s contest. Fittingly, his variant on Space Invaders involved chess: The player’s avatar was positioned in the middle of a chess-like grid, and chess-piece enemies advanced on it from either side in chess-like formation. A few months later, another announcement in the Amiga magazine listed Chess Invaders as a runner-up.[22] It was not quite enough to win Hassabis a job. But he called up the company and landed an invitation to visit. Hassabis boarded a commuter train from London and set off for the exurban town of Guildford. By now Bullfrog had done well enough to -- 30 of 565 -- vacate its dingy attic office, and the studio was housed in a shiny building in a research park. “They had nice carpet and people that cleaned the windows without being asked,” Molyneux’s cofounder recalled, though the Bullfroggers rendered the premises a bit less nice by skateboarding down the corridors and vomiting in the urinals.[23] To Hassabis, however, windows and carpets were beside the point: He was leaving the orbit of his parents and arriving in the promised land; the creative magic of a famous video game savant was about to be revealed to him. “Can you imagine how excited I was?” Hassabis exclaimed later. “I was literally skipping off the train, jumping on the bus to the research park. It was a beautiful sunny day and I was coming over the brow of a hill. I thought I had just gone to heaven.” Hassabis’s special brand of sparkling seriousness made for an excellent first impression. “He was a lovely kid and so phenomenally bright,” one Bullfrogger remembers thinking.[24] Hassabis was soon invited to stick around at the studio for a week, and was assigned a desk next to Molyneux. His quick intellect brought out the boss’s good side: The savant handed down instruction and the pupil soaked it up; the two got along famously. Hassabis was also fascinated by the other Bullfrog employees: technically talented, self-made young men, many of whom had dropped out of high school, being too idiosyncratically gifted or plain wild to sit meekly in a classroom. Several of the illustrators who worked on the games had learned their artistry by spraying trains with graffiti and dodging the cops on the way home. Hassabis was particularly taken by a brilliant self-taught coder named Sean. “He had an edge to him. He could have been in a gang. I mean, he probably was in a gang,” Hassabis remembered. The following winter, Hassabis won admission to the University of Cambridge, where he would study computer science. The college authorities ruled that although he was academically ready, at sixteen he was too young to enroll, so he should find something else to occupy himself for the next year or so. This suited Hassabis just fine: He would graduate from high school a year early and go back to Bullfrog until Cambridge was ready -- 31 of 565 -- for him. Molyneux was embarking on a brand-new, genre-busting adventure, a game called Theme Park. Not everyone at Bullfrog felt confident about Theme Park’s prospects. The idea was that players would build virtual carnival rides and burger stands and ice cream stalls, managing digital entertainers, mechanics, and security guards. “I just didn’t get it,” one graphic artist recalled. “I left Bullfrog to make a decent game. What a dickhead!”[25] But Hassabis was happy to embrace Molyneux’s vision. Together with a handful of other young employees, he moved into Molyneux’s higgledy-piggledy country house, a mysterious old rectory with hidden doors and secret passages and plenty of gear for gaming.[26] There, the coding commune worked both day and night, gathering periodically in the kitchen for impromptu spaghetti and discussion. The line between working and philosophizing blurred, Hassabis recalled. “We were brainstorming these big ideas. There was this thrill of unbridled creation.” In the early 1990s, video games were built on software platforms known as “finite-state machines.” Characters toggled crudely between a limited number of states—a monster might run, attack, or eat, for example. But Molyneux insisted that the finite-state architecture should be pushed to the max, so that the digital figures in Theme Park would exhibit a far greater range of behaviors. They would crave food and drink, which might be salty or sweet. They would want fast rides and dizzying ones. They should experience nausea and nerves, happiness and sadness. Hassabis rose to Molyneux’s challenge, packing Theme Park full of imaginative details. If the player put extra salt on the French fries, the visitors would feel thirsty and soft-drink sales would rocket. If the player made the roller coasters too scary, the digital riders would vomit; not scary enough, and thrill-seeking customers would grow disappointed. Periodically, Molyneux would drive Hassabis from the old rectory over to the studio, where the precocious apprentice would demonstrate his latest wonders, the boss would issue instructions, and the two would disappear again. Naturally, this raised some hackles. “Peter has a new pet!” an older -- 32 of 565 -- programmer muttered. “Who the fuck is this kid?” another grumbled.[27] But nobody could question the excellence of the kid’s output. • • • SOMETIME IN THIS PERIOD, Molyneux gave Hassabis a copy of Gödel, Escher, Bach, a fire hose of a book that has inspired a remarkable number of future AI scientists.[28] This tome, which won the Pulitzer Prize, delivers a torrent of ideas on “fugues and canons, logic and truth, geometry, recursion, syntactic structures, the nature of meaning, colonies, concepts and mental representations, translation, computers and their languages, DNA, proteins, the genetic code, artificial intelligence, creativity, consciousness and free will,” as the author, Douglas R. Hofstadter, proclaimed, without evident modesty. Seventeen years old and voraciously curious, Hassabis was deeply fascinated by all of the above. But the passages that influenced him most were the ones on intelligence and consciousness. As a chess prodigy, Hassabis had long been curious about the workings of his own mind: How did his brain formulate moves? Why did it make mistakes? And what was behind this phenomenon called thinking? Hofstadter attacked these questions as a physicist, insisting that human intelligence and computer intelligence are virtually indistinguishable. The human brain, as he explained it, is a purely physical object. It is composed of biological material that obeys the laws that govern the rest of the universe, computers included. Moreover, human brains, like computer brains, work on trickles of electricity; when they form an opinion or conceive a plan, they are responding to the chemical equivalent of ones and zeroes. The idea that the gooey mass inside the skull contained some ineffable, unprogrammable something—consciousness? spirit?—was nothing more than biochauvinism. This proposition would upset many readers, Hofstadter conceded. Human beings are seized by a strange sensation: the feeling of possessing a unique “I-ness,” which in turn is at the root of something called “free will.” Readers needed to reckon with the reality that, notwithstanding such -- 33 of 565 -- feelings, human intelligence and machine intelligence resembled one another closely. “Only if one keeps on bashing up against this disturbing fact can one slowly begin to develop a feel for the way out of the mystery of consciousness: that the key is not the stuff out of which brains are made, but the patterns that can come to exist inside the stuff of a brain,” Hofstadter wrote.[29] For a youth who was already fascinated by programming intelligence, this line of argument was thrilling. If the patterns were what mattered—those crackles of electricity, ultimately governed by genetic code—then similar patterns could be encoded into artificial brains. What the mind could do, computers should be able to do—one day. These propositions were all the more intoxicating given the setting. Hassabis was transforming his programming hobby into a well-remunerated art, proving his facility with the code whose potential Hofstadter was stressing. He was living away from his parents, surrounded by rebels who loved to dream about AI, under the watch of a mentor who encouraged these passions. “We were discussing AI all the time,” Hassabis recalled. “How it could help the games. What it would take to build it.”[30] Even as he plowed his way through Hofstadter’s dense work, Hassabis was inhaling science fiction. When he was younger, he had read Isaac Asimov’s Foundation series, and his imagination was fired by the main character, Hari Seldon, who prophesies the collapse of the Galactic Empire and plots to mitigate the fallout. “The only way for us to have that capability of predicting disasters, and then averting them, would be to have AI,” Hassabis said later. At Bullfrog, Hassabis ripped through the first books in Iain Banks’s Culture series, which described a post-scarcity, interstellar society dominated by intelligent artificial beings. In this world of Banks’s imagining, AI systems would generate economic abundance, and citizens would lack for nothing. Space travel would be as simple as hopping on a London bus, and people could choose among hundreds of planets to live on. What’s more, the intelligent machines that Banks envisaged would exist peacefully alongside humans; they would be too preoccupied with their own intrigues to pick a fight with mortals. It followed that artificial -- 34 of 565 -- intelligence was not to be feared. To the contrary, it would enrich human experience. Halfway through the work on Theme Park, Hassabis accompanied Molyneux to an artificial intelligence conference in the United States. There they watched a professor from Carnegie Mellon University give a talk on lifelike computer agents. The professor showed a video with three bouncing blobs: The big one was labeled “Bear,” the medium-sized one was called “Dog,” and a small blob was “Mouse.” The blobs interacted in ways that, to state the matter generously, were only mildly intriguing. The bear defended the mouse. The dog chased the mouse. And so on. Such, apparently, was the state of the art in academic AI programming. After the lecture, Molyneux and Hassabis went up to the professor. “Do you want to see what we are working on?” they asked. They cracked open a laptop and produced a demo of Theme Park. “He fell off his chair,” Hassabis recalled. “He was like, ‘What is this? Who are you guys?’ “And I showed him all the different properties that we were modeling, how happy the characters were feeling, how sad, how thirsty, how hungry, how much money did they have, who were their friends? All of that was simulated in this massively complicated theme-park world. And he couldn’t believe it.”[31] Taken together, Hassabis’s experiences at Bullfrog answered his big post-Liechtenstein question: His mission and purpose would be to build artificial intelligence. Molyneux and Gödel, Escher, Bach had planted the idea: Computers would soon do whatever the brain could do. Iain Banks had supplied a utopian vision of what AI’s realization could mean: boundless human flourishing. And the Carnegie Mellon professor had inadvertently established that Hassabis possessed the requisite talent: If he could impress an eminent scientist before even attending university, there was no limit to what he might accomplish in the future. “I decided then that I was going to dedicate my career to working on AI,” Hassabis recalls. “I already had the kernel of the idea for what eventually became DeepMind.”[32] -- 35 of 565 -- It would take one more epiphany to clinch his destiny. OceanofPDF.com -- 36 of 565 -- I CHAPTER 2 “DEEP PHILOSOPHICAL QUESTIONS” n the fall of 1994, Hassabis quit Bullfrog to embark on his studies at Cambridge. Molyneux did everything to persuade him not to go: He wrote out a check for £500,000 to get him to work on Bullfrog’s next game, Dungeon Keeper. It was an astonishing sum to dangle in front of an eighteen-year-old, equivalent to about $1.7 million in today’s money.[1] But Hassabis refused to cash the check. Hassabis’s determination to attend Cambridge owed much to a film, Life Story. The movie celebrates the scientists James Watson and Francis Crick. They meet at the university, enjoy sunny strolls along the River Cam, and hurry along rain-drenched cobbled streets to the shelter of the ill-lit Eagle pub, where they speculate exuberantly about DNA and conspire to become famous. Ultimately, with the help of an X-ray image created by their rival, Rosalind Franklin, Watson and Crick discover the double-helix structure of DNA, winning the Nobel Prize for their achievement. The importance of this film to Hassabis, like the influence of Iain Banks’s Culture series, says much about how he came by his worldview. Dropping out of school for periods, and operating beyond the understanding of his parents, Hassabis’s ambitions were shaped by a magpie collection of encounters. “I’m quite indiscriminate about knowledge,” he said. “I’ll have any knowledge. Chess game, book, philosophy, I’ll drink it all in.” But the Life Story movie points to something else as well. Multiple forces had driven Hassabis down the path toward AI. But perhaps the strongest driver was the thrill of science: the prospect of discovering the truth behind the other truths. -- 37 of 565 -- “What’s fun?” Watson asks Crick, early on in Life Story. “Oh, the big questions,” Crick responds. “What is man? What is life? How did we come to be the way that we are?” For a while after that fateful game in Liechtenstein, Hassabis had thought seriously about a career in theoretical physics. Here was a field that seemed to grapple with the biggest possible questions—the nature of the universe, the building blocks of reality. For a youth with vast ambition, the prospect of understanding everything was profoundly alluring, and physics held out the hope that you could rise above the clutter of quotidian facts to a higher plane of abstraction. From that loftier vantage point, physicists could explain reality in terms of phenomena unseen—atoms, gravity, geometry, time—and perhaps even discover an all-encompassing, unifying insight that knitted everything together. Every so often in the history of science, one giant theory displaces another: Copernicus announced that Earth was not the center of the universe; Einstein replaced Newtonian physics with general relativity. Perhaps humanity had arrived at another watershed in the progress of ideas—a moment when siloed understandings could be fused into a single theory. Thrilling though this prospect seemed, Hassabis was also practical. When he signed up for a game, he liked to feel that he could win, and physics seemed like a long shot. The way he saw things, all physicists since Einstein had ultimately come up short. They had failed to hit on a theory that explained all of reality. “Even Richard Feynman couldn’t do it,” Hassabis said, matter-of-factly. “He died without understanding everything. I realized that however good I was going to be, I was unlikely to surpass him.” Following this line of thought, Hassabis hit upon a strategy. He resolved to go after the infinite mysteries of physics with the help of artificial intelligence. Science had always advanced courtesy of new tools: Telescopes had allowed humans to peer into space; X-ray machines had made it possible to see into humans without invasive surgery. AI would be the ultimate lever: an extension not merely of vision but of the capacity for understanding. -- 38 of 565 -- Arriving at Cambridge, Hassabis was on the lookout for scientific challenges to crack once AI became available. He imagined himself sitting in the Eagle pub, conspiring in the murky light and hatching Nobel-level adventures. But for those who expect geniuses to be one-track obsessives, he was something of a disappointment. In his first year as an undergraduate, he developed a taste for electronic music: After nights of raving with friends, he would flop down on his bed at dawn and put on Music for the Jilted Generation by the electronica band The Prodigy. In his second year at Cambridge, he took buddies for joyrides in his new car—a Porsche 911 Turbo. In a flourish of nineteen-year-old chutzpah, Hassabis had persuaded Molyneux to lend him the Porsche, saying that he needed it for his commute to the Bullfrog studio, where he had agreed to contribute as a consultant.[2] Sometime in this period, Hassabis fell in love with an Italian undergraduate who would become an academic bioscientist. They would later marry. Hassabis was the subject of much gossip. Students swapped stories about his chess exploits. They marveled that an undergraduate had been the cocreator of Theme Park, a game that had sold millions of copies. They rolled their eyes about his car—and envied it. But despite the legends that grew up around him, what struck friends most about Hassabis was his affability. “My first impression of him was, he’s a nice kid,” another computer science student recalled.[3] He radiated a conviction that you were going to get along, and the conviction was self-fulfilling. One day I asked Hassabis about this friendly approachability. “I’ve always tried to live that,” Hassabis told me. “It’s a very deep, personal philosophy.” “Where did it come from?” I asked. “Probably my mom,” Hassabis said. “The religious upbringing she gave me. “She’s Christian, very religious. That got her out of the hard situation in her childhood, when she was basically an orphan. When I was growing up, she was always helping poor or lonely people through her church. -- 39 of 565 -- “I used to go to Sunday school, played my flute in the church band, prayed before I went to bed, helped with the charity work. My mom’s religion certainly stuck with me. “But I also think it’s just my personality. I want to help people and I feel very strongly that it’s just really bad to manipulate or control people.” I thought of Ender and wondered whether there might be another explanation for this humble affability. Hassabis was small and appeared young for his age: “I am regularly told that I look ten years old,” he confessed when he was in his early twenties.[4] This gave him a good reason to shy away from confrontation, from explicit efforts to exert control; when the bullies came after Ender, he hit back with magic force, but Hassabis was not a cartoon ninja. Hassabis could choose to avoid confrontation, moreover, because he had an alternative way to earn people’s respect: He could beat everyone at games, from chess to backgammon and even table football. In this sense, his gentle affability and his ferocious competitiveness were two sides of the same coin. On the one hand, Hassabis loved to yank and spin the table-football figurines in the college bar: It was an everyman hobby, telegraphing his approachability. On the other, he became so fixated on the game that he organized a college team and vanquished adversaries all over campus. “I was the best table-football player at Cambridge, pretty much,” Hassabis remembered, without irony or self-deprecation. “I could shoot with my left hand from the midfield with a lot of control, so I had something quite special. “You know, there’s a professional scene in the US, and even they don’t shoot like that.” Hassabis also started to play the ancient Chinese board game of Go: Here was something else that he could win at. Once a week, he would bicycle out to the home of a professor and take afternoon lessons, peering down at the restful symmetry of the grid, pondering how to position each stone in a way that captured territory and denied it to the adversary. Hassabis was so good at this test of spatial intelligence that the professor -- 40 of 565 -- noted down some of his games, later including them in a handbook for Go students. As he made the most of Cambridge life, Hassabis remained a magpie. Whatever his fellow students were excited by, he was eager to wrap his mind around, too—and his enthusiasm was contagious. One time, between bouts of table football, a biologist acquaintance told him about the mysteries of protein shapes that, if only they could be accurately plotted, would unlock extraordinary medical breakthroughs. It was a passing conversation, so fleeting that it failed to register in the memory of the biologist friend, but Hassabis filed it away in the back of his extraordinary mind as a potential double-helix-type challenge.[5] A quarter of a century later, DeepMind unraveled the mystery of proteins, winning a Nobel Prize for revolutionizing the field of structural biology.[6] • • • HASSABIS FORMED his strongest intellectual bond with a student named David Silver. They both stood out for being sunny and friendly, and even before their paths crossed at Cambridge, each was curious about the other. Silver had a boyhood memory of an intense young Hassabis competing at chess tournaments in his hometown of Ipswich: The visiting Londoner would defeat all the locals and make off with the prize money. For his part, Hassabis was keenly aware that Silver had the highest exam results of any computer science undergraduate at Cambridge.[7] When the two eventually met, mutual respect and overlapping interests made for a quick melding of minds. “We shared the same passions,” Silver recalled. “The big debates about AI, the deep philosophical questions about computer science. It was just really fun talking to him.”[8] Silver’s “deep philosophical questions” hearkened back to the founding fathers of computing. In 1956, a group of artificial intelligence pioneers had convened a summer workshop at Dartmouth College in New Hampshire. “An attempt will be made to find how to make machines use language, form abstractions and concepts, [and] solve the kinds of problems now -- 41 of 565 -- reserved for humans,” the organizers announced boldly. The premise for this project was that every “feature of intelligence can in principle be so precisely described that a machine can be made to simulate it.” It was a presumption that reflected the midcentury faith in logic and reason. The 1950s enshrined the rational agent at the heart of economics, the efficient- market hypothesis at the heart of finance, and “scientific” managers at the heart of corporations. In this hopeful era, it was only natural for the Dartmouth group to believe that human intelligence was rational and deductive—and thus describable and programmable. For the next half century or so, the standard approach to building intelligent machines was known as symbolic artificial intelligence. Programmers chose symbols to represent concepts from the real world: digits, words, physical objects. They supplied the computer with information about these symbols, then added instructions on logical rules— the definitions of “and,” “or,” “not and,” and so on. If the computer was told that the first symbol was green and the second one was blue, it could accurately deduce that they were not both the same color. If it was told that humans are mortal, and that Socrates was human, then it could deduce that Socrates was mortal. The idea was to transform all real-world phenomena into quasi-mathematical syllogisms. If a is b, and c is a, then c is b, also. Hassabis had encountered this approach before: David Levy’s handbook, which showed how to turn chess positions into numerical statements, had been an example of symbolic programming. In 1984, the year Levy’s book appeared, symbolic AI reached its apogee with a project called Cyc, which aimed to create a system equipped with the majority of human commonsense knowledge. Cyc was taught rules as detailed as “You can’t be in two places at the same time,” and “When drinking a cup of coffee, you hold the open end up.”[9] But the truth, as philosophers had long recognized, was that the subtleties of human reason could not be captured in this way. For every rule, there are myriad exceptions. Knowledge cannot be decomposed into discrete axioms, nor is understanding achieved exclusively through logical deduction. -- 42 of 565 -- Consider a basic facet of intelligence: the ability to arrange things into categories. What is the rule that explains to a computer that a butter knife and a carving knife belong in the same category, despite their different sizes and appearances? Ah, you may point out: Both objects serve to cut; they are united by function. But then how do you simultaneously explain to the AI system that a golden retriever and a dachshund should be grouped together: Do they share a “function”? Of course, the answer is that the big dog and the small dog belong to the same species, meaning that they can mate together and produce fertile offspring. But how can an AI model be instructed to know which categories are defined by appearance, which by function, and which by reproductive potential? The pioneers of symbolic artificial intelligence had no answers to such questions.[10] At Cambridge in the mid-1990s, Hassabis and Silver encountered a culture still wedded to the midcentury assumptions. They were taught “first order logic,” a system of rigidly unambiguous statements that was used in deductive programming. (The statement “All birds can fly” would be written “∀x(Bird(x)→CanFly(x)),” for example.) To the two undergraduates, the limits to this methodology were clear. Silver, like Hassabis, had read Gödel, Escher, Bach: The first name in the book’s title belonged to the mathematician Kurt Gödel, who had proved that, contrary to the Dartmouth pioneers’ presumption, no system of logical deduction could encompass all possible true statements. To Hassabis and Silver, Gödel’s “incompleteness theorem” merely confirmed what was intuitively obvious. After all, humans engage in deductive logic only a small fraction of the time. Mostly, they take in jumbled images, words, smells, and sensations; then they extract meaning from the noise—a process that logicians call induction and lay people might call pattern recognition. “The idea of using first order logic to understand language—it was obvious to me this was nonsense,” Hassabis remarked later. “We don’t speak in first order logic and yet we can understand each other.” If intelligent humans could comprehend one another’s messy sentences, it followed that intelligent machines should be able to make sense of -- 43 of 565 -- imprecise, unstructured data. They should digest examples and derive general truths. They should be inductive as well as deductive. “We speak ungrammatically all the time,” Hassabis went on. “It doesn’t collapse our brains. We can converse. So first order logic is clearly not the whole story.” To the computer science establishment, however, induction seemed daunting. Deduction yields unambiguous truths. Induction yields generalizations that are not provably correct, and that may have to be revised in light of fresh information. After studying the morning routines of ten New Yorkers, for example, an observer might induce that all humans drink coffee. But observation of millions of people across multiple cultures would require this conclusion to be modified. Because learning from examples requires many examples, an inductive machine can succeed only by taking in as much data as possible. But then it will hit the limits of its computational power, requiring a strategy for deciding which parts of its training data to focus on. This gets to the challenge that defines AI: the challenge of teaching a machine to navigate copious data. The human mind relies on mental shortcuts to pull off this trick; but at the time when Hassabis and Silver were at Cambridge, scientists had found no way to codify these human “heuristics” so that they could be fed into a computer. Ever since the Dartmouth workshop, artificial intelligence pioneers had wrestled with this conundrum, which philosophers termed the “problem of induction.” But however hard they tried, humans’ mental shortcuts could neither be defined nor written into a program. “AI has utterly failed, over a quarter century, to solve problems that philosophy has utterly failed to solve over two millennia,” the Harvard philosopher Hilary Putnam observed wryly.[11] Hassabis and Silver had no idea how to program induction, either. But at least they identified the right problem. Somehow, AI scientists would have to overcome their attachment to provably correct deductive logic. Until they did so, the effort to build intelligent systems would be stymied by a contradiction. The essence of intelligence is the ability to respond flexibly to complex situations. But symbolic programming involves feeding -- 44 of 565 -- inflexible rules into inflexible machines; inflexibility piled on inflexibility would never conjure flexible intelligence. To rise above this contradiction, future scientists would have to invent a new kind of machine: a machine that discovered the patterns in a near infinity of data. • • • IN THEIR THIRD YEAR AT CAMBRIDGE, as they imagined this futuristic infinity machine, Hassabis and Silver persuaded an unusual professor, John Daugman, to have them over to his office for a series of tutorials. This sort of small-group teaching was what made Cambridge special, and Daugman’s interests ranged far beyond symbolic programming. He taught courses on information theory and computer vision, becoming famous for inventing an algorithm for iris recognition. Hassabis recalls the tutorials with Daugman as “nirvana sessions,” and Daugman took an instant liking to the friendly pair. “You could actually talk to them,” the professor remarked. “I’m sorry to say this, but in general that’s not true about computer scientists.”[12] The sessions with Daugman led Hassabis to his next epiphany. He realized that a superhuman computer would be more than just a means to a scientific end, the end being progress in scientific understanding. Rather, the computer might itself be the end, because information, marshaled by computer science, was the basic unit of reality. The traditional contenders for the status of fundamental building block—energy, matter—were less compelling by far; only information provided the basis for explaining all facets of experience. The behavior of particles, the flow of energy, and even human consciousness could be seen as examples of information processing. Of course, Hassabis had already absorbed the germ of this idea from Hofstadter. Biology, Hofstadter had argued, was an information processing system; what defined life was not muscle or tissue but the signals that animated them.[13] But with Daugman, Hassabis went deeper, studying Claude Shannon’s theories on what information is: how it can be quantified, stored, and transmitted over time and space; how it is defined by a simple but profound insight—as the opposite of uncertainty. Seen in this light, any -- 45 of 565 -- reduction in uncertainty would depend on information, intelligently processed. A theory of everything—that is, a theory that reduced uncertainty to something near zero—would in all probability take the form of a computer program. “That’s the way I still view the whole universe,” Hassabis said later. “I think information is the fundamental unit.” This was just the first part of the epiphany, however. If information was the fundamental unit of reality, what came on top of this foundation? To Shannon, the answer was computation: the processes for sifting information, moving it around, and generally deriving meaning from it. This insight led Shannon to theorize computers that did not yet exist: They needed to exist, and so they would exist. But what if, nearly half a century later, computing was at an impasse, as it appeared to Hassabis and Silver? Perhaps the answer was to move another level up: from information on level one, to human-designed computation on level two, to machines that figured out how to design their own computation on a third level. Such machines—artificial intelligence systems, or programs that designed programs—barely existed, but they would fill an obvious gap: If humans lacked the wisdom to teach machines induction, the infinity machines of the future would teach themselves to crack the problem. Like Shannon before them, Hassabis and Silver believed that such systems needed to exist, and so they would exist. Hassabis sometimes explained the need for an infinity machine by contrasting physics with biology. “A deductive system like mathematics may be the perfect description language for physics,” he said; Newton had managed to capture the nature of motion in a series of equations. “But AI may be the right description language for biology, because biology is so messy, emergent, dynamic, and complex.” It was impossible to imagine something as elegant as Newton’s laws to describe a cell. But if you fed an infinity of data about cells into an inductive computer, the machine might figure out a way of describing what was going on—it would see the unseen patterns, the hidden laws, that explained cellular behavior. “AI—the kind of -- 46 of 565 -- information system we’re building—will probably be the right tool for this,” Hassabis suggested.[14] Over the ensuing years, Hassabis’s two-part epiphany stuck with him. First, information was the fundamental unit of reality. Second, a machine that learned for itself how to induce nature’s patterns was the most powerful imaginable tool with which to apprehend reality. And while artificial intelligence could push the frontiers of science, it could also do much else besides: discover medicines, extending the lifespan of humans; solve the obstacles to nuclear fusion, rendering energy clean and abundant. As Hassabis once put it to the Guardian, “What we’re working on is potentially a meta-solution to any problem.”[15] A machine that could navigate an infinity of data would be infinite in its reach. • • • GRADUATING FROM CAMBRIDGE in 1997, Hassabis flirted briefly with the idea of a year off in Japan, where he planned to study Go. But he chose instead to work on Black & White, Peter Molyneux’s latest game project.[16] The players of this “god sim” would be equipped with divine powers, and they would choose whether to be black or white, terrible or benign, perhaps discovering something of their own character in the process.[17] The god- player’s choice of personality would color the game’s simulated world: If the player opted to be evil, the landscape would darken; if the player was good, there would be angelic chirpings in the background. Whichever path the players chose, their challenge was to persuade the masses to believe in their powers. Reflecting his own life as a creative impresario, Molyneux was obsessed with the idea that a deity is nothing without followers.[18] Hassabis was drawn to Black & White by the opportunity to experiment with AI programming. Whereas the characters in Theme Park had been complex finite-state machines, obeying fixed rules governing their preferences, Black & White would be the first game in which the avatars’ internal rules changed based on feedback. If a digital creature hurled rocks at villagers and the player responded with a slap, the creature would learn -- 47 of 565 -- not to repeat this transgression. If a creature ate excessively and the player reassured it with a pat, the creature would adjust its algorithmic preferences in favor of continued bingeing. This basic “reinforcement learning” system was a small step in the direction of AI: a program that adjusted its program —that was capable of learning. On a more practical level, the creature’s adaptability made every experience with the game feel fresh. When Black & White eventually appeared, it was another Molyneux blockbuster. Hassabis contributed to the early brainstorming for the game, but he did not stick around to implement the vision. He had matured since his first stint with Molyneux, and his reaction this time was different. Before going to Cambridge, Hassabis had been so excited to be at Bullfrog that he ignored Molyneux’s volatile side; now he noticed it. He could see that, despite his undisputed creativity, Molyneux was a fabulist, a teller of tall tales, often promising journalists that his next project would include some fantastical technical advance, never mind that his own coding team had assured him that it was impossible. In his conversations with Hassabis—his protégé but now, potentially, his rival—Molyneux would claim to have discovered a secret new path forward to AI, but he would never quite produce the evidence: He was by turns emphatic, vague, elusive, and menacing, yo-yoing between warmth and iciness, bravado and tears, stoking the anxiety of everyone around him. Years later, Hassabis compared Molyneux to the mysterious character in The Magus, a novel by John Fowles. The magus is manipulative, mendacious, a master of illusions and mind games. “That was pretty much how Peter approached me,” Hassabis said.[19] I thought back to what Hassabis had told me about his mother’s religion —about how bad he felt it was to dominate people. Perhaps, if life were a god game, Molyneux would be the black god, exercising power through manipulation and menace. Hassabis would be the white god, exercising power by dint of contagious enthusiasm and lucidity. “The worst thing you can do to somebody is to be controlling,” Hassabis said to me again. “I go to great lengths not to be like that.” -- 48 of 565 -- Besides, Hassabis by now had larger ambitions. Toward the end of his time at Cambridge, he had confided to his friends that, to pursue his dream of building AI, he planned to found a company.[20] It was a shocking idea. Entrepreneurship was a foreign concept on the Cambridge campus; Britain had no equivalent to Silicon Valley. “If you had looked at the students and asked, ‘Who’s going to set up a company?’ the answer would’ve been nobody,” one of Hassabis’s contemporaries recalled. “It was like, who are you going to work for, or what PhD are you interested in? Of course you don’t set up a company!”[21] Perhaps thanks to his exposure to Molyneux, perhaps also to the influence of his free-spirited father, Hassabis was an exception. He saw no reason not to start a company—and so he did. OceanofPDF.com -- 49 of 565 -- O CHAPTER 3 THE JEDI ne night in early 1998, Hassabis slouched back in a comfy chair at his parents’ home in North London and stared outside into the dark sky as he listened to the soundtrack of his favorite movie, the sci-fi classic Blade Runner. The ramifications of his recent decision were beginning to sink in. He had traded in his job with Peter Molyneux for a shot at starting his own firm, and entrepreneurship suddenly felt daunting. But he wasn’t going to sit around wondering what might have been. “You only get one life,” he told himself. Hassabis’s first call the next day was to his friend David Silver. When they were at Cambridge, Hassabis had talked about his plan to found a company, and Silver had been intrigued without ever quite believing him. After all, the celebrated Peter Molyneux pretty much worshipped the ground on which Hassabis walked. Why take the risk of starting a competitor? “That games company we talked about, do you want to do it?” Hassabis asked. The previous evening’s doubts had been erased. His conviction was infectious. Silver had taken a job at a cool software boutique, building special effects for movies. He didn’t miss a beat. “Absolutely. Let’s do it.”[1] Silver went over to the Hassabis home, and the two of them thrashed out a business plan. They named their fledgling studio Elixir, this being, according to a handy dictionary, “the quintessential part of any substance.” “Obviously I had no clue as to what this meant, but it sounded good,” Hassabis wrote in the Elixir Diaries, a series of dispatches that he published -- 50 of 565 -- monthly in a gaming magazine.[2] Having grown up on stories and movies, frequently imagining himself in the shoes of the heroes, Hassabis was taking the logical next step. He was composing his own story. Armed with a name, a business plan, and one brilliant ally, Hassabis set out to recruit more talent. Top of his list was a game designer named Joe McDonagh. A couple of months earlier, McDonagh had tired of his big- company employer and applied for a job at Molyneux’s studio: His submission had consisted of a bottle containing a tea-stained message from a person shipwrecked on the island of “Korporate.” Impressed, Hassabis had taken charge of interviewing the applicant, noting that his CV mentioned a strange pair of hobbies: origami and boxing. The interview consisted of Hassabis challenging McDonagh to a series of games; he beat the candidate in a race to fold an origami bird, but decided not to test his left hook or his haymaker. Now Hassabis offered McDonagh a job, working for him rather than Molyneux. The next person on the list was a wizard named Tim Clarke, whom Hassabis had known at Cambridge. Clarke was into coding, theoretical physics, and weightlifting. When still at high school, he had written a program that simulated the sensation of flying over Mars. This was a hit among space nerds and got him a summer job at NASA. Hassabis quickly talked McDonagh and Clarke into joining. His powers of persuasion were uncanny: “Demis had what we called his Jedi mind trick,” Silver said later. “He would kind of be like, ‘You will believe the things I’m going to say,’ and then people did believe them.”[3] The stint with Molyneux had no doubt helped Hassabis to develop this presentational flair. Entrepreneurship flourishes in clusters such as Silicon Valley, where technologists learn how to project confidence by apprenticing to one another. Britain was, to a first approximation, a start-up desert. Hassabis was lucky to have spent time working for a master storyteller. Having assembled his three cofounders, Hassabis set off to raise money. Typically, game design studios sought backing from game publishers, which provided up-front capital in exchange for a large share of future revenues. But Hassabis reckoned he would get a better deal by following -- 51 of 565 -- the Silicon Valley model and tapping venture capitalists. “The plan was simple,” Hassabis explained. “Blow them away with impressive stats on Theme Park, talk them through the detailed business plan, enthuse about the backgrounds and records of the core team.” In the late 1990s, however, London venture capitalists hardly deserved the title. In Silicon Valley, T-shirted start-up founders raised millions from seasoned investors. In London, Hassabis felt obliged to don a stuffy suit, and the investors were more pickled than seasoned. Arriving at his first pitch meeting in the financial district, Hassabis was greeted by two young associates who led him off to their preferred meeting place, a restaurant. There, according to the Diaries, they ordered three bottles of wine: A couple of hours and many glasses later, the associates announced that it was time to meet their boss, who, as it happened, was in a pub around the corner. Pints of lager were served up, and Hassabis felt hot and drunk and exhausted. Still, he launched into a stump speech about his chess exploits, and soon the impressed boss made him an offer. But it was not for Elixir. Hassabis should think bigger than that silly start-up, the boss declared; he should work for the boss’s firm as a currency trader. Such was the City of London’s commitment to British entrepreneurship. Several days later, Hassabis received a letter from his recent drinking buddies. They were willing to back Elixir after all, and would kick in £2 million; but in return they expected fully half of Elixir’s equity. To be fair to the investors, this was by some measures an unsurprising proposal: Back in the 1960s, Silicon Valley’s first venture capitalists kick-started the entrepreneurial ecosystem by offering terms that were roughly as brutal.[4] But there was no way that Hassabis was going to accept this sort of proposal. Surrendering half his equity would mean giving up control of Elixir, and if there was one thing that the Molyneux experience had taught, it was that he hated to be controlled by anyone. After fruitless negotiation and a dozen attempts with other so-called venture capital outfits, Hassabis gave up. “They liked us, but they wanted our soul in exchange for the money.” -- 52 of 565 -- Hassabis had kept himself afloat with his savings from his time with Molyneux. Now his cash was running low, and he was getting desperate. He was still living at his parents’ house, and he rode the bus to work; for a while he drove his mother’s beat-up car, but then the clutch broke. Fortunately, it turned out that another part of Britain’s start-up ecosystem— rich individuals, so-called angel investors—was in healthier shape than the venture capital part of it. Over the next couple of months, an industrialist, a lawyer, and Peter Molyneux himself all ponied up a bit of cash.[5] Hassabis rented a small, windowless workspace near a motorway junction. On July 7, 1998, Elixir was founded. • • • WHEN HASSABIS had talked of starting a company during his last months at Cambridge, his ambition had been to build powerful AI, not just to design video games.[6] In founding Elixir, he was balancing his ambition against his practical side. A games studio would allow him to at least experiment with AI, and it would give him entrepreneurial experience. It might also make him rich—rich enough, perhaps, to launch his ultimate dream: a Manhattan Project for artificial intelligence. The Elixir crew got down to work, noodling ideas for the first game project. The roof was made of corrugated iron and there was no ventilation; the gamers sat on recycled school chairs and argued about whether eating scrambled eggs would have unacceptable consequences for the office’s air quality. Hours of intense silence would be followed by raucous discussion: Should Spock have been the captain of the Enterprise rather than Kirk? Which football stars had the worst haircuts? The team also played fantasy football and the video game StarCraft in an attempt to satisfy Hassabis’s “burning desire to play and win something, anything,” as Hassabis himself put it. A month after Elixir got started, Hassabis won the five-game Mind Sports Olympiad for the first time. He showed up at a hotel in West London and bested some two thousand rivals, sprinting between games in different rooms so that he could rack up multiple wins simultaneously. -- 53 of 565 -- Hassabis did not do much of the coding at Elixir, but he provided most of the vision. One time at Cambridge, he had played an obscure board game based on a power struggle in a banana republic. This was early 1995, and the media was flooded with photos of Russia’s invasion of Chechnya. The combination of the game and the grim wartime imagery got Hassabis thinking. “I began to ask myself about the people who make history, the men who shape the courses of our lives. Who are they? How do they become what they are?”[7] Following this line of thought, Hassabis came up with a twist on Molyneux’s god formula. He imagined a dictator game: The player would assume the identity of a faction leader who had to oust the ruler by means of demagoguery, political intrigue, or brute force, with the fate of millions of lesser beings dependent on the outcome. The idea became Elixir’s first project: Republic: The Revolution. To put flesh on this concept, Joe McDonagh took himself off to the British Library to learn about Russia and its former Soviet satrapies. He also discovered a bizarre relic in a rough part of South London: the Society for Anglo-Soviet Cooperation. McDonagh installed himself in the society’s dilapidated reading room and glanced warily at the odd characters browsing the bookshelves. He presumed that they were spies, and he presumed that they presumed that he was a spy: A life lived through games can stimulate the imagination. After a couple of months of reading, McDonagh dreamed up the fictional but realistic country of Novistrana, featuring elements of Belarus, Ukraine, and Azerbaijan. Novistrana would have a dictator, ample corruption, and Eastern Orthodox churches.[8] Echoing Molyneux, Hassabis demanded heroic efforts from his coders. Novistrana was to be populated with legions of plausible people: husbands, students, housewives, and drunks, each living separate lives, each of them believable. Hassabis also wanted groundbreaking, high-definition graphics, so that the moss would be visible in the cracks in the buildings. By the end of 1998, Elixir had made enough progress to attract a funding package from a game publisher, Eidos. Having struck out with the venture investors, Hassabis fell back on the standard source of capital. -- 54 of 565 -- Armed with the additional cash, the team moved into a larger office in the trendy neighborhood of Camden. There were curving metal beams under the roof, and McDonagh and Clarke, the boxer and the weightlifter, took turns swinging chimpanzee-style from one beam to the next one. Hassabis installed table football and announced to all and sundry that nobody would beat him, ever. His lieutenants trained maniacally to prove him wrong. At length, the evil day arrived. The champion’s reign ended.[9] Hassabis retreated from the table and sat in his chair mutely. “There was this dark cloud over his chair. He had this somber, cloudy face. And at a certain point he just couldn’t contain it anymore. “He stood up and said, ‘It’s like my soul is on fire!’ ” David Silver remembered. “Was there an element of self-mockery?” I asked Silver. “Or was it completely serious?” “It was both. It really was how he felt. But I think he also knew that it would get a laugh or something.”[10] Through 1999, the team slogged away on the details for Republic. Hassabis urged his art team to create imposing cities that reflected the premise of the game: that the man on the street is a mere ant. To stir the creative juices, he led viewings and discussions of film noir classics: Fritz Lang’s Metropolis, Batman, and his own favorite, Blade Runner. The artistic possibilities were expanded by Tim Clarke, the weightlifter, who coded a world-class graphics software that could render images in photorealistic detail, down to the screw threads on the bolts of Novistrana’s brutalist factory machinery.[11] Announcing Clarke’s achievement in the gaming press, Hassabis dubbed his system the “infinite polygon engine.” Jealous rivals mocked it as the “infinite monkey engine.”[12] Meanwhile David Silver pushed AI for games to the next level. The characters in Theme Park had differed little from one another, but Silver made it possible for Republic to feature proper individuals. There was Ludmilla Mironova, a sleazy town councilor and a walking advertisement for Soviet-era cosmetics. There was Eduard Satarov, an even sleazier -- 55 of 565 -- journalist with a huge beer gut and a fantastic comb-over. Hassabis declared expansively that there would be fifteen hundred of these highly differentiated characters, plus “a million individual living, breathing people with their own daily routines and their own beliefs and loyalties.” Thanks to Silver’s algorithms, each character would develop as it interacted with the next, shaping the game’s development. At the end of 1999, Hassabis went public with his work in progress. He granted a long interview to the gaming magazine Edge, which ran a gushing cover story on Republic. Hassabis, the magazine suggested, might be inventing the future of gaming; Republic was “one of the most ambitious computer games ever.”[13] “Long term, I want to be the best games developer in the world,” Hassabis told another interviewer.[14] The echoes of Peter Molyneux’s ebullient storytelling were obvious. • • • JUST OVER A YEAR LATER, at the start of 2001, Elixir’s grand ambitions collided with a hard deadline. Republic was to be unveiled at the Electronic Entertainment Expo that May: Fifty-five thousand developers, publishers, retailers, and journalists would descend on Los Angeles. The reception that Republic garnered would determine its fate. “If the competition is working fifteen hours a day, I want us to be working sixteen hours a day,” Hassabis declared during the lead-up. With a real-life Ender in their midst, several members of the Elixir team worked even more than that. Tim Clarke was known for coding into the small hours, passing out on a sofa, then waking up, soaping his armpits, and sitting back down at his terminal. Republic’s wild complexity was causing trouble, and in the week before the expo David Silver was still hacking at the code, struggling to get the demo working. In desperation, he had resorted to a trick. The computer he used had to look normal from the outside, since the game would need to work on customers’ standard home PCs; but Silver and his colleagues surreptitiously stuffed the machine with eight times the normal quantity of memory. Even so, the combination of -- 56 of 565 -- high-definition graphics and differentiated game characters was causing the system to choke. The day before he was due to fly with Hassabis to LA, Silver stayed up all night, trying desperately to get the demo working. When morning arrived, at the last possible moment, Silver saved his code and powered down, Hassabis scooped up the computer in his arms, and the two rushed out for the airport. As they went through security, their flight was closing, and Hassabis and Silver began running. The two men reached the gate with no time to spare. Proceeding to their seats, their faces pouring with sweat, they nearly injured a few passengers as they marched down the aisle with the computer. Somewhere toward the back of the aircraft, Silver stored the machine by the extra seat he had bought for it. As soon as the plane was in the sky, he resumed his fight to make the code work. Arriving at the cavernous Los Angeles Convention Center, the two comrades made their way to the private room where Hassabis would present Republic to executives and industry journalists the next morning. By now dizzy with sleeplessness, Silver set up the system. Elsewhere in the expo hall, five thousand other development teams fanned out over an area equivalent to eight soccer fields. Standing in front of the computer the next day, Hassabis launched into his first presentation. The boss of his publisher-financier and several other bigwigs had arrived to watch: This first session would be among the most important. But as soon as Hassabis began talking, Silver’s souped-up system crashed, forcing Hassabis to reboot it. Even more humiliatingly, as the computer rumbled back to life the screen displayed the system’s vital statistics, including the massively augmented memory. Something inside Silver snapped. With the opening music for the demo throbbing in his brain, he edged toward the doorway and bolted. To this day, he cannot stand to listen to that soundtrack. Silver found a sofa and passed out. There was no fight left in him anymore; he slept for a few hours without stirring. When he finally awoke, he set off to see Hassabis. He had tried everything, everything, he would tell -- 57 of 565 -- his friend. Everything to make the demo work. Everything to save their company. When Silver eventually located Hassabis, it transpired that no apology was needed. Hassabis had proceeded with his presentation despite the hardware disaster, and everyone had loved it. The demo had been clunky— even when he got the PC working, it could only display the frames in slow motion—but the accompanying patter had been genius. Hassabis had waxed effusive about Republic’s imaginative scope. Players could order beatings of noted community members, plot the martyrdom of a revolutionary student, and experience the dynamics of rioting throngs—here Hassabis dazzled his audience by invoking the book Crowds and Power by the Nobel Prize winner Elias Canetti. Such was the curiosity and anticipation that Hassabis generated, Republic: The Revolution won an award at the expo for the best upcoming game, and a reviewer declared it the most exciting strategy concept since Civilization.[15] Standing on the precipice, teetering at the edge, Hassabis had pulled off his greatest Jedi mind trick ever. • • • DESPITE THIS IMPROBABLE ESCAPE, Elixir was in trouble. The hardware crash reflected a hard truth: The computers of the time couldn’t handle Hassabis’s ambitions. Round-the-clock coding sessions were taking their toll: Silver’s sudden exit from the presentation room had been a warning of incipient burnout. Several other members of the team were in a similar condition. Hassabis’s mind tricks had helped him to recruit talent, raise money, and create spectacular buzz. But they had raised expectations to the point where delivery became almost impossible. Over the next couple of years, Republic’s release date was pushed back repeatedly.[16] To get the product out the door, the team diluted the game’s most innovative ideas, and morale inevitably suffered. As the keeper of the vision, Hassabis fought a rearguard action against compromise, and it took time for him to recognize the trap that his own charisma created. “Who would’ve thought that you can actually inspire people too much?” he -- 58 of 565 -- reflected, years later. “Well, you can, because you can get to the point where you are deluding your team, and then they are deluding you also. “It’s like, I’m making the judgment this is possible because the engineers are telling me it’s possible; but they’re only telling me it’s possible because I’ve over-inspired them,” Hassabis said. “So, in fact, none of us is getting real feedback.” Republic eventually went on sale in August 2003. Relative to the hype, it was an anticlimax.[17] This disappointment was quickly followed by a more fundamental setback: David Silver quit Elixir. The circumstances of Silver’s departure raised a troubling question: Perhaps Hassabis was guilty of more than over-inspiring his troops; perhaps he might be harming them. Over the previous year or so, Elixir’s coders and designers had taken to pulling Silver aside: You tell Demis that this feature will not work, they’d plead. You’re the only one he’ll listen to. But the job of communicating reality to Hassabis was grueling. “You had to push the conversation to the point where he got more and more intense and defended his positions more and more strongly,” Silver said later. “The stronger he got, the closer you were. Then eventually he might go quiet. That’s when he had absorbed the message.”[18] After months of playing go-between, Silver decided that he had to go elsewhere. He moved to the south of France with his girlfriend and spent a year recuperating. Hassabis was left to contemplate the thin line between his charisma and his obstinacy, between his gift for storytelling and the risk that he might drink his own Kool-Aid, between his magical capacity to inspire people and the risk of inadvertently destroying them. Silver was the first to say that Hassabis’s intentions were good: When Demis talked about the influence of his mother and his horror of manipulating others, he meant it.[19] But it was one thing to abhor the idea of controlling colleagues. Given his Jedi-level charisma, it was quite another to avoid it. Of course, Hassabis was not alone in this dilemma. Many sensational entrepreneurs, from Steve Jobs to Elon Musk, have bent reality with their stories, shifting the boundary between the possible and the impossible, with consequences both exhilarating and harmful. Closer to Hassabis’s own -- 59 of 565 -- experience, Peter Molyneux could be both inspiring and abusive. Relative to these other leaders, Hassabis may have been more self-aware, and more anxious to correct his own tendencies. “I am strong-willed and stubborn,” he told an interviewer during the early days of Elixir. “I try to be open- minded to my ideas not always being the best ones,” he added.[20] And to be fair to Hassabis, he seldom put more pressure on colleagues than he put on himself. By the time Elixir went out of business, in 2005, he, too, was burned out—“Demis is killing himself,” a friend remembers thinking.[21] But that, in a way, was exactly the point. A boss who approached everything with relentless Ender-style intensity was bound to be tough on whoever was around him. One day I discussed Hassabis’s double-edged mind powers with a psychologist. She caused me to rethink my Black & White analogy—the one where I had imagined Molyneux as the dark, manipulative god, and Hassabis as the friendly games-playing deity. Charismatic people cannot be just bad or good, the psychologist explained. If they are bad all the time, they will alienate everybody, and a charismatic person with no entourage is a contradiction. Likewise, charismatic leaders cannot be consistently benign. In the roar and crash of life, there are bound to be moments when interests and opinions collide. Somebody has to prevail over somebody else. Given their unusual gifts, charismatic personalities will end up on top, no matter how earnestly they aspire to avoid dominating others. There is a further point, the psychologist continued. It is not merely that charismatic personalities must inevitably swing between inspiring and controlling people. The oscillation itself is part of the charisma. Followers become addicted to a Jedi’s variable moods, just as gamblers become addicted to arcade machines that disappoint and disappoint and then spike their brains with jackpots. If slot machines spat out similar rewards turn after turn, there would be no juice in the game; if a parent is consistently loving, the child will feel no need to cling on anxiously. Oscillation stokes dependence; it sinks the hook into the mouth. This was precisely what Hassabis had experienced and then angrily rejected in his relationship with Molyneux. -- 60 of 565 -- What of Hassabis himself, I wondered? His intentions were good, but could he remain good? If Hassabis set out to build an infinity machine, he would have to cut a path through tough terrain, and adversity would test his personality. The outcome of this struggle would have consequences for society writ large: Hassabis’s character—his temperament, his integrity, his choice of where to stand in moments of challenge—would matter. In Molyneux’s Black & White, the player-god’s moral compass determined the color of the world. It was a metaphor to ponder. • • • ELIXIR’S FAILURE came with an unexpected benefit. It caused Silver and Hassabis to move on to the next chapter in their lives, picking up the threads of their studies at Cambridge. When he left Elixir for the south of France, Silver read a textbook by a University of Alberta computer scientist named Richard Sutton. Its premise matched the conviction that Silver had developed at Cambridge: that inflexible machines trained on inflexible logic could never achieve flexible intelligence. Rather, machines should learn by trial and error, as Shannon had said; they should interact with the world, receive feedback, and use that feedback to fine-tune their behavior. Sutton’s textbook laid out ideas for implementing this approach, which was known as reinforcement learning. The basic idea of reinforcement learning, often known as RL, was already familiar to the gaming fraternity. When he had worked for Molyneux, Hassabis had proposed that a simple form of reinforcement learning should be coded into Black & White: The slaps and strokes from the god-players would provide feedback to the digital creatures, which would adapt their behavior. At Elixir, Hassabis and his comrades had invited an RL professor to give a talk at the studio; Silver recalls being “gobsmacked” by the presentation.[22] But Richard Sutton’s textbook led Silver in deeper, describing the algorithms that a programmer might deploy to make RL a reality. If computers could be trained to understand the world by interacting with it directly, they could learn what to do in any given -- 61 of 565 -- situation, effectively programming themselves. “That’s real AI,” Silver remembers thinking. Apologizing to his girlfriend, who preferred the climate in the south of France, Silver wrote to Sutton and proposed that he should do a PhD in snowy Canada. “AI is often vaguely defined,” Silver said later. “But Sutton’s book showed that we could really pin it down. AI is a system that learns for itself how to solve problems. “You are trying to build a system that can figure things out without human instruction. Rich Sutton was laying out a road that stretched all the way to the horizon. I wanted to reach that horizon. That was it for me.”[23] In 2004, Silver moved to Alberta. When Elixir closed the following year, it was Hassabis’s turn to choose a fresh direction. Like Silver, he needed to recuperate: “It was probably the hardest time in my life; I was in pieces,” he said later. Like Silver, he decided that the best way to rebuild himself was to go back to learning. As a precocious undergraduate, Hassabis had rejected symbolic programming because it failed a basic test: It was not how human intelligence operated. The way Hassabis saw things, this test was crucial, because powerful machine intelligence, when it eventually arrived, would emulate the human variety. After all, the human brain provided the only grounds for believing that a general, flexible intelligence was even possible: The brain was, as Hassabis said, the “existence proof” that underpinned AI endeavors. It followed that, to build artificial intelligence, one should understand human intelligence first. Following this logic to its daunting conclusion, Hassabis resolved to do a PhD in neuroscience.[24] Hassabis lacked the normal training for this. He was neither a doctor nor a biologist nor even a psychologist; he knew little of the chemistry of the brain and the nervous system. Undismayed by this considerable gap in his résumé, he brandished his credentials as a computer scientist, talked up his exotic entrepreneurial record, and won acceptance to the doctoral program at University College London. His supervisor, a rising academic star named Eleanor Maguire, feared that, after managing a company, Hassabis might find academia dull. But Hassabis assured her—truthfully, in that moment— -- 62 of 565 -- that he just wanted to study: “I don’t need to be in charge of things, I’m fine to learn,” he insisted. Maguire responded by assigning her new student a pile of scientific papers to read before he showed up on campus. In the summer of 2005, Hassabis married his Cambridge girlfriend. The two headed off on honeymoon to an Italian beach, the groom lugging his neuroscience homework with him. • • • HASSABIS’S BEACH READING that summer concerned the workings of the human memory. The scientific papers featured two schools of thought: One side believed that memories are like videos, waiting to be recalled from the data banks of the brain; the other side held that the brain retained only a bare essence of the past, so that memories were not so much recalled as reconstructed. The first camp asserted that memories are faithful to what really happened. The second camp regarded memories as unreliable, since the process of reconstruction left room to hallucinate fictitious narratives. “It was obvious to me that the reconstructive people were right, that it wasn’t a video,” Hassabis said later. He was particularly taken by studies of witness statements, which showed the power of suggestion. If witnesses were subtly prompted by a police questioner, they would give the evidence that the police officer wanted, proving that the brain could be tricked into reconstructing a memory of something that had never happened. Sitting on the beach, before even embarking on his PhD, Hassabis experienced a eureka moment. If memories were reconstructed, then perhaps they might use the same brain mechanisms as imagination. To be sure, the goals of memory and imagination were opposed: Memories supposedly involve real events, whereas imagination deliberately conjures novelty. But the process of creating vivid images, historical or hypothetical, seemed linked. And although memory is retrospective and imagination often visualizes the future, Hassabis recalled a line from Through the Looking-Glass. “It’s a poor sort of memory that only works backwards,” the White Queen tells Alice. -- 63 of 565 -- “Maybe looking at this other capability, imagination, would be a creative way to resolve the memory debate,” Hassabis remembers thinking. On his first day as a PhD student, Hassabis arrived bubbling with research hypotheses. Not all of them were sensible. “He’d have these crazy ideas,” his friend and fellow PhD student Dharshan Kumaran recalled. “But he was very creative.”[25] Kumaran was Hassabis’s comrade from the junior chess circuit. He had gone on to study medicine and become a doctor before embarking on a neuroscience PhD; when it came to the physiology of the brain, he knew a lot more than Hassabis did. But Kumaran soon discovered that they made a productive pair. Hassabis was full of wacky conjectures. Kumaran supplied rigor. “I was always trying to pick the game-changing idea,” Hassabis recalled. “And then it’s like, ‘Oh, the details! Who knows, whatever.’ ” “We always used to have our discussions in the tiny kitchen in the neuroscience building,” Kumaran remembered. “Demis would explain some idea and I’d say, ‘No, that doesn’t seem right to me.’ Then he’d go back and think about it in a different way. And then we’d be back in that kitchen.”[26] After batting ideas back and forth, Hassabis and Kumaran eventually came up with a workable experimental strategy. They would test patients who had suffered damage to the hippocampus, a pocket of tissue located deep within the brain where the day’s memories are recorded. If Hassabis’s hypothesis was correct—if memory and imagination were linked—patients with memory loss would also struggle to imagine things. With Maguire’s help, the two friends identified a handful of rare patients who had damage to the hippocampus but were otherwise healthy. Sure enough, when Hassabis and Kumaran carried out their tests, they got the result that Hassabis had anticipated. Damage to the hippocampus harmed patients’ ability to visualize a three-dimensional scene, like a day at the beach or a shopping trip. A part of the brain associated with memory was indeed crucial to imagination. -- 64 of 565 -- At the start of 2007, Hassabis, Kumaran, and Maguire published their research in the prestigious Proceedings of the National Academy of Sciences.[27] It was a remarkable achievement: Science listed it among the year’s top neuroscience breakthroughs, and the paper eventually collected over seventeen hundred citations.[28] But for Hassabis, the scientific accolades were only part of the thrill. He was more excited by the implications for the existential questions that preoccupied him. If memories were not records of some objective external reality, but rather simulations created by the brain, perhaps all of reality might be a mental fabrication. “The structure of the world is, basically, created by the mind,” Hassabis told me during one of our long talks. “I was trying to prove that with my neuroscience work: that reality might be a simulation.” I thought back to my first encounter with Hassabis, when he appeared on that conference stage, invoked Immanuel Kant, and explained that the ultimate goal was “to understand our own minds better.” Physics explained the external world. Neuroscience explained human beings’ internal world. But neuroscience was the nobler subject of study, because the mind creates reality.[29] “I like Kant’s idea that the world out there is basically a mental construct,” Hassabis reiterated to me now. The stuff that physicists studied —matter, energy, time—was ultimately less real than the bits of information pulsing between neurons. “I’ve always been fascinated by the Brain in a Jar thought experiment,” Hassabis continued, referring to a philosophical device for exploring the relationship between reality and consciousness. The experiment posits that a brain is removed from a body and kept alive in a jar of nutrients. It is hooked up to a computer that simulates sensory inputs: a view of the sea, the sound of the waves, the smell of salt and seaweed. The computer creates a virtual reality so convincing that the brain believes it is still living a normal life, taking in the outside world through eyes, ears, and nostrils. Of course the brain is really just experiencing a simulation—a series of ones and zeroes generated by a computer. -- 65 of 565 -- I could see why Hassabis was taken with this thought experiment. It was the premise for The Matrix, another classic sci-fi film that he was fond of. The humans in the movie believe they are living real lives, but their minds are connected to a master computer and their bodies are harvested for energy. If the brain in the jar could be misled into believing that it was experiencing reality, how can the rest of us be sure that we are not also living in a simulation? If the brain in the jar can see and hear and smell things that have no presence in the real world, perhaps our consciousness may also be a product of ones and zeroes in the brain, utterly divorced from whatever physical reality exists around us? “Reality may be constructed by the mind,” Hassabis repeated. “That’s what I think Kant was getting at.” I marveled at how Hassabis’s experiences and ideas appeared to slot together. His curiosity about physics had spurred him to work on AI, the ultimate tool to unlock science. His curiosity about AI had led him to investigate the human brain, the existence proof for intelligence. His work on simulations in video games echoed his research on simulations in the mind. And influences as various as Immanuel Kant, Gödel, Escher, Bach, John Daugman’s tutorials, and neuroscience had pushed Hassabis toward the same bottom line: that information was the fundamental unit of reality. The God of the Bible, to Whom Hassabis had prayed as a young child, controlled the universe and all reality. The god of Black & White controlled a simulated universe, coded into a computer. But if information and simulations were as real as reality itself, the boundary began to blur. A computer that simulated a limited world could be seen as a limited god. A futuristic computer—a powerful AI—might be limitless, infinite. OceanofPDF.com -- 66 of 565 -- A CHAPTER 4 THE GANG OF THREE t the start of 2009, when Hassabis was finishing his PhD, Peter Molyneux popped into his life again. The two had been in contact as Hassabis, feeling the entrepreneurial itch return, had begun to noodle start- up ideas to pursue after academia. Seeing an opportunity, Molyneux dispatched an emissary to inquire whether Hassabis would like to join forces on his next game. When Hassabis demurred, the emissary asked why. “I want to be the person who solves AI,” Hassabis responded.[1] The question, as ever, was how to go about this. Hassabis toyed with start-up ideas that, like Elixir, would combine an opportunity to work on AI with a commercial payoff. He imagined an AI-powered recommendation algorithm, focused on matching consumers with the right TV shows. He considered a variation: a recommendation system for the bag of tricks in Apple’s new app store. But these stepping-stone projects failed to quicken Hassabis’s pulse. Now thirty-two, he was conscious of his biological clock. It was time to pursue his life’s ambition more directly. As a first step toward founding a company focused squarely on AI, Hassabis enrolled as a postdoctoral fellow at the Gatsby Computational Neuroscience Unit, a division of University College London. The Gatsby’s researchers worked at the intersection of human and machine intelligence, making it an obvious place to scout for kindred spirits. The unit’s director, Peter Dayan, had demonstrated how the reward signals in reinforcement learning, the branch of AI chosen by David Silver for his PhD, mimicked the dopamine signals that reward human learning. Before him, the Gatsby’s founding director, Geoffrey Hinton, had pioneered another area of AI: so- -- 67 of 565 -- called neural networks, loosely analogous to the human brain, with layers of decision centers, or “neurons,” hooked up to one another. Later widely celebrated, but in 2009 still relatively obscure, Hinton’s approach to AI was known as deep learning. Despite the Gatsby Unit’s pedigree, Hassabis’s first weeks as a postdoc were not encouraging. Most of the faculty dismissed his AI ambitions as far-fetched. They were trying to get reinforcement learning or deep learning to work on a few simple problems; they did not dream of superhuman machine intelligence. Still determined to identify soul mates, Hassabis figured that perhaps his fellow postdocs might be more on his wavelength, so he scanned the bios on the Gatsby website. But out of the entire cohort of Gatsby fellows, only one single researcher confessed to an interest in building powerful AI. Hassabis made a mental note of the photo accompanying the bio, and tried to remember the researcher’s name: Shane Legg—it had a cowboy ring to it. Perhaps he would bump into this cowboy guy in the Gatsby cafeteria. A few weeks later, with no serendipitous encounter having taken place, Hassabis headed off to America, hoping that it might prove more AI- friendly. In typical Hassabis fashion, he had arranged to do two fellowships simultaneously, at MIT and Harvard.[2] His MIT supervisor, Tomaso Poggio, was a physicist turned computational neuroscientist who taught the university’s oldest machine-learning class: Hassabis got along with him immediately. Later, Poggio would say that, of the many Nobel Prize winners he had encountered, the majority were both brilliant and lucky— lucky in the sense that they had chosen a research problem that turned out to be both consequential and soluble. But a handful of Nobel laureates, Poggio said, were so exceptionally gifted that they were going to win the prize no matter what. In this category, Poggio placed the physicist Richard Feynman, the biologist Francis Crick, and his postdoctoral student Demis Hassabis.[3] While at MIT, Hassabis also had the opportunity to meet Geoff Hinton, the founding director of the Gatsby, who was visiting from his subsequent base at the University of Toronto. By now, Hinton had labored on his deep -- 68 of 565 -- neural networks for a quarter of a century, exhibiting what Poggio called a “religious belief” in their potential. In 2006, the professor and two coauthors had shown how to train big neural networks, dubbed deep belief networks: The step-up in size brought a step-up in performance.[4] In 2009, around the time he met Hassabis, Hinton received another boost: the opportunity to turbocharge these networks with special chips—“graphics processing units” originally designed to render video game images.[5] But even though Hinton was riding high, he found to his astonishment that Hassabis was as cocky as he was. “Demis is the only person I’ve ever met who’s more competitive than me,” Hinton recalled of that encounter.[6] To Hassabis’s disappointment, most of the researchers at MIT and Harvard were no more AI-forward than the Gatsby crowd in London. Strangely, the denizens of MIT’s prestigious Computer Science and Artificial Intelligence Laboratory, housed in a wacky, multicolored funhouse designed by the architect Frank Gehry, were especially traditional. The elder statesman of the laboratory, the computer scientist Marvin Minsky, had attacked neural networks since the 1960s. Nearly half a century later, neither Minsky nor his younger disciples seemed inclined to update their perspective. One day during his stint with Poggio, Hassabis met the head of the Computer Science and AI Lab, a revered figure named Patrick Winston. Hassabis had prepared for this moment: If the opportunity presented itself, he wanted to test his vision for an AI start-up on a pillar of the MIT establishment. Following his script, Hassabis told Winston of his plans. “I explained to him I was going to do reinforcement learning and deep learning,” Hassabis recalled. “We had a long discussion. He said it was just nonsense.” Hassabis put a brave face on this rejection. “Well, this is kind of fantastic,” he remembers telling himself. If an establishment figure had embraced his vision, it would have signaled that it lacked originality—and that rival entrepreneurs with similar ideas would soon be circling. The fact that Winston had dismissed his arguments, while saying nothing that Hassabis hadn’t heard before, confirmed that an AI company would at least -- 69 of 565 -- be contrarian. But however much he reassured himself, Hassabis was up against tough odds. He couldn’t launch a company unless somebody believed. A contrarian with no following is merely an oddball. Returning to London in the late summer of 2009, Hassabis wrote a business plan for his envisaged AI start-up, which he called Project Orion. The idea was to build a computer version of the human brain, which could handle the subtle functions that Hassabis had studied for his PhD, such as memory and imagination. But the document’s principal significance was to reveal how far he was from getting a project off the ground. The business plan listed Peter Molyneux among Orion’s likely backers, but the truth was that Molyneux was not going to invest in a science experiment. The plan named David Silver as part of the founding team, but Silver was a long way from being ready to go back into partnership with Hassabis. With no capital, no brilliant cofounder, and an establishment that viewed artificial intelligence as a lot crazier than video games, Hassabis was in a far weaker position than he had been with Elixir. At the start of October 2009, Hassabis boarded an elevator at the Gatsby Unit. A curly-haired man with the lithe figure of a dancer maneuvered himself and his suitcase into the small space beside him. Hassabis immediately recognized the stranger as the elusive Shane Legg, the sole postdoc in the building who shared his AI ambitions. Glancing at the suitcase, and realizing that Legg was about to disappear out of town, Hassabis struck up a conversation. “Where are you going?” he asked brightly. Legg answered in a pronounced New Zealand accent. He was on his way to New York to attend something called the Singularity Summit. Hassabis had never heard of this event. “What happens there?” he asked. Legg explained that the Singularity Summit was an annual gathering of AI believers. The “singularity” was the moment when machine intelligence would surpass human intelligence. At that point, machines would know how to improve themselves without human assistance, and their capability would explode upward. -- 70 of 565 -- The elevator reached the ground floor. The doors opened; the singular New Zealander marched off; the conversation was over. Hassabis was left to ponder what had just happened. After years of preaching the AI gospel to others, he had just met somebody who might be more wired into that world than he was. • • • SHANE LEGG’S PATH to the Gatsby Unit had been as improbable as Hassabis’s. Born three years before Hassabis, in 1973, he had grown up in Rotorua, New Zealand, a tourist town known for its boisterously active geyser and bubbling mud pools. At school, Legg had struggled so much with his lessons that his teachers suspected him of subnormal intelligence. His worried mother took him off to have an IQ assessment. After administering the test, the assessor grew furious. “Why have you brought this child here?” he demanded. Legg’s mother was upset. “I’m just being a responsible parent,” she pleaded. “This boy is somewhere above what we’d call gifted,” the test official retorted.[7] Notwithstanding this news, Legg continued to perform miserably in class until his parents bought him a computer. Like other children who, defying stereotype, find that solitary immersion in digital technologies is terrific for their mental health, Legg can still recite the vital statistics of his first love: “a Dick Smith VZ200, eight kilobytes of RAM, a Z80A processor.” Soon, Legg became an avid coder, and as he incorporated mathematics into his programs his performance in his school math class went from mediocre to outstanding. But he was not especially gracious about this change in his fortunes. He blew off his homework, railed at his teachers, and came top in the exams anyway. As Legg reflected later, the lesson of his youth was, “Write my own rules, because there’s a lot of bullshit around. A lot of bullshit.” -- 71 of 565 -- Legg moved on to the University of Waikato to study mathematics. Feeling unfit and very skinny, he joined a gym and started weightlifting. To his surprise, he found it fun, and so he added cardio to his training program. Waikato was, perhaps remarkably, a hot spot for aerobic dance, and Legg thought to himself, “Why not?” Pretty soon, he could do the splits, and then he moved on to jazz dance, which incorporated elements of ballet. “So then I thought, ‘Damn it. Why don’t I just do ballet?’ ” Legg recalled later. “That sounds crazy. I’ll try it.” In a country where the archetypal male played rugby, Legg’s newfound hobby caused his parents to jump to conclusions. “My father went into denial about me being gay,” Legg recalls. “Which was quite weird because I wasn’t gay. “And my mother was interestingly fine with me being gay. So that was nice to know. If I had been gay, she would’ve been accepting of that.” After Waikato, Legg did a master’s thesis in math at the University of Auckland. The obvious next step was to do a PhD, but Legg’s rebellious side intervened again. Deciding that a PhD in theoretical mathematics was “rather hard” and “kind of pointless,” he took a routine job as a database engineer: There was not much Ender about him. For the next eighteen months, he settled into debugging corporate IT systems. The more-than- gifted boy who had bubbled up from Rotorua’s mud pools was on track to contribute zilch to scientific progress. In late 1999, at the height of the bubble in internet stocks, Legg moved on to his next gig: a start-up called Intelligenesis. This shift, as random as the proverbial flapping of a butterfly’s wings, would affect the path of AI history. Intelligenesis turned out to be no ordinary company, and its founder, a Brazilian American prodigy named Ben Goertzel, was not remotely normal, either. Perpetually unkempt, his face framed by waves of curly hair that fell below his shoulders, Goertzel had earned a PhD in mathematics by the age of twenty-two; by thirty, he had published four textbooks and had held university appointments in math, psychology, and computer science. In the mid-1990s, Goertzel had decided that the rapidly -- 72 of 565 -- expanding internet was not merely a revolutionary communications platform or an opportunity to rethink commerce. It was a new form of intelligence, a sort of worldwide mind: Each computer on the network resembled a neuron, each hyperlink a synapse, each human user a sense organ.[8] In 1997, Goertzel founded Intelligenesis to build upon his vision. Goertzel set his team to work coding a digital brain. In some ways, this was a precursor to Hassabis’s Project Orion, but coming a decade earlier, and with computing still much less advanced, it was even more of a long shot. The plan for how this brain would function was a lot wackier, too. Goertzel planned to build a “Baby WebMind,” then release it on the internet and watch it mature into adulthood. Pretty soon, Goertzel insisted, a worldwide, self-organizing machine consciousness would spring forth, ending humanity’s monopoly on intelligence.[9] Goertzel’s baby-brain project was a bubble-era flight of hubris. But Legg enjoyed this trippy stuff so much that, after working remotely from New Zealand, he picked up and moved to New York, where Intelligenesis had an office. The way Legg saw things, the WebMind’s viability was almost beside the point. What captivated him was the question of what it meant to build intelligence, which further led him to consider what intelligence was, or what it might be. To someone whose intelligence had been misclassified as a child, no subject could be more riveting.[10] Having learned in his school days to write his own rules, Legg was also drawn to Goertzel’s willingness to call bullshit. For example, Goertzel showered contempt on the classic definition of machine intelligence, proposed by the British mathematician Alan Turing, which states that a computer program can be deemed intelligent when it can pass itself off as human. Goertzel countered that a successful WebMind wouldn’t ever meet that standard: It would never sound humanoid on such topics as how sex feels, and there would be no point teaching it to appreciate a sunset. “But this doesn’t matter,” Goertzel continued. “I’d rather have a computer program that knows it’s a computer and discourses about its computer-ness intelligently, than one that can successfully pull off a pathological-liar act and fool us into thinking it’s human.”[11] -- 73 of 565 -- In late 2001, Intelligenesis went bust, and Legg spent the next nine months backpacking around Africa and Europe. But he stayed in touch with his old boss, and in 2002 Goertzel asked him to comment on a book that he was working on. The manuscript was provisionally entitled Real AI, the premise being that most artificial intelligence projects were contemptibly timid. But Legg urged that dismissing the bulk of AI as not real—and therefore puny or phony—might be impolitic. To protect Goertzel’s standing in the research community, Legg persuaded him to go with an alternative title: Artificial General Intelligence. Around the time he coined that term, Legg read The Age of Spiritual Machines by the inventor and futurist Ray Kurzweil. The book’s central message was that the power of computers was about to surge dramatically. Since the 1960s, technologists had referred knowingly to Moore’s Law, and the very familiarity of this concept had caused people to stop thinking about it. But the prophecy that the power of semiconductors would double every two years was not something to be filed and forgotten in the back of the collective mind. To the contrary, the more time went on, the more mind- boggling its implications. A doubling every two years implied exponential progress—the curve would start off relatively flat and then later explode upward. As of the year 2000, human brains were fully one million times more powerful than the most advanced machines, a huge gap in capability. But at some point in the 2020s, Kurzweil calculated, computers would draw even. And once that happened, they would accelerate past their creators. The singularity was approaching. Rather like Goertzel, Kurzweil occupied a tenuous space between visionary and weirdo. Convinced that humans would soon reach a kind of computer-assisted immortality, he was determined to cling on to life until the bots took over the task of extending his existence. For a while he ingested 250 supplements per day, not to mention multiple cups of green tea and glasses of alkaline water, all the while anticipating the moment when people could transcend human limits and enjoy life without end, or what Kurzweil called “indefinite extensions to the existence of our mindfile.”[12] But whatever the merits of this transhumanist vision, Kurzweil’s exposition -- 74 of 565 -- of Moore’s Law affected Legg deeply. Reading his book, Legg realized that not only was the power of computers set to explode, the amount of data that could be fed into the machines would also explode, thanks to the spread of the internet. With better hardware and more data, the third component of AI —algorithmic advance—would become insanely valuable. “I was thinking AI is going to be real. I buy the basic argument of Kurzweil. So, if that’s the case, I should go get a PhD,” Legg recalled later. “So, I thought to myself, ‘OK, what’s the biggest issue in AI as I see it?’ “And I thought, ‘Well, the biggest problem is there isn’t a measure for intelligence.’ “It’s very hard to build an intelligent machine if you can’t even measure it.” Having discussed these matters thoroughly with himself, Legg found his way to Marcus Hutter, a researcher with similar ideas at IDSIA, an AI institute in southern Switzerland. Under Hutter’s supervision, he embarked on a PhD, assembling multiple possible conceptions of intelligence, and applying his mathematical training to their measurement. Critics of AI would later accuse its creators of skipping over this challenge: of failing to grapple with the difference between intelligence and mere statistical facility. [13] But Legg confronted this conundrum head-on; and the definition of intelligence that he settled on combined Goertzel’s rejection of Turing with his own advice to Goertzel. AI should not be measured by its ability to impersonate humans, since imperfect human cognition amounted to an arbitrary benchmark. Rather, the true mark of intelligence was generality. Together with Hutter, Legg landed on a summarizing phrase: “Intelligence measures an agent’s ability to achieve goals in a wide range of environments.” Legg completed his PhD thesis in 2008, entitling it “Machine Super Intelligence.” As he was finishing, he spent a year at the Swiss Finance Institute, where he used reinforcement learning to try to predict the gyrations of the capital markets. He contemplated the idea of launching an AI start-up—an attempt to succeed where Goertzel had failed—but decided that computing was still too immature and that entrepreneurship was too -- 75 of 565 -- precarious. He convened a weekly neuroscience reading group, curious about how the functioning of the human brain might expand his conception of intelligence. In April 2009, seeking to deepen his understanding of this human-machine frontier, he began a postdoctoral fellowship at the Gatsby Computational Neuroscience Unit. • • • THE FOLLOWING AUTUMN, a few weeks after that brief elevator encounter, Hassabis spotted a second opportunity to get talking with the mysterious New Zealander. Legg was due to deliver a talk on the future of AI on October 31. He had given his presentation a mock-scary title, “The Halloween Scenario.” On the appointed day, Hassabis showed up in a dimly lit classroom and took his seat in the audience. Legg stood by a large screen, a fit figure with laughing eyes and a patterned button-down shirt. For the next two hours or so, he ranged enthusiastically from definitions of intelligence to developments in machine learning to the exciting spillovers from neuroscience, culminating with a Kurzweil-inspired explanation of how computer power was exploding. But then he pivoted to a dark warning—the Halloween Scenario of his title. “If this all takes off, we’re going to have people with brain-like AI architectures plugging their systems into exaflop supercomputers, and we have no idea how to deal with the consequences,” he declared. His words signaled alarm. His tone was still excited. “These systems to start with, maybe they are not that dangerous,” he went on. “Maybe they are not going to take over the world, do anything crazy. But they are starting to converge on the types of algorithms that we really should be worried about.” Computers that became superintelligent, and that developed agendas of their own, might subjugate or annihilate humans, Legg was saying. “We don’t know what they are going to do!” he concluded.[14] Legg was channeling a common view from the Singularity Summits. Steeped in science fiction, fascinated by catastrophe narratives as moths are -- 76 of 565 -- fascinated by fire, the Singularity crowd contemplated Armageddon without seeming upset by it. Ray Kurzweil, one of the conveners of the Singularity events, summed up the standard sequence of emotions at the prospect of superhuman AI: Wow!, Uh-Oh, and What Other Choice Do We Have but to Move Forward? The shock, the fear, and then the resignation resembled the predicament of medical patients confronted with terminal diagnoses: At first, they contemplate doing something radical with their remaining time alive; next, they revert to their routines as though nothing much has altered. Human beings lack a vocabulary with which to process existential threats; they cannot think the unthinkable. They are wired to act as though life will carry on, for otherwise they could not act, and life would become impossible. Legg’s London audience, far removed from the Singularity vibe, was confronting the risk of Armageddon for the first time. That protective human wiring had yet to kick in. The seated figures listening to Legg were still processing the Wow! and Uh-Oh parts of Kurzweil’s sequence. A man spoke up from near the front. Legg had raised the prospect of an existential threat. If he really thought the future of humanity was on the line, surely he wasn’t going to end his lecture there, without saying what should be done about it? Legg’s eyes twinkled, as though the question put him in mind of some familiar private joke. With a touch of theatrical swagger, he faced the room and asked rhetorically, “So, what do we do about this?!” He evidently had no answer to offer. All he could do was to repeat the listener’s question. Nervous giggles greeted him. The prospect of computers threatening humanity seemed absurd. The absurd is a close cousin of humor. • • • SEATED AT THE BACK of the classroom, Hassabis was less fixated on the surreal threat than on what he had heard beforehand. His impression from that elevator encounter had been confirmed. Legg’s imagination was wide open -- 77 of 565 -- to the possibility of superhuman AI, and he had the technical tools to help build it. Hassabis was not going to miss the opportunity to bond with the New Zealander this time. He went over and introduced himself. Years later, thinking back on the conversation that ensued, Hassabis became emotional. “It was amazing to find Shane because it’s like finding an oasis, right? Until then, as far as I knew, I was the only person thinking about these subjects. “I mean, there were other people interested in AI, like David Silver, but I’d got them interested. I’m good at galvanizing people, so I can’t take that as an independent measure of whether I’m really on to something. “So to find someone who’d had an independent path, who had come to the same conclusion…that was a very powerful corroboration. “I’m getting goosebumps thinking about it.” I thought about how different Hassabis’s experience might have been in California. In Silicon Valley, change-the-world dreamers were practically normal, and there was a network ready to support you. In London, out-of- the-box ambition was an isolating trait. You had to search to identify collaborators. “I’d never read any of Kurzweil,” Hassabis said. “For me, it was Gödel, Escher, Bach, Asimov, Iain Banks, my own practical work with Molyneux, Blade Runner. Those were my influences. I don’t think I even knew who Kurzweil was because I was sort of in my own parochial backwater, just dreaming dreams. “But Shane came with all these ideas. He’d studied with Marcus Hutter. He’d done this theoretical proof of intelligence. He was already going to the Singularity Summit. He had coined the term artificial general intelligence. He had all these contacts in the nascent AGI world that I didn’t even know existed. “Here was a guy who’d dedicated, independently from me, his entire life to this mission. And that’s why we had both ended up at the Gatsby. Because we were kind of looking for each other. Neither of us knew what the other one would be like. But we were looking. -- 78 of 565 -- “Shane had left New Zealand, worked with Ben Goertzel, done academia. And eventually he came to the Gatsby for the same reason I was there: Because it was one of the only places in the world that was combining neuroscience and machine learning. And so we both must have thought that there had to be interesting, like-minded people there. “And it turned out there weren’t that many. “But there was one, and that was enough.” • • • IN THE WEEKS after the Halloween lecture, Hassabis and Legg met regularly for lunch at an Italian restaurant near the Gatsby. They talked about AI, where it was going, how best to pursue it. They agreed that this was an exciting moment. Thanks to Hinton’s deep belief networks, the application of GPU chips to his systems, Legg’s work on measuring intelligence, and Hassabis’s grasp of the AI intuitions that flowed from neuroscience, the field was approaching a watershed. But they differed on what to do about this. Legg still worried that it was too early to start a company: Who on earth would finance a venture with no prospect of delivering a product? Hassabis countered that academia was too slow: To build powerful AI, they would need a team, a sense of urgency, and freedom from academic bureaucracy. A mission-driven start-up—a Manhattan Project, as Hassabis liked to say—could surely be funded by the right sort of investor: a billionaire, or possibly a multibillionaire, with the stomach for the long horizon. Inevitably, Hassabis proved to be the more insistent of the two: It was a case of Ender versus not-Ender. On December 29, 2009, shortly before midnight, Legg emailed his new friend to say that he was ready to go forward with an AI start-up. The comrades named their future company DeepMind: It was a nod to deep learning, the school of artificial intelligence pioneered by Geoff Hinton; but also to Deep Blue, the computer that had beaten the world champion at chess; and also to Deep Thought, the supercomputer from The Hitchhiker’s Guide to the Galaxy. -- 79 of 565 -- Having lined up a scientific cofounder, Hassabis still needed some more backup. For this he turned to an unusual figure: a brash and brilliant autodidact who was as driven and energetic as he was. If Hassabis and Legg had each had improbable journeys, Mustafa Suleyman’s was even more extraordinary. Born in August 1984, eight years after Hassabis, he, too, had grown up in the melting pot of North London with an immigrant parent and religion in the family. But there the resemblance with Hassabis stopped. Mustafa’s father was a devout Muslim from Syria who spoke broken English and drove a cab, putting in shifts from four in the morning until eight in the evening. His mother was a nurse who had grown up in England and converted to Islam; her work was as grueling as her husband’s. Whereas Hassabis’s bohemian upbringing combined the committed Christianity of his mother with the secular humanism of his father, Suleyman’s parents were united in their faith. There was no music in the home, and no books or newspapers, either. The young Mustafa embraced his parents’ religion, and he loved the comradeship that came with it. On Fridays he attended a North London mosque, standing toe-to-toe and shoulder-to-shoulder with polyglot believers: Pakistanis and Bangladeshis, Indonesians and Malaysians, Somalis and Sudanese, Turks and Arabs, “all worshipping together in unity and equality and alignment,” as Suleyman said later.[15] At around the age of twelve, he would go to his school playground to recruit kids with roots in these countries, then he would lead them in prayer. His message stressed the moral obligation to do good. To be raised as a London Muslim in the 1990s was to understand the value of community: Britain was welcoming only up to a point. If Muslims wanted society to be just, they would have to work to make it just. And they would have to work together. When Mustafa turned fifteen, the case for community took on a whole new meaning. His parents split up and his mother followed her new partner to New Zealand. His father returned to Syria to remarry, leaving Mustafa in London to fend for himself and his fourteen-year-old brother. The boys had no home to go to, so they found shelter with friends. His father sometimes sent them money, but it was never enough; luckily, Mustafa was -- 80 of 565 -- resourceful. From the age of eleven, he had bulk-bought candy bars and sold them to schoolmates at a markup, eventually hiring older kids to market to their peers and renting out friends’ lockers to warehouse his inventory. When he turned fifteen and his parents left London, Mustafa moved into buying and selling cell phones. At eighteen he graduated to fixing and trading cars, freshening up classic BMWs and Mercs with the help of his younger brother. Hardship did not stop Suleyman from flourishing academically. Identified early on as an outstanding student, he had won a place at one of the best government high schools in Britain, Queen Elizabeth’s School in North London, founded in 1573 by the Protestant nationalist Queen Elizabeth I and dominated, four centuries later, by students of Asian origin. In this high-octane environment, he was among the top academic performers. A sympathetic teacher helped him to develop a passion for reading: She picked out novels, explained the difference between the conservative Telegraph newspaper and the liberal Guardian, and gave him a subscription to The Economist. At seventeen, Mustafa won a national enterprise award for a plan to make London’s tourist attractions accessible to disabled visitors. To document the barriers that a disabled person might face, he borrowed a hospital wheelchair and wheeled himself around the monuments and museums of the city.[16] Around this time, Suleyman began to hang out with a new friend—a guy named George Hassabis. He often stayed over at the Hassabis home, borrowing the empty room vacated by George’s older brother, Demis. Since Demis’s days on the chess circuit, the Hassabis parents had built some capital by fixing up houses, and to Mustafa—by now known universally as Moose—their home represented middle-class security. Every Wednesday evening, George, Moose, and Moose’s girlfriend, Marilyn, worked as chess instructors at the education center that the Hassabis parents now ran. One day, Demis showed up for a barbecue in the family’s garden. Already in charge of his own gaming studio, he cut a distant and impressive figure. • • • -- 81 of 565 -- IN 2002, his last year at high school, Suleyman was among a handful of classmates admitted to Oxford or Cambridge.[17] Suleyman opted for Oxford, and his chosen field of learning was philosophy and theology. The choice reflected his predicament as a British Muslim. The terrorist attacks of September 2001 had unleashed a wave of Islamophobia, and Suleyman believed in Muslim solidarity more passionately than ever. But he was also gravitating toward a secular vision of Islam, and Oxford pushed that process further. He remained dedicated to social justice, but he doubted the existence of God. He loved the post-racial culture of the mosque, but he rejected the gender inequality and the condemnation of gay people. His freethinking eclecticism was obvious at a glance. He wore pink corduroy trousers held up by suspenders and a traditional flat cap on a head of flowing curls. He smoked a pipe and sported Ellesse sneakers. During Suleyman’s first spring at Oxford, the United Kingdom backed America’s invasion of Iraq, and the predicament of a modern-minded British Muslim became all the more excruciating. On the one hand, Suleyman wanted urgently to do something to help his community.[18] On the other hand, he was not going to side with the medieval clerics who had perpetrated the 2001 attacks, nor did he sympathize with Iraq’s brutal dictator. That summer, observing a teenage rite of passage for young Brits, he backpacked around Europe with three non-Muslim friends; while the others honored the true purpose of such journeys, which is to drink, Suleyman joked and played cards and refused to touch alcohol. He had the affect of a carefree undergraduate, and yet he stood apart. His tough London adolescence had felt like real life. Oxford was a bubble. Toward the end of his second year at university, Suleyman bumped into one of his friends from that summer trip around Europe. He told the friend that he was done with libraries and books and intellectual abstractions. He was going to drop out of Oxford. “I remember being absolutely flabbergasted,” the friend recalled later. Nobody ever dropped out unless they were in a serious mess. Especially for someone from an underprivileged background, Oxford was a magic escalator. -- 82 of 565 -- “I remember thinking, ‘Oh my God, you’ve just ruined your whole life,’ ” the friend went on. “I was like, ‘Hey, you got the golden ticket.’ ”[19] I asked Suleyman what he thought about this golden-ticket warning. “I always had nothing, so I was fearless when it came to flipping the table. I was like, ‘Why am I studying church patristics when I should be changing the world right now?’ ”[20] Quitting Oxford for London in the summer of 2004, Suleyman hopscotched from one job to another. Together with George Hassabis, he started yet another business, this time selling milkshakes at the market in Camden, not far from Elixir’s second office. To satisfy his appetite for social justice, he teamed up with a friend who had founded a nonprofit called the Muslim Youth Helpline, which offered counseling to depressed or confused Muslims. His comrades at the helpline came from what Suleyman described as “chaotic, crazy-ass backgrounds,” some of them even more extreme than his—crushed families, drug addictions, mental demons, to the extent that, a couple of years later, one helpline employee set himself alight in a park in North London.[21] But the helpline provided Suleyman with a chance to come to terms with his identity. It offered a way to be both Muslim and secular, Muslim and modern—in fact, Muslim and British. Around this time, Suleyman completed his separation from his parents’ faith, progressing from agnosticism to a decisive atheism. Toward the end of 2009, Suleyman decided that his career needed a reboot. The Muslim Youth Helpline had failed to satisfy his yearning to bring about social change, and two subsequent jobs had not fulfilled him, either. He had worked at the London mayor’s office, hoping to improve society on a larger scale, but had found government work to be bureaucratic. He had cofounded Reos Partners, a conflict resolution consultancy, aspiring to overcome divisions by opening people’s minds, but it turned out that collective therapy seldom dissolved enmities. Searching for a better way to catalyze societal progress, Suleyman’s attention fastened on a more powerful force. He had recently become aware of Facebook, the social network that had sprung out of nowhere, attracting by that point a community of some 132 million monthly users. In its capacity to shape -- 83 of 565 -- ideas and change societies, the platform dwarfed anything that Suleyman had seen before. If Facebook continued to grow at the same pace, it would soon exceed the cultural heft of Islam or Christianity. Pondering these magnitudes, Suleyman made up his mind. If he truly aspired to make an impact on the world, he should become a technologist. • • • FOR A TWENTY-FIVE-YEAR-OLD Londoner with no university degree, a future in tech seemed like a stretch. But Suleyman knew of someone who could help: George Hassabis’s brother, Demis. Since that fleeting barbecue meeting a few years before, Suleyman had struck up a relationship with the older Hassabis. Demis had exited Elixir with a few million pounds, and in 2007 Suleyman had proposed a business partnership. Hassabis would provide capital; Suleyman would purchase apartments; these would be rented out, and the two would split the proceeds.[22] The partners discussed their plans over a few lunches near University College London, where Hassabis was studying for his PhD, and Suleyman talked about what he was reading. In contrast to Hassabis, who loved books about science, Suleyman devoured volumes on politics, sociology, and complexity theory, logging every title that he went through. In December 2007, the two friends met for lunch at a steakhouse, right by the Smithfield meat market where livestock had been slaughtered since the tenth century. There, with the ghosts of medieval London swirling around them, the two twenty-first-century idealists got talking, and Hassabis asked Suleyman about his latest reading. Suleyman reached into his pocket and pulled out a slim volume: Owning Your Own Shadow, by the Jungian analyst Robert A. Johnson. The book was about the unlit underbelly of the human ego. It asked readers to understand their shadows: to acknowledge the harsh sides of their personalities and the darkness that they cast on those around them. Hassabis said that sounded interesting. “Oh, you should take it,” Suleyman offered. -- 84 of 565 -- Hassabis brushed the book aside, saying that this sounded like the sort of thing that Suleyman should be reading. Years later, feeling he had been plunged into darkness by Hassabis’s shadow, Suleyman recalled this exchange. “The irony of that is ridiculous,” he said, with a grimace. “Surreal,” he added, almost shuddering. Then he conceded, “I mean, Demis was partly right. It was actually what I needed.” A dozen years later, when Suleyman’s career veered temporarily off track, his own shadow had a lot to do with it. I looked across the restaurant table at the success story in front of me. The teenager who had grown up without a home had recently been named the head of AI at Microsoft. Of course he hadn’t pulled off this transformation without wrestling with demons. How could he not have a shadow? “Maybe you both needed that book,” I suggested. “Maybe most of us need it.” “We all need it,” Suleyman replied. “At least I acknowledged it.”[23] • • • WHEN THEY WERE not discussing books, Suleyman and Hassabis played poker. Together with Demis’s younger brother, George, they would meet up at the Vic, an iconic casino on the noisy Edgware Road in London. The Poker Room on the second floor stayed open, alarmingly, twenty-four hours per day, and played host to a steady stream of tournaments. Demis would wear a pair of outsized shades that concealed his expression and gave him a badass Blade Runner appearance. On July 15, 2010, Suleyman met the Hassabis brothers at the Vic for one of their sessions. George and Moose were soon bounced from the tables, but Demis came in fifth, pocketing almost two thousand dollars.[24] A disappointed George went home, returning to the group house that he shared with Moose and a few others. To celebrate Hassabis’s winnings, -- 85 of 565 -- Demis and Moose found a restaurant table and ordered chocolate cake and vanilla ice cream. Pumped up on dopamine and sugar, Hassabis let Suleyman in on a secret.[25] His performance that evening proved that he could master poker if he tried; but, he confided, he had higher uses for his energy. For the past several months, he had been plotting a stealth AI company, whose mission was not just to invent AI, but rather to go after AGI, artificial general intelligence. Hassabis already had a plan, a brilliant cofounder, and a list of potential collaborators. He just needed an investor. Playing off what Suleyman had told him over the course of their lunches —that he wanted to leverage technology to drive social change—Hassabis stated the obvious. If Suleyman was truly interested in improving the world, artificial general intelligence was the best possible vehicle for him. An infinity machine would have infinite potential. “We saw technology as a force multiplier,” Hassabis recalled of this discussion. “Charities, consultancies, the London mayor, whatever: You put one unit in, you get one unit out. The goal was to put one unit in and get a million units out. Otherwise, how are you ever going to do enough? That resonated with Mustafa.” The way Suleyman remembers it, part of him wondered whether Hassabis’s vision was even remotely realistic. Hassabis liked to talk up his track record building machine learning for games, but the real world was vastly more complicated than any game, and driving societal change was difficult. Economies and societies turned on fights and emotions and who said what to whom. No artificial intelligence, however powerful or general, could comprehend, let alone shape, the billions of potential permutations. A vigorous debate ensued between the scientific visionary and the social activist. It was Ender versus Ender this time, and the more the two missionaries challenged each other on the substance, the more they bonded at a deeper level. Both were articulate and forceful, willing to follow ideas to their logical extremes; even when they were at loggerheads, they were nonetheless a pair of kindred spirits. And while their professional experiences were different, they were complementary, too. Hassabis was -- 86 of 565 -- imagining a technology that would have huge impact on the world. Suleyman had sought to understand how the world needed to be impacted. Suleyman could see the opportunity in front of him. Of course, Hassabis’s vision was audacious and his mission might fail. But a voice was whooping inside Suleyman’s head: “Wow! Amazing!”[26] Two days later, Suleyman followed up with an email. He congratulated Hassabis on his poker winnings and pointed him toward a Wired magazine profile of Sergey Brin, the cofounder of Google. According to the article, Brin was pouring part of his $15 billion fortune into computational medicine, hoping to drive a revolution in the pace of drug discovery. Perhaps Brin was the sort of billionaire who might fund an AGI company? [27] Hassabis emailed back. “Very cool article,” he said approvingly. And then he proposed a collaboration. He was gearing up to pitch a different billionaire at the next Singularity Summit, in August; he had the outline of a business plan, but it needed fleshing out, and the summit was approaching. Might Moose have time to help draft the document? OceanofPDF.com -- 87 of 565 -- O CHAPTER 5 FOUNDING DEEPMIND n August 14, 2010, the Singularity Summit got underway in San Francisco. Ben Goertzel, the Baby WebMind promoter, showed up with his resplendent hair, looking, as one observer said, as though he had been blown off course on his way to the Glastonbury Festival. A Canadian inventor named Steve Mann, who described himself as a cyborg, wore a black woolly hat and computer-enhanced eyeglasses. A biophysicist called Gregory Stock proclaimed that “science has slammed the evolutionary process into fast forward.”[1] To associate with this fraternity was to invite ridicule: An earlier summit had been mocked as “the Bay Area coming-out party for the tech-inspired philosophy called transhumanism.”[2] But Shane Legg and Demis Hassabis had eagerly accepted speaking slots. Mustafa Suleyman had come along for the ride. He was sleeping on a couch in Hassabis’s hotel room. Hassabis eyed his fellow conference-goers warily. Asked by a journalist if he called himself a “singularitarian,” he responded politely, “Maybe it’s because of my British side, but it’s a bit Californian.”[3] The DeepMind trio were out in force because they were desperate for money. In the months since he had persuaded Legg to be his cofounder, Hassabis had rattled his tin at the investors who had backed Elixir. Not one was prepared to back him. An AGI company was too far out: It was both technically daunting and commercially dubious. Hassabis secured a promise of some money from his MIT supervisor, Tomaso Poggio, not least because Poggio’s wife, a psychologist, had told her husband that whatever that electrifying English guy did, they should absolutely back him.[4] But -- 88 of 565 -- Poggio was good for only £100,000—about $150,000. The closest Hassabis had come to landing a real investor was an eccentric financier named David Gammon. With Suleyman as his new wingman, Hassabis had spent hours courting Gammon over lunches in a London pub. The financier seemed open to making this unusual bet because his motives were themselves unusual. “There is a deeply religious aspect to AGI,” Gammon explained to me later. “It’s really finding God’s algorithm.” I asked Gammon to elaborate. “The architect of the universe is what we may call God,” Gammon answered. He had swept-back hair and the military bearing of an English country gentleman. “You know, I believe. I have a very strong faith. I really believe the universe and the world advances. And you have to keep pushing.”[5] Hassabis’s main hope at the Singularity Summit lay in another believer, the Catholic contrarian Peter Thiel. By 2010, Thiel was already a legend, famous for founding the original digital payments company, PayPal; the original software-first defense company, Palantir; and for being the earliest investor in Facebook, a bet from which he had already reaped north of a billion dollars. In 2008, a start-up named SpaceX had endured its third consecutive rocket-launch failure. Two days later, Thiel’s Founders Fund swept in, investing $20 million in the company. Thiel was also a Singularity enthusiast. A competitive chess player, he had been thrilled when the chess program Deep Blue defeated the reigning human champion, Garry Kasparov. It was only a matter of time, Thiel reasoned, before AI dominated most cognitive tasks at the heart of the information economy.[6] The way he saw things, this could only be a good thing. The world urgently needed technological advance to ward off economic stagnation; indeed, the very survival of the free-market system might depend on an AI breakthrough.[7] Following this line of reasoning, Thiel financed the Singularity gatherings, believing they might generate some wacky but high-potential bets. It would only take a single start-up hit for his investment to pay off multiple times over. -- 89 of 565 -- Hassabis, Legg, and Suleyman showed up at the conference hall at the Hyatt Regency hotel in San Francisco. They scanned the room for Thiel: He was supposed to be scouting for AGI start-ups, and they were desperate to be scouted. But Thiel was nowhere to be seen. Apparently, he had paid for the proceedings but felt no need to attend them. “That’s so strange!” Suleyman remembers thinking.[8] Legg knew that at the Singularity Summit the previous year, Thiel had thrown an after-party for the conference speakers. He would do the same this time. That would be the chance to get his attention. That evening, the gang of three showed up at Thiel’s home near the Golden Gate Bridge, on the northern tip of the San Francisco Peninsula. Sure enough, Thiel was there, already surrounded by a crowd of supplicants. The interlopers from London felt awkward. “We were standing around sheepishly and wondering how to approach him,” Suleyman remembered. “We were irrelevant nobodies. He was a titan.”[9] For the second time that day, Legg’s experience of the Singularity scene proved useful. Through his PhD work and his trip to the Singularity Summit the previous year, he had befriended Eliezer Yudkowsky, one of the high priests of the community. Now Yudkowsky walked Legg and Hassabis over to meet Thiel. “These are some of the smartest guys in the whole field of AI and they’re starting a really ambitious company,” Yudkowsky said.[10] Hassabis was ready with his lines. “I was preparing for that meeting for a year,” he said later.[11] Instead of pitching Thiel with yet another start-up story—and doing so in the middle of a crowded party, with thirty seconds in which to blurt above the noise—he hooked Thiel with chess, observing that there was a deep tension between the bishop and the knight, with the two pieces carrying the same value yet possessing vastly different capabilities. [12] Sure enough, this gambit was enough to open up the board. After a brief back-and-forth, Thiel invited Hassabis and Legg over to his home the next day to explain their ambitious venture. When the duo showed up to make their pitch, Thiel greeted them in his workout gear; he was still sweating. His butler brought him a Diet Coke. He had a grave expression.[13] -- 90 of 565 -- Hassabis explained his vision for a company that would build powerful AI, drawing on the latest insights from neuroscience and capitalizing on the explosion in computing power. “This might be a bit much,” Thiel thought to himself. Still, Eliezer Yudkowsky’s endorsement meant a lot. Thiel had known Yudkowsky for half a dozen years, and DeepMind was the first company that he had recommended.[14] Hassabis kept talking. New machine-learning methods were just starting to work. Between them, Hassabis and Legg knew everyone in the field. What’s more, Hassabis had entrepreneurial experience, having built Elixir. He had won the five-game Mind Sports Olympiad on five occasions. Thiel began to think this project was A-plus on the science, and maybe F on the business model. But he also had a further thought. Hassabis was an extreme case of what venture capitalists call an “authentic” entrepreneur: not a mercenary who starts with a desire to get rich from a start-up, then casts around for a plausible idea; rather, a missionary who feels compelled to work on a particular challenge, then starts a company as a way of tackling it. The good thing about missionaries is that they never quit: Even if they have to work around the clock and pay themselves nothing, they will keep obsessing about the problem. “I always think that people aren’t really entrepreneurs in the abstract, but there’s maybe one great company that somebody has in them,” Thiel reflected. “It was Demis’s destiny to build this one.”[15] Thiel told his visitors to come back in a few weeks to pitch to his partners at Founders Fund. He seemed curious, but wary. • • • THE DEEPMINDERS RETURNED to London, and Hassabis and Suleyman worked on the last revisions to the business plan. The document ran to some thirty pages, ranging from high-concept futurism to the specific milestones that DeepMind would reach before its next funding round. It explained why artificial general intelligence was necessary; why it would prove possible to -- 91 of 565 -- build; and why DeepMind’s approach to the challenge was superior to that of its rivals. The first part of the plan, on why AGI was necessary, offered a statement of the technology’s power. It quoted Bill Gates: “If you invent a breakthrough in artificial intelligence, so machines can learn, that would be worth ten Microsofts.” The plan combined that statement with a theory of necessity. Society faced problems of unprecedented complexity, from stabilizing capitalism, which had blown up in the financial crisis of 2008, to feeding an expanding population. Progress on these challenges was depressingly limited, reflecting a phenomenon known as the “ingenuity gap”—Suleyman had borrowed this phrase from a book of the same title. As the business plan explained it, the human brain had limited storage capacity; humans had limited lifespans; grouping humans together resulted in diminishing returns because big organizations are sluggish. In sum, the intricacy of society’s most pressing challenges lay beyond the reach of human capabilities. “AGI is the solution to this problem,” the plan stated boldly. Next, the plan explained why the moment was right for an AGI company. A chart showed the capability of supercomputers exploding upward to 1020 calculations per second by 2025, a hundred thousand times more than at the time of writing. Just as fiber optic cable and high- performance routers had enabled the internet in the 1990s, accelerating semiconductor progress would fuel the AI revolution. Meanwhile advanced imaging technology was making it possible for neuroscientists to peer inside the brain, yielding an increasingly detailed picture of its workings—a picture to which Hassabis’s PhD had contributed. The insights from this scholarship were ignored by the majority of AI developers; with almost sixty thousand neuroscience papers appearing annually, extracting the gems was impossible for nonspecialists. By mining and indeed contributing to this literature, DeepMind would have an advantage. “The human brain is composed of a number of distinct parts, each with its own anatomical structure and algorithmic capability,” the business plan explained. “While these components are powerful in isolation, the real -- 92 of 565 -- genius of the brain lies in the way in which they have been deeply integrated together to produce general intelligence.”[16] Perhaps most audaciously, DeepMind asserted that its ultra-ambitious conception of AI made progress more likely. Other AI research sought to maximize the chances of success by focusing on narrow tasks: training a system to recognize images, for example. In contrast, DeepMind was out to build agents, not merely systems, the difference being that agents would be more general and proactive. Rather than being engineered by humans to master a single finite task, agents would learn broadly and autonomously, mastering a wide range of problems as they interacted with their environment. The jump in complexity was vast. Rather than building the digital equivalent of a house, DeepMind aspired to build a city. To realize their ambition, Hassabis and his colleagues would have to teach agents complex skills such as a mastery of concepts, the business plan continued. To endow agents with this facility, DeepMind would leverage its expertise in neuroscience, and specifically in research identifying a set of complex interactions in the hippocampus and the prefrontal cortex, which appeared to transform memories into broader abstractions. The challenge of turning intuitions from this literature into a machine-learning architecture was immense, the business plan conceded, but the alternative of ducking the challenge was hopeless. AI systems that merely matched one type of symbol (the image of an apple) with another type of symbol (the word “apple”) were not connecting with the real world: They didn’t really know anything. “In a system that describes symbols solely in terms of other symbols, it is not clear where the meaning resides.” To use a favorite AI expression, such machines could never be truly intelligent because they were not “grounded” in reality. Fifteen years later, with the benefit of hindsight, not all DeepMind’s prophecies look accurate. Insights from neuroscience proved useful during DeepMind’s early days, but not after 2015 or so. The question of whether AI systems need to be “grounded” is still hotly debated. Large language models such as ChatGPT or Gemini are not directly taught concepts, yet these systems exhibit an impressive grasp of how the world functions. A -- 93 of 565 -- feeling for concepts somehow emerges as a by-product of statistical mastery, bypassing DeepMind’s ambition to program conceptual understanding explicitly. And yet, despite these debates and details, DeepMind’s road map was prescient. The computational power driving AI models grew almost exactly in line with the business plan’s projection.[17] More to the point, the prediction that it would be possible to build human-level AGI by around the year 2030 seemed outlandish in 2010. But as of 2026, and allowing for the fact that the definition of AGI remains fuzzy, DeepMind’s forecast appears to have been just slightly conservative. • • • IN SEPTEMBER 2010, Hassabis and Suleyman appeared before a strange kind of investment committee. David Gammon, the religious investor, declared himself ready to commit capital, but he would only go forward if DeepMind did things his way. Entrepreneurs seeking his support were required to visit his home in Cambridge and pitch to Gammon, his artist wife, and his three teenage sons. Each family member would get an equal say on whether to invest. “I said to Demis, if you can’t explain this to my youngest son, Cameron, you’re not going to get his vote,” Gammon remembered. There was a painfully large gap between the grand science of the DeepMind business plan and an invitation to chat with a middle schooler. But Hassabis and Suleyman complied. If they refused to humor wacky investors, they might not raise any capital at all: Their project was itself wacky. The visit to Cambridge went smoothly; Hassabis won over all five family members. “I couldn’t have got this off the ground without David Gammon,” Hassabis recalled. “But can you imagine going to pitch to a fourteen-year-old? I mean, I’m a serious scientist.” A few weeks later, Hassabis and Suleyman returned to California. Gammon was planning to kick in a few hundred thousand pounds, but DeepMind needed a lot more than that. Hassabis remained desperate for Thiel’s money.[18] -- 94 of 565 -- If Gammon was eccentric in a certain way, the world of Peter Thiel felt even more unsettling. As part of the get-to-know-each-other process, Thiel’s lieutenants arranged for the visitors to meet three other Founders Fund start- ups. The most successful was Palantir, the software-first defense contractor that was helping US forces to track terrorist networks. The DeepMind visitors squirmed: To Suleyman, a human-rights-oriented opponent of the Iraq War, a defense contractor was by definition suspect. “I was scared that Palantir was building surveillance apparatus, and that we would be pushed to develop algorithms for it,” Suleyman said later. A second Founders Fund protégé, a military-robotics venture, alarmed Suleyman even more: The first thing you saw when you showed up at its office was a gun mounted on a pair of caterpillar treads. “This was not the vision of AI that we were trying to build,” Suleyman recalled with a shudder. The third start-up was disconcerting for a different reason. Named Halcyon Molecular, it embodied the Singularity spirit: Its mission was to extend human lifespans massively by applying AI to medicine. The coming collapse in the cost of gene sequencing would generate reams of genetic data, and the data would be analyzed by powerful AI. Life expectancy would rise in lockstep with Moore’s Law.[19] Hassabis and Suleyman were all in favor of ambition, but Halcyon seemed borderline crazy. “We thought it was loopy,” Suleyman recalled. “So capital intensive. So speculative. “Part of me was like, that’s brilliant because we are in the same bucket. But part of me was like, who’s making the decisions here?” The decision maker, it turned out, was a charming, voluble, midthirties futurist by the name of Luke Nosek. An engineer and friend of Peter Thiel’s since the formation of PayPal, Nosek was another booster and financial backer of the Singularity Summits. He was also the Founders Fund partner responsible for the team’s most far-out wagers, notably SpaceX and Halcyon. “I always wanted to bring about a positive singularity for humanity, or at least prevent a negative one,” Nosek told me, his words tumbling out in a -- 95 of 565 -- torrent of excited energy. But then, abruptly, he turned quiet. The silence went on for a few moments. “Sorry, I just lost my train of thought,” he resumed. “The singularity is just such an intense concept that that’s what it causes people to do!” he added, now beaming again. “It causes people to lose their ability to think sometimes! “I would say if it doesn’t affect you emotionally, and if it doesn’t affect your thinking at all, well then you’re doing something wrong! “You’re not truly visualizing how transformative a superhuman intelligence would be!” When Hassabis and Suleyman visited Halcyon, Nosek was there to greet them. A year or so earlier, he had assumed the role of company president, declaring, “Of all the Founders Fund companies that have…the potential to change the world—Facebook, SpaceX, Palantir—Halcyon is the one with the chance to do so in the most profound way possible.”[20] But when he got talking with Hassabis, Nosek fell head over heels, again. “It was like a lightning bolt,” he told me. “Here was the first person who actually seemed really, really competent, really, really brilliant, and dedicated to building AGI. I had met people before who had the same goal, but I didn’t believe that they could do it. “When you are early in a particular technology, it’s probably just not possible to build something with it,” Nosek continued. “And so then you are going to meet people who are just crazy, just dreamers. But when I encountered Demis, it became clear immediately: ‘Oh yes, we should definitely invest! And I need to join the board of this company!’ ” Two years earlier, when he had backed SpaceX, Nosek had pulled off the contrarian investment of a lifetime. The company’s value had since shot up more than tenfold, and it was just getting started. Now Nosek saw similarities between SpaceX and DeepMind. Back in 2008, Nosek had grasped SpaceX’s potential by ignoring the apparent craziness of its mission —to build a private rocket company. Instead, he had focused on a shift in the technological backdrop that rendered the hitherto impossible just about conceivable. In the case of SpaceX, the shift was that cheap, off-the-shelf -- 96 of 565 -- aerospace components had become available, offering a chance to outcompete the government’s NASA space program, which relied on ultra- expensive, bespoke components. In the case of DeepMind, the shift was that algorithmic breakthroughs such as deep learning might unlock the potential of ever more powerful computing. Nosek’s SpaceX wager had also been a statement of faith in its founder, Elon Musk, who was visionary and driven. Likewise, this British computer-scientist-neuroscientist-chess- master guy exuded similar missionary voltage. “His mantra was AGI, AGI, AGI,” Nosek recalled of Hassabis. “And speaking with him almost made your brain break.”[21] Nosek had never before pushed to join the board of a start-up outside the United States. Nor would he normally have joined a board in cases where his fund’s potential investment was tiny. But when it came to DeepMind, Nosek was far too excited to follow standard procedures. He was not going to pass up the chance to witness the coming of the singularity. “Peter was agnostic about whether we needed a board seat,” Nosek recalled of Thiel. “He was like, ‘Well, we’ll see what happens.’ “And I said, ‘We’ll see what happens?! What do you mean, we’ll see what happens?!’ ” A gap opened up between Thiel and Nosek. As a general matter, Thiel doubted that going on boards was a good use of his partners’ time. Start-ups should be left to sink or swim. The art of venture capital, he liked to say, was to back contrarian ideas, not coach company founders. Thiel and his partners also disliked the fact that DeepMind was in London. It was almost an article of faith that any start-up worth backing would be within forty-five minutes of Stanford University; the Founders Fund team joked that investing in Britain was like investing in Somalia. But Nosek respected Hassabis’s argument that London would be the best place for attracting undervalued European talent. If AI worked, the mother of all recruitment battles would break out on the West Coast; meanwhile, under the radar, DeepMind would hoover up the best scientists in Europe. Besides, Nosek could see that if he wanted to get along with Hassabis, he had better let him get his way. Hassabis was a British patriot who bristled at -- 97 of 565 -- the presumption of American preeminence. “I feel like British culture represents a lot of good values, and I wanted to show that you could build a deep-tech company in Britain,” Hassabis said. “I guess I was rooting for the underdog.” If it had not been for Thiel’s contrarianism, the gap between Nosek and the rest of the partnership might have doomed DeepMind. Most venture partnerships decide on investments by voting; if a handful of the partners see hair on the deal, the deal will be rejected. But Thiel had taken the unusual position that collective decision-making should be avoided. The way he saw things, if investments were chosen based on voting, the Founders Fund portfolio would consist of middle-of-the-road start-ups to which nobody objected. Given that all the profits in venture come from a few improbable moon shots, this sort of consensus portfolio would deliver mediocre performance. Following this logic, Thiel had empowered his partners to go with their guts. If Nosek wanted to back DeepMind, he should be allowed to do it. Thanks to Peter Thiel’s contrarianism and Luke Nosek’s lightning-bolt infatuation with Hassabis, the die had been cast: DeepMind was in the money. • • • IN DECEMBER 2010, Founders Fund wired $2.3 million to DeepMind. For this rather modest investment, Team Thiel assumed ownership of a bit less than half the company.[22] The terms were not so different from the ones that Hassabis had been offered by the hard-drinking London venture capitalists when he had founded Elixir. But this time he couldn’t tell the venture investors to get lost. There was no other capital available. The equity split among the gang of three reflected Hassabis’s dominance. He owned nearly as many shares as Founders Fund; more tellingly, he owned a whopping nine times more than Legg, his scientific cofounder. Explaining this lopsided apportionment, Hassabis said, “DeepMind was my idea and I was the driving force behind it.” Besides, -- 98 of 565 -- Hassabis was ready to work initially without drawing a salary, since he had a cushion from Elixir. He brought CEO experience to DeepMind; and to set the company’s direction, he believed he needed to own as many shares as possible. However much he disliked the idea of dominating others, he was determined to control his quest for superhuman intelligence. Legg, for his part, was too laid-back to fight for a bigger portion of the pie. “I figured if DeepMind was going to be successful, then whatever I had was still going to be quite a lot. So it was fine,” he said later.[23] Meanwhile Hassabis owned fully fourteen times more shares than Suleyman, the uncredentialed latecomer of the three. But, having been invited in on a temporary basis to help with the drafting of the business plan, Suleyman had worked and negotiated hard enough to earn the title of cofounder.[24] His willingness to sleep on Hassabis’s couch on their visit to San Francisco had signaled what was to come. Suleyman put himself at the center of the action at every opportunity. DeepMind opened its first office in an attic on Russell Square, an elegant London landmark laid out during the Napoleonic Wars, when the president of the United States was Thomas Jefferson. The British Museum was just a couple of streets over, and University College London was an easy walk away; Hassabis and his colleagues sometimes went to the Gatsby cafeteria for lunch, hoping to recruit restless researchers. Hassabis warmed to the location for other reasons, too: The London Mathematical Society was next door, and Hassabis liked to imagine that Alan Turing’s spirit was still there, even as DeepMind built on his foundations.[25] If you wandered past the Mathematical Society, you came to a pedestrian crossing where, on a humid morning in September 1933, the Hungarian physicist Leo Szilard had conceived the idea of a nuclear chain reaction. Hassabis recalled that chain-reaction scene from the opening pages of The Making of the Atomic Bomb, a classic history of the Manhattan Project. “I used to think about Szilard quite a lot,” he told me. “Obviously, we are building AGI to be positive in the world, but AGI is definitely momentous in the same way that the bomb was.” -- 99 of 565 -- I recalled that Szilard was one of the few nuclear physicists of the 1930s to anticipate that his specialty would empower humankind to destroy itself. [26] The rarity of Szilard’s foresight distinguished the bomb from artificial intelligence. In the case of AI, worries of human annihilation were commonplace among the pioneers, not least because the pioneers were steeped in the story of Los Alamos.[27] “I would think about the stakes as I crossed the road,” Hassabis went on. “Even though our office was a tiny attic, we were working on something pretty significant.” I said I was struck by Hassabis’s memory for stories, and his capacity to see himself in them. “Look, I have a very vivid imagination,” Hassabis answered. “That’s why I studied imagination for my PhD. I can imagine people in situations and viscerally empathize with how it must have felt. It’s something I just do naturally. “When I was at Cambridge, I used to get a late-night kebab from a van in Market Square. And at one or two in the morning, Cambridge is peaceful, and I walked down King’s Parade, thinking of all the incredible people who had walked down that street, that same exact street, probably looking pretty much as I saw it, because the university buildings and the cobblestones had been there for centuries. “Isaac Newton, Alan Turing, all my heroes. I could feel them in the bones of the stone, their intellect and vision. They were almost calling out to me.” “And Russell Square had a little bit of that character?” I wondered. “Yes. Because I could feel Szilard. “It’s like visiting a Buddhist school, where the monks have meditated and prayed for hundreds of years and their efforts are layered on top of each other and together they have left a residue in the rocks, so that there is a sort of physicality around you. Cambridge is definitely like that, and to a certain extent, Russell Square is, too. “And maybe we’ve added our own little piece to the charm of Russell Square,” Hassabis concluded. -- 100 of 565 -- Of course, when he opened his office at the end of 2010, Hassabis had yet to make history. All he had was a room for some desks, another for meetings, an area that could be fitted out as a kitchen, and a large closet for the computer servers. His challenge was to populate those desks with the world’s most talented people. For the first several months, hiring topflight scientists to DeepMind proved as hard as raising capital. Recruits needed to believe in the possibility of AGI; that ruled out most academic researchers.[28] But recruits had to be credentialed too: The eccentric Singularitarians had ample belief, but Hassabis was not about to staff his Manhattan Project with a crew of flaky dreamers. To the contrary, he aspired to hire PhD scientists in the mold of David Silver; indeed, the DeepMind business plan had listed Silver as a core member of the founding research group. But although Silver had returned from Canada to a prestigious postdoctoral fellowship at University College London, he had still not forgotten the trauma of Elixir. The most he would offer DeepMind was some part-time consulting, and when Hassabis compensated him with a grant of DeepMind shares, Silver had a neuralgic reaction. Feeling that his independence might somehow be compromised, Silver insisted on giving the shares back. The decision would cost him a small fortune.[29] Hassabis also tried to hire Ilya Sutskever, a Geoff Hinton protégé and later a cofounder of DeepMind’s archrival, OpenAI. Sutskever was an exceptional talent, and a messianic believer in deep learning. Hinton regarded him as the only one of his students who had more good ideas than he had. But even though Hassabis stretched DeepMind’s pay scale in an attempt to get Sutskever on board, he could not offer him enough to be persuasive.[30] Part of the trouble was that nobody believed in the future value of DeepMind stock. Hinton and the other prestigious figures in the field presumed that a research team with no revenues would do interesting science for a couple of years and then go out of business. To boost the chances that top postdocs might risk signing on, Hassabis hit on a new strategy. He offered stipends to leading AI professors, recruiting them as senior advisers in the hope that they would encourage -- 101 of 565 -- their disciples to see DeepMind as a worthwhile prospect. But even this strategy proved fraught. Geoff Hinton was happy to accept an advisory position: Having met Hassabis at MIT, Hinton realized he was a force of nature.[31] Rich Sutton, David Silver’s PhD adviser and the academic father of reinforcement learning, also accepted a DeepMind affiliation. But when Hassabis attempted to recruit Yann LeCun, a distinguished AI pioneer at New York University, LeCun kept his distance. “Frankly, my original opinion before meeting Demis was, this is yet another company that claims AGI is just around the corner and it’s complete BS,” LeCun said bluntly.[32] In the first half year of DeepMind’s existence, the sole PhD scientist to sign on full time was Dharshan Kumaran, Hassabis’s childhood chess friend and later his collaborator at University College London.[33] Kumaran’s credentials as a neuroscientist were impeccable, but a neuroscientist was not going to be the person to build machine intelligence. Then, in September 2011, DeepMind’s fortunes turned. An ebullient Dutch computer scientist named Daan Wierstra became the first AI expert to take the risk of joining the three founders. A close friend of Shane Legg’s from graduate school, Wierstra shared the enthusiasm for neuroscience-inspired AI that animated Legg and Hassabis. His life as a postdoctoral researcher in Switzerland was comfortable and well paid, but he chafed at the way that academics thought small, and he disliked their tendency to regard colleagues as rivals rather than teammates. The prospect of joining a mission-driven start-up excited Wierstra so profoundly that he accepted a significant pay cut.[34] Of course, he also received stock. But he assumed it would be worthless.[35] During his first weeks in London, Wierstra wondered what on earth he had been thinking. By now Hassabis had hired a handful of engineers from the video game world, but the office still felt empty. “There was no furniture, a few boxes lying around,” Wierstra recalled; in order to keep his spirits up, he set about filling the dead space with his own energy. He persuaded colleagues to embrace a convention known as Formal Thursday: DeepMind’s jeans-and-sneakers gang would dress up in suits, inverting Casual Friday. Unsuspecting job candidates who showed up for interviews -- 102 of 565 -- confronted a sartorial sensibility that looked like something out of Turing’s time. “We had to tell them, ‘It’s a joke! Really a joke!’ ” Wierstra remembered.[36] Meanwhile, Wierstra took to showing up in the office and announcing cheerily, “Let’s build Terminators.” Eventually Hassabis took him aside and asked him not to alarm people. • • • IN DECEMBER 2011, DeepMind raised a second round of capital from Founders Fund. This time Thiel’s outfit kicked in $7.9 million, and the Skype cofounder Jaan Tallinn provided a further $2 million or so. Extending the pattern of DeepMind’s earlier backers, Tallinn’s motives for investing were not conventional or commercial. DeepMind’s mission was insanely dangerous, in his view. He invested in order to press for safety. Hassabis kept Tallinn at arm’s length and carried on hiring. By now he was not merely recruiting talent; he was building a platform on which talent could flourish. This involved drawing on ideas that he had been marinating since Cambridge. In most entrepreneurial ventures, the goal is to turn a known technology into a product: This is an engineering challenge. At a deep-tech start-up such as DeepMind, the goal is to invent the technology itself: This is a scientific challenge. Scientific start-ups are harder and riskier than engineering ones, because you can’t be sure that success is even possible. Before Apollo 11 landed on the moon, nobody could be certain that a moon landing could be pulled off; after Apollo 11, imitators benefited from the proof that such landings were doable. And because scientific start- ups are pushing the frontiers, they must be staffed and structured in a special way. At engineering outfits, you need pragmatic problem solvers who will do anything to get a specific product built. At scientific start-ups, you need blue-sky thinkers who wander the unknown—although you also need somehow to direct those wanderings. Hassabis had three ideas on what the DeepMind platform needed. The first was conviction. Nobody could say how AGI was going to be built. But Hassabis insisted that it could be built; the existence of the human brain -- 103 of 565 -- proved that general intelligence was possible. Moreover, Hassabis understood that his sense of conviction had to permeate his research team— otherwise morale would flag and nobody would achieve anything. In the early days of DeepMind, when prestigious figures such as Yann LeCun derided AGI ambition as crazy, every scientist at the company needed to have faith AGI was possible. “We only wanted hardcore believers,” Shane Legg remembered. “We would go to conferences and tell people, ‘We are starting an AGI company and we are trying to build real AI systems with general intelligence.’ “Eighty percent of people would roll their eyes at us. I mean, literally roll their eyes and turn around and walk away. We figured that this was a very efficient way to discover who we should be talking to.”[37] The second thing DeepMind needed was time. Venture investors’ patience is finite, but the vistas of science stretch into the future unpredictably. With this in mind, Hassabis set out to extend DeepMind’s research runway by generating revenues from side projects. In 2011, he assigned a small team to come up with a commercial video game. In early 2012, he revived his old ideas on recommendation algorithms. By now, deep-learning systems were starting to recognize images, and Suleyman took the lead on hiring a team to apply this technology to fashion retailing. A shopper could input an image of a dress, then get back recommendations for dresses with similar shapes, patterns, colors, and styles. It was a way of searching for visual ideas without summoning the words to describe them. The third thing that DeepMind needed was a culture that brought out the best from its scientists. With his habit of collecting ideas from everywhere —movies, books, chance acquaintances in the university bar—Hassabis understood instinctively how to find the special quality in each team member. “He just sees it. He talks to each person at the right level. He knows immediately what’s good about each individual,” Wierstra marveled. [38] To imprint this magpie facility on his fledgling firm, Hassabis hired a cadre of program managers—“glue people,” as they were sometimes called —whose job was to nurture the talent, compensating for social deficiencies. -- 104 of 565 -- Brilliant researchers might be incapable of administrative coordination, of oral communication, or even of looking colleagues in the eye. DeepMind would be a place where such shortcomings were irrelevant. “Look, we have people who are so socially awkward that they lock themselves up in the bathroom for hours on end,” Wierstra explained. “But then they come out of that bathroom with a brilliant insight. “If you can find these people and be gentle to them and mother them, you get something great which other companies are missing.” I remarked that the researchers were almost all men; that the mothering program managers were often women; and that the resulting gender dynamic troubled some of Wierstra’s colleagues.[39] “They’re all men, but many are very awkward men,” Wierstra responded. “So yeah, I’m sure it felt uncomfortable to some people. On the other hand, these men had felt uncomfortable about themselves all their lives. And we created an institution for them to thrive in. “There’s a law of comparative advantage. You don’t have to struggle with your social skills. You shine at what you’re good at.”[40] Bit by bit, the Russell Square premises filled, then overfilled. The conference room became an overflow desk space, with people staring silently at terminals—Helen King, the first project manager to join DeepMind, recalls that the hush was so intense that she could hear the water clicking through the ancient heating pipes. To preserve this library environment, phone calls and company meetings were shifted out into the garden square. When the weather was bad, the DeepMinders did phone calls from the closet with the computer servers, or from the rickety stairwell, which involved tolerating colleagues who clambered past on their way to the company’s sole bathroom. Trevor Back, a newly minted PhD in computational astrophysics who worked on the fashion-recommendation project, recalls interviewing job applicants one after another with the servers whirring by his head. Every hour or so he would emerge, rush out into the square, gulp down some air, and hurry back inside again.[41] In September 2012, DeepMind finally outgrew the Russell Square office and moved into a nearby space on Bernard Street. Almost two years had -- 105 of 565 -- passed since the first fundraising, and DeepMind was on its way. Now it had to demonstrate that it could build something exciting. OceanofPDF.com -- 106 of 565 -- B CHAPTER 6 ATARI y the fall of 2012, when DeepMind acquired its new office, the energy in the AI world had shifted decisively. The futuristic Singularity Summits had been pushed off to the sidelines, and a series of projects from Geoffrey Hinton’s Toronto lab were capturing the field’s attention. Starting in 2010, speech recognition systems began to work, and in October 2012 a soft-spoken Hinton protégé named Alex Krizhevsky showed up at a conference in Italy and announced something astonishing. Working from his bedroom at his parents’ home, Krizhevsky had trained a deep-learning system that smashed all previous records in computer vision: In a competition called ImageNet, devised by the pioneering Stanford computer scientist Fei-Fei Li, his model was nearly twice as accurate as the next one.[1] Hinton immediately formed a company with Krizhevsky and his charismatic collaborator, Ilya Sutskever. Such was the excitement that, after just two months, the trio sold their outfit, consisting of nothing but themselves, to Google for $44 million. Hassabis had seen this coming. As an undergraduate in the mid-1990s, he had understood the shortcomings of symbolic systems that were limited to deduction. Real AI—a computer that could understand messy phenomena such as images or speech—would have to learn by generalizing from copious examples: It would have to think inductively. Back then, as a student, Hassabis could not imagine how such an inductive system would work. The foundations of deep learning had already been laid, but their significance was still unclear: The world’s largest computers were puny relative to human brains, and there was too little data to train models on. -- 107 of 565 -- But by 2010, one of the premises for DeepMind, and indeed its name, was that deep learning was on the cusp of a breakthrough. With ImageNet, Hassabis was vindicated. Hassabis did not merely anticipate Hinton’s success. He had a strategy to surpass it. As he had stressed in his business plan, the road to AGI would involve more than just replicating the various components of the human brain; the components would have to be integrated. The progress in image recognition was therefore just one piece of the puzzle. The larger challenge was to combine deep learning, which would solve challenges such as computer vision, with reinforcement learning, which would deliver other facets of intelligence, including the ability to hatch plans and think strategically. To deliver on this premise, the business plan promised a “Deep Learning Agent” that would master games without being told what the rules were. This sort of model would experiment with millions of possible actions, observe their consequences, and discover what worked. By dint of trial and error, it would induce successful strategies. • • • AT THE TIME of the ImageNet breakthrough, DeepMind was courting an AI scientist named Vlad Mnih, a Ukrainian-born Canadian who would be key to the company’s ambitions. Mnih was another soft-spoken Hinton protégé: He had a handsome, brooding presence, like a moody hero in a Russian novel. But although he was a Hintonite, Mnih was less tribal than many of his colleagues. For the most part, Hinton’s deep-learning group in Toronto barely communicated with the premier center for reinforcement learning at the University of Alberta, where David Silver did his PhD. Mnih was an exception. After taking undergraduate classes from Hinton, he had completed a master’s degree in Alberta before returning to Toronto for his doctorate. Steeped in the teachings of both deep learners and reinforcement learners, Mnih wanted to blend the two techniques. He regarded the failure to combine them as a huge missed opportunity. -- 108 of 565 -- In Toronto, Hinton would say of deep neural networks, “This is how the brain works.” In Alberta, Richard Sutton, the luminary of reinforcement learning, would say of RL agents, “This is how the mind works.” The two professors had similar ambitions, and each had discovered a promising approach. “When you hear things like that, it’s like, ‘Why aren’t you guys working together?’ ” Mnih said.[2] There were reasons for the Toronto–Alberta division. The reinforcement learners loved developing mathematical proofs showing that their systems worked in theory, even if they were difficult to build in practice. The deep learners were the opposite: They loved building systems that worked in practice, even if there was no elegant theory to explain them. A deep neural network was a mysterious black box: impressive when measured by its outward results, opaque when it came to its internal functioning. Alex Krizhevsky’s winning ImageNet entry illustrated this paradox. The software encoded millions of connected decision-making centers known as artificial “neurons.” Every time the system was shown a photograph, the neurons in the first layer processed the pixels, looking for the simplest visual cues—edges, lines, patches of color—much as the human eye begins by noticing contrasts of light and dark. The next layer of the neural network pieced these fragments together into more meaningful shapes: curves, circles, textures. Further into the network, neurons began to pick out recognizable parts of an object—the outline of a paw, for example. Finally, the deepest layers of the system assembled these parts and identified the image: cat, dog, and so forth. The key was to make the neurons work together. When the system was shown an image, each neuron in the first layer took in pixels and turned them into numbers, much as David Levy’s program had done for chess pieces. The neurons multiplied the numbers by a variable known as a “weight,” added in another variable known as a “bias,” and then fed the result through a mathematical filter that determined what sort of signal to send to the next layer of neurons. Each layer of the network repeated this process, until eventually the final layer spat out the name of the object in the photograph. If the system got the name wrong, it would adjust the -- 109 of 565 -- weights and biases in its neurons: With around sixty million of these “parameters” to play with, it nudged them iteratively this way and that, eventually landing on the magic combination of settings that allowed it to match inputs (photos) correctly with outputs (words). These parameters, once discovered, amounted to an algorithm that cracked the challenge of vision. They made sense of the patterns in an infinity of pixels. A primitive version of an infinity machine had been willed into existence. As a practical matter, Krizhevsky’s program performed splendidly. But its precise mechanisms were obscure. At no point during the training had a programmer provided the system with rules—a cat has four legs and a tail, and so forth. Instead, the model had iterated its own way to the right combination of parameters. A human observer could not tell why any given weight or bias had settled at a particular value, and the complex interactions among the millions of variables defied human understanding. Tweaking a weight in one layer of the system would change the signal transmitted to the next, and the ripples would flow through multiple layers, with nonlinear effects that boggled the imagination. Because of the black-box nature of these networks, the scientists who built them often sounded like surprised parents. Look, my child can say so many more words than just a week ago! As with a toddler who acquires language by listening to an adult’s voice, the internal workings of a deep-learning system were impossible to fathom. But the system’s performance was a strong sign of intelligence.[3] One day when Mnih was a graduate student in Alberta, he asked his supervisor, a reinforcement-learning theoretician named Csaba Szepesvári, why he did not take advantage of the advances in neural networks. Hinton had recently published his landmark 2006 paper on deep belief networks. Surely, Mnih urged Szepesvári, this was exciting stuff. For anybody working on AI, new opportunities beckoned. “I know that in practice neural nets work,” Szepesvári confessed. “I just can’t prove anything about them, so I don’t use them.”[4] Mnih might have been tempted to dismiss the reinforcement learners as a blinkered bunch, except that they were plainly on to something. Deep learning took you only so far: It could recognize patterns and make sense of -- 110 of 565 -- data, but it could not create agents that interacted with their environments. This set a limit on what deep learning could achieve, since much human learning occurs through trial and error. By dropping an object, a child learns about gravity. By saying “please” and getting what she wants, she learns the value of good manners. Reinforcement learning equips machines to do the same: to act, and to learn by acting. Unlike deep learning, which involved layered neural networks, reinforcement learning was a conceptual framework rather than a computational architecture. RL researchers described their systems in general terms. Like David Levy’s chess system, an agent would require a “value function,” which estimated the rewards that would accrue from a particular environmental state. It would require a “policy,” meaning a way of deciding what to do next. It might also be equipped with a “model,” allowing it to predict how the environment would change based on its actions. The computational methods that would give life to these abstractions were sometimes left unspecified, and might be crude: The “policy” could be as simple as a lookup table showing what action to take in any given state, for example. But all these elements of reinforcement learning were designed to achieve one thing. Complex environments allow for an infinity of possible actions; to learn by trial and error, the system needs a way of knowing which actions are worth trying. In order to tame infinity, in other words, an infinity machine has to develop algorithms that narrow the search for the best action. Relative to deep learning, with its mind-boggling nonlinearities and impressive practical results, reinforcement learning seemed theoretical and primitive. But to Mnih and other believers in RL, the promise of agents that could learn from experience remained thrilling. Whereas deep learning depended on the availability of training data—human-labeled cat photos, for example—reinforcement learning held out the hope that an AI could collect its own data by acting in the world and observing the consequences of its actions. In principle, there was no limit to the scope of such actions. An RL system could learn anything. -- 111 of 565 -- • • • COMPLETING HIS MASTER’S degree in Alberta in 2008, Mnih returned to Toronto, joining Hinton’s group and occupying a desk in a converted supply closet.[5] There he encountered the mirror image of Csaba Szepesvári’s reluctance to engage with the other tribe’s methods. “What do you want to work on?” Hinton asked his new PhD student. Mnih responded that he wanted to combine reinforcement learning with deep learning. “I’ve tried that. It doesn’t work,” Hinton counseled him. Mnih mentioned his ambition to a few other colleagues. One deep learner after another urged him to drop it. “Once you become a reinforcement learning researcher, it’s a separate community and we’ll never hear from you again,” he was told firmly. Chastened, Mnih spent his PhD years building deep-learning programs to interpret satellite images: It was a classic Hintonite project. His models took in the pixels, detecting edges and colors, and gradually learned to recognize objects in the photographs—industrial buildings, oil tankers, signs of deforestation. The systems worked impressively, and Hinton suggested that the two of them should start a company together. But Mnih hated the idea of pitching investors. In the summer of 2012, Mnih presented his PhD findings at an AI conference in Scotland. The conference was dominated by sober projects like his; futuristic schemes to build artificial general intelligence were nowhere on the agenda. But at the reception the first evening, the tone suddenly shifted. Two conference participants showed up at the party and announced that they were building AGI. They had a start-up in London. They were looking for recruits who believed in the mission. Mnih’s first thought was that this pair sounded crazy. Hinton had warned his students to steer clear of overexcitable Singularity types, with their goofy, let’s-build-Terminators mindset. This AGI duo, who introduced themselves as Shane Legg and Daan Wierstra, appeared to fit that profile.[6] But, as it happened, Mnih had an older brother named Andriy, who was also -- 112 of 565 -- an AI scientist. Andriy had done a stint as a postdoc at University College London, where he had met Shane Legg. Now he assured Vlad that these AGI promoters were not as sketchy as they seemed. They talked AGI, but they were also real scientists. The younger Mnih agreed to have coffee with Legg and the Terminator- building Wierstra. What do you want to work on, the DeepMinders asked him? Mnih gave the answer that usually got him a disdainful look: “I want to try combining neural nets with reinforcement learning.” “That’s what we’re doing!” the pair answered delightedly. In Switzerland, where Legg and Wierstra had done their PhDs, combining the two approaches was actually encouraged. Besides, neuroscience strongly suggested that reinforcement learning would be a necessary complement to deep learning. After all, the reward signals in reinforcement learning resembled the dopamine signals in the human brain. If the brain was the template for artificial general intelligence, RL would be indispensable to building it.[7] Mnih began to think that these crazy guys might know something. He had bounced between Toronto and Alberta trying to combine his two research passions. Perhaps he should bounce himself to London. Besides, Mnih realized, the ambition to build AGI might sound hubristic, but it came with an advantage. In his experience, the culture of academia could be both boringly cautious and terrifyingly competitive: boring because it pursued incremental advances, terrifying because scientists cut each other’s throats to be the first to publish. Legg and Wierstra were promising the opposite experience: the thrilling pursuit of the big leap, and the near absence of rivals. “They were like, yeah, we are going to do stuff where there is no competition because no one thinks it’s possible,” Mnih recalled. “And if it works it will be massive.”[8] A few weeks later, Mnih was invited to a video interview with Hassabis. In advance of the conversation, Hassabis sent over a link to his Wikipedia page. Reading it, Mnih discovered that Hassabis had been a chess prodigy, a -- 113 of 565 -- superstar video game designer, and the five-time winner of the Mind Sports Olympiad. Now he felt intimidated. Mnih dialed into the video call, unsure what to expect. Almost immediately, he was captivated. For one thing, Hassabis was surprisingly approachable. “I remember being struck by how humble he was and how easy it was to connect,” Mnih said later. For another, Mnih found Hassabis’s neuroscience perspective refreshing. “If you’re entrenched in academic computer science, you’re going to be thinking about the next practical step. But if you come at it from neuroscience, you understand the end point of intelligence.”[9] Mnih also recognized in Hassabis that contagious conviction, a quality he had learned to appreciate during his time with the deep learners in Toronto. Precisely because there was no theoretical proof that neural nets would work, it mattered enormously that charismatic lab mates like Hinton and Sutskever insisted that they absolutely would work: confidence substituted for theory. Similarly, Hassabis had evidently been determined to pursue AI since his teen years: He was utterly committed to the mission. “It’s this thing, you have to believe,” Mnih reflected. I recalled a line from the Life Story movie, which had inspired Hassabis to apply to Cambridge. “I’m one of the believers,” says Watson, the codiscoverer of DNA. “Blessed are they who believed before there was any evidence.” Mnih fully expected that DeepMind would go the way of most start-ups and soon be out of business. But by the end of the video call with Hassabis, he had decided to join anyway. “I remember talking to Demis and being like, ‘You know what? I am really a scientist,’ ” Mnih said. “This guy, he’s so passionate about building his company, raising money, doing whatever it takes. I just want to go and work for him.” If Mnih passed up this opportunity and took a postdoc appointment at a university, he would be stuck in the academic peloton, frustratingly boxed in. If he joined these out-of-the-box characters at DeepMind, he would be speeding down a road that stretched all the way to the frontier, and nobody would jostle him. -- 114 of 565 -- • • • MNIH PACKED UP his life in Toronto and moved to London in May 2013, joining a steadily expanding research team at DeepMind’s new office on Bernard Street. The day he began, David Silver also became a full-time employee, overcoming his inhibitions after finding that the hours he spent with kindred spirits at DeepMind were more rewarding than the ones spent at his university laboratory.[10] Silver had by now established himself as an authority on reinforcement learning, but the other newcomers at Bernard Street demonstrated DeepMind’s commitment to intellectual diversity.[11] Two recruits worked on statistical methods for quantifying uncertainty and incorporating probabilities into models.[12] Two had worked on deep learning at New York University under Yann LeCun.[13] Others, including Wierstra, were focused on the intersection between artificial intelligence and human intelligence. A computational psychologist named Chris Summerfield had signed on, working alongside Dharshan Kumaran, in DeepMind’s fledgling neuroscience unit. For decades, disparate computer scientists, statisticians, psychologists, neuroscientists, physicists, and biologists had experimented with AI: The field was so balkanized that it barely existed. Now, at last, DeepMind was unifying it. By the time Mnih arrived in London, DeepMind’s eclecticism seemed somewhat contrarian. The excitement about neural networks had intensified further: Without drawing from other branches of AI, deep learning seemed poised to deliver progress on tasks ranging from medical diagnostics to translation.[14] But DeepMind stuck to its interdisciplinary vision. Whatever the progress in deep learning, the approach essentially promised systems that matched one thing onto another—speech to text, images to text, and so forth. With its emphasis on agentic and general intelligence, DeepMind aspired to something more: an agent that could make plans and achieve goals in multiple environments.[15] The question was, what sort of environments? Shane Legg, whose doctoral work had defined intelligence, took the position that DeepMind should build its own metrics to gauge the progress of its agents. Hassabis, -- 115 of 565 -- who shouldered the burden of fundraising, believed that DeepMind’s advances would appear more credible if measured against an external yardstick. David Silver agreed with Hassabis’s view. DeepMind should not be both the test-setter and the test-taker. External yardsticks were better.[16] Not long after Mnih arrived, the DeepMind team resolved this argument. They hit on the perfect environment for testing an agent: the suite of video games designed in the 1970s and 1980s by the pioneering company Atari. [17] Given the primitive state of video graphics in that era, the computing power required to crack Atari would be affordable. Given that Atari had released dozens of games, an agent would have plenty of opportunities to prove it could be general. And given that most Atari games featured a constantly updating score, the agent would have the feedback it needed to learn how to play better. Besides, as Hassabis noted, DeepMind’s potential investors had grown up playing Atari favorites such as Space Invaders and Breakout.[18] An AI system that mastered these classics would be instantly appealing. The DeepMind brain trust divided into teams, each pursuing a distinct approach to the Atari challenge. Unlike in symbolic programming, there would be no human guidance on how to win a point, how not to lose a life, or what losing a life even signified. The AI system would get only the raw pixels on the screen, a joystick with which to move the cursor, and a running tally of the score. Like a human gamer trying out a fresh release, it would process the pixels, experiment with the joystick, and develop strategies through trial and error. At first, several methods showed promise. One researcher tried breaking a game into its constituent tasks and designing a separate algorithm to deal with each of them.[19] This worked reasonably well, but handcrafted algorithms for dealing with specific tasks could not generalize to multiple Atari environments. DeepMind’s probabilistic duo took a different tack, creating a reinforcement-learning agent that began with a model of how the games worked, then increased its confidence if trial-and-error play confirmed its hypothesis. This second experiment also yielded progress; but when the feedback indicated that the initial model was wrong, the system -- 116 of 565 -- misfired as it tried to generate a new hypothesis.[20] A third team adopted a similar approach, but rather than providing its reinforcement-learning agent with a brittle probabilistic model, it equipped it with a more flexible neural network. To begin with, this group consisted of Vlad Mnih and Koray Kavukcuoglu, a Turkish alumnus of Yann LeCun’s group in New York who would later become DeepMind’s research director. Later, David Silver joined, adding advice on reinforcement learning. Mnih set about training a deep-learning system to interpret the raw pixels on the Atari video screen, providing the agent with a perceptual input. Then he bolted on an established reinforcement-learning approach known as Q-learning—the Q stood for “quality.” The idea was that, by playing randomly, the agent would learn the quality of any action in any given state of the game: If you move the cursor left when the ball is heading to the left side of the screen, paddle and ball will connect and you may get a point, meaning that this state-action pair has a positive Q-value. Over millions of training runs, the agent would try out multiple possible actions in myriad game-states, recording each result in a database known as a Q- table. In theory, if the system tried out every possible configuration, it would fill out every square in the table and its training would be done. It would know the highest-quality action in every conceivable state of the game. It would play at superhuman level. Of course, trying out every possible permutation would have taken decades. To shortcut the challenge—to solve the problem of induction— Mnih added in some more deep learning. As the agent collected experiences, each one consisting of the state of the game, the action taken, and the reward that resulted, these were fed into a neural network. The network then performed the sort of learning exercise at which it excelled. Just as image-recognition systems examined labeled cat photos, eventually learning to recognize an unlabeled cat, so Mnih’s system examined images of state-action pairs that were labeled as having won a point, eventually inducing which other state-action pairs would win points. The trial-and- error marathon needed to fill out the Q-table was dramatically shortened. -- 117 of 565 -- Mnih also confronted a challenge that hearkened back to Hinton’s reservations about combining reinforcement learning and deep learning. A basic RL agent will often gather similar experiences as it explores one part of its environment: Think of a game-playing agent experiencing one section of a maze, for example. In standard deep learning, in contrast, the system starts with a full plate of curated data—labeled photos, for example. It then studies a random sample of these photos, representing the full variety of images in the set. The randomness is crucial. To see why this is so, imagine an image-recognition system that receives an ordered stack of photos. All the cat photos are on the top, then there are trees, then dogs, then lions, and so forth. Next, consider three scenarios. In the first scenario, the system studies the photos in order. The first batch consists entirely of cats, so the model concludes that all images should be labeled cat, irrespective of their content. Early impressions count: Once the system becomes convinced of its cats-always view, it has difficulty shedding it. By studying a subset of the photos without randomizing the selection, the system has landed itself in trouble. In the second scenario, the ordered stack of training photos is shuffled just slightly. The first batch now shows a mixture of cats and trees, so the model performs better. But only marginally better. It learns that photos showing branches should not be labeled cat. But it still concludes that all pictures with eyes, feet, or a tail deserve the cat label. The only way the system can succeed is with the third scenario: The ordered stack of photos is thoroughly shuffled. Now the AI is exposed to every variety of not-cat: dogs, lions, furry blankets, and so forth. The diversity allows the AI to understand what distinguishes cats from other objects. Mnih pondered how to save his Atari agent from getting caught in the first and second scenarios. As the agent collected experiences, it went through periods when it was stuck in one corner of the Atari board, generating a run of state-action-reward data that captured only a microcosm of the game’s possibilities. If the agent tried to learn from these -- 118 of 565 -- unrandomized and unrepresentative experiences, it would never master Atari. To get around this obstacle, David Silver proposed a new riff on an old idea in reinforcement learning. Back in the 1990s, RL scientists had experimented with a technique called memory replay: To extract maximum learning from limited data, experiences were stored in a buffer and the agent learned from them repeatedly.[21] Silver now suggested that memory replay might be useful in a different way. Rather than learning from experiences as they came in, the agent would store them in its memory for a while, then sample from them randomly. Silver’s idea appealed to Hassabis. During his PhD work, Hassabis had studied how humans store memories in the hippocampus, then replay them during sleep, so that salient events are gradually lodged in the neocortex.[22] Silver was proposing something analogous for the Atari system. Mnih liked Silver’s idea for a different reason. From the start of the Atari project, he and Koray Kavukcuoglu had aimed to bridge the Alberta– Toronto divide by turning data from reinforcement learning into something that deep learning could handle. Storing game experiences in a memory buffer would allow the neural network to take samples at random, avoiding the learn-as-you-go mode that caused deep learning to malfunction. To Silver, the memory buffer built on ideas in reinforcement learning. To Hassabis and DeepMind’s neuroscientists, the buffer was playing the part of the hippocampus. To Mnih and Kavukcuoglu, the goal was to turn correlated game-playing experiences into the sort of randomized teaching materials required for deep learning.[23] Memory replay soon boosted the performance of Mnih’s systems, and whichever way you looked at it, the success marked an inflection point. Silver’s vision had been vindicated: Since completing his PhD, his goal had been to prove that an RL agent could learn successfully from raw data.[24] Hassabis’s vision had been vindicated, too: Breakthroughs in artificial intelligence did indeed mimic the interactions between segments of the human brain. Mnih’s long-standing ambition had also been realized: The disparate traditions of Alberta and Toronto were being fused together. -- 119 of 565 -- • • • ON SUNDAY JULY 7, 2013, Mnih sat at home in his London apartment, watching the Wimbledon men’s tennis final. “I was so nervous, I almost couldn’t watch,” Mnih remembered vividly. He loved the British underdog, Andy Murray, and winced every time his opponent, the top-seeded Novak Djokovic, took a point off him. “In Wimbledon finals, every point matters,” Mnih recalled in an intense voice. “I often have to look away,” he added. To ease the stress, Mnih got up from time to time and wandered over to his laptop. He tapped the keyboard and refreshed the screen to check on his Atari agent. One of the nice things about AI was that you could leave the office for the weekend and the system would diligently continue training. Mnih’s agent was busy playing its own rackets game, Pong, which was Atari’s very first creation. When the company started out, its business model was to install game machines in bars. Because of this distribution channel, Pong had to be so simple that even the inebriated could play it. Mnih’s agent, while diligent, was still performing abysmally. Just occasionally, it got lucky and won a point. Usually, it lost 21–0. A typical rally consisted of the ball advancing steadily toward one part of the screen while the agent’s paddle bobbed about indifferently in some entirely different quadrant. The system resembled a toddler who turns circles by a tennis court, lost in her own imaginary world while her parents focus on their topspin. Now and again the toddler sticks out her racket and makes contact with the ball by accident. After peering at a screenful of statistics showing his agent’s performance, Mnih returned to Andy Murray. The match was going Murray’s way. But when Djokovic threatened to break serve, Mnih covered his eyes. Eventually Mnih rose again. He tapped on his keyboard, studied the summary statistics on the screen, and saw something surprising. The agent had just lost another game, but this time the score was 21–4. The statistical probability of randomly winning four points was minuscule. -- 120 of 565 -- Excited, Mnih toggled from the summary statistics to a live shot of the Pong play. He wanted to see whether the four points were for real. Perhaps they had been generated by some malevolent bug in the score-tallying system? Mnih watched as the ball traced a path across the screen. This time, as if by magic, the agent moved its paddle toward it. The next rallies were the same: The agent went after the ball, even if it didn’t always hit it. All of a sudden, the oblivious toddler was behaving like an eight-year-old with her first tennis coach. • • • ONCE MNIH’S AGENT STARTED to win points, it improved exponentially. Equipped with memory replay, the system became superhuman at Pong and at another ball-and-paddle game, Breakout. But the more complex Atari environments still eluded it. For example, a game called Seaquest offered players multiple routes to success: Your submarine could destroy other submarines; it could rescue a diver; it could secure extra oxygen, which would not win points, but would allow it to play longer. Mnih’s agent embraced the first strategy, firing off torpedoes with gusto, but it ignored the other two. After merrily zapping its enemies for a while, the agent would realize that something was amiss, then it would abruptly switch to doing nothing.[25] Thanks to his doctoral research, Mnih knew a lot about coaxing performance out of neural networks. But this Seaquest problem related to the reinforcement-learning part of the model. He turned to David Silver. “How do you debug reinforcement-learning agents?” Mnih asked him. Silver had learned from his PhD supervisor, Rich Sutton, to put himself into the shoes of his agents. “If you are viewing the agent from a human perspective, you won’t understand it,” Sutton would say. “You have to be the agent. You have to experience its experiences.” In this case, Silver said, Mnih should examine the agent’s Q-table. “Look at the data that it’s living and breathing,” he counseled.[26] -- 121 of 565 -- Following Silver’s advice, Mnih pulled up the agent’s Q-table on his screen. He could immediately see the source of the problem. For some reason, the agent had been generating higher and higher estimates of the rewards that would flow from actions that had succeeded previously. Hence its monomaniacal zeal for firing off torpedoes.[27] Pondering this glitch, Mnih realized that it reflected another clash between supervised learning and reinforcement learning. The goal of a computer-vision system was to predict how photos were labeled. Every time it answered right, it would get a simple, standard-size reward, and that round of the training would be over. But a reinforcement-learning agent had no equivalently neat task. Its objective was to maximize rewards over the course of a game, but the quantity of possible rewards was unspecified, as was the duration of the gameplay. With no upper limit to its success, the Seaquest agent could let its imagination run wild. When it hit on a point-scoring strategy such as torpedoing a rival sub, it pictured itself repeating this action over and over, so that the expected value of its strategy shot upward. What’s more, this ebullience fed on itself. Every time the model repeated an action, it would remember its sunny estimate of the expected value from last time, then it would lather on some extra optimism. The result was an unstable upward spiral of expectations: The agent was like a dreamer who finds a couple of hundred-dollar bills on the sidewalk and leaps to the conclusion that he will reap millions of dollars if he walks for a month—and then, having imagined earning the millions, extrapolates further and conjures billions. The dreamer in this analogy would probably give up scouring the sidewalk after a day, no richer, but with sore feet. Likewise, when the Seaquest agent’s cognitive bubble burst, it suffered a crisis of confidence. To solve this problem of spiraling expectations, Mnih broke the feedback loop between the agent’s playing and its learning. He did this by equipping his agent with two neural networks, so that he could separate the two functions. The first network assumed the role of the player, responsible for choosing actions in the game: It was expressly not allowed to learn, meaning that the weights and biases in its network were fixed at the initial -- 122 of 565 -- setting. Meanwhile, the second network served as the observant coach, watching the player’s actions, assessing the results, and adjusting its parameters accordingly. Then, after a suitable period of study, the adjusted parameters in the coaching network were transferred to the playing network. The “suitable period of study” was the crucial element. To see why this was so, consider a human tennis coach. As she watches a tennis player, she forms a hypothesis about how he could perform better, but she lets him play on for a while, allowing time for further observation. If the player wins two points with a serve-volley combo, the coach may think, “Oh, I should tell him to do that more often.” But if she keeps on watching, she will see the player win with lobs, drop shots, and big forehands from the baseline, and she will build a richer plan about the advice she ought to provide, perhaps even concluding that the serve-volley combo is among the less successful strategies. Hypotheses based on induction must be open to revision, in other words. A bit of patience—a suitable period of study—turns premature and counterproductive coaching advice into something valuable. The same idea applied to Mnih’s Seaquest system. If the playing network won points by zapping a few subs, the coaching network would register that this action brought in rewards, but it wouldn’t communicate its excitement to the playing network or tell it to restrict itself to a torpedo-only strategy. Unburdened by those premature instructions, the playing network would continue to experiment with trial-and-error actions, eventually discovering that it could also do well by saving divers and sourcing oxygen. As these additional reward sources were uncovered, the watchful coaching network would take note, grasping that Seaquest is a game of multiple strategies. Eventually, after a suitable period of study, the rich wisdom from the coaching network would be transferred to the playing network. With the problem of spiraling expectations thus solved, Mnih’s system began to master Seaquest.[28] Memory replay had shown that AI systems would perform better if they mimicked the relationship between the hippocampus and the neocortex. The playing/coaching separation established that dividing an AI system into -- 123 of 565 -- discrete, brain-like regions could empower stronger agents. Hassabis’s business plan had promised fundamental breakthroughs in neuroscience- inspired AI. Three years on, he was delivering. • • • IN DECEMBER 2013, Mnih showed up at Harrah’s Lake Tahoe Hotel & Casino, on the western edge of Nevada. He was not there to gamble. The hotel played host to the Neural Information Processing Systems (NIPS) conference, the world’s biggest machine-learning gathering. Wearing a gray sweater, its sleeves rolled up to the elbows, Mnih stood in front of a room so packed that it probably violated fire regulations. The talk was a sort of coming-out party for DeepMind. For the first three years of its existence, the firm had stayed under the radar: Its website consisted of a black screen, a logo, and no further information. But now the company had something exciting to show off: the system it had recently dubbed the Deep-Q Network—DQN to the initiated. Word of the network’s accomplishments had leaked, and professors and power brokers gathered to listen. Mnih took the audience through a series of slides, culminating with videos of his agent navigating Atari games with astonishing precision. In Seaquest, the agent demolished a series of enemy subs, then went up to the surface to get oxygen, then returned to demolition. In Space Invaders, the agent went after the mothership, the target that generated five times more rewards than zapping infantryman adversaries. In boxing, it pummeled the opposing avatar against the ropes so that it had nowhere to escape to. The boxing demo got an appreciative laugh. Perhaps strangely, given the AI community’s on-and-off anxiety about Terminator risk, the audience was amused by DQN’s display of ruthless violence. But the grand finale came with the game of Breakout, which involved batting a ball at a wall of bricks, gradually destroying them. The agent had figured out the old trick for winning with maximum efficiency: First cut a tunnel through the bricks, -- 124 of 565 -- then send the ball through the tunnel so that it ricochets off the back wall, zapping multiple bricks without the player having to do anything. “The room went completely silent,” David Silver remembered. “For every game, the same agent had learned something completely different. People were just blown away. It was a turning point.”[29] Looking back on this triumph, Silver noted how Hassabis had grown since the experience with Elixir. In both cases, Hassabis had announced a maximalist ambition, but in the case of DeepMind, he had also figured out a ladder that led to his destination. At Elixir, he had plunged his company straight into making the most complex video game ever, and the overreach had doomed the project. At DeepMind, the ultimate goal was even grander, but Hassabis had let people tinker while he was building out the scientific team, not setting a demanding goal for them. Then, once the team had assembled, Hassabis had shown exquisite judgment. In choosing the Atari challenge, he had understood that the moment to fuse deep learning and reinforcement learning had arrived. The result was another ImageNet moment—not just for vision, but for agents. OceanofPDF.com -- 125 of 565 -- O CHAPTER 7 THIEL TROUBLE n October 8, 2012, while DeepMind was assembling its Atari team, Luke Nosek of Founders Fund made a trip to Cape Canaveral. There, on Florida’s Atlantic seaboard, a slender white cylinder pointed straight up into the evening sky: a SpaceX Falcon 9 rocket. The vessel contained the dreams of Elon Musk, Nosek’s favorite company founder. If the launch succeeded, the Falcon 9 would become the first commercial rocket to resupply the International Space Station, delivering scientific equipment, clothing, and chocolate-vanilla swirl ice cream for the space station’s resident astronauts.[1] At 8:35 p.m. Eastern time, a brilliant column of fire and smoke propelled the rocket skyward. Seventy-nine seconds later, the white flare flushed to a deep red as one of the nine engines malfunctioned. Despite that heart- stopping moment, the other eight engines thrust the spacecraft into orbit, and NASA declared “a new era for spaceflight.”[2] Afterward, still processing the excitement of the day, Nosek flew back to California on Musk’s private jet, accompanied by Larry Page, the cofounder and CEO of Google. At one point on the flight home, the conversation turned to AI. Page’s father, Carl, had studied primitive neural networks in the 1960s, and Google had recently revamped its voice recognition system with the help of one of Hinton’s graduate students. The ImageNet breakthrough was just a couple of weeks away, and already there were rumors that Hinton was starting a deep-learning boutique. Page was determined that Google should buy it. -- 126 of 565 -- When Page dropped a hint about his acquisition plans, Musk tried to one-up him. Thanks to the Nosek connection, Musk had met Hassabis at a Founders Fund retreat, and Hassabis had followed up with a visit to SpaceX in Hawthorne, California.[3] As they ate lunch together in the factory canteen, with cranes moving vast pieces of rocket overhead, Musk and Hassabis had discussed which mission mattered most: space travel, which might turn humanity into a multiplanetary species, or developing AGI, which might empower humanity to solve any and all problems. Musk had declared that humans needed to colonize Mars in case disaster struck Earth. Hassabis had countered that killer AI robots might be one such disaster, but that the AI could obviously follow humans to Mars if it wanted to. The two men had forged a competitive friendship, and Musk had decided that Hassabis was right: Powerful artificial intelligence might indeed be more consequential than spaceflight. Anxious to be part of the biggest revolution of his time, Musk had promised to invest in Hassabis’s AGI start-up. “There’s only one AI company that I think is going to work,” Musk now informed Page. “And I’m an investor in that company, DeepMind.”[4] Page responded to this put-down in the most respectful way possible. He took out his Android phone and typed a note of the name that Musk had just dropped on him. Watching this exchange, Nosek’s mind started racing. “I know Larry. Larry wants to build AGI,” Nosek said later, reconstructing his reaction at the time—a sensation bordering on panic. “Larry has wanted to build AGI his whole life! He’s going to try to get DeepMind! Or copy it! Or something!”[5] Nosek hated the prospect of Google acquiring DeepMind. The way he saw things, AGI was a terrifyingly powerful technology: Recently, he had taken up meditation to help process the enormity of it. Because of the awesome stakes involved, Nosek did not trust a corporate behemoth like Google to steward the technology. He wanted AGI to remain in the hands of his friend Hassabis, with appropriately freaked-out people such as himself keeping a careful watch over it. -- 127 of 565 -- “I thought it was very, very important for DeepMind to stay independent in order to fulfill its mission,” Nosek recalled later. “I remember thinking, ‘Oh man, OK, how can I derail this conversation? Because if I don’t, Larry’s going to get DeepMind.’ ” Later that day, Nosek phoned Hassabis in London to warn him of Google’s interest. Knowing that Hassabis had visceral feelings about the power of his technology, he was expecting him to be leery of a Google takeover. “Look, what do we do about this?” Nosek asked desperately. He was still teetering on the edge of panic. To Nosek’s consternation, Hassabis sounded unruffled. “Well, this could be good,” he said. “Let’s play this out. Let’s see what happens.” • • • IN ONE RESPECT, at least, Nosek had read Hassabis correctly. His second- favorite founder certainly did have visceral feelings about the consequences of artificial intelligence. The way Hassabis saw things, true general intelligence would make almost anything possible, surpassing the internet, the printing press, or even the Industrial Revolution in importance. It would usher in a post-scarcity world of radical abundance, resembling the future described in the science fiction he had read as a teenager. “People aren’t thinking ambitiously enough about what a post-AGI world will look like,” Hassabis once told me. “I still hear people talking about the limits to our resources. Like, will we have enough to pay for government programs to deal with the fallout from AI, such as a universal basic income? Or for the electricity to power the data centers? “But it’s going to be like Iain Banks’s Culture series. We’re going to be mining asteroids. We’re going to solve nuclear fusion. We will have ways of extracting hydrogen fuel from seawater. People are not understanding the magnitude of the change. “I don’t think money’s even going to be relevant. What will money mean in a post-scarcity society? -- 128 of 565 -- “Or corporations. Or the stock market. What do these things mean if we have superabundance? “And I’m not sure that the solution to social needs will be a universal basic income, by the way. There’s this other thing called universal basic provision, where it’s not money you’re giving people. Instead, you’re providing all that’s needed for today’s millionaire lifestyle—a nice house, schooling, health care, basic travel. All of that costs you nothing as a citizen. “And then, look, maybe you get a normal car for free, but if you want a Ferrari, OK, well then you need to do some work to earn some extra income. But everyone has this amazing basic access to material goods. That’s my view of what the world will look like in the long run.” On one of my visits to see Hassabis, I ate lunch in the DeepMind cafeteria before the meeting. The food was delicious and varied and absolutely free: The post-scarcity society had already arrived in this corner of London. Sneaker-shod researchers padded about contentedly with plates of salad and sea bream. Nobody was old, nobody was stressed, and nobody was short of vitamins.[6] After lunch I sat for a short while in Hassabis’s waiting room. Here, again, the vibe would have been enough to soothe Nosek himself, at least for a few moments. A comfy sofa was draped with a pale green blanket and dotted with bright cushions. The walls were decorated with chessboards, each accompanied by a photo of Hassabis in the company of a onetime world champion, and each captioned with an upbeat quote about DeepMind’s unbeatable chess system. Gazing at the calming geometry of the boards, browsing the cheery quotations, it was easy to imagine artificial intelligence as merely a heartwarming extension of a familiar mind game. My reverie was broken when I was shown into Hassabis’s office. Almost immediately, the enormity of AGI bubbled up again in conversation. “So it will be bigger than the Industrial Revolution?” I asked, curious to hear more about the post-scarcity future. “Yeah, I think so,” Hassabis reiterated. “Maybe AI is more like fire and language. Or maybe it’s as big as the emergence of the prefrontal cortex in -- 129 of 565 -- humans. I mean, it’s on a level with those caves where tens of thousands of years ago some brilliant person had the idea of making handprints on the wall. That’s the dawn of consciousness, isn’t it? “Look, the Industrial Revolution, let’s not minimize that. Power and energy and steam engines. That’s the first information age, by the way— Maxwell’s equations.” Hassabis was referring to the four equations published in 1865, describing the relationship between electricity and magnetism and paving the way for everything from telescopes to electrical engineering to Einstein’s general theory. “Now we’re in the second information age: We’ve gone from physical information to pure information, thanks to computers. “And then maybe now we’re about to enter the third age, which is the AI age, where the information comes alive. It starts to process itself, to generate itself. It becomes autonomous.” I wondered what it was like to live in the familiar, pre-AGI world, the world of chessboards and seared bream, but also to imagine a future with AGI so vividly. “For me, science is a spiritual endeavor,” Hassabis answered, circling back to our discussions of religion. “Maybe ‘spiritual’ is too mystical a word. But I feel I’m communing with the universe whenever I am trying to understand it. “It’s very deep for me, building AI. Because it will help me to understand the universe and realize my purpose. “I mean, this is what Spinoza said,” Hassabis went on, referring to the seventeenth-century Dutch philosopher. “That God is present in nature, so understanding nature is a spiritual endeavor. And Einstein, although he was not conventionally religious, agreed. He said he believed in the God of Spinoza, and I think he meant what I mean. “People assume, oh, religion’s over here, science is over there, it’s weird to put them together. But in my world, humanism and spiritualism and science all go together. -- 130 of 565 -- “It’s like with Leonardo da Vinci. His anatomical drawings are beautiful art as well as unbelievable biology. Da Vinci is my favorite because everything’s just flowing into one river. And that’s how I try to live. Everything’s fluid.” I read out a line from a biography of Spinoza, which Hassabis had recommended in one of our earlier conversations. The line reminded me of the intensity with which Hassabis pursued his scientific mission. “Philosophy was for Spinoza, not a weapon, but a way of life, a sacred order whose servants were transported to a supreme and certain blessedness.”[7] “I agree with that 100 percent,” Hassabis interjected. “If you ask what life is really for, it’s to do with knowledge or self- knowledge. And I think that is our purpose because why otherwise would the world be constructed like this? Why would science be possible? Why should computers be possible? What about semiconductors? Why should sand, with a bit of copper, do anything? “These things are, strangely, set up for scientific endeavor. So whether you want to call that God’s design, or whether it’s just the universe, or a simulation, I’m open-minded about all of that. I think that’s part of what we’ll find out, when we’re on this journey. “But in the meantime, it feels like the flow of the universe is going in this direction, towards discovering the answers. And I’m part of that flow, I’m going with that flow, and it’s exhilarating.” • • • NOSEK WAS RIGHT: Hassabis was profoundly committed to his scientific mission. But Nosek was also wrong, because Hassabis was practical. David Silver’s ladder metaphor—his observation that Hassabis had grown better at combining ambition and pragmatism since his days at Elixir—captured the two sides of Hassabis’s persona. When he stayed awake into the small hours of the morning, reading and thinking and dreaming, Hassabis reveled in -- 131 of 565 -- maximalist ambition. When he arrived at the office the next day, he focused on getting to the next rung of the ladder. In October 2012, at the time of that panicked phone call from Nosek, the next rung involved money. Having scraped together a bit over $2 million in 2010, and $10 million in 2011, Hassabis had now decided that he needed much more: For his Series C round, he was targeting $65 million.[8] His fundraising negotiations with Nosek and Founders Fund had already started. Nosek understood the case for an audacious fundraising target. The excitement about deep learning was pushing up AI salaries. The harnessing of powerful GPU chips was pushing up the cost of hardware. The mission of building AGI was of the utmost importance. But the old division between Nosek and Peter Thiel, the Founders Fund leader, presented a problem. Back in 2010, Thiel had gone along with the DeepMind investment because it had seemed bracingly contrarian, not to mention cheap. Now that AI had turned expensive and mainstream, Thiel’s instinct was to sell the consensus. [9] “I was thinking, DeepMind was going to burn money like crazy for the next decade,” Thiel recalled later. “The company was going to have to keep raising more and more capital. There were no revenues and no products. “Even today, there’s still no business plan attached to generalized AI,” Thiel went on, reflecting the debates that plagued AI into the 2020s.[10] Thiel also wasn’t sure if he could trust Hassabis. Whereas Nosek was in touch with Hassabis frequently, imbibing regular doses of charismatic conviction, Thiel barely saw the DeepMind team, and he felt instinctively suspicious of a fellow chess player. A man who had spent his formative years mentally crushing opponents should be treated with caution, Thiel reckoned. Besides, Hassabis excelled at other board games such as Diplomacy. Thiel thought that Diplomacy was essentially a test of how well you could manipulate people.[11] A month after the Falcon 9 launch, when the discussions between Hassabis and Founders Fund were still going badly, the outcome of Musk’s name-dropping landed in Hassabis’s inbox. -- 132 of 565 -- “Sorry to send you an email out of the blue,” the message began. “My name is Alan Eustace and I work for Larry Page at Google.”[12] Eustace was Google’s engineering chief, and a fan of daredevil tech challenges. At the time he emailed Hassabis, he was plotting to deck himself out in an astronaut suit, attach a helium balloon to his back, ascend twenty-five miles to the stratosphere, and use an explosive device to detach himself from the balloon contraption. A couple of years later, Eustace duly took off into the ether, performed two elegant stratospheric backflips, and plummeted to earth, reaching a speed of 822 miles per hour as he descended.[13] “Larry’s had a longstanding interest in AI and has asked me to build a set of teams with different but complementary charters,” Eustace’s email continued. “Larry sent me your name this morning as one of the people he believes is doing revolutionary work in the area. I wonder if you have time to talk.” Hassabis certainly did have time. In fact, given his testy relationship with his venture capital backer, he was eager. A deep-pocketed parent company could free him from the endless fundraising negotiations that cluttered his life and pulled his attention away from DeepMind’s research. “I was having these inane conversations nonstop with investors; I felt my brain was atrophying,” he said later. Back in the 1960s and 1970s, venture capital had emerged as the best kind of finance for experiments in applied science: For engineers who had been cooped up in bureaucratic firms, it represented liberation capital. But DeepMind’s science was not applied; it was blue-sky. Funding from that skydiver would bring a truer liberation. The two sides talked. It was clear that Google was willing to pour almost unlimited capital into ambitious projects, and AI was in its wheelhouse.[14] According to an informal company guideline, Google would underwrite truly grand software adventures so long as they had a shot at impacting one billion consumers. Eustace himself had benefited from this stance. One day, when he was building the company’s new mapping tools, he was summoned to the office of Patrick Pichette, Google’s chief financial officer. -- 133 of 565 -- Pichette asked him what was up. Eustace had submitted a purchase order for several Cessna aircraft. Who needed that many private turboprops? What was Eustace thinking? Eustace explained that he was building a secret mapping feature. At the click of a mouse, a user would be able to toggle from the regular map to a novel 3D view, then take a virtual stroll around a neighborhood. The project involved photographing huge swaths of territory from the air, and Eustace and his engineers had built some special home-brew lenses to capture the necessary images. To keep the cover on their secret, Eustace now needed to attach these lenses to his own fleet of aircraft. The result would be a tool that delighted billions of users. The CFO grinned. He authorized the Cessna shopping spree without asking further questions. “The problem with public businesses is that they are all about the next ninety days,” Pichette explained later. “But some infrastructure takes a decade, and Google had the balance sheet to do that. “I mean, you can bury anything in that sort of balance sheet. You can bury Wisconsin, and nobody would know about it. “I always thought Google should be there to make these big bets. That’s what the world needs more of. That’s why investing in AI made total sense to me.”[15] A parent company with this sort of outlook appeared ideal to Hassabis. But the downside of dealing with Google soon became apparent: Nothing was going to happen in a hurry. Google had a thousand other projects going on. Buying Geoff Hinton’s outfit, for example. Hassabis asked Eustace to clarify his timeline. DeepMind would soon be needing capital, and Hassabis’s first choice was some sort of financing from Google—an acquisition or just an investment. But if Google didn’t hurry up, Hassabis would raise a Series C round from his venture capital backers. DeepMind’s value would be substantially marked up, raising the future cost of acquisition for Google. The pitch elicited a shrug. Eustace advised Hassabis to go ahead with his next venture capital round. If that meant that DeepMind’s valuation jumped, -- 134 of 565 -- so be it. “That blew our mind,” Mustafa Suleyman recalled. “We were like, how can you be cool about paying an extra few hundred million? “Also, it was a disaster. It meant we had to raise the Series C,” Suleyman added.[16] • • • THE FLIRTATION with Google did give Hassabis an idea, however. If Eustace was on an acquisition spree for AI talent, the value of that talent was bound to rise, even if it seemed high already. Therefore, when Geoff Hinton’s deep-learning boutique came up for sale, DeepMind should bid for it. At the end of December 2012, Hinton arrived at one of the towering casinos on the southern end of Lake Tahoe.[17] He was there for the annual NIPS conference—the same event that, one year later, would play host to Vlad Mnih’s Atari talk. But he was there for another reason, too. The recent breakthrough with ImageNet had made him an industry celebrity. The NIPS gathering would provide the perfect opportunity to auction off his start-up. Alan Eustace flew his own twin-engine plane into Lake Tahoe to meet Hinton and his two cofounders. They had dinner together, and it was clear that Google would be bidding energetically. Microsoft showed up at Lake Tahoe, too, and Hinton also had an expression of strong interest from the Chinese search giant Baidu. The fourth and by far the smallest suitor was DeepMind. A few days earlier, Hassabis had called Hinton and said that a fair price for his company would be $10 million. Hinton put an upside-down trash can on a table in his hotel room, then balanced a laptop on top of it. Back pain prevented him from sitting down, so he always worked standing. He typed out an email to the four potential buyers. The auction of his company was now open. Bids started to arrive in Hinton’s Gmail. DeepMind soon offered $10 million, an astonishing bid from a start-up that had cumulatively raised a little over $12 million. The offer involved paying with stock, but even so it was amazing. The paper value of all of DeepMind’s equity was just $45 -- 135 of 565 -- million, and most of the shares were owned by venture capitalists. The implication was that Hassabis would buy Hinton’s outfit by forking over 22 percent of DeepMind’s stock. Hassabis himself owned only 21 percent.[18] There was no time to ponder DeepMind’s bid, or how it could even make good on its offer. As the auction continued, DeepMind dropped out and the price kept on heading upward. Hinton had noticed that, on the ground floor of the hotel, there was a big noise and a blast of flashing lights every time a gambler won $25,000 at a slot machine. Up here on the seventh floor of the tower, his payout was rising in $1 million increments. Sometime after DeepMind pulled out, Hassabis called Hinton. “This is crazy. They are still bidding,” Hinton told him. “What? I think you are worth $50 million,” Hassabis replied. “Even though I was trying to get you for $10 million.” Hinton was surprised. Hassabis was now saying his company was worth fully five times more than he had asserted just a few days earlier. “He wanted to buy us to make DeepMind more valuable so he could sell it to Google for more,” Hinton surmised later. “When that didn’t work, he wanted me to sell to Google at the highest price possible, because that would raise the price that Google would pay for DeepMind. “It was very helpful, by the way,” Hinton continued. “At one point in the bidding, when the price reached around $30 million, a very senior person at Microsoft called up and said they’d give us an extra $10 million if we stopped the auction. I was tempted for a moment, but the fact that Demis had said we should try to get $50 million made it easier for me turn down that offer.”[19] In the end, Hinton and his cofounders netted $44 million from Google. Hassabis added his own coda to this episode: “Geoff probably would’ve done better if he’d let me buy him and then we’d have all sold to Google as a block.” This statement is true.[20] Ironically, Hassabis himself would probably have ended up poorer.[21] • • • -- 136 of 565 -- AT THE START OF 2013, Hassabis pivoted back to his negotiations with Founders Fund. Hoping to spur the venture capitalists to support his target of a $65 million fundraising, he let it be known that Google had approached him before Christmas about a possible acquisition. Nosek knew that this was not a bluff—he had directly witnessed Larry Page’s interest. But the other Founders Fund partners were suspicious. Thiel thought Hassabis was playing games, not least because he knew that Hassabis was a player of games, and if Hassabis was playing games, Thiel wanted to counterplay him. Meanwhile a Founders Fund partner named Brian Singerman kept hammering Hassabis on how he would generate revenue. Neither the video game project nor the fashion-recommendation algorithm was close to ready. “I’m talking about the biggest invention ever. And they keep coming back to, ‘Where’s the widget?’ ” Hassabis recalled of the discussions. “And I’m like, ‘It’s going to revolutionize all widgets, so I can pick you a random widget if you want me to, but you obviously haven’t got the point if you are asking me this.’ “And then they were like, ‘You’re from the UK and it’s all a bit strange and it doesn’t pattern match.’ ” Hassabis considered pitching other venture capitalists. But Nosek discouraged him, leery of allowing non-freaked-out investors to get their hands on the technology. Most venture capitalists cared about profits, not purpose, Nosek reckoned; DeepMind should only accept capital from people with pure motives. Whenever Hassabis mentioned other potential VC backers, Nosek would tell him, “Oh, they’re terrible. Don’t talk to them.” Unwilling to alienate his principal backer, Hassabis accepted Nosek’s guidance. Not being based in Silicon Valley, he didn’t understand that inviting a fresh venture firm to lead the next funding round was standard operating procedure. Moreover, Nosek was evidently correct that DeepMind’s existing investors were not straightforward profit seekers. David Gammon was motivated by a religious faith in human progress. Tomaso Poggio and his wife had their own kind of faith: not in God, but in Hassabis. Jaan Tallinn, the Skype cofounder and Series B investor, wanted a -- 137 of 565 -- seat on the DeepMind board because of his preoccupation with doom scenarios. Nosek himself approached investing with a wide-eyed idealism, proclaiming that he had learned from the novels of Ayn Rand that visionary companies offered society’s best hope of salvation.[22] Besides, Hassabis still wanted to believe that Thiel believed—in him, and in the mission. After all, Thiel was a titan and a visionary. He had seen the future early, bankrolling the Singularity Summits as well as DeepMind. Hassabis kept telling himself that Thiel would ultimately support the case for a big funding round. Suleyman told Hassabis to stop deluding himself. The youngest of the three founders was emerging as the one figure at DeepMind who could challenge the boss on the strategic issues. Almost every evening, the two men reviewed the events of the day, often talking into the small hours of the morning. Suleyman acted as a sounding board, helping Hassabis process his torrent of ideas, replicating the relationship between Hassabis and Dharshan Kumaran when they had brainstormed neuroscience theories in their lab’s mini-kitchen. But Suleyman was also capable of pushing back. Like David Silver at Elixir, he was close enough to Hassabis to be able to contradict him. “Peter Thiel wasn’t meeting with us, wasn’t doing reviews, wasn’t giving us feedback,” Suleyman remembers telling Hassabis. “We had no connection to him. Obviously, we were irrelevant nobodies from North London.” Revealing a curious blind spot, Hassabis was slow to pick up on these signals. It was as though the Jedi, accustomed to winning people to his cause, couldn’t adjust when he encountered an aloof cynic. “Demis would get up in front of the company and tell the team how he had met with Peter and briefed Peter and Peter believed in this and Peter believed in that,” Suleyman recounted. “I was like, ‘Dude, you haven’t even seen Peter.’ “I thought, ‘You just can’t say stuff like that.’ But Demis didn’t think of it as lying or even exaggeration. It was just his reality. -- 138 of 565 -- “If we had just been a bit more realistic, we would’ve realized that Peter didn’t care about us earlier,” Suleyman argued. “We would have gone to other investors. We could have used their feedback to calibrate our strategy.”[23] Looking back years later, Hassabis conceded that he had misjudged Thiel’s intentions. “I don’t think Peter ever really believed in our thesis,” he admitted. “I realized that afterward. “He was investing as a contrarian. That’s how you’re going to pick up assets that are undervalued. Occasionally, one of these contrarian ventures will come off and it will pay for all your losses. You do a lot of those bets but you’re not really committed to any one of them. “So, I don’t actually think he ever believed in AGI,” Hassabis said, belatedly describing the speculative mindset and the pattern of returns that drives all venture investing. “But I couldn’t see that at the time. I was in awe of what goes on in the Valley. I was still just a kid from London.” • • • BY MID-FEBRUARY 2013, the outlines of a deal with Founders Fund appeared to be emerging. Hassabis’s aspiration to raise $65 million was ruled out. But Nosek told him that Thiel and his partners would support a $30 million raise, provided that DeepMind could find $10 million of that elsewhere. It felt like a reachable target. Hassabis and Suleyman were counting on some capital from Solina Chau, who managed the wealth of Li Ka-shing, the richest person in Asia. In early 2012, Chau had invited the DeepMind duo to meet her in a private room at Shoreditch House, a hip East London club with a pool on the roof—a useless flourish of conspicuous consumption, given the British weather. Chau had bonded with the two visitors the moment they walked in. “We started talking and within five minutes she was finishing our sentences,” Suleyman remembered. Even better, Chau turned out to be another atypical investor: Once per year, she was authorized to make a bet with an expected payoff of zero. With a -- 139 of 565 -- fortune of $25 billion, Li Ka-shing spent hundreds of millions on philanthropy. Every so often, he was happy for Chau to support an inspiring but commercially implausible start-up. It was another kind of charity. “I was thinking, this is a project that I would like to back. This is a founder whom I would like to know better,” Chau said later. After a fifteen- minute chat, she declared that she wanted to invest in DeepMind. At the time, the Series B round had closed, and the Series C round was some way off. But Hassabis and Suleyman offered to sell Chau a stake of $2.5 million. Having known her for a scant quarter of an hour, $2.5 million felt like the most they could ask from her. Chau quickly asked for a larger allocation. Tempted, but not wanting to look like pushovers, Hassabis and Suleyman said they would think about it. [24] A year later, in 2013, they knew precisely what they thought. Chau was welcome to invest as much as possible. Hassabis also expected capital from Elon Musk, who had promised to invest the previous summer. The two had agreed that Musk would come in as part of the Series C round. On March 1, 2013, Hassabis contacted Musk to nail down the details. Musk told Hassabis to get back to him later. A SpaceX rocket was blasting off that day. Musk couldn’t talk until he knew for sure that the launch had gone successfully. “I remember almost praying that the launch would work,” Hassabis recalled. “I was seriously worried if it didn’t work, he wouldn’t be able to invest—maybe he’d decide he didn’t have any spare cash anymore. “I knew it was costing Elon personal money to fund SpaceX,” Hassabis went on. “And he’d be in a bad mood if the rocket exploded.” Hassabis spent the next few hours refreshing an internet news site and willing the launch to go perfectly. If Musk backed out of the Series C deal, it might be hard to round up the $10 million that would unlock the Founders Fund backing. At length, at around one o’clock in the morning, Hassabis read that the mission had gone well. He called Musk from his living room in North London. -- 140 of 565 -- Musk was in excellent spirits. “How much do you want me to invest?” he asked cheerily. Hassabis was taken aback. He hadn’t prepared for such a direct question. “I don’t want to take up all the allocation,” Musk added magnanimously. He seemed to think it would be bad manners to crowd out the supposed hordes of wannabe DeepMind investors clamoring for a piece of the action. Still uncertain what to say, Hassabis named the biggest number he thought he could ask for. “Five million,” he suggested. It was double the amount he had proposed to Chau a year earlier. Musk agreed immediately. “I should have just told him $50 million,” Hassabis said later. “I was probably being too British about it.” • • • WHATEVER REGRET HASSABIS felt after lowballing his request to Musk, his remorse soon intensified. A couple of months later, when Hassabis and Suleyman were alone at the office late one evening, Nosek called from California. “I can’t get this through,” he told them. Hassabis and Suleyman demanded to know what he meant. What were the new terms? How much money were they now supposed to raise from other investors? Nosek answered that DeepMind had to find another backer to lead the Series C round. Founders Fund no longer wanted to write the biggest check, nor did it want to be the one to determine DeepMind’s valuation. Having urged Hassabis repeatedly not to speak to other venture capitalists, Nosek was now performing a complete reversal. He felt terrible that his partners had pushed him to this point. But there was nothing he could do about it. Hassabis and Suleyman were furious, but they were also out of options. Founders Fund had strung them along until their cash reserves were running low, and there was no time left to rethink the funding strategy. They had to -- 141 of 565 -- assemble a deal out of the pieces that they had. Otherwise, DeepMind would have no cash to pay salaries. “That was deeply scary,” Hassabis recalled later. “Getting close to a situation where you’re going to run out of money. Of course, I remembered that from my Elixir days. I never wanted to be there again.” A few days later, Hassabis and Suleyman contacted Chau. The Series C round would be closing soon. Founders Fund was in. Was she in? Chau was enthusiastic. “We’ve decided to reduce their allocation and boost your allocation so you could do more,” Suleyman offered. “Great!” Chau responded.[25] This time, Hassabis and Suleyman had learned their lesson. They asked Chau for double-digit millions. A week or so later, the round closed with Chau making a $13.6 million investment. Founders Fund kicked in $9.2 million. All in all, DeepMind raised a bit over $25 million. DeepMind’s near-death experience forced Hassabis to come to terms with two basic realities. First, Suleyman was right that he shouldn’t put his faith in Thiel. Second, open-ended, blue-sky research was a poor fit for venture capital. Over the next three months, moreover, the divergence between DeepMind’s scientific aspirations and Founders Fund’s commercial imperatives became even more obvious. Hassabis spent the Series C money on star researchers and computing resources for the Atari project; the project’s success underscored the need for yet more cash to train powerful models. Thiel, for his part, was propelled by his contrarianism in the opposite direction: He regarded the war for talent in AI as an incipient bubble.[26] “We were becoming increasingly bullish, but the Founders Fund people were becoming increasingly skeptical,” Shane Legg remembered. Without Thiel and the Singularitarians, DeepMind might never have gotten off the ground. But it was time to find a new backer. OceanofPDF.com -- 142 of 565 -- O CHAPTER 8 GET GOOGLE n a June weekend in 2013, Elon Musk’s wife, Talulah Riley—an actress known for playing a seductive TV robot who takes to massacring humans—rented out a castle in Tarrytown, New York, to celebrate her husband’s birthday. “It was one of these fake American castles,” Hassabis remembered. The men dressed up incongruously as samurai warriors; and Riley arranged for the sumo world champion to be there, all 350 pounds of him. Musk took the champion on, throwing him impressively and injuring his own neck in the process. Among the guests at the party were Hassabis and Larry Page of Google. Since buying Geoff Hinton’s boutique, Page had learned that it wasn’t just Musk who thought highly of Hassabis. Hinton admired him, too, despite what he regarded as his pathological competitiveness. The professor had known Hassabis since their encounter at MIT. He had served as an adviser to Hassabis’s firm. Hinton’s PhD student Vlad Mnih had recently joined DeepMind. Seizing the opportunity, Page proposed a walk with Hassabis. The two men strolled around the grounds of the castle, taking in the pointless folly of the battlements and arrow slits. Speaking in a strained whisper, the effect of a rare illness of the vocal cords, Page suggested that Hassabis’s company-building endeavors might be similarly pointless.[1] Hassabis’s goal was to create AGI. So why bother with the idea of an independent DeepMind? Google was the obvious place to realize his ambition. “Why don’t you take advantage of what I’ve already created?” Page asked Hassabis. It was a recruitment pitch that he had used successfully on -- 143 of 565 -- other start-up founders.[2] “He was basically telling me, maybe you could build a company like Google, but it would take the best part of your career,” Hassabis recalled. “But if my real mission was to build AGI, then why don’t I use all the resources that he’s accumulated? I thought that was a pretty good argument. “Would I be happier looking back on building a multibillion-dollar business or helping solve intelligence?” Hassabis continued, remembering the decision that Page framed for him. “It was an easy choice,” he added.[3] The choice was all the easier because of what Page represented. The Google chief was not a business person or a product person: He was a scientist. “You could easily see Larry as a top professor at an Ivy League,” Hassabis said. “He had that intellectual capacity, that demeanor. “When we went on that walk together, I felt he would’ve taken his own offer.” The contrast with DeepMind’s venture capital backers was obvious. Hassabis had struggled to persuade Founders Fund that DeepMind would end up changing every widget in the world. With Page, he didn’t even have to make the argument. “I was fed up with scrambling around, trying to justify what I knew was the biggest thing of all time,” Hassabis recalled. “I just thought, look, I’ll go to Google. I’ll get a shitload of computers and then I’ll solve intelligence.” • • • IN THE FALL OF 2013, the three DeepMind founders flew out to the Google headquarters to discuss a potential acquisition. To keep the negotiations secret, they were taken to a discreet business office, across the street from the main building.[4] Google’s mergers and acquisitions (M&A) team had assembled a roster of in-house AI experts to assess DeepMind’s prowess, and the visitors showed off the recent progress with their Atari agent. Hassabis and Suleyman, for their part, took a less conventional approach. -- 144 of 565 -- Flipping the normal template on its head, they showed no interest in negotiating the price that Google would pay for their company. “Normally you start with, OK, we’re interested; what do you want to pay?” Suleyman explained. “We didn’t talk about that. “We thought, the moment we mention money, they’ll think we’re trying to dash for the door. It’ll look like we’re going to take the cash and head off into the sunset. “Instead of asking about the size of our payout and how many years until we get it, we asked about the research budget they would give us. That demonstrated that we just cared about building AGI. We were not going to leave; we were in it for the long term.” Paradoxically, by refusing to negotiate its price, DeepMind made itself appear more valuable. As well as pressing for research funding, Hassabis and Suleyman had a second objective. Although they were less paranoid about their technology than Nosek or Tallinn, they took safety seriously. For Hassabis, this seriousness was sometimes leavened by that metaphorical ladder—safety certainly mattered, but with AI still in a primitive state, it existed on a higher rung and was not an immediate worry. For Suleyman, however, the question of safety felt visceral. Three years earlier, he had resolved to work on technology in order to do good in the world. He refused to postpone his pursuit of that objective. If DeepMind was going to hitch itself to Google, ethics and safety should be baked into the contract. Suleyman had a plan for how this could be done. During his stint working at the London mayor’s office, he had befriended a posh and idealistic human rights lawyer, who opened his mind to the beauty of the legal system.[5] The mix of statute, precedent, and scholarship, layered on top of each other like a painter’s impasto, created an exquisite mechanism for balancing conflicting principles—“a framework that we should all get behind,” Suleyman remembers thinking. Now he decided that legal engineering might deliver ethical and safe AI. If DeepMind was going to be owned by Google, it should be protected by an independent oversight board, composed of scientists, philosophers, and other reputable figures, who would have the last say on how AI should be deployed into society. -- 145 of 565 -- “The basic idea was, look, we have to plan for success,” Suleyman explained. “In a success scenario, we can’t just have the Google founders using AGI for their own purposes.” To pressure Google on this point, Suleyman drew on his experience as a poker player. “We told them, we are the best-funded pre-revenue start-up in Europe. We’ve got Peter Thiel, Solina Chau, Elon Musk, all billionaires, all backing us,” Suleyman remembered. “Of course, those people didn’t really have our backs—that’s what makes you feel queasy as a negotiator. But in poker, you learn to play the table, not the cards. You size up the other players and then you make your bets, based on your reading of their psychology. If you looked at your cards, you would realize that you had nothing. You would fold before the playing even started. “So we said, look, we were not going to have a problem raising money before you approached us. So if you really want to do this, there are two things that matter. First, the research funding. Second, an ethics and safety review process. Oh, and by the way, if you don’t believe in that process, you don’t understand where this technology is going.”[6] “Moose is very good at that stuff,” Hassabis said later. He generally thought of himself as a chess player rather than a poker player. In chess, there are no hidden cards. The game is open and there is no scope for bluffing. As it turned out, the bluffing may have been unnecessary. Google was not a normal corporation; its leaders were far more inclined to think over the horizon than executives at other companies. The top team in Mountain View was already experiencing its version of Suleyman’s safety worries. “We thought AI was like atomic energy,” Patrick Pichette, the chief financial officer, recalled. “You can make bombs with it, but if you are smart you can also solve climate change with it. So we discussed all the big questions from the get-go. What if it takes off on its own and runs amok? How do we control it? -- 146 of 565 -- “Right up front at the executive committee, there was this question of, OK, we buy this company, it figures out the financial markets, it screws the rest of the world, and all the money ends up in our bank account,” Pichette continued. “How do we think about that!?” he asked rhetorically.[7] Google was already so profitable that it was in danger of triggering antitrust proceedings. The last thing it wanted was an internal hedge fund. With Google evidently willing to bankroll DeepMind’s research, and with its leaders attuned to DeepMind’s safety concerns, the path toward an acquisition seemed open. But, having earlier overestimated Founders Fund’s reliability, Hassabis and Suleyman were taking nothing for granted. Hoping to push Google to commit to a deal, they flirted with another suitor: Mark Zuckerberg of Facebook. Zuckerberg had been watching nervously as other tech behemoths built up their AI faculties. Belatedly, he had begun scrambling to catch up, making time on his calendar to woo individual AI researchers, even though he was busy running a company with six thousand employees and a billion customers. Yann LeCun, the deep-learning pioneer based at New York University, had been surprised over the summer when one of his former students had decided to join Facebook. “Why would you even consider going there?” LeCun had asked. A company that wanted to move fast and break things seemed like a poor fit for a research scientist. “I talked to Mark Zuckerberg twice!” came the answer.[8] Suleyman flew out to California to meet Amin Zoufonoun, Facebook’s head of corporate development. Zoufonoun welcomed Suleyman to his home, served him a murderous tumbler of whiskey, and teased him for wanting ice that would dilute it. Over a series of discussions, Zoufonoun proposed a way of making DeepMind’s founders richer than they would be from a Google acquisition. If Facebook bought DeepMind, it would lowball the price it paid for DeepMind shares, but then fork over a vast signing bonus to the founders and their top colleagues. Suleyman reported back to Hassabis. The money thing was interesting, but money was not their main objective. Meanwhile, Zoufonoun had -- 147 of 565 -- brushed aside Suleyman’s talk about AI governance. Over the coming years, Facebook’s indifference to AI safety would come to be well known. The DeepMind duo already sensed it. Zoufonoun reported back to Zuckerberg. DeepMind had a strong roster of AI scientists, and if Facebook didn’t buy the company, they would end up in the arms of Google. Hassabis came out to the West Coast to have lunch with Larry Page, still the strongest suitor. Zuckerberg got wind of his visit and invited him to dinner. Arriving at Zuckerberg’s Palo Alto home, Hassabis administered a subtle test on him. The two men discussed the potential of AI, and Zuckerberg expressed appropriate excitement. But then, as the dinner continued, Hassabis brought up other hot technologies: virtual reality, augmented reality, 3D printing. Zuckerberg sounded equally excited about all of them. “That told me what I needed to know,” Hassabis said later. “Facebook offered more money, but I wanted somebody who really understood why AI would be bigger than all these other things.” After the dinner, Hassabis got back to Larry Page. “Let’s go further,” he told him. Spurned, Zuckerberg’s competitive instincts kicked in ferociously. He redoubled his wooing of individual researchers. Toward the end of November, leveraging the network of LeCun’s protégé, who still constituted almost the entirety of Facebook’s AI team, Zuckerberg invited LeCun himself over to his home. The stage was set for another recruitment dinner. [9] LeCun retraced the path that Hassabis had followed a couple of weeks earlier. There was an imposing fence and a barrier of big trees around the edge of Zuckerberg’s property. What would it take for LeCun to join Facebook, Zuckerberg demanded? If he couldn’t buy DeepMind and acquire a ready-made team, Zuckerberg wanted a famous professor to assemble an AI squad for him. Armed with a virtually unlimited war chest, the professor would pick off the top scientists -- 148 of 565 -- at less well financed labs, starting, presumably, with DeepMind. It wasn’t just Zuckerberg’s haircut that put one in mind of Caesar. LeCun said he wasn’t going to leave New York, and he wasn’t going to quit his professorship at New York University. He assumed that these conditions were deal-breakers. The next day Facebook’s chief technology officer showed LeCun around the company’s Disney-style campus. There was graffiti artwork on the walls, some of it created by the artist David Choe, who had shrewdly taken payment in the form of Facebook equity. At the end of the visit, LeCun was taken to see Zuckerberg again. Both his conditions were acceptable, Zuckerberg told him. “Where do I sign?” LeCun responded.[10] • • • IN EARLY DECEMBER 2013, the DeepMind leadership showed up at the NIPS jamboree in Tahoe. This was their coming-out party: Vlad Mnih was set to unveil the Atari agent. By now the talks with Google had advanced: Shane Legg recalls reviewing rough drafts of the acquisition documents between scientific meetings.[11] But Zuckerberg and LeCun were present at NIPS, too. They rented one of the hotel ballrooms and announced the creation of a new AI lab in Manhattan, not far from NYU.[12] Right after Mnih’s Atari talk, a panel of AI luminaries appeared onstage, Zuckerberg among them. [13] Hassabis saw LeCun at the conference. “You’re not going to poach all my guys, are you?” he asked him. “I had just signed on basically to do that,” LeCun remembered.[14] Two weeks later, just before Christmas, LeCun phoned Koray Kavukcuoglu, his former student. As well as being a key scientific contributor to DeepMind’s Atari project, Kavukcuoglu was a natural leader —in the coming years, DeepMind would depend on him increasingly. Now LeCun offered Kavukcuoglu a huge pay raise to come over to Facebook. -- 149 of 565 -- “That was the moment I thought DeepMind might really fail,” Suleyman said later. “I remember speaking to Demis on Christmas Day and thinking, ‘This is a disaster. This is just the first; they are going to pick off all our key people.’ And then why would Google go through with the acquisition?”[15] Hassabis scrambled to fight back. He let Kavukcuoglu in on the secret: DeepMind was on the point of selling itself to Google. The stock options that the DeepMind scientists had mentally written off might soon be worth a fortune. Kavukcuoglu agreed to sit tight for the moment. Hassabis urged Google to close the acquisition as rapidly as possible. On Sunday December 29, a Google team flew into London on a Gulfstream jet. Larry Page had wanted to lead the group, but in the interest of speed he delegated the task to Alan Eustace. The plane was kitted out with a makeshift bed that allowed Geoff Hinton to fly lying down, an attempt to manage his back pain.[16] The following morning, the Google team showed up at DeepMind. The visitors were shown into a conference room and treated to another series of demos; and then Google’s legendary engineering leader, Jeff Dean, asked to inspect the code that powered Vlad Mnih’s Atari system. Demos were all very well, but Dean knew they could be faked. He wanted to look under the hood to make sure there was a real engine.[17] “I was like, why are you showing him my code?” Mnih said, recalling his anxiety. “Research code is famously hacky. You’re building something you might throw away tomorrow, so you don’t spend much time making it nice.”[18] “It was a crossing of the Rubicon moment,” Hassabis remembered. “The biggest, best company in the world gets to see all your research. If you don’t do the deal after that, you’ll be crushed. It was high stakes for us.” Dean gave the code a thumbs-up. Unlike that Elixir demo that David Silver shuddered to recall, there were no hidden tricks in the Atari system. Now it was up to the business guys to hammer out the details of the acquisition. -- 150 of 565 -- • • • A FEW DAYS LATER, Luke Nosek went with Elon Musk to a party in Los Angeles. Musk was accompanied by Talulah Riley. Nosek was carrying his laptop. Nosek told Musk that Google was about to buy DeepMind. His friend Larry Page was about to one-up him. “Demis shouldn’t lose control of his company,” Musk responded. “We can’t have a giant corporation control AGI. This is not a good thing for humanity.” The conversation quickly grew intense. “We were asking, ‘What are the things that matter in the world?’ ” Nosek recalled. “And I started to question whether I’m doing enough about the stuff that is important. “And then we said, well, DeepMind is the thing that really matters. The control of AGI! Like, DeepMind is about to be sold to a corporation!” Untroubled by the fact that he was a corporate leader, too, Musk proposed that Nosek should get Hassabis on the phone immediately. He led Nosek and Riley up to the master bedroom in the party house, and into a small closet. The three sat on the floor. Nosek fired up his laptop and placed a Skype call to Hassabis. “Peter [Thiel] and I had failed to deliver the financing that DeepMind needed,” Nosek said later. “Elon and I were going to fix that. We thought, ‘This is the most important thing that we need to be working on for humanity.’ ” Not for the last time, the extreme potential of AI was triggering extreme behavior. Hassabis picked up Nosek’s call, even though it was the middle of the night in London. Musk and Nosek started peppering Hassabis with options. His neo-Faustian sellout had to be prevented. “How about if Tesla acquires you?” they proposed. Hassabis pointed out that Tesla was not generating enough cash to support DeepMind’s research. The carmaker’s revenues were growing fast, but it still reported a loss most quarters. -- 151 of 565 -- “How about if SpaceX acquires you?” the LA pair persisted. Hassabis again demurred. SpaceX had nothing to do with AI, he objected. “AI for robots on Mars!” Musk said. SpaceX didn’t have the computer power that DeepMind was going to need, Hassabis insisted. “Founders Fund couldn’t do it. Elon couldn’t do it,” Nosek confessed. “These were the people that cared the most about AI. And we couldn’t provide the capital. We ended the call feeling dejected.”[19] Having rebuffed Nosek and Musk, Hassabis went back to worrying about Google. • • • THE QUESTION OF DEEPMIND’S VALUE, which Hassabis and Suleyman had avoided, now had to be addressed in earnest. DeepMind had no revenues; its main asset was its people. Google’s acquisition specialists had a standard way of valuing “acquihire” transactions of this kind. “We had a price-per- engineer model,” Don Harrison, the chief Google negotiator, said later. Harrison figured that DeepMind had perhaps thirty or forty technical stars. They were not engineers; they were scientists. Back of the envelope, each one might be worth about $10 million. A tough Canadian lawyer who had helped take Google public, Harrison had hammered down the details of dozens of deals. He seldom met much resistance. On this occasion, however, Hassabis and Suleyman pushed back aggressively. Google had paid almost $15 million per scientist when it had bought Hinton’s boutique. One year on, the market for talent was hotter than ever. Besides, DeepMind was more than just a team. As the Atari success demonstrated, the company embodied a way of surpassing the deep-learning paradigm for which Hinton was famous. Deep-learning systems matched this onto that—images onto words, and so forth. The Atari agent had taught itself multiple games. It was capable of strategy. Hassabis and Suleyman proposed a valuation for DeepMind that was roughly twice as high as Harrison’s. -- 152 of 565 -- Harrison and his team gulped. “Everyone had upset stomachs,” he said later.[20] Jeff Dean, who led Google’s deep-learning group, called Google Brain, agreed that DeepMind’s target might be excessive. Geoff Hinton objected. During the auction of his three-man outfit, Hassabis had encouraged him to hold out for $50 million. Now he told Dean that Hassabis by himself was worth $150 million. “I recognized Demis’s drive and leadership and political skill,” Hinton explained later.[21] Of course, all these numbers were plucked out of the air. But the bottom line was that Google could afford to pay what DeepMind demanded. If Hassabis’s team managed to build on the Atari breakthrough, a few hundred million would turn out to be a bargain.[22] Beyond the question of price, Google was forced to wrestle with DeepMind’s other conditions. Hassabis insisted on operational autonomy. He wanted to keep DeepMind’s distinctive hiring practices and culture; he was determined to remain in London. Meanwhile, Hassabis and Suleyman were still insisting that the uses of their technology should be restricted in advance. Military applications would be banned. An ethics and safety review board—a committee that would include the DeepMind founders and some external grandees—should be set up to dilute Google’s power over the technology. “For me, this was a huge problem,” Harrison remembered. “I was in front of our board of directors selling a deal that wasn’t just about the price. It involved a structure that reduced our control over an asset that we were spending a great deal of money on.” Google’s leaders were open to pondering AI safety. But they preferred to do this pondering themselves, without outsiders second-guessing them. “As a lawyer, I’m actually not sure a company can sign a contract that stops it from seeking maximum value for shareholders,” Harrison continued. “We worried that if we signed something like this, a shareholder could sue us.” In the end, Google swallowed these concerns because of Hassabis. “There is no way we would’ve agreed to the structure without being -- 153 of 565 -- absolutely convinced that Demis represented the future of our AI strategy,” Harrison said later. “Half the deals we do, we accept that we’re going to lose the CEO or founder,” Harrison went on, and Google was usually fine with that. But DeepMind was the reverse case. Google wanted the company, but it also wanted Hassabis. “My job as an M&A adviser was to say, ‘This is unique, this is unprecedented,’ ” Harrison recalled. “The founders [Larry Page and Sergey Brin] were initially with me, concerned about the issues I was raising. But they finished on Demis’s side. So I give Demis credit for articulating a vision that brought the founders along.”[23] At the end of January 2014, Google bought DeepMind for $650 million. Hassabis netted $136 million, far more than Hinton had reaped the previous year, and almost certainly more than he would have received if he had diluted his holding by buying Hinton’s three-man operation. Suleyman pocketed $34 million, having amassed additional shares as his responsibilities had expanded. Legg, for his part, took $29 million—it was plenty to live well, as he had foreseen at DeepMind’s founding.[24] More to the point, DeepMind itself acquired ample resources to keep going. If Silicon Valley behemoths bid top dollar for scientists like Kavukcuoglu, DeepMind could now counterbid. Not long after the Google acquisition, DeepMind was paying $260 million in staff costs annually, perhaps six times more than its total spending during its first three years of existence. [25] For Hassabis, the acquisition also meant that his fundraising ordeal was over. He had the money and computer power of his American parent, but he was still running a nearly autonomous start-up, and he was doing it from London. Nosek, Musk, and many future commentators might lament the fact that he had sold to Google—British political and business leaders would frequently lament that a national champion had gone cheap to the Americans. From Hassabis’s perspective, however, the advantages of the sale were overwhelming. -- 154 of 565 -- OceanofPDF.com -- 155 of 565 -- I CHAPTER 9 INTUITION n May 2014, Hassabis flew out to the West Coast to address a gathering of Google’s senior executives. He held forth about his latest coup: DeepMind’s Atari agent had been substantially improved, and the top scientific journal, Nature, had featured the breakthrough on its cover. After his presentation, Hassabis got chatting with Google’s cofounder, Sergey Brin, and mentioned a possible next project. The techniques that had worked on Atari could be extended to the game of Go. A computer could defeat a world champion. Brin seemed incredulous. He was a keen Go player himself, and he knew that no machine approached the mastery of the best humans. “Wouldn’t that be impossible?” he asked skeptically. “Great!” Hassabis thought to himself. “If he thinks it’s impossible, it should be pretty impressive if we do it.” Hassabis told Brin that cracking Go was absolutely doable. “How long do you think it would take?” Brin asked, still sounding doubtful. Larry Page, his cofounder, was known for insisting that the impossible was possible. Brin was the practical partner. “Two years,” Hassabis responded. He hadn’t given the timeline much thought, he admitted later. Brin’s skepticism was well founded: Go is a game of vast combinatorial complexity. Two players take turns placing pieces on a nineteen-by- nineteen board; after just one move, there are 361 possible positions. After the second move, the number of possible sequences is 361 × 360; after the third move, it is 361 × 360 × 359—which is almost 47 million.[1] The -- 156 of 565 -- number of possible board states during the course of a game is estimated to be at least 10170, way more than the number of atoms in the observable universe. The mind-boggling complexity of Go posed a problem for designers of intelligence. At the dawn of the computer era, a few early agents mastered simpler games by crunching through the consequences of each possible move: The search tree was relatively small, so brute force was effective. Later, more sophisticated agents, including David Levy’s chess program and the teenage Hassabis’s adaptation for Othello, coped with larger search trees by lopping off branches that were obviously bad, a method often known as alpha-beta pruning. But the vast number of permutations in Go rendered alpha-beta pruning powerless. In order to master Go, a machine would have to do what humans do: look at the configuration of the pieces, the patterns that they form, and intuit the correct move, whatever that meant. Plenty of AI researchers agreed with Brin: Certain mysterious human powers would remain inaccessible to computers, and intuition was one of them. When a human hears a knock at the door and sees a portion of a visitor’s face, she doesn’t search laboriously through hundreds of recently encountered faces, finally saying, oh yes, you are number 403 in my memory buffer. Rather, something about the chin or the cheekbones triggers a response, allowing her to recognize the face in a fraction of a second. Quite how this instinctive, nondeliberative thinking happens—“System One” thinking, to use the psychologist Daniel Kahneman’s label—defies explanation. And yet it clearly happens, in social interactions and in complex games. Go was therefore seen as a grand challenge in AI: a peak that might eventually be scaled, but not in the next decade or two.[2] Hassabis had reasons to believe that Go could be solved sooner. These began with Gödel, Escher, Bach: What the human brain could do, computers should be able to do. Intuition, or System One thinking, sounded ineffable, impossible to program. But ever since Cambridge, Hassabis had believed that all facets of intelligence come down to finding patterns in the -- 157 of 565 -- infinity of noise. Intuition amounted to a clever algorithm of some kind—an algorithm that science could discover. Hassabis was also bullish about Go because of David Silver. At Cambridge and again at Elixir, the two friends had dreamed of ways of solving Go. “We had this idea that if we could crack Go, then we could go all the way to AGI,” Silver remembered. Silver had gone on to research Go programs for his doctoral thesis, ignoring the objections from his head of department, who hated to see a brilliant mind wasted on something so intractable. “I was absolutely drawn to Go because I was told it was impossible,” Silver said.[3] Silver made more progress on Go than his department chief had expected. His main innovation was to discard the alpha-beta pruning that had worked for chess, but which was a dead end for games of higher complexity. Deep Blue had succeeded by analyzing each possible move sequence twelve to sixteen plays ahead, then pausing to evaluate which sequences to discard before searching the promising lines more deeply. Because of the nineteen-by-nineteen board, a Go system could calculate only about four moves out before the search tree became intractable.[4] And evaluating a Go position is hard. Chess positions can be scored based on simple rules, such as who controls the center of the board or how many pieces have been taken. A Go agent couldn’t tell which positions looked good after four moves and therefore which move sequences warranted deeper investigation.[5] Silver set out to solve this conundrum with an approach called Monte Carlo Tree Search.[6] Instead of analyzing every possible move sequence, and then pruning the bad ones, Silver’s program followed a small number of sequences all the way to the game’s end. This showed which sequences brought victory: It solved the position-evaluation half of the Go problem. By repeating this narrow but deep search strategy, and selecting the next batch of sequences based partly on which ones had led to victory before, the system identified a large number of winning lines of play: It was like a human player who imagined possible futures, then returned to the beginning and gamed out another set of possibilities.[7] Silver said his system was -- 158 of 565 -- engaging in “introspection”—the slow, deliberative process that Kahneman called “System Two” thinking. One day I asked Hassabis about Silver’s description. It sounded suspiciously anthropomorphic? “In Go, in chess, in life, the first thing that comes to mind isn’t necessarily the best thing,” Hassabis answered. “So humans engage in introspection, and a Go system is doing the same thing. It goes down a path and then backs up and goes down another one, and then it compares all the options. This is a primitive form of introspection. I’m sure the final AGI system will have that.” Completing his PhD in 2009, Silver felt he had pushed Go as far as he could, given the limits of technology. His agent could beat a decent human amateur, but it was a long way from professional level.[8] Yet even as he pursued other research, Go remained on Silver’s mind. Sooner or later an algorithmic breakthrough or additional computer power would create a fresh springboard for progress. He was constantly looking for that springboard.[9] In 2012, Silver persuaded Aja Huang, a Taiwanese scientist whose doctoral work had also involved Go, to move to London and join DeepMind. Huang was a very strong Go player himself, and his PhD project had built on Silver’s experiments with Monte Carlo Tree Search, yielding a Go program that beat all rivals at the 2010 international Computer Olympiad. When the time came to tackle Go again, Silver wanted Huang by his side. In the meantime he enjoyed teasing him about their joint obsession. “Aja, let’s do Go together,” Silver would say. Behind his wire-rimmed glasses, Huang’s eyes would light up. The mere mention of Go excited him. “Not now, Aja.” Silver would laugh. “One day!”[10] In spring 2014, Silver suggested to Hassabis that Go’s day was approaching. The success of the Atari project, combining reinforcement learning with deep learning, provided just the sort of springboard that Silver had been waiting for.[11] It was this suggestion, the culmination of -- 159 of 565 -- conversations begun at Cambridge, that explained Hassabis’s confidence in his meeting with Sergey Brin. Primed by Silver, Hassabis knew that Go could be solved sooner than his new owner imagined. Returning to London after talking to Brin, Hassabis sought out his old friend. A Go breakthrough would shock and awe DeepMind’s new American parent, he reported. Did Silver really think this was the moment? [12] Silver answered that it was indeed the moment. The next time Silver saw Huang, he wasn’t teasing anymore. “Aja, for some reason Google has asked us to start a Go project,” he told him. Huang could not believe his luck. “It was my life’s dream,” he remembered.[13] • • • A MONTH OR SO LATER, Huang joined Silver for a video conference with two Geoff Hinton protégés, Ilya Sutskever and Chris Maddison. Sutskever was part of the boutique that Google had acquired in Tahoe, and was now working at the search giant’s headquarters in Silicon Valley. Maddison was studying for his PhD in Toronto. He was beginning a Google internship. Silver explained his strategy for a fresh assault on the Go challenge. For Atari, DeepMind had solved the problem of a vast search space with the help of deep learning. The Atari agent would have taken forever to try out every possible move in every conceivable game position; deep learning allowed it to induce good moves from previously successful ones, dramatically accelerating the training. For Go, Silver suggested, a deep- learning network could contribute something similar. If the system was shown inputs (board positions) coupled with outputs (the move chosen by a human Go professional), it would learn to map one to the other. By ingesting examples of human moves and learning to reproduce them, it would mimic intuition. Silver was reviving an approach that Sutskever himself had tried some six years earlier. Back when he had been a PhD student, working with small -- 160 of 565 -- neural networks, Sutskever had built a deep-learning system that predicted the move a Go professional would make, outputting the right answer on roughly one in three occasions.[14] The results had been too weak to attract attention, and the experiment was forgotten. But in the half decade since that trial run, deep learning had advanced by leaps and bounds. Silver was proposing to repeat Sutskever’s experiment, this time using modern methods. It was one of those ideas that seemed instantly compelling the moment you heard it. Human beings’ ability to interpret patterns—to recognize a face appearing at the door—is a facet of visual intelligence. Since the ImageNet breakthrough in 2012, deep learning had excelled at vision. Just as Alex Krizhevsky’s program had matched pixels to words, a deep-learning Go system would look at a pattern on the board and map it onto the move that a human expert would have chosen. Silver asked the Hinton protégés to help build the deep-learning system he envisaged. Maddison was especially enthusiastic.[15] He got on a plane and flew to London and spent the next three months working out of DeepMind’s office. Huang introduced Maddison to a database of 150,000 games played by human experts. A game lasted for around two hundred moves, so each game could be viewed as two hundred pairs of inputs (the game positions) and outputs (the moves that the experts had chosen), yielding a total of thirty million input-output combinations. Maddison fed these training pairs into a neural network. To contain the cost of the experiment, Silver decreed that the network should not be too large. He also stipulated that Maddison should initially test his deep-learning system without adding in Monte Carlo Tree Search or other enhancements. He wanted a clean answer to the crucial question: Could a neural network mimic intuition?[16] Sure enough, it could mimic it. Even though Maddison’s network was modest by the standards of 2014, it was still roughly 250 times larger than Sutskever’s system of 2008, and the jump in computational muscle delivered a sharp improvement.[17] Instead of predicting an expert’s move correctly one in three times, Maddison’s network got the answer right a bit -- 161 of 565 -- more than one in two times. By simply looking at the game positions and not even bothering with tree search, it approached the playing strength of the world’s best Go systems, achieving the proficiency of a strong human amateur.[18] The result confirmed the central contention in Gödel, Escher, Bach. Human intuition was not so magical after all. A machine could reproduce it with the trick of mapping one thing onto another. • • • IN THE EARLY DAYS of Silicon Valley, the pioneering venture capitalist Tom Perkins announced a formula for high-risk research projects. Before you invest big money, try to fail fast. Begin with the absolutely hardest piece of the challenge. Take the white-hot risks off the table. Having proved that neural networks worked for Go, Silver reported back to Hassabis. “We’ve got something here,” he told him. Leveraging a low-cost intern, and training his model on a modestly sized network, he had shown that intuition was replicable. Now that the white-hot risk was gone, DeepMind had a chance to solve the long-standing grand challenge of beating a human Go champion.[19] “We had begun with the cheap bet,” Hassabis recalled. “And the results were promising. “So then I had to estimate what would happen if we put more people and resources to work. I thought we could make serious progress.” At the end of 2014, DeepMind began secretly testing a hybrid version of its system against Crazy Stone, the world’s strongest commercial Go program. The hybrid combined Maddison’s deep learning with an improved version of Monte Carlo Tree Search developed by Huang. By early 2015, Crazy Stone had been defeated. Elated, Huang was eager to trumpet his victory by publishing a research paper. “If we publish, we can say we are the world number one!” Huang urged Hassabis. -- 162 of 565 -- “No, no, no,” Hassabis retorted. The goal was not to be the world’s best program. The goal was to defeat the top human professional. Huang thought this ambition was a stretch. He wanted to pocket the intermediate reward, not gamble for the jackpot. Crazy Stone was only about as strong as Huang himself. The top human players existed on an entirely different level. “This goal is not practical! It’s not possible!” Huang complained to Silver. He said it over and over. “IT’S—NOT—POSSIBLE!” “Dave just laughed at me,” Huang recalled later. “He said, ‘Come on, Aja, that’s the whole point of the project.’ ” Huang had already created a program that beat all other programs back when he had done his PhD. To repeat the same feat would be meaningless.[20] At the start of 2015, Silver was simultaneously engaged in a debate with Ilya Sutskever. A few weeks earlier, sporting a black T-shirt and close- cropped jet-black hair, Sutskever had stolen the show at NIPS, the annual AI conference. Announcing the sensational results of a new translation model, he had made a general case for the supremacy of deep learning. A big neural network consisting of millions of neurons, each with a weight and a bias that could be tweaked this way and that, could train on almost any variety of data, eventually discovering the magic combination of parameters that mapped one thing onto another. “If you have a very large dataset and a very large neural network, then success is guaranteed,” Sutskever told his congregation.[21] Back in 2013, DeepMind’s reinforcement learning system for Atari games had been the toast of NIPS: The idea of an agent that learned from trial and error captured the conference’s imagination. One year on, Sutskever’s deep-learning vision of AI was staging a counteroffensive. Sutskever believed that Go might cement deep learning’s ascendency. If Maddison’s deep-learning Go model showed promise with a relatively small network, why not scale it to the max? Maybe you could build a very powerful program just by mapping board positions onto move choices? Perhaps agentic introspection—Monte Carlo Tree Search—was not all that -- 163 of 565 -- important?[22] “Use minimum innovation for maximum results,” as Sutskever had put it to the NIPS audience. Silver agreed that scaling up the neural network could produce significant advances. His PhD supervisor, Rich Sutton, talked about the “bitter lesson” of AI: Scientists were forever trying to dream up brilliant algorithms, but the truth was that keeping the algorithm simple and supplying additional computer power almost always worked better.[23] At the same time, Silver believed that the advantages of scaling were not limited to neural networks. Scale would also improve Monte Carlo Tree Search, he told Sutskever.[24] Sutskever had been born in the Soviet Union, in a city known for building armaments. He often appeared to be scowling, even when he wasn’t.[25] Now he suggested to Silver that scaling up Monte Carlo Tree Search should not be the priority. “Humans can only think ahead for a certain number of steps,” he objected. If humans didn’t analyze move sequences out to a game’s end, why would an AI need to do so?[26] Silver had grown up in provincial England, and his father was a writer and a poet. He had a pixie smile and a perpetually friendly manner. On this occasion, his smile signified assurance: He believed Sutskever was missing something. To be sure, the DeepMind culture usually embraced arguments that assumed computers should emulate humans. But when it came to Go, mere emulation was not actually the goal. Silver wanted an AI that would defeat humans. “If you just pattern-match what humans do, it is not going to take you all the way to beating the top human,” Silver said. “The system needs to discover new moves which aren’t humanlike.”[27] Sutskever and Chris Maddison had helped to reproduce intuition, providing Silver with his springboard. But now Silver wanted to bounce into the beyond—to build a machine that would search the infinity of permutations in Go and come up with entirely novel strategies. • • • -- 164 of 565 -- IN EARLY 2015, Silver’s vision seemed quixotic. After the defeat of Crazy Stone, progress on the Go system stalled, and Huang kept telling Silver that improvement was impossible. A couple of researchers lost hope and moved off to other projects. “People didn’t really believe,” Silver remembered later. One day Silver surprised Huang with a fresh initiative. “Aja, I know you are skeptical,” he said. “But I am going to show you.” “OK, show me,” Huang responded. Silver introduced Arthur Guez, a Canadian who had earned a PhD at University College London under Silver’s supervision. Guez was going to build a complement to Maddison’s neural network. Rather than looking at a board position and proposing the best moves, Guez’s network would look at a board position and evaluate the odds of victory.[28] “Impossible,” Huang muttered. Guez set to work, repeating the by now familiar process of organizing data into a form that would support deep learning. He isolated the inputs (board positions taken from the database of games played by human experts) and labeled them with outputs (whether those board positions had resulted in victory). Then he trained a neural network on these input-output pairs, so that the system learned to map one thing onto the other. After a while, Guez handed his model over to Huang, who stitched it into the existing Go system. As Huang had expected, there was no boost to performance. Guez tried again. Unlike the doubters who had quit the team, he was a fierce believer. After two months of iteration, he presented his latest version to Huang. This time the results were astonishing. “Wow, wow, this thing is incredible!” Huang remembers thinking. Once Guez’s network was grafted into the system, it beat the old version in more than nine out of ten games. “At that moment, I started to believe David Silver,” Huang said.[29] DeepMind now had two intuitive models: Maddison’s “policy net,” which looked at a board position and suggested a move, and Guez’s “value net,” which looked at a board position and assessed the odds of victory. -- 165 of 565 -- Both could be described as System One networks, mimicking the fast- thinking parts of human intelligence. But what was really powerful was how the intuitive deep-learning models worked with the introspective reinforcement learning, the tree search—the deeper, slower, System Two side of intelligence. Thanks to Maddison’s policy net, the search algorithm no longer began with an unfathomable number of possible moves, with no way of knowing which might be fruitful. Instead, it could start by analyzing the moves that an expert might make, radically pruning the search tree. Likewise, thanks to Guez’s value net, the model no longer had to crunch through move sequences all the way to the game’s end. Instead, it could follow sequences as far out as its computing resources allowed, then ask the value net to score the resulting positions. Equipped with the knowledge of which position was best, the system could play the move that led to it. The problem of vast combinatorial complexity had been vanquished. In April 2015, a German scientist and Go player named Thore Graepel showed up for his first day at the DeepMind office. He had a handsome, upright bearing, befitting a man whose name derived from the Norse god of thunder. Silver went to greet the new recruit. Armed with the ample resources provided by Google, DeepMind had poached him from Microsoft. “Look, we have this super exciting project,” Silver said, a bit mysteriously. “We want to try it out on you.” Graepel followed Silver to a table in an open-plan atrium. By now, DeepMind had moved to a fancier office in London’s King’s Cross, where Google was building its European headquarters. Graepel took a seat across from Aja Huang. On the table was a Go board. Graepel understood that Huang was not his real opponent. Huang’s role was merely to place pieces on the board, executing commands issued by a Go program. “I thought, OK, I’ll play it safe,” Graepel remembers. Most public Go programs were not that great, and Graepel was an accomplished human player—less highly ranked than Huang, but stronger than Hassabis or Silver. -- 166 of 565 -- Graepel played some standard moves, figuring there was no need to get aggressive. “How good can it be?” he kept asking himself. DeepMind staffers stopped by to watch. Hassabis himself showed up and joined the audience. After around half an hour, Graepel was losing. The machine recognized patterns and move sequences at least as well as he did. Eventually, it ground him down. “That day I added to my CV: first person to lose against DeepMind’s baby Go system,” Graepel said later.[30] Around this time, Silver’s team bumped up against a new ceiling. It had ransacked the internet for all available records of expert human games. There were no more left. Progress stalled again. To get around this bottleneck, Silver drew from reinforcement learning. Deep-learning systems depended on human-created data. But reinforcement-learning agents created their own data through trial and error. By playing millions of games against itself, the agent could radically expand its corpus of training material. DeepMind had to be careful about how it used this material. If the new training data from self-play had been fed into the policy net, the effort would have dead-ended. It would have been like trying to teach a magician new tricks by showing her some sleights of hand that she had herself invented. After all, the Go agent’s moves during self-play had been proposed by the policy net in the first place. To sidestep this trap, Silver and his colleagues used the data from self-play to improve Guez’s value net. Unlike the policy net, which mapped Go positions onto subjective outputs (the moves that human players had chosen), the value net mapped Go positions onto objective truths (whether the positions led to victory). This was a sturdier variety of data—a win is a win, whereas a human move is just a recommendation. By playing games against itself, Silver’s agent had collected millions of new examples of win/lose truth. By training on these, Guez’s value net could refine its assessments of the win probability for any given board position. The improvement to the value net kicked off a virtuous circle. Because the value net was now better at judging which board positions were -- 167 of 565 -- advantageous, the agent chose moves more wisely. As a result, as self-play continued, the games were of a higher standard. This gave Maddison’s policy net the opportunity that it had lacked before. By learning from the higher-quality moves generated through self-play, the policy net grew stronger. As self-play went on, the virtuous circle kept spinning.[31] Five years earlier, DeepMind’s business plan had laid out a theory of intelligence. The brain was composed of powerful components, but its genius was to integrate them deeply. Coming on the heels of Atari, Silver’s Go system provided a second vindication. • • • IN OCTOBER 2015, DeepMind approached a milestone. The baby agent that had defeated Graepel now also beat Huang easily. It was time to test it on a stronger opponent. Huang emailed the three-time European Go champion, a Chinese-French professional named Fan Hui, inviting him to London for a secret five-match contest. “I wrote to him in Chinese. I called him teacher—Teacher Fan,” Huang recalled. “I told him we were doing a Go project. I said, ‘It probably can beat you.’ ”[32] Fan dismissed the notion that a machine could defeat him. But he lived in the quiet French city of Bordeaux. A trip to London was appealing. DeepMind booked a room for Fan in the elegant Great Northern Hotel in King’s Cross. Huang and Graepel went over to the hotel restaurant to meet him. Huang repeated his warning. “You need to do your best. Play carefully, think deeply.” Fan remained blasé. “No, come on. Very easy.”[33] The next day Fan showed up at the DeepMind office, still predicting victory. He promptly lost his first match against the program. He regrouped, revised his strategy, and the next day he lost again. The computer, he said, seemed “like a wall.”[34] -- 168 of 565 -- “He was very, very shocked,” Huang remembered. The DeepMind team was surprised also. “Literally none of us knew what was going to happen,” Silver recalled.[35] Who could say how big the gap was between a strong amateur like Huang and a professional like Fan? “You couldn’t know until you did that calibration,” Hassabis remembered. Graepel had bet his colleagues that the agent would lose at least one game. He figured that something would go wrong somewhere. But the computer beat the human five games straight. For losing the bet, Graepel showed up at the office dressed as an ancient Japanese Go master. On the last day of October, with the agent’s triumph still fresh, DeepMind confronted a different form of competition. Facebook’s chief technology officer announced to a roomful of reporters that the company was working on a Go model. Perhaps a bit rashly, DeepMind had explained the design of Maddison’s policy net in a paper published in December 2014; not surprisingly, Facebook’s starting point for its model closely resembled DeepMind’s. Facebook had teams in New York and California working on its project. Their model was already stronger than search-based systems such as Crazy Stone.[36] Hassabis responded quickly. He knew that for now DeepMind was ahead. But the landmark Fan Hui match had been conducted behind closed doors; unaware that it had happened, journalists reported Facebook’s boasts as though the upstart were the leader—not just at Go, but in AI more generally.[37] To ensure that DeepMind’s ascendancy was recognized, not least by Google’s top brass, Hassabis resolved to do two things. He would get the news of the Fan Hui match published in a top scientific journal. And he would quickly follow publication with a match against the world’s top human Go player. When it came to marking territory in scientific journals, Hassabis was a master. He had proved this a year earlier, by getting the paper about DeepMind’s Atari system into Nature. His colleagues had doubted that this was possible: Nature had never published a paper on computer science, as far as anybody could remember. But Hassabis had befriended an editor at -- 169 of 565 -- the journal and spent the best part of a year persuading him to break the mold: AI was a rising science, and Nature should put its stamp on it. Next, to boost the chances that the article would not merely be published, but would be featured on Nature’s cover, Hassabis had turned to the graphic artists who worked on DeepMind’s video games, and the artists had dreamed up a cover showing DeepMind’s noble Atari agent battling space aliens. Breaking all precedent, Nature had rewarded Hassabis by both publishing the article and splashing it on the front cover. “Doing something original is so difficult,” Hassabis reflected later. “It’s almost your moral imperative to milk your creative successes to the max, so that you get the resources to go again with the next thing.” After the Fan Hui match, Hassabis set the team to work on a second pitch to Nature. Meantime he asked Silver when his system would be strong enough to make the leap from defeating the European champion to outclassing the world champion. Silver knew that, as self-play continued, the agent was getting stronger. But the uncertainty that had concerned him before the Fan Hui match was unavoidable. There was no precise way of measuring the gap between Fan Hui and the world’s top players. “I think we’ll be able to beat the world champion by March,” Silver ventured, with only a moderate degree of confidence. “Right,” Hassabis said. “We’re going to do this.”[38] • • • IN JANUARY 2016, Nature duly published DeepMind’s Go paper, again featuring it on the front cover.[39] The day before publication, following the usual protocol, the journal distributed embargoed copies of the article to journalists. A reporter called Facebook for comment, and word promptly reached Zuckerberg. Exhibiting the competitive bite that he had shown when he had tried to poach DeepMind’s research director, Koray Kavukcuoglu, Zuckerberg rushed out a hasty announcement before the Nature article went public, trumpeting Facebook’s considerably less -- 170 of 565 -- impressive Go project. It was “a bizarre and hapless bid for preemptive PR,” the journalist Cade Metz observed.[40] It was also a foretaste of the AI race that would start in earnest later. The press brushed Facebook aside and focused on DeepMind. With its victory over Fan Hui, DeepMind’s agent, now dubbed AlphaGo, had defeated a human champion for the first time, doing so about a decade earlier than experts had expected. What’s more, Hassabis coupled the release of the Nature cover with an announcement: In March, AlphaGo would play Lee Sedol, a legendary South Korean Go master and winner of eighteen international tournaments.[41] DeepMind had put a $1 million prize on the table. Hassabis had thought hard about the choice of opponent. His first idea had been to play a Japanese champion. But at the time of his decision, no Japanese player was quite in the top rank; South Korea and China were the world’s two Go superpowers. Examining these options, Hassabis soon settled on Lee Sedol, not just because of his prowess, but because of something he embodied. Lee had grown up on a small South Korean island and become a Go professional at twelve; he exuded a “noble warrior spirit,” Hassabis reckoned. Further, Lee’s gladiatorial rivalry with the Chinese Go master Gu Li attracted huge audiences in Korea; the geopolitical overtones recalled the classic Cold War chess contest between Bobby Fischer and Boris Spassky. A match between Lee and AlphaGo, the equivalent of Garry Kasparov’s bout with IBM’s Deep Blue, would cause Go-crazy Korea to go crazier. “Lee was a national hero. Koreans love Go. They also love AI,” Hassabis said later.[42] The timing of the match had also required judgment. Silver had guesstimated that AlphaGo would be ready in March, but several members of the team wanted a safety margin. Every so often, the system would “hallucinate,” choosing a move seemingly at random. Hassabis overruled the doubters because of the threat from other AI labs. Facebook was already breathing down DeepMind’s neck, and the Nature cover story had revealed how AlphaGo worked, laying out the combination of a policy net, a value net, and Monte Carlo Tree Search. Because of the reverence for Go in their -- 171 of 565 -- country, the Chinese internet giants, notably Tencent, would also seize on the Nature paper. The decision to go full steam ahead was clinched by an assist from DeepMind’s parent company. At the end of 2015, Huang and his colleagues began to run AlphaGo on a new kind of hardware—a special Google chip that supplanted Nvidia’s graphics processing unit. The new “tensor processing units,” or TPUs, could speed through calculations even faster than GPUs; by rounding off numbers to the nearest integer and sacrificing a small amount of precision, they could perform trillions of extra multiplications.[43] When Huang tested out Google’s new semiconductors, it was another wow moment: AlphaGo with TPUs had an 80 percent–plus win rate against AlphaGo with GPUs. Fan Hui, who by now had been recruited to the DeepMind team, reported that the souped-up AlphaGo had a different style of play. Its moves were creative—even beautiful.[44] A few weeks before the match in South Korea, Google’s chairman, Eric Schmidt, visited Hassabis in London. If DeepMind was staging a Deep Blue–Kasparov type of spectacle, Schmidt wanted to be sure of victory. “How’s it going?” he asked Hassabis. “The metrics look good, but we still have some hallucinations.” “Great, just don’t fuck it up,” Schmidt said, only half-joking. • • • IN MARCH 2016, Hassabis, Silver, and the team duly arrived in Seoul. Eric Schmidt flew in from California, and so did Jeff Dean, the guru behind Google’s TPU chip. Sergey Brin, cofounder and Go enthusiast, joined three days later. The full drama of the occasion took the visitors by surprise: There were armies of media and giant screens in the streets so that pedestrians could glimpse the action. Over two hundred million people would watch the face-off between man and machine. It was more than twice the audience for Deep Blue’s defeat of Kasparov—more even than the Super Bowl.[45] -- 172 of 565 -- Silver felt daunted. “I’d underestimated how big a deal it would be by two orders of magnitude,” he said geekily, affixing numbers to that queasy feeling in his stomach. Lee Sedol appeared confident. He had studied the move-by-move breakdown of the games against Fan Hui, which had been published in Nature, and he predicted that he would win 5–0 or 4–1, since he was far stronger than Fan was. Most Go professionals agreed. Defeating DeepMind would be the easiest million dollars a top pro could hope for.[46] “I’m going to do my best to protect human intelligence,” Lee vowed earnestly.[47] When game day arrived on March 9, Aja Huang sat in a sparse room in a black leather chair, the Go board in front of him. To his left was the computer screen that displayed AlphaGo’s move choices, generated by servers on the far side of the Pacific Ocean. Across from him sat Lee Sedol, whose moves would be generated by adrenaline and coffee. Minutes into the first game, the human was in trouble. Lee attempted to confuse AlphaGo with an unorthodox third move and an immediate skirmish; he was deliberately reaching for strategies that would be outside the computer’s training set. But AlphaGo appeared unfazed. Lee had underestimated how much the system might have improved since the Fan Hui match in October.[48] Lee looked by turns shocked, amused, and grimly accepting. He sat back in his chair and smiled. He massaged his neck. Everything he had expected from studying the Fan Hui games was turning out to be irrelevant. The system could be beatable one day and invincible five months later. Eventually, Lee resigned. “I didn’t foresee that AlphaGo would play the game in such a perfect manner,” he confessed at the postgame press conference.[49] The next day, for the second game, Lee tried something different. He played cautiously, waiting for AlphaGo to make an error. After thirty-six moves, he took a break to smoke, then came back to study the position. In his absence, AlphaGo had played Move 37: a black stone placed in a mostly empty zone, striking at Lee’s right flank. -- 173 of 565 -- Lee took fully twelve minutes to respond. He had never seen a move like this before. In another room not far away, the world’s top-ranked Western Go player, Michael Redmond, was following the game by video and live streaming to a global audience. He too was flummoxed. Seeing the move that AlphaGo had chosen, he placed a black stone in the corresponding position on the board in front of him. Then he removed it. “No, that can’t be right,” he muttered. And yet it was right. After checking the screen again, Redmond put the stone back in its strange place and tried to make sense of it. “I don’t really know if it’s a good move or a bad move,” he confessed to the fans watching his live stream. It turned out to be a great move. When the game ended more than a hundred moves later, Move 37 proved to be decisive. “When I saw this move…I thought surely AlphaGo is creative,” Lee said this time at the postgame press conference.[50] “I am quite speechless,” he added.[51] The next day was a rest day. The DeepMind scientists went for a stroll in the city and stopped at a newsstand. The front page of every newspaper featured AlphaGo. A young woman spotted Hassabis in the street, recognizing him instantly. She mimed a swoon, as though Hassabis were a pop idol. “It happens all the time,” Hassabis assured a journalist who was with him.[52] Of course, the opposite was true. For AI researchers everywhere, everything had changed. AlphaGo had brought an end to obscurity, humility, and innocence. The next day, the machine defeated Lee a third time. The Korean was playing some of the best Go of his career, but AlphaGo outclassed him. At that day’s press conference, with banks of cameras flashing in his face, he apologized to all humans. Like Fan Hui before him, he had started out confidently and quickly fallen down to earth. “I kind of felt powerless,” he admitted. -- 174 of 565 -- What were humans supposed to do in the face of machine superintelligence? If you can’t beat it, join it, was one possible response: After losing 5–0, Fan Hui had signed on with DeepMind, and Fan had even suggested that defeat had opened his eyes to the full possibilities of existence. “I can see the world is so much bigger than I thought before, and I really like this feeling,” he marveled.[53] It was a sweetly humble sentiment, but it glossed over the reality of human loss. Machine superintelligence expanded possibilities, of course. But it also threatened humans in the most unsettling way: by hinting that their intuitions and ideas would one day cease to matter. Another response to superintelligent machines was to keep fighting them. In game four in Korea, Lee Sedol managed a surprise upset against AlphaGo. With Move 78, a masterstroke that came to be known as his Hand of God move, Lee produced a ploy so unusual and so bold that the computer was wrong-footed. Feeling the algorithmic equivalent of desperate, AlphaGo began to make nonsensical moves, hallucinating, undermining its position, flailing about in a display of inhuman humanity, and ultimately resigning. Lee celebrated this victory, saying that he felt a supreme warmth, and intimating that humans had not yet been subjugated. Fans chanted his name, and a computer programmer in Florida had Move 37 tattooed on one arm, Move 78 on the other.[54] And yet fighting the computer, celebrating its failures, felt as inadequate as Fan Hui’s contrary response. Three years later, when Go systems had grown massively stronger, a saddened Lee announced his retirement, saying he no longer felt joy in playing. • • • THE DEEPMINDERS THEMSELVES were unsure how to process AlphaGo’s victory. AlphaGo had been built by humans. It was not some sort of alien force; it was a manifestation of human drive and curiosity. But the DeepMinders also empathized with Lee Sedol’s despair. “I couldn’t celebrate,” Hassabis -- 175 of 565 -- recalled of Lee’s 4–1 defeat. He knew what it was like to compete ferociously and be beaten. A few years later, I asked Thore Graepel what he had felt as machines surpassed humans. “The early version of our Go system played as a human would. It rediscovered certain strategies that humans had learned over millennia. Which was very reassuring for us,” Graepel told me. “Then it discovered that certain time-honored human stratagems can actually be counteracted. So it discarded them. “And then, as the system became stronger, it played like nothing we’ve ever seen. It came up with a style that was completely alien. “It played stones that appeared to be randomly sprinkled across the board. But as the game progressed, thirty, fifty, a hundred moves in, you’d feel all of these stones work together…” “And the noose is tightening around your neck?” I asked, a bit nervously. “Exactly,” Graepel nodded. “Exactly. Magically. “Of course, not magically! By the foresight of the algorithm. It’s only to a lesser intelligence that it seems magical. “This is how we have to imagine the future. In the domain of Go, we have achieved superhuman intelligence. We can observe what it feels like to interact with it. “At first, it looks harmless. Then it’s just completely dominating. We don’t understand the mechanics, the tactics, the strategies. We just know that it is in control.”[55] OceanofPDF.com -- 176 of 565 -- O CHAPTER 10 OUT OF EDEN n August 14, 2015, while AlphaGo was secretly amassing strength, Hassabis joined Google’s top brass for a meeting of DeepMind’s ethics and safety review group. On nearly every dimension, his relationship with his American parent was going well. Google had liberated him from the fundraising hamster wheel. Google had allowed him to retain DeepMind’s independent culture in London. Google had even granted his followers a privileged status: DeepMinders could get into any Google building globally at the swipe of a key pass, and help themselves to the free food; but Googlers were barred from DeepMind’s premises. Meanwhile, Google was providing Hassabis with the wherewithal to hire top scientists and train costly models. Some weeks, a single research team at DeepMind might gobble up more computational resources than Google’s worldwide Gmail network, which had nine hundred million users. The ethics and safety meeting promised to be fraught, however. Breakthroughs such as Vlad Mnih’s Atari agent had imbued AI safety discussions with a fresh urgency. The higher the existential stakes, the bigger the egos that clamored to take part, and the harder it became to forge consensus. Hassabis had already experienced this dynamic in early 2014, when Elon Musk tried to buy DeepMind, allegedly to safeguard it for humanity. A year later, Musk remained bitter that his bid had been spurned; if he couldn’t be the one to build AI, he wanted nobody to do so. Alluding to the “close to exponential” progress taking place behind closed doors at DeepMind, he asserted that “the risk of something seriously dangerous happening is in the five-year timeframe, ten years at most.”[1] To ward off -- 177 of 565 -- the risk of a catastrophe, he donated $10 million to the safety-minded Future of Life Institute and called for AI regulation—all the while hustling to recruit AI scientists to Tesla.[2] If this brand of rivalrous alarmism were to dominate the deliberations of the DeepMind safety board, they were not going to be constructive. Awkwardly, none other than Musk would host the safety meeting. This was the result of a gamble: In April 2015, hoping to placate their bumptious and presumptuous frenemy, Hassabis and Page had invited Musk to join the safety board, even granting him the honor of convening its first exploratory session.[3] Musk had readily accepted, but he continued to fulminate against DeepMind, denouncing Hassabis as an evil genius, the evidence being that Hassabis had once worked on a computer game called Evil Genius.[4] Sometime in this period, Musk agreed to meet Hassabis and Suleyman for lunch in central London. Suleyman remembers him arriving at the restaurant with his ethereal wife Talulah in the back of a rather small Tesla; with his six-foot-two frame, Musk’s knees were practically in his mouth, and he had trouble clambering out of the vehicle. Over lunch, Musk kept up his griping, effectively accusing DeepMind and Google of irresponsibility. A month after that London encounter, on the evening of May 25, Musk received an email from an investor named Sam Altman. At thirty, Altman already had a seat at Silicon Valley’s top table; he was running Y Combinator, the start-up incubator that had birthed a slew of outstanding ventures, including Airbnb and Dropbox. But he always had an eye for the next big thing. A student of power, he had once told a friend to read Robert Caro’s books on Lyndon Johnson, the better to get ahead in Silicon Valley. [5] “The most successful founders do not set out to create companies,” he observed. “They are on a mission to create something closer to a religion.”[6] Altman often met Musk for dinner on Wednesdays, when Musk visited the Bay Area on his weekly rotation through his various companies. [7] “Been thinking a lot about whether it’s possible to stop humanity from developing AI,” Altman now wrote. “I think the answer is almost definitely not. -- 178 of 565 -- “If it’s going to happen anyway, it seems like it would be good for someone other than Google to do it first,” the email went on, playing into Musk’s obsessions. Having set the table skillfully, Altman popped a proposal. “Any thoughts on whether it would be good for YC to start a Manhattan Project for AI?” he asked, referring to Y Combinator. Evidently, Hassabis was not alone in seeing the parallels between artificial intelligence and the atomic bomb. Altman shared a birthday with J. Robert Oppenheimer, the Manhattan Project’s leader. He liked to point this out to interviewers. “We could structure it so that the tech belongs to the world,” Altman’s email continued. “Obviously we’d comply with/aggressively support all regulation.” Two hours later, Musk responded. “Probably worth a conversation.” For the next month or so, Altman pressed his case to Musk, determined to access both his prestige and his capital. For someone who hungered to create a company that was akin to a religion, the prospect of building an infinity machine was irresistible. “I think we’d ideally start with a group of 7–10 people, and plan to expand from there,” Altman wrote to Musk in a follow-up email on June 24. The venture would be structured as a foundation, he added. It would have a five-person oversight board—Musk, Bill Gates, and none other than Altman would occupy three of the seats on it. “The technology would be owned by the foundation and used ‘for the good of the world,’ and in cases where it’s not obvious how that should be applied, the five of us would decide,” he suggested. “Agree on all,” Musk responded.[8] A few days later, Musk celebrated his forty-fourth birthday. Two years had passed since the party at which Larry Page and Hassabis had bonded. This time there was no fake medieval castle and no visiting sumo champion. The gathering took place amid the rolling vineyards of Napa, California, at a secluded resort dotted with cabins. After dinner the first evening, Page and Musk sat outside together, near a pool and a glowing fire pit. Inevitably, the conversation turned to AI. -- 179 of 565 -- Musk was in full-blown paranoia mode. He was terrified of superhuman intelligence. Speaking in his raspy voice, Page told him not to worry. He said he looked forward to a time when people might merge with intelligent machines, or when machines might simply replace humans. Evolution would ensure that the best form of intelligence won out; if the best form involved fast silicon circuits rather than slow biological tissue, so be it. There was no point being sentimental about such things. It would be survival of the fittest. Page was channeling a view that had been around for half a century. In 1964 the science-fiction writer Arthur C. Clarke had called it a privilege for humanity to be a stepping stone to higher things. “I suspect that organic, or biological, evolution has about come to its end, and we’re now at the beginning of inorganic, or mechanical, evolution, which will be thousands of times swifter.”[9] Whatever the pedigree of Page’s position, Musk was appalled by it. He was rooting for human survival, not for the emergence of a more sublime intelligence.[10] Page retorted that Musk was a “speciesist”—a bigot with a soppy prejudice in favor of carbon over silicon.[11] The speciesist epithet sent Musk over the edge. Here was the final proof that Page was not be trusted as the steward of the world’s most critical technology.[12] It was an outrage, Musk thought, that AI was in the hands of a transhumanist in Mountain View and a cartoon villain in London. When I first heard of Page’s speciesist remark, I wondered how seriously to take it. According to witnesses, the conversation had stretched into the chilly hours.[13] Perhaps Page had been tired? Perhaps he was joking? Hassabis agreed. “I mean, look, who’s not on team humanity? What does that even mean? How could you not be? Larry’s a quirky personality, but he was voicing a certain logic, not espousing a belief. He would’ve been thinking that was a fun, philosophical discussion.” I consulted others who worked closely with Page. The verdict was less reassuring. -- 180 of 565 -- “Larry loves technology,” one acquaintance began. “I think he loves technology more than the average cat. “Larry thinks machines are better than humans for so many things. If we could just get rid of the human obstructions and let machines do stuff, we would unlock all kinds of progress. “Those are his values. That’s why he has come up with so many cool products. But the flip side is that he pushes to the extreme, where everybody’s living in a computer chip.” Uploading one’s brain to a computer was another long-standing sci-fi idea, often proposed by AI futurists as a route to immortality. With a few more breakthroughs in computer science, theological speculation about a heavenly afterlife would be rendered obsolete. The essence of a person could be replicated on silicon and preserved for all eternity.[14] “Larry sometimes seems to think we don’t need hummingbirds, or whales in the ocean,” the person concluded. “You’ve got to remind him, whales are also pretty cool. They represent fifty million years of evolution.”[15] In the end, Musk’s jealous mistrust of Page was mostly about Musk: his ego and his demons. But Musk also had a point: It was unnerving, to say the least, that someone with an uncertain commitment to human existence should find himself in control of such a consequential technology. Either way, DeepMind’s safety and ethics review meeting was not going to generate a relaxed meeting of minds with Musk and Page at the same table. Meanwhile a second split within the ethics board raised questions about Hassabis’s vision of the future. In common with most AI pioneers, Hassabis hoped that a single scientific effort would drive the technology forward. When he had attended the Singularity Summit back in 2010, a good share of the world’s AI believers could fit inside a conference hall: There was one AI community. Anybody who addressed that community—whether in San Francisco or elsewhere—was calling upon the full congregation of the faithful to buy into a vision: a vision that usually included a theory of how artificial general intelligence might be built, coupled with a call to safeguard it. And although these visions might vary subtly from one person -- 181 of 565 -- to the next, believers shared a common horror of sectarian splits. Whatever vision of AI won out, a single vision would be better than many. If numerous labs competed to put the technology out into the world, the race to come first would sideline scruples about safety. Hassabis was never part of the Singularity crowd. But he shared the assumption that a “singleton” scenario provided the best shot at safe AI— not least because, if a single lab was going to take the technology forward, the most obvious contender was DeepMind. Moreover, when it came to the final steps to artificial general intelligence, Hassabis’s fascination with science fiction and scientific history fused into a heroic vision: He imagined convening a band of elite scientists in a secluded research center, there to focus single-mindedly on the birthing of safe superintelligence. This mash- up of Ender’s clandestine space station and the Manhattan Project’s secret encampment in New Mexico bubbled up in conversation periodically, including when Hassabis met job applicants for final interviews. Perhaps testing their level of commitment to DeepMind’s mission, Hassabis would inform candidates that, if they signed on, they should prepare for a climactic endgame when they might have to disappear into a bunker.[16] One day I met a former DeepMinder who had been on the receiving end of Hassabis’s recruitment riff. What did Hassabis mean by the bunker, I wondered? Was it just a metaphor? No, came the reply. “At any stage when I was at DeepMind, if Demis had told me to get on a flight to a secret location in Morocco, I would have felt that I had been given fair notice.” Why Morocco? “Oh, the desert. I was just thinking about the Manhattan Project. That was in a desert.” I suggested that the bunker talk expressed Hassabis’s desire for scientific focus, not a paranoid desire for military-style secrecy. Hassabis had often told me that he wanted to assemble a dream team in a dream setting—a place with a clear mission and absolutely no distractions. “Perhaps Demis was saying that he craved seclusion,” my acquaintance acknowledged. “But the bunker was also a hideaway from hostile powers. -- 182 of 565 -- You know, powers that wanted to get their hands on our technology.” After all, the Manhattan Project had been both things: a thrilling scientific sprint, and an attempt to hide the bomb behind a shroud of mystery.[17] Later, perhaps in an effort not to alarm people, Hassabis modified his bunker terminology. The last steps toward superintelligence should take place under the aegis of an international scientific agency, he would say— something like CERN, the European Organization for Nuclear Research. Hassabis imagined himself going off to join this effort, together with other DeepMind scientists, plus stars from academia; Terence Tao, by some reckonings the world’s top mathematician, was one name he liked to mention.[18] “We could assemble a council of the hundred wisest people from all the corners of earth,” he told me on another occasion. “Philosopher-kings, like in Plato’s Republic but more diverse, and I’d advocate for the pope to be on there.” But whatever the detail of Hassabis’s vision, he was consistent in imagining a united team, fighting to deliver safe AI on behalf of all humanity. A singleton effort would surely be better than an anarchic charge over the precipice. Not everyone on the safety board was convinced. The chief skeptic was Reid Hoffman, an engaging, polymathic billionaire and founder of the social network LinkedIn. Drawing on his feel for politics and history, Hoffman regarded the singleton scenario as hopelessly unrealistic. Absent coercion, humans do not coalesce into a single unit. Rather, they are disputatious, competitive, and tribal. It followed that calls for a singleton AI effort were no more likely to be heeded than their equivalents at the dawn of the nuclear age. After the atom bomb destroyed Hiroshima and Nagasaki, Oppenheimer urged the United States to transfer its nuclear monopoly to some kind of CERN-like international organization, which would in turn prevent nations from acquiring their own arsenals. Nothing had come of Oppenheimer’s proposal.[19] The more realistic path to AI safety, Hoffman argued, was to learn from multiparty democracy. The leaders of the field should back a handful of frontier research labs. Each would be animated by its own vision, and each would provide a congenial home for one part of the AI community. As in a -- 183 of 565 -- multiparty democracy, this pluralism would be balanced by a shared commitment to bedrock values: For democracies, the shared values included fealty to the constitution and the rule of law; for AI development, they would involve a good-faith commitment to AI safety. From Hoffman’s perspective, Musk’s egocentric fury was extreme. But it illustrated a larger truth about the inevitability of AI competition. • • • WITH ALL THESE tensions roiling under the surface, a dozen power brokers descended upon SpaceX for the first, informal meeting of the ethics and safety group.[20] In addition to Google’s top brass, all three DeepMind founders attended, and so did a handful of outsiders, including Musk, Hoffman, the Oxford philosopher Toby Ord, and Peter Dayan, the director of the Gatsby Unit. You could cut the tension with a knife. “Larry and Elon hated sitting in the same room as each other,” Suleyman recalled. “It was just awkward.”[21] The group ate dinner and listened to some presentations about AGI and what it might signify for humanity. Hassabis summarized DeepMind’s research road map. Shane Legg went over his timeline for getting to AGI and the risks that a malign agent might escape from its box, hack into critical online infrastructure, and otherwise threaten humanity. Some way into the conversation, Suleyman addressed the gathering. Suleyman was determined to make the most of this forum, which existed largely because of him. During the acquisition negotiations with Google, it was he who had insisted most forcefully on the ethics and safety review process, as it was formally known, and Page’s recent speciesist comment had only reinforced the case for oversight of Google’s AI decisions. Google was a for-profit corporation controlled by two idiosyncratic founders. “I just wanted to get the tech into the hands of a credible, independent board of responsible citizens.”[22] Rather than focusing on the risk of an AI agent turning on its human creators, Suleyman’s presentation stressed the threats to social cohesion. -- 184 of 565 -- The way he saw things, AI might cause mass unemployment, meanwhile concentrating money and power in the hands of tech elites, intensifying inequality. The winners from this upheaval, most notably Google, would need to find a way of sharing wealth with the millions who lost out. “They’re all going to see us as the demons,” he told his audience.[23] There were icy looks around the table, but Suleyman kept going. The final slide in his handout showed a still from the TV show The Simpsons. In the scene, the townspeople charge forward carrying cudgels and torches. “The pitchforks are coming,” he announced darkly. The room fell silent. After a pause, Larry Page spoke up, objecting that AI would create more jobs than it destroyed: such had been the experience with past technologies. People were adaptable. Solutions would be found. There was no need to worry.[24] Eric Schmidt, the Google chairman, weighed in too. He dismissed Suleyman’s concerns, pointing out that despite the internet, mobile computing, and other impressive advances, unemployment was low by historical standards. When machines displaced human labor and drove production costs down, the result was lower prices, which in turn boosted demand. Higher demand meant that more products got shipped—and more jobs were created. Suleyman stuck to his argument. AGI was unlike the previous innovations on which the Googlers based their optimism. In the past, word processors, digital databases, and online search had taken over particular tasks; for the most part, they had not replaced human jobs in their entirety. In the future, however, machines would think, acquiring the versatility to do jobs rather than just tasks; what’s more, they would outperform humans not only at the jobs that existed today, but also at the ones that might be invented tomorrow. Artificial general intelligence was a general technology, with scarily generalized effects on human relevance. At a minimum, Suleyman insisted, AI would be good only if Google acted to make it good, both by promoting socially beneficial uses of AI and by limiting its dangers. When dinner was over, the visitors toured the SpaceX factory and the neighboring Tesla design studio. Musk led his guests past the hulking -- 185 of 565 -- shapes of rocket parts, each attended by robotic arms and teams of human technicians, who apparently thought nothing of working late into the evening. It was extraordinary that this industrial temple wasn’t enough to satisfy Musk’s ambition. The group passed by a closet housing the computer servers, and Hoffman noticed a sign on the door that said “Skynet.” It was a reference to the AI system in the Terminator movies—the one that is hell-bent on eliminating humans. Reid Hoffman pointed out the sign to Hassabis. Turning to Musk, Hassabis remarked, “You know what you would start saying about me if I had something like that on one of my systems!”[25] In fairness to Musk, he was not the only engineer to find Terminator tropes irresistible. The military-industrial complexes in the United States and Britain had each seen fit to name a futuristic system after Skynet. The prospect of killer computers was frightening. Somehow, it was also thrilling. • • • LOOKING BACK ON the lessons from the safety board meeting, Hassabis recalled his disappointment. The conversation had done nothing to heal division, and the evening had ended without clear agreements or conclusions. Musk feared the emergence of an existential threat, and favored regulation. For their part, Hassabis and the Google leadership believed AI was still too primitive to warrant government restrictions. At the same time, Suleyman’s stress on nearer-term social dislocation exposed a difference not only between DeepMind and Google, but also between himself and Hassabis. In the summer of 2015, Hassabis’s overriding priority was to secure the resources to train AlphaGo; alienating Google’s bosses with speculative hand-wringing and cartoon slides risked DeepMind’s research budget. In any case, who knew what AI’s social impact would turn out to be: soaring inequality, or radical superabundance? -- 186 of 565 -- If the meeting resolved anything at all, it was that the singleton vision of AI was sadly optimistic. The gamble of inviting Musk to host the discussion had signaled Hassabis’s devotion to the ideal of a unified AI effort and his sincere fear of a race dynamic. But the profound divisions at the meeting demonstrated that Reid Hoffman was right. When a technology of infinite potential comes into view, there will never be a quiet consensus about who should control it. With so much at stake—power, money, scientific glory, the future of humanity, no less—conflict is unavoidable. A few months later, the email discussions between Musk and Sam Altman came to fruition. The conspirators teamed up to launch OpenAI, a not-for-profit lab explicitly aimed at breaking the Google DeepMind AGI monopoly, and Hoffman was among the backers.[26] OpenAI’s founders presented their venture as a crusade: The forces of light had entered the arena to combat darkness. But to believers in the singleton vision, OpenAI’s founding represented the Fall: the moment when the serpent brought evil into the garden, precipitating the expulsion from Eden. Hassabis, ever practical, was also angry in a simpler way. Musk and Hoffman had been invited to the SpaceX gathering in good faith. They had sat through the meeting, listened to DeepMind’s plans, and then used what they had heard to double-cross him.[27] “Maybe at that point I was naive,” Hassabis reflected. “I thought we would be having a proper philosophical discussion about what I could see coming. “The problem is, if you have a safety board, you want these interesting, amazing people so that you can get good insights. “And then the whole point of the safety board is you discuss everything and you get proper advice. “But if you have powerful people who are able to understand the impact of the technology, they’re not just going to sit on the sidelines. They won’t be content to just be your advisers. “So obviously what was actually going on was, our supposed advisers were really our rivals. “They were thinking, ‘How can I make use of that?’ -- 187 of 565 -- “Or, ‘Demis is right, but that means I’ve got to launch my own thing.’ ” OceanofPDF.com -- 188 of 565 -- A CHAPTER 11 P0 PLUS PLUS t the time of the SpaceX discussion, in August 2015, the pitchfork presentation was just one plank of Mustafa Suleyman’s agenda. Five years earlier, he had been the youngest and least important of the three founders; now he had eclipsed Shane Legg and often managed to present himself as Hassabis’s near-equal.[1] DeepMind’s research team, accounting for two-thirds of the company’s head count and a larger share of its budget, was firmly under Hassabis’s control. But DeepMind’s “Applied” side, charged with rolling out practical technologies, reported to Suleyman, who treated his part of the company as a quasi-autonomous fiefdom. Bursting with restless ambition, Suleyman had grand plans for his Applied division. His initial roles at DeepMind—helping Hassabis with fundraising and overseeing the project to build a fashion-recommendation algorithm—ceased to require his attention after the Google acquisition. He had therefore invented a new role, which was to steward DeepMind’s impact on society. This mission fell into two parts: assembling a policy team to shape the public debate about AI, and building practical technologies that would address social injustices.[2] As Suleyman was fond of saying, DeepMind’s purpose was to solve intelligence and use it to make the world a better place. The implication was that the solve-intelligence objective, under the command of Hassabis, was pointless unless there was an enlightened strategy for rolling AI out into the world. The formulation of that strategy was the purview of Applied, under the command of Suleyman. “Demis and I had conversations about how to impact the world,” Suleyman told a magazine writer around this time. “He’d argue that we -- 189 of 565 -- need to build these grand simulations that one day will model all the complex dynamics of our financial systems and solve our toughest social problems. I’d say we have to engage with the real world today.”[3] Suleyman’s impatience reflected his life experience and his politics. The call to activism—directed at both the DeepMind safety board and sometimes equally at Hassabis—hearkened back to his youth, when he had challenged fellow Muslims to make society better, commandeered a wheelchair to experience London as a disabled person would, and quit Oxford to lead the Muslim Youth Helpline. For a teenager who was given nothing, action was not a choice; it was a matter of survival. Not surprisingly, given his focus on inequality, Suleyman supported the political left; his thinking tracked closely with a book called Inventing the Future, which came out around the time of the SpaceX safety meeting.[4] According to one adviser, Suleyman identified so closely with this left-utopian manifesto that he felt he should have been the one to write it. For much of the twentieth century, the manifesto argued, left-wing thinkers had followed Karl Marx: They celebrated the liberating potential of technology. More recently, however, the left had lost touch with its roots. From the antiglobalization protests of the 1990s to the Occupy Wall Street movement after 2008, it had deplored modernization, calling instead for society to be “ ‘human-scaled,’ ‘tangible,’ ‘slow,’ ‘harmonious,’ ‘simple,’ ” all of which amounted to “an attempt to make global capitalism small enough to be thinkable.”[5] The trouble with this “folk politics,” the book noted, was that capitalism was not small, and flakily pretending otherwise rendered the left irrelevant. Calls for sustainable farming wouldn’t drive meaningful change. As the authors put it succinctly, “Goldman Sachs doesn’t care if you raise chickens.”[6] Folk-political technophobia was not merely ineffectual, however. It missed the vast positive potential of tech—the potential that could be realized if enlightened leaders seized control and shaped it. “Many of the classic demands of the left—for less work, for an end to scarcity, for economic democracy, for the production of socially useful goods, and for the liberation of humanity—are materially more achievable than at any -- 190 of 565 -- other point in history,” the manifesto insisted.[7] Suleyman agreed wholeheartedly, not least because his personal journey mirrored the book’s argument. He had escaped the folk politics of the Muslim Youth Helpline and the London mayor’s office precisely to become a technologist and make a broader impact on the world. Now that he was in a position to debate the grand questions of societal cohesion with the likes of Larry Page, he felt morally obliged to deliver on the premise of his career shift. By the summer of 2015, in other words, Suleyman was primed to carry out a grand experiment with AI: to show what happens when a messianic figure sets out to use the technology to improve society. His passion was beyond doubt. The obstacles he faced were formidable. • • • AS A FIRST demonstration of how DeepMind could do good, Suleyman resolved to help Britain’s National Health Service. The crown jewel of the postwar social compact, employing more than 1.5 million people and delivering free care for all, the NHS was also a shambles, groaning under the inefficiencies of antiquated data management. Many patient records existed only on paper; the NHS was said to be the world’s largest purchaser of fax machines. Beaten-up computers frequently crashed, losing what little digital data existed. It was impossible, under these conditions, to deliver the right care to the right patient at the right time. And without decent data curation, there was no prospect of improving medicine by deploying artificial intelligence. To figure out how he could help, Suleyman first consulted Hassabis. Both saw health as a huge opportunity for artificial intelligence. After all, image-recognition systems were clearly capable of interpreting medical scans, opening the door to earlier preventive care, saving lives as well as money. Working his network from his neuroscience research, Hassabis took Suleyman to meet some of London’s top medical professors. Next, Suleyman spent time visiting hospitals. This quickly confirmed that the health system was in dire need: It was stuck in the past century. The -- 191 of 565 -- worst thing was how clinicians responded. Occasionally, they resorted to pinging scans to one another via messaging apps; mostly they seemed resigned to the idea that nothing would ever work properly. When doctors and nurses ordered pizza on their mobile phones, they were using technology that was better than the stuff they used to care for patients. Suleyman empathized especially with the nurses, not least because his mother was one of them. One day, an HIV-positive patient had stabbed his mum with a needle, causing her to fear—unnecessarily, as it turned out— that she had been infected with the virus: Suleyman knew how tough the job was. Poorer, less educated patients suffered disproportionately as well. They struggled to navigate the NHS bureaucracy, to get time off for appointments, to remember the instructions handed down by harried clinicians who were always rushing to the next crisis. “Better-off patients would have a loved one at their bedside,” Suleyman recalled of his visits to the hospital wards. “They would be coordinating the care, ensuring that the doctors doing their rounds spent an extra few minutes with them, chasing down the overdue scan or figuring out why a blood result was unusual. “But the more you walked around the hospital, the more you noticed the elderly Bangladeshi immigrant whose family isn’t around, who doesn’t understand, who is just getting overlooked. “You see the whole world falling apart,” Suleyman concluded.[8] • • • A FEW WEEKS into his investigations, Suleyman met a doctor named Chris Laing at the Royal Free Hospital in North London.[9] Laing suggested that DeepMind’s first health project should be AKI—acute kidney injury. Shockingly, one in seven British hospital patients experienced kidney malfunction. Each year, for lack of timely treatment, around forty thousand died. Thousands of others needed a kidney transplant or lifelong dialysis, costing the NHS around £1 billion annually.[10] -- 192 of 565 -- Suleyman made a proposal. DeepMind would build an AI system to predict the onset of kidney failure. But Laing explained that prediction was not the immediate problem. The NHS already conducted blood tests that identified kidney trouble; the challenge was to get the test results to clinicians. Hospitals still relied on old-fashioned pagers to notify doctors. But the doctors had to find a moment to call the number on the pager, and by the time they did so, whoever had beeped them was often no longer available. Alternatively, hospitals depended on doctors and nurses to log on to clunky computers and scan hundreds of blood readings. Hours could go by between a test that flashed a need for urgent care and somebody noticing the emergency.[11] Plenty of AI executives would have backed off at this point. Getting blood-test results to doctors was a simple software challenge; it had nothing to do with artificial intelligence. But Suleyman cared about the problem first. If the solution did not initially involve AI, so be it. He would help to fix the software now, then deliver AI later. Together, Suleyman and Laing decided to build a messaging system to connect blood labs to clinicians. If a test identified a patient in danger, a notification would ping on the smartphones of the responsible nurses and doctors. The message would contain only the necessary information, including which ward and bed the patient was in. It would go only to the relevant team, so that other busy staff would not suffer data overload. DeepMind would build the technology for free. It would sign whatever data protection agreement the hospital provided. Laing believed that, if this first project went well, the idea of targeted, real-time alerts could be extended to multiple hospital conditions. Suleyman assembled a team of engineers and designers and dispatched them to the hospital. He told them to shadow the clinicians as they went on their rounds: to watch them fill their notebooks with hand scribbles; to see how they struggled with the ancient “cows”—the computers on wheels, which were buggy and unstable. “I wanted them to smell the hospital smells and hear the constant beeps and understand what it’s like to be in a depressed sensory state,” Suleyman -- 193 of 565 -- told me. “You have to know all of those things to create beautiful software that really works in that environment.”[12] Within a few weeks, the first version of Suleyman’s AKI alert system, called Streams, was being tested in the hospital. The blood lab zapped notifications directly to smartphones; patients who might have died got timely attention. For Chris Laing and his NHS colleagues, the speed of this transformation was a miracle—“an almost hallucinatory experience,” Laing called it. “It’s very difficult to articulate just how much of a step up this was,” he marveled; “that willingness to just get started.” Laing was an NHS veteran, and he had never seen a pilot project get off the ground this fast before. “It was definitely the highlight of my career,” he told me.[13] • • • BY THE END of 2015, DeepMind’s health work was advancing on several fronts simultaneously. With the Streams app making progress, Suleyman planned to return to his original vision: He would upgrade the diagnostic part of the kidney-alert system, replacing standard blood tests with AI that could predict kidney injury earlier. Meanwhile, he had forged a partnership with London’s premier eye hospital, Moorfields, to build an image- recognition system to diagnose macular degeneration, a cause of preventable blindness. There was a plan for AI cancer screening, too, and Suleyman recruited teams of young researchers from the junior ranks of academia to deliver on his vision. One newcomer recalled his delight at the DeepMind refrigerators, which were stocked with free soft drinks; there was also an intelligent beer fridge, programmed to unlock itself each day at five o’clock precisely. Naturally, the newcomer and his buddies cracked the beer refrigerator’s code. The new opening time was four o’clock. To lead his burgeoning health division, Suleyman hired Dominic King, a multitalented surgeon. Still in his midthirties, King had found time to work in government, publish in medical journals, and pick up a PhD in behavioral economics. He had also created a hospital software program -- 194 of 565 -- called Hark, which extended the idea behind Suleyman’s Streams app. Hark’s goal was to manage the scarce time of doctors and nurses: to direct clinicians to the most urgent cases, automating the process of triage. As part of the process of hiring King, Suleyman bought the rights to his software, paying the NHS handsomely for it. The fact that King agreed to join DeepMind was a testament to Suleyman’s momentum. King recalls a Friday in the autumn of 2015, when he appeared before a review panel of London’s top surgery professors. He had reached the end of his grueling post–medical school training, and the reviewers were to assess his performance and announce his next assignment. Fifteen years after qualifying as a doctor, the big prize would be a senior position at a prestigious London hospital—a job that came with slightly less suffocating pressure, plus the opportunity to conduct research. The surgery professors delivered their verdict. They were so impressed with King that they offered him a choice of two topflight positions. King emerged from the meeting and called his wife. Finally, there was light at the end of the tunnel, he told her: He would be able to spend time with their one-year-old daughter. The couple planned a celebratory dinner that evening. “The point of the dinner was to say, ‘I’m through all the crap now, hopefully things should be better,’ ” King said later. King ended the call and set off for his next meeting—with Suleyman. The two had been in touch for a few weeks, discussing Hark and Streams, and hoping there might come a time when the two systems could be implemented simultaneously. “I’ve enjoyed getting to know you,” Suleyman began. “We are doing some work that you seem to be excited about. “Why don’t you come and join us?” “I thought it was the most exciting offer ever,” King recalled later. “I didn’t go into that meeting thinking I would throw my hospital career out of the window. But it was a no-brainer when he asked me. “At that point in DeepMind’s history, it felt like Demis was the scientific genius, but Moose was the messianic figure,” King reflected. “He would tell you a story about how you’re going to reform health care and then -- 195 of 565 -- revolutionize energy and then do other crazy things. He’d mention that he’d just been with some famous person like Barack Obama. “Moose left university after a couple of years. “I collected five degrees. But I was willing to drop everything.”[14] • • • IN FEBRUARY 2016, Suleyman showed up at Soho Farmhouse, a swanky members-only resort in the English countryside near Oxford. Google’s public outreach team had invited him to address a group of rising policy stars, and he laid out DeepMind’s plans to revolutionize the use of data in the health system.[15] Suleyman delivered his characteristic call to arms, but this time the response was not enthusiastic. Instead, the audience was shocked. The previous year, a plan to create a centralized database of medical records in England had caused a public backlash. What if the data fell into the hands of pharmaceutical companies and insurers, who sought to extract profits from it? Had patients consented? What if the data leaked? How robust was the governance surrounding it? The idea of collaborating with a foreign tech behemoth like Google was especially neuralgic. A new kind of technophobia was stirring, centered not on folk politics but on an opposition to surveillance capitalism. “Don’t you realize you have miscalculated?” somebody asked Suleyman. “We British are very patriotic about the NHS.” “I am British and I am very patriotic about the NHS,” Suleyman retorted. He thought of his mother. This was personal. Suleyman showed his audience an example of the archaic data storage he had encountered in the health system: a paper list of patients, complete with sensitive health information on each of them. Advocates who objected to modern data management systems were implicitly endorsing this hard- copy alternative. “This paper was found on the floor of a supermarket, near the hospital I am working with!” Suleyman announced. “This is the system you are -- 196 of 565 -- defending! “It’s chaos!” he continued angrily. “The criticism of electronic data is the luxury of people who have never needed to go to hospital. The luxury of people who have never been sick.”[16] “I’m like, you’re telling me about data leaks?” Suleyman said later. “What do these people not realize?”[17] It was all very well advocating patient privacy, but there was a balance to be struck. You could make an individual’s data absolutely private by sharing it with nobody. But then you would forgo the chance to make that individual healthier. “Everyone thinks changing the system is too difficult,” Suleyman continued. “But the idea that the status quo has to be accepted is just not in my lexicon. “When the nurses are like, yeah, we’re using these old computers on wheels, I’m like, well, why doesn’t the hospital buy iPhones for you? “So then they tell me, oh, the purchase order has gone in, it’ll be six to nine months. “I’m like, we’ll bring twenty in next week! “So we do that, and suddenly the nurses can do their jobs without using goddamn fax machines!” I pictured Suleyman handing out phones like some Silicon Valley Santa Claus. “Neither of us understood how to navigate the health care stuff,” Suleyman went on, referring to himself and Hassabis. “So every night there would be a conversation. ‘OK, what are the political implications of this project? Who can we get on our side? Should we talk to the newspapers? What if we gave our technology away for free for the first five years or whatever.’ “It was naivety combined with this relentless push,” Suleyman remembered.[18] • • • -- 197 of 565 -- SULEYMAN ABSORBED the message from the Soho Farmhouse exchange: People would be suspicious of an American-owned tech group with access to NHS data. But he already had a plan to deal with this problem: a health- specific version of the ethics and safety oversight group that he had demanded from Google. To build trust in DeepMind’s NHS agenda, Suleyman envisaged an Independent Review Panel, consisting of respected health experts. He would allow the panel’s members almost unfettered access to his team. And he would invite them to publish an annual assessment of Applied’s progress—the good stuff but also the errors. Google’s lawyers hated Suleyman’s plan. As far as anybody could remember, no other company had ever given outsiders the keys to the kingdom while allowing them to speak out freely.[19] Suleyman was refusing even to have the reviewers sign a nondisclosure agreement. “Good people will do the right thing,” he maintained. “If we expect them to trust us, we should trust them.” “The goal was to break down the forces that led the NHS to see corporations as scary profiteers, and the forces that led the best technology companies to see the NHS as hapless idiots,” Suleyman said later.[20] Suleyman placated the lawyers and pushed ahead with his gamble, persuading an impressive list of public figures to join his Independent Review Panel.[21] At the end of February 2016, shortly before AlphaGo’s defeat of Lee Sedol, the creation of DeepMind Health was announced publicly. The initial work on kidney failure and the formation of the independent panel were part of the rollout. More than two dozen doctors and nurses were now using the Streams prototype, receiving an average of eleven emergency notifications daily. The first feedback arrived: an admiring article in the Guardian newspaper. It quoted a distinguished surgeon and former health minister, who noted that the nation’s beloved National Health Service cried out for innovation.[22] An aging, sickening population, together with an overburdened public purse, rendered tech-driven efficiencies in health care indispensable. -- 198 of 565 -- Of course, giving an American tech giant access to NHS data could “raise ethical questions,” The Guardian also noted. But the newspaper acknowledged that DeepMind had anticipated this problem. The company was promising that patient data would never leave the UK, nor would it be used for Google products. Furthermore, these pledges seemed credible because of the new Independent Review Panel, which would watch over DeepMind’s behavior. DeepMind was striking a sensible balance between patient privacy on the one hand and patient health on the other. Suleyman inhaled deeply. His clash with Google’s lawyers had paid off: The governance experiment was creating the cover he needed to modernize the health system. If the public mistrusted the Silicon Valley behemoths, perhaps the answer lay in radical transparency, and if DeepMind demonstrated that transparency could work, it might become standard for every tech company seeking to deploy AI responsibly. Finding a way to reconcile technological progress with other cherished imperatives would have consequences for society writ large. To Suleyman’s lieutenants, their boss was pioneering a new, enlightened form of capitalism.[23] • • • A FEW WEEKS LATER, on May 3, 2016, Suleyman stepped out of a meeting to take a call. “I felt a big hole open in the ground,” he said later. The call was to warn him of a hit job. The Daily Mail, a popular and populist tabloid, was about to splash its front page with an all-caps banner headline: GOOGLE HANDED PATIENTS’ FILES.[24] “Up to 1.6 Million Private Records Passed on without Permission in NHS Deal with Internet Giant,” the subhead added. Suleyman felt strongly that the charge sheet was garbled. Google was not allowed to see any of the patient records handled by the Streams app: DeepMind had built a special data storage infrastructure and denied access to its parent company.[25] The critics were also saying that the legal agreement covering the data was skimpy. But the contract that DeepMind had signed was the standard template that hospitals provided to hundreds of -- 199 of 565 -- outside suppliers of technology.[26] Finally, privacy activists were complaining that the Streams data had not been anonymized, but this confused two kinds of NHS collaboration with tech companies. In research collaborations, patient identities were of course removed. But in clinical collaborations, patient identities had to be retained because the whole point was to flag threats to specific individuals. Software systems, CT scanners, and other hospital technologies used patient identities in order to serve patients. The Streams app was just one example.[27] Suleyman called Hassabis to warn him of the Daily Mail onslaught. He assured him that DeepMind was innocent on all charges. The Streams project had become a target, he insisted, because activists failed to acknowledge the trade-off between privacy and health, and because journalists could not resist an opportunity to bash American tech giants. Without DeepMind’s ties to Google, the Daily Mail would not have run the article. Hassabis was furious. This was the first reputational crisis that DeepMind had suffered. For good reasons, given breakthroughs such as Atari and AlphaGo, most press coverage was adoring. Recently, a headline about Hassabis in the British Observer had consisted of twelve words, and two of those words had been “genius” and “superhero.” “There is a look in his eyes of what I can only describe as radiant purpose, almost childlike in its innocence,” the profiler had said of him.[28] The allegations of a data breach came at a bad moment. A few months earlier, DeepMind had tried to kill off Elon Musk’s rival AI laboratory, OpenAI, by dangling big pay packages in front of Musk’s star scientists. But Musk and Sam Altman had managed to convince their researchers that OpenAI represented the forces of light, and that DeepMind might indeed be evil. “Everyone feels great, saying stuff like ‘bring on the DeepMind offers, they unfortunately don’t have “do the right thing” on their side,’ ” Altman had messaged Musk, in December 2015.[29] A slew of talented investigators had signed on with OpenAI, including Ilya Sutskever, Geoff Hinton’s most famous protégé.[30] Negative headlines about DeepMind plots to steal NHS data would only bolster OpenAI’s recruitment. -- 200 of 565 -- Suleyman got to work. If the critics wanted a more detailed legal contract, he would produce one so comprehensive and airtight that they would have to applaud it. The standard NHS template said nothing about the management of cloud-based data, for example. Suleyman fixed that. The standard template stated the purposes of the data sharing in a couple of bullet points. The new document detailed all potential uses, both permitted and forbidden. After six months of legal deliberation, the original two-page agreement was replaced by a fat document. Suleyman published the contract with minimal redactions and invited advocates to read it. The advocates acknowledged the advance. But they also had a different criticism. Suleyman’s radical experiment in data management required “participatory consultation.” Suleyman was happy to jump through this hoop, also. He told his team to put together some suitable events. He announced that this was a priority. His staff had other priorities to attend to. When you worked for Suleyman, everything was a priority. “We had this prioritization system, P1, P2, P3,” a former member of Applied explained. “P1 means that’s the important thing. P2 is like, yeah, we’ll get to it when we get to it. P3 is, we’ll probably never do it. “But then everything is P1, so we also had P0. That means there’s a fire going on. Drop everything and go do it. “At DeepMind Applied, a lot of things started getting P0. And so then this unofficial designation was created: P0 Plus. And then P0 Plus Plus. “As in, it’s not just a fire; it’s a bigger fire! Everybody get on it! “Internally, when an instruction came down, we’d be like, is it P0 or is it P0 Plus Plus? “Then there was the traffic light system. If you’re on track with your targets, your light is Green. If you are a little bit behind, it is Amber. And if it’s not going well, it’s Red. But then we got a new designation, which was a Double Red. “And if it was Double Red, it was probably Double Red P0 Plus Plus. At which point people would get the message that this really was a priority.” -- 201 of 565 -- Having understood that participatory consultation was somewhere in this upper band, Suleyman’s team invited charities and patient groups to bring chronically sick people from all over the country to visit the DeepMind office. They came in groups, patients and family members and carers, perhaps 150 people at each session. An artist was on hand to create a conceptual piece, capturing the spirit of the proceedings. “It was a huge performance. Patients came from everywhere. We had team members to meet them at the train station,” Dominic King remembered. “People with walking sticks. Blind people. “This is the kind of thing you have to do when you are in the business of care delivery,” King went on. “It was beautiful. “But it wasn’t a normal thing for a research outfit like DeepMind. You watched this group of visitors being helped into the building, and you got the feeling that the company might be splitting.”[31] • • • WHATEVER THE TENSIONS within DeepMind, the headlong expansion of its health work was impressive. In remarkably short order, Suleyman had mounted a multipronged attack on the health system’s backward IT; the millions of Britons who longed for somebody to fix the NHS might have celebrated his efforts. The pilot project on kidney injury was a case in point. Suleyman had broken through the apathy that bedeviled the hospital system; he had protected patient privacy and set a new standard in transparent corporate governance; and when an anti-Google backlash hit him in the face, he had done everything possible to assuage the critics. He had done this, moreover, at zero cost to British taxpayers. Down the road, DeepMind planned to charge the health service a share of the savings produced by its technology. But for the first few years, it would charge nothing. At the time Suleyman was hit with the data backlash, the kidney work was just getting started; he therefore had limited evidence on the impact of -- 202 of 565 -- his project. But an independent evaluation of the Streams app, completed in 2019, found that it caused medical teams to help patients in minutes rather than hours; meanwhile, the share of urgent cases that went unnoticed fell by three-quarters, from 12.4 percent to 3.3 percent.[32] Freed from the burden of logging on to clunky computers, nurses reported spending more time with patients, and the cost of treating a person with acute kidney injury came down, as early intervention reduced complications. In 2019, DeepMind also announced the results of its project to diagnose kidney damage earlier, with the help of AI. After collecting scrupulously anonymized patient data—because of the backlash in Britain, DeepMind did this in the United States—Suleyman’s team had trained a model that predicted the onset of acute kidney damage one or two days earlier than the usual blood tests could.[33] The work on eye disease and breast cancer also showed promise. About 4 percent of over-sixties in Europe develop the vision-threatening form of macular degeneration. In 2018, a DeepMind paper in Nature Medicine unveiled an image-recognition model that matched top doctors in scrutinizing retinal scans for early signs of trouble.[34] Meanwhile, DeepMind built a system that outperformed human radiologists in interpreting mammograms; in fact, it was roughly as good as having two separate radiologists examine each image. Given the shortage of human professionals, this was a breakthrough: In Britain, the Royal College of Radiologists reported that the country was short of more than a thousand specialists.[35] Both the eye scans and the cancer scans held out the dual hope of reducing the burden on doctors and facilitating a vast scale-up in diagnosis. Every public health expert agreed that DeepMind was pushing in the right direction. “We need something like the Streams app. We need it worldwide. And we don’t have it,” Eric Topol, one of America’s most cited medical researchers, told me.[36] Topol recalled that, when he became a doctor in the 1970s, a first clinical appointment with a patient was scheduled for a minimum of one hour. By 2019, that hour had been squeezed down to twelve minutes in the United States and nine minutes in -- 203 of 565 -- Britain, and a big reason was that clinicians spent inordinate amounts of time grappling with clunky IT systems. The medical-industrial complex was crying out for artificial intelligence to do the data sorting and the image scanning and the timely alerting. The case for an AI-driven health care revolution was overwhelming.[37] But the tragedy was that DeepMind’s revolution was stillborn. The Daily Mail article triggered two official inquiries into the alleged mishandling of data, and although neither investigation concluded that DeepMind had done wrong, each kept the allegations in the public spotlight for a year or so.[38] In 2018, moreover, the public anger at big-tech surveillance reached a peak: Cambridge Analytica, a political consulting firm, was found to have harvested data on up to eighty-seven million Facebook users without consent, using it to target voters during Britain’s referendum on quitting the European Union and during the 2016 US elections. Because of the backlash against big tech, DeepMind and its NHS partners lost their appetite for further engagement, and the momentum drained out of Suleyman’s reform effort. The Streams app was never upgraded as it might have been: by connecting it to DeepMind’s AI-based AKI prediction system, or by expanding it to deliver the broader triage that Dominic King’s Hark app had envisaged. Chris Laing’s ambition to extend the idea of real-time alerts to diseases beyond AKI was never implemented, either. The final blow came when Streams was closed down. “An AKI app will not sustain a strategic partnership, long term,” Laing explained. “If a wider suite of clinical applications had been deployed, things might have turned out differently.”[39] The fate of DeepMind’s retinal technology was equally sobering. For this project, DeepMind had partnered with an outstanding ophthalmologist named Pearse Keane, who had done postdoctoral work in the United States before taking up a lectureship at University College London.[40] The combination of the world’s top AI lab and a top medical technologist was a resounding success: The resulting algorithm, unveiled in the 2018 Nature Medicine paper, seemed likely to prevent blindness in tens of thousands of patients per year, just counting the United Kingdom. But seven years after -- 204 of 565 -- publication, the algorithm had yet to be deployed; worries over data and the Daily Mail hit job were again part of the reason. “If there were any negative news stories about Google or privacy, someone at a senior level in the hospital would forward me the article and say, ‘Any thoughts?’ ” Keane recalled. “Anything like that, despite not being related to my clinic or my specialty, made it orders of magnitude harder to do stuff.” [41] Looking back, there were two possible interpretations of Mustafa Suleyman’s triumph-cum-tragedy with health care. The first held that the adventure was doomed from the start. “It was a catastrophic error to pick health,” one former DeepMinder argued. “I mean, bonkers to pick the most sensitive and politically controversial industry and go there.” A few years later, Dario Amodei, the founder of the rival AI lab Anthropic, added a conceptual framing to this pessimistic view. Economists often observe that investing in a factor of production is useful only if complementary inputs are available. For example, hiring additional pilots won’t help an air force that has no spare jets to be piloted. By the same token, before committing resources to a project, AI executives should ask themselves whether additional intelligence will unlock progress. If there are other limiting factors—bureaucratic institutions, squeamishness about privacy, public suspicion of tech firms—a sensible AI lab might focus its efforts on an alternative challenge. “In the AI age, we should be talking about the marginal returns to intelligence,” Amodei proposed.[42] Arguably, the marginal returns to intelligence in the NHS were always likely to be limited. There was a second possible interpretation, however: that DeepMind Health might have succeeded if Suleyman had played his cards differently. On this view of history, Suleyman was right to throw his arms around the NHS—especially given the huge social benefits that seemed possible. On this view, too, a future attempt at revolution may succeed, turning health care into Exhibit A for beneficial artificial intelligence. This counterfactual speculation matters because, at the time of writing, AI for medicine is making a comeback. Suleyman is leveraging his position as chief executive of Microsoft AI to reassemble bits of his old health team, and Pearse Keane -- 205 of 565 -- has helped to launch a start-up to prevent blindness. Of course, it is impossible to prove this second and more optimistic view. But it seems at least plausible. The reason for optimism lies paradoxically in that Double Red P0 Plus Plus confusion. Even if one grants that some sort of privacy pushback was inevitable, given the politics of health care, the dysfunction within Suleyman’s Applied shop was not inevitable. “I was managing a portfolio of businesses and I did too many things,” Suleyman confessed later. “I’ve tried to learn from that. It was definitely a significant weakness.”[43] In 2016 and 2017, everything was a whirl: Suleyman had a habit of focusing intensely on a new project, then appearing to turn cold on it.[44] He fired off fresh commands in all directions, willing new initiatives into being; then he would get on a plane and disappear, and forget what orders he had issued. [45] To keep track of the myriad tasks that he assigned to his underlings, he appointed a merciless enforcer of his edicts, but the edicts issued on a Tuesday often undermined the edicts issued the previous Thursday. When, unsurprisingly, work was not completed to the standards he expected, Suleyman would berate staffers aggressively in front of their colleagues, or pepper them with late-night all-caps emails. “He was tough, but it was coming from a good place,” one lieutenant pleaded; my sense from conversations with dozens of staffers is that there was much truth to this verdict. But something else was true as well. Suleyman frequently behaved as though he was exhausted and distracted. And so in fact he was. Unbeknownst to nearly everybody at DeepMind, Suleyman had embarked on a secret negotiation with Google. OceanofPDF.com -- 206 of 565 -- I CHAPTER 12 THE AGENT AND THE TRANSFORMER n August 2015, the month of the SpaceX safety meeting, David Silver married his longtime girlfriend and flew out to Sri Lanka. The honeymoon couple checked into a beach hotel, but Silver’s mind was whirring with his work on AlphaGo. The first night he was too jet-lagged to sleep. “I was up all night, thinking,” he recalled. “And it just sort of crystallized in my mind: the idea for AlphaGo Zero.”[1] The Zero that popped into Silver’s head referred to zero human knowledge. He would build a new version of his Go system, but this time he would skip straight to the second phase: Rather than training the agent initially on expert human games, he would have it learn exclusively by playing against itself—by experimenting with random moves and discovering which ones generated a reward signal. The new Agent Zero would stand as a triumph for Silver’s scientific specialty, reinforcement learning; it would also mark a leap toward machine autonomy. Yesterday’s deep-learning systems had ingested data that represented human knowledge, curated by human programmers. Tomorrow’s reinforcement- learning agents would rely on data that they generated themselves, by acting in the world and gathering experiences.[2] A year or so passed before Silver could attempt to realize this vision. After his honeymoon, he went back to work on AlphaGo, frantically preparing for the face-off against Lee Sedol in South Korea. But following AlphaGo’s victory in March 2016, Silver reassessed his options. He needed a new focus for his research, and he contemplated either applying AI to science or developing agents that would set new benchmarks in cognitive -- 207 of 565 -- tasks, such as exploring decision trees and planning. “So many choices, so much to do,” he mused in his diary. Then he resolved his dilemma with a story from his trip to South Korea. Some way into the tournament with Lee Sedol, a Go expert had approached Silver with tears in his eyes. “You’ve created the most beautiful thing I’ve ever seen,” he told him. “To create intelligence so beautiful that it makes observers cry with joy: that seems a goal worth shooting for,” Silver wrote in his journal.[3] A year and a half later, in October 2017, Silver and his colleagues duly unveiled a new and beautiful intelligence: a shockingly powerful AlphaGo Zero.[4] Learning only from self-play, the system outclassed its predecessor by a mile. By unshackling itself from human wisdom, the model had discovered stratagems unknown to mortal players, arriving at a new understanding of Go’s mysteries. Then, in December, Silver and his team surpassed themselves again, rolling out an even better iteration. The new new agent, its name now abbreviated to AlphaZero, could play not just Go but also chess and the Japanese version of chess, shogi.[5] The honeymoon inspiration that had arrived in the middle of the night had “worked like a dream,” Silver said later.[6] For Hassabis in particular, the chess prowess was thrilling. Two decades had elapsed since Deep Blue’s famous victory over Garry Kasparov, and chess programmers had spent the intervening years stuffing human mastery into chess engines. Top systems such as Stockfish incorporated giant databases of opening sequences, both classic and obscure, plus terabytes of endgames. Plenty of chess experts doubted there was scope for improvement—perhaps the game’s deepest enigmas had been unraveled, such that chess had been “solved,” like the Rubik’s Cube. In 2016, Hassabis had discussed this possibility with Murray Campbell, one of the engineers who had worked on Deep Blue. Was there some hidden dimension of chess, still waiting to be discovered? “Both of us were unsure,” Hassabis remembered.[7] Silver’s reinforcement-learning wizardry proved that chess had not been solved—far from it. Humans had not understood how little they had -- 208 of 565 -- understood: An infinity machine could open up new vistas of knowledge. At the start of its training, AlphaZero developed the same opening sequences that were used by human pros. But it soon found flaws in some of these routines; first it discarded them, then it invented better ones. AI stood in judgment over centuries of human wisdom, vindicating some verdicts and tossing out others.[8] The routines that AlphaZero invented felt strikingly human. Earlier chess engines had proved that you could neutralize human creativity with relentless algorithmic precision. AlphaZero showed that a machine could generate a fluid and attacking kind of play, advancing boldly, sacrificing material freely, and prioritizing position. “AlphaZero had a dynamic, open style like my own,” Garry Kasparov himself marveled.[9] Silver’s creation played like a grandmaster, yet it had not studied the masters’ play. The difference was that the machine operated at a higher level. AI experts had long recognized that tasks that humans perform instantly and unthinkingly, such as recognizing an image, are hard to program, because humans are not conscious of the steps taken to accomplish them: This is known as Moravec’s paradox. Meanwhile, tasks that humans perform slowly, consciously, and with considerable effort are traditionally regarded as easier to program: A simple pocket calculator can find the square root of anything. “The main lesson of thirty-five years of AI research is that the hard problems are easy and the easy problems are hard,” the psychologist Steven Pinker famously observed in the mid-1990s.[10] Thanks to David Silver, however, DeepMind had amended Moravec’s dictum—not once, but two times. The first revision had come when AlphaGo mimicked intuition, suggesting that an apparently impossible programming challenge was in fact soluble. The second amendment had come with AlphaZero, which proved that a game that humans played consciously and laboriously—and which they therefore thought they could program—had not actually been solved definitively. Legions of chess experts had hand-coded chess knowledge into Stockfish, but they had succeeded only up to a point; meanwhile, Silver’s self-sufficient system delivered a much stronger performance. The dual lesson, Hassabis reflected, -- 209 of 565 -- was that unconscious knowledge is more programmable than it seems; and conscious knowledge is less solid than it seems. Known knowns can turn out to be unknown. To understand what we understand, we need autonomous machine intelligence.[11] • • • THE QUESTION WAS what AlphaZero meant, not just for humans and their cognitive limits, but rather for the road to artificial general intelligence. For Silver, this breakthrough for reinforcement learning marked a revolution. AlphaZero had mastered three different complex games from scratch, without human instruction or human data. The old obstacle to AI—the impossibility of devising a deductive system to classify and explain the world—had been bypassed. The problem of induction—the challenge of finding patterns in an infinity of data—had been vanquished. “We’re no longer constrained to systems with predefined rules,” Silver observed.[12] Reinforcement-learning systems like AlphaZero seemed set to conquer all kinds of complex, real-world challenges. For DeepMind’s competitors, reinforcement learning appeared less compelling. From his base at Facebook and New York University, Yann LeCun derided RL as the “cherry on the cake”—the cake being deep learning. Silver and his DeepMind colleagues hit back with a slide showing a cake topped with a dense lattice of cherries: The fruit dominated the bakery. Meanwhile, from his base at DeepMind’s other main rival, OpenAI, Ilya Sutskever was on LeCun’s side of this divide, much as he had been back in 2015, when he had debated the next steps for AlphaGo with Silver. Sutskever was at least as excited as Silver was about the coming revolution in AI. But the way he saw things, it was new types of neural networks, not RL, that were going to change everything. To understand this continuing tension between reinforcement learning and deep learning, start with a peek under the hood of AlphaZero. In one sense, DeepMind’s triumph was exactly as the company described: a fantastic demonstration of the power of autonomous, agentic learning. In -- 210 of 565 -- another sense, however, AlphaZero demonstrated the progress that LeCun and Sutskever stressed—the progress in the design of neural networks. Ever since the 2000s, the pioneers of deep learning had grappled with a conundrum. In theory, the more layers of neurons they added to a network, the more sophisticated it would become: A larger brain could learn more. In practice, however, too many layers caused the network to go haywire. The reason recalled the party game of telephone: When a message is relayed from person to person, the longer the chain, the more garbled the meaning when the message reaches the last partygoer. The deep-learning version of this garbling was known as the problem of “vanishing gradients”: As a signal passed down through many layers of neurons, it grew fainter and fuzzier. In 2006, Geoffrey Hinton’s breakthrough paper on deep belief networks had begun to tackle this issue, allowing neural networks to grow from perhaps three layers to about ten or so. Then, at the end of 2015, a trio of Microsoft researchers came up with a new learning architecture known as a residual neural network.[13] This invention solved the telephone problem for the time being, and it lay at the heart of AlphaZero. The residual neural network, or ResNet, packaged two ways of making larger neural networks work better. First, its neurons transmitted the gap, or residual, between the signal received from the previous layer of neurons and the output that they sent on to the next layer. This cut the computational burden; it was a bit like asking a book editor to forward notes about how a manuscript should be fixed, rather than forwarding the full, rewritten pages. Second, the residual network was equipped with “skip connections,” the equivalent of express lanes, which linked the top layers in the neural network to the bottom ones, bypassing some in the middle. Thanks to these express lanes, signals could speed through the network, reaching the most distant layers intact. Before the advent of this residual architecture, increasing the size of a network had been self-defeating: If you added new layers, you undermined the performance of the old ones. Now, courtesy of Microsoft’s innovation, learning was spread evenly through all parts of the network—even a network with 150 layers or so. -- 211 of 565 -- Thanks to this advance, the neural network under AlphaZero’s hood was much more powerful than AlphaGo’s. Working with networks that were just twelve layers deep, AlphaGo had needed two separate systems: the policy network, for move selection; and the value network, for position evaluation. In AlphaZero, by contrast, the policy and value functions could be handled by a single, powerful, forty-layer network, creating a more general intelligence. Seen in this light, AlphaZero hearkened back to Atari. Both systems mastered multiple games, demonstrating that agentic trial and error could deliver impressive versatility. At the same time, however, both systems achieved their advances in reinforcement learning thanks to progress in deep learning. Back in 2013, the Atari system had leveraged a particular kind of network, known as a convolutional neural net, which was built to excel at image recognition. Three years later, AlphaZero was a triumph both for RL and for the updated version of the convolutional neural network— the residual neural net design from Microsoft. At the end of 2017, when AlphaZero was unveiled, the question of which part of the triumph to stress was perhaps the largest dilemma in AI research. Because of his PhD training in Alberta, not to mention the success of Atari and AlphaGo, Silver naturally presented AlphaZero as a reinforcement-learning breakthrough. “Once we have a system that can learn for itself, there is no ceiling anymore,” he reflected in 2018. “These systems can learn for themselves to build up knowledge, accumulate knowledge, and learn everything there is to know.”[14] In a mark of his optimism about his specialty, Silver had recently encouraged his PhD supervisor, Rich Sutton, to join DeepMind. Together with a handful of fellow RL experts, Sutton had opened a new DeepMind office in Edmonton, Alberta, cementing the company’s lead in designing agents that learned through experience. Hassabis shared Silver’s enthusiasm for reinforcement learning, albeit for his own reasons. Thanks to his PhD in neuroscience, he had always thought that artificial general intelligence would depend on integrating multiple components: perceptual systems built on convolutional neural nets; -- 212 of 565 -- various kinds of memory, both long-term and short-term; search algorithms that simulated what humans think of as planning. Because of this broad- church assumption, Hassabis had assembled a research team spanning multiple specialties: The idea was to bet on several approaches, and then to put resources behind those that showed promise. Yet although Hassabis was not committed to reinforcement learning as the one true path, he could see that Silver’s research had yielded fabulous results. As a strategist, moreover, Hassabis understood the advantage in locking in DeepMind’s dominance of RL. The more he faced competition from the likes of Facebook and OpenAI, the more he wanted to tighten his control of a key square on the chessboard. On the other side of the Atlantic, Hassabis’s rivals resembled DeepMind in some ways, but they differed in the detail. They too experimented with a variety of research paths. Despite LeCun’s cake-and-cherry metaphor, Facebook had tried to build a Go system, incorporating the same techniques that went into AlphaGo. Likewise, OpenAI built RL agents to play games, goaded on by Elon Musk, who remained obsessed with Hassabis’s lead in this area. The latecomers’ copycat behavior was facilitated by the practice of openly publishing research: The labs encouraged their scientists to unveil their findings in journals, believing that a top-notch publication trail was essential to recruiting the best talent. But despite these commonalities, Facebook and OpenAI neither believed in nor excelled at reinforcement learning to the extent that Silver did. Nor could they aspire to close the gap. Silver’s prestige and Google’s deep pockets pretty much ensured that the best RL researchers chose to work at DeepMind. All of which meant that Hassabis’s strategic calculation was logical. So long as progress toward artificial general intelligence involved reinforcement learning, DeepMind could maintain its lead by cornering the market in this part of the AI supply chain. But Hassabis’s strategy involved a risk: that a surprise breakthrough in some other area of AI might undermine RL’s importance for long enough to allow a rival to challenge him. -- 213 of 565 -- • • • AS PETER THIEL had said of Hassabis, geniuses are seldom brilliant in a general way: They tend to be brilliantly suited to a particular mission. For Hassabis, the mission was founding a company to go after AGI. For Silver, it was to push the frontier of reinforcement learning. For Vlad Mnih, it was fusing reinforcement learning with deep learning. Meanwhile, for Ilya Sutskever, the opportunity that sat most squarely in his wheelhouse was the appearance of a new type of neural network in June 2017. It was called the transformer, and it would revolutionize AI. Large chunks of Sutskever’s research, including his PhD, had primed him for this moment. If Microsoft’s residual architecture had made it possible to build larger networks, the transformer architecture addressed another long-standing conundrum in deep learning. Many kinds of information—speech, text, videos, stock-price charts—cannot be understood by examining each unit of information in isolation. Rather, each item must be interpreted as part of a sequence. If I say, “The dog ran outside because it saw a car,” you cannot understand the pronoun “it” unless you have absorbed the first half of the sentence. Likewise, if I watch episode three of a complex crime drama, I won’t understand the plot if I haven’t watched the first installments. Convolutional neural networks, which are designed to interpret images, exploit spatial dependencies: neighboring clusters of pixels contain related clues to one part of an image—a cat’s eye, for example. Similarly, transformers make sense of temporal dependencies: Sequences of information play out over time, and the network needs to understand how each phrase or plot twist relates to the other ones. Sutskever’s doctoral thesis focused on an earlier approach to temporal dependencies, known as a recurrent neural network. (Not to be confused with the residual neural network, discussed above, from Microsoft.) By scrutinizing the words in a sentence sequentially, and remembering past words as it considered the next one, a recurrent network aimed to penetrate the links between each word and its antecedents. The basic version of this architecture, invented in the 1980s, achieved little. Like a human with -- 214 of 565 -- severe short-term memory loss, the system couldn’t grasp links that were even slightly far apart—the subject of a sentence and a verb that came a few words later, for example. But around 2012, when his colleague in Geoff Hinton’s lab, Alex Krizhevsky, was coaxing performance out of his convolutional neural network, Sutskever was doing the same for a recurrent neural network. He fortified it with high-performance GPU chips, added extra layers, and came up with algorithmic tweaks to improve its competence at tasks such as language modeling. Yet he never quite managed the equivalent of the ImageNet breakthrough.[15] Completing his doctoral work in 2013, Sutskever was for recurrent networks and sequential data what David Silver was for reinforcement learning and Go: a leading authority who had not completely cracked his chosen problem. After his PhD, Sutskever continued to wrestle with the challenge of temporal dependencies. In 2014, he and two Google coauthors, Oriol Vinyals and Quoc V. Le, came up with an innovation known as the “sequence-to-sequence” framework. Rather than just mapping a static input onto an output, as basic deep learning did, the framework took sequences that played out through time and mapped them onto one another. It was like a professional interpreter who listens to a full sentence, digests its meaning, and maps it onto the equivalent sentence in another language. To pull off this feat, Sutskever and his colleagues began with an established method known as word embedding. They built a map of the English language with hundreds of dimensions, and embedded words within it. (Of course, humans accustomed to living in three dimensions, or four if you add time, find it impossible to visualize a map with hundreds of dimensions. But computers cope with such maps easily.) In this enormous language map, a word such as “computer” would be close to other associated words: In one dimension, it might be close to “keyboard”; in another, it might be close to “electricity”; in a third, it might be close to “semiconductor.” It was a bit like understanding an idea or a person in terms of multiple possible contexts. In one context, an individual may be an astrophysicist. In others, she may be a keen amateur baker, a daughter, a skier, a resident of Chicago, and so forth. -- 215 of 565 -- Once Sutskever and his colleagues had constructed their map, the computer could see the linkages between every word in the language and every other word. Or, more precisely, it could see the linkages among “tokens”—a token being a word, a syllable, a punctuation mark, or some other fragment of language, depending on what the system could most easily learn from. For example, by splitting the word “unbelievable” into three tokens—“un,” “believ,” “able”—the model could absorb the meaning of common prefixes and suffixes, thus learning to handle other compound words that had not come up in its training set. Knowing that “bio” means “life” might help the system to interpret longer words: “biology,” “biography,” “biome.” The next step for the system was to assign each token a mathematical value, or a “vector.” The magic here was that a vector contained more information than a token: It located the fragment of language on that multidimensional map, recording its many associations and connotations. By transforming simple tokens into rich mathematical coordinates, Sutskever equipped his system with the beginnings of an understanding of linguistic nuance. It was like creating a social-network graph, but for words. Connections exist not just between relatives but between friends; and not just between friends, but between friends of friends. A vector encoded all these linkages and meanings. Finally, Sutskever and his colleagues added one more innovation. Having understood all the possible connotations of each word, the sequence-to-sequence program had to decide which connotations mattered given the specific context. To tackle this challenge, the system imitated those professional interpreters who listen to a full English sentence before rendering it into French. As it ingested a sequence of token vectors representing an English sentence, it compressed them into a dense mathematical summary: a “context vector.” Then, using this context vector as a guide, the system refined its word choices as it outputted a translation. At the end of 2014, when Sutskever showed up at the NIPS conference to give his “success is guaranteed” lecture, the sequence-to-sequence framework explained much of his optimism. In addition to translation, -- 216 of 565 -- Sutskever regarded sequence-to-sequence modeling, or Seq2Seq, as a breakthrough for any challenge involving temporal dependencies. A Seq2Seq system would soon be able to summarize: It would take a long passage, transform it into rich vectors, then use its grasp of connotation and context to generate a cogent précis. Similarly, a Seq2Seq system might be capable of conversation: It would take in a question, use word embeddings and all the other tricks to make sense of it, and then output an answer. Of course, the outputs would be clunky to begin with, but Sutskever felt that he was on a path. Bigger networks, additional data, a few more flashes of algorithmic genius: In time, computers would master sequential information as proficiently as they already handled vision. In May 2015, a flash of algorithmic genius duly appeared, courtesy of three researchers at the University of Montreal, led by Yoshua Bengio, a celebrated pioneer of deep learning. Their paper’s key proposal came to be known as “attention.”[16] The idea addressed a weakness in the Seq2Seq framework. When humans understand linguistic context, they don’t keep track of every word in a conversation or a paragraph; they retain just the essentials. The Seq2Seq program was far less skilled at discerning the essentials, so it struggled to compress the essence of even a single sentence into its context vector. For simple sentences, Seq2Seq worked. For long ones, it didn’t. To overcome this weakness, the Montreal trio reimagined the context vector. Instead of creating one fixed-length summary of an English passage, their system composed a variety of summaries, each capturing different facets of its meaning. Then, as the system generated a translation, it was guided by a summary tailored specifically for the next step it had to take, with only the relevant information from the English passage being brought to its attention. For example, if the next step involved translating a verb into French, the summary might direct the system’s attention to the English verb, but also to the verb’s subject, to its object, and to anything else that might help the program choose the most suitable French verb. The summary could flag relevant clues even if they appeared later in the passage, and it weighted the components of its advice—the subject of the verb might be -- 217 of 565 -- deemed more important than an adverb, for example. It was like helping a human to make sense of episode three of that TV drama. At some points in the episode, the human might need to know the background of a particular character. At other times, the essential context might involve some prior event—an earthquake in episode one, a murder in episode two, and so forth. Knowing which part of the context the system needed to understand at any particular moment—where to direct its attention—amounted to a superpower. By zoning out reams of irrelevant information, the model could home in on the patterns that unlocked meaning in an infinity of data. It could give its undivided attention to the stuff that mattered. • • • AT THE END OF 2015, Sutskever caused another stir at NIPS by announcing his career switch. After three years at Google, which had begun when the search giant had bought the Hinton–Sutskever–Krizhevsky boutique, Sutskever was signing on with Musk and Altman in their anti-Google, anti- DeepMind effort. Google had done its best to keep him, but Sutskever had an entrepreneurial itch. An agile start-up sounded thrilling. “At Google, I could see my life ten years into the future,” Sutskever recalled. “I thought, I’ll do some good science, but on the career side, it’s not going to be that exciting.” His logic contrasted with that of Hassabis, who had put science ahead of business ambition when Google had offered to buy DeepMind. “I was in Silicon Valley!” Sutskever added, attesting to the cult of entrepreneurship on the West Coast. “You’re supposed to start a company in Silicon Valley!”[17] Assuming the title of research director, Sutskever joined the gaggle of OpenAI recruits who convened at an apartment in San Francisco’s Mission District. He worked on whatever projects came up, contributing to OpenAI’s early efforts to build reinforcement-learning agents for games, and to a five-fingered robotic hand that mastered the Rubik’s Cube. “We did lots of different things to make noise; that is how you keep the company -- 218 of 565 -- alive,” Sutskever explained later.[18] But these experiences only increased his reservations about reinforcement learning. It was hard to get agentic systems to do anything of consequence, he found; RL amounted to “an endless hill of suffering.”[19] Even as he worked on video games and robotics, Sutskever kept pursuing side projects on sequential learning. In April 2017, Sutskever’s fixation with sequences helped to spark a fresh breakthrough.[20] An OpenAI language model ingested eighty-two million Amazon e-commerce reviews, with the goal of learning to produce fluent text by training on the data. The way this training happened marked a shift from what had come before. For the past couple of years, frontier neural networks had started to break free from the old method of “supervised learning”—the technique of teaching models to map inputs onto human-provided examples of the correct outputs. Now, instead of feeding an AI model human-labeled cat images, or human-curated English/French text pairs, researchers taught their systems to learn from unlabeled data: With this innovation, humans would be spared the time- consuming work of annotation. To succeed at what came to be called “self- supervised learning,” language models might cover up the last token in a sentence and then try to predict what that token should be; if a model guessed wrong, it would adjust its weights and biases accordingly. Following this recipe, OpenAI’s 2017 system practiced on the Amazon reviews until it learned to generate text fluently. But that was not its main achievement. Somehow, as it practiced, the model developed what the researchers called a “sentiment neuron.” Of the many thousands of decision centers, or neurons, in the network, one particular neuron fired or remained dormant depending on whether Amazon’s human reviewers felt positive or negative about the product they were commenting on. Usually, a model’s intelligence emerged from the interactions among many neurons; in this case, an appreciation for sentiment had been cleanly isolated in a single one. Equally remarkably, the system had acquired this emotional sensitivity without being directed to do so: Sutskever and his colleagues had neither told the model to develop a feel for sentiment nor helped it by labeling reviews “positive” or -- 219 of 565 -- “negative.” The emergence of the sentiment neuron hinted at the many forms of intelligence that sequential models might acquire in the future. “If you predict the next token, you could potentially solve intelligence,” Sutskever suggested. “You could discover everything that needs to be discovered.”[21] This was just the dress rehearsal for the main event, however. Two months later, on June 12, 2017, eight Google researchers published a description of the architecture that would transform sequential modeling: It was called, appropriately, the transformer. Their paper wrapped together the recent advances in sequential learning: word embeddings, tokenization, vectors, the idea of attention. But the authors audaciously discarded the invention that had kick-started the whole field: the recurrent neural network. Every prior experiment, from the Seq2Seq framework to the sentiment-neuron model, had stuck with the presumption that to understand sequential data, a model had to scrutinize it step by step, one token at a time, searching for temporal dependencies—the task for which recurrent networks had been invented. But the Google researchers realized that, thanks to the attention mechanism, plodding sequential processing had been rendered obsolete. If a language model knew which tokens to attend to as it outputted each bit of its response, the original word ordering could be ignored: Paradoxically, sequential learning could be done nonsequentially. “Attention Is All You Need,” ran the title of the Google paper. Philosophically, Google’s innovation recalled an old insight about time, and how people relate to it. In the early 1900s, Henri Bergson, a French thinker so celebrated that fans stole locks of hair from his barber, debunked the idea of time as a simple linear progression. The proverbs might say that “time marches on”; that “time waits for no man”; and so forth. But this is not how time is actually experienced. Rather, humans spend many of their conscious moments contemplating yesterday or anticipating tomorrow, dwelling on what was and what might be even as they dwell in the present. What governs their experience, in other words, is not the inexorable ticking of the clock. It is their choice of where to focus their attention.[22] Google’s revolutionary transformer paper—with its insight that attention unlocks the -- 220 of 565 -- meaning of sequences far better than a linear analysis of them—was the computer science equivalent of Bergson’s realization. Two breakthroughs followed from Google’s leap of imagination. The first involved AI systems’ grasp of context. Recurrent networks had trouble processing more than one or two sentences at a time; discarding recurrent networks meant that the system could pay attention to key words or phrases that were scattered through whole paragraphs, or even pages. For a translation model, this meant that the algorithm could consider clues that came much earlier or later than the token being rendered into French. For a conversational model, the advantage might be greater still. To respond intelligently to a human question, a chatbot could relate it to a fact or an idea that came up several minutes earlier in an exchange, for example. The second breakthrough concerned speed. Recurrent neural networks were slow: Their step-by-step sequential processing made this inevitable. Now, freed from this labor, a model could take in all the tokens in a paragraph at once, leveraging the parallel processing power of modern AI chips. Since the start of the decade, AI models had worked on graphics processing units, or their Google equivalent, the TPU. But sequential processing could not take advantage of GPUs’ ability to process thousands of tasks simultaneously. Now, thanks to the transformer architecture, sequences could at last be parallelized. The models would finally make full use of the hardware that powered them.[23] “It was perfect,” Sutskever said later. “There is this famous talk by Richard Hamming called ‘You and Your Research,’ ” he added, referring to an address delivered by a retired Bell Labs scientist in 1986. “I read it in grad school many times. One of the things he says is that you’ve got to have a prepared mind. “If you have been thinking about a problem from every angle, sometimes the right solution comes up and you recognize it straight away. “That is what happened to me. I was thinking about sequence modeling for a long time. And then along came the transformer.”[24] • • • -- 221 of 565 -- THE DAY THE GOOGLE paper appeared, Sutskever read it, jumped out of his chair, and went off to find his colleague Alec Radford. A young and especially inventive engineer, Radford was Sutskever’s key partner when it came to deep learning. One of the things that Sutskever loved about OpenAI was that it revered engineers—“PhDs don’t do anything. You’ve got to hire people who do things,” Musk had insisted to his cofounders.[25] Of course, Sutskever was a PhD himself. But he was also a fervent believer in scaling up networks. Without resourceful engineers like Radford, the scaling would never happen. “Building AGI is going to be a megaproject, so engineering is bound to be central,” Sutskever explained. “To me, that was one of the core ideas of OpenAI: You’re going to respect engineering in an unprecedented manner. “Now, why is this so revolutionary? Because AI has academic roots, and academics tend to look down on the dirty work of engineering.” This was true—indeed, it was a fair characterization of DeepMind. Hassabis and his colleagues disparaged OpenAI’s work as engineering-led: all brute force and no intelligence. They published Nature papers; OpenAI put out blog posts. “At OpenAI, we had this belief: Don’t be too clever,” Sutskever said. “The stuff that we have is so potent already. Success is guaranteed! Just do it! This is what Google and DeepMind lacked. This was our advantage.”[26] Sutskever’s respect for engineers did not stop him from demanding that they change focus abruptly. “Drop everything! Start working on the transformer! It’s going to be amazing,” Sutskever now announced to Radford. I pictured the young, bespectacled Radford, sitting at a terminal, deep in thought, his boyish mop of strawberry-blond hair tangled in concentration. Suddenly his calm is shattered by a caffeinated charismatic. “Did he look at you and think, ‘What the heck is this guy talking about?’ ” I asked Sutskever. “I think he had a little bit of that reaction at first,” came the response. “But typically, I insisted so much that he did it anyway.”[27] -- 222 of 565 -- Radford duly dropped what he was doing and replicated the transformer. Following the formula published by Google, he built a network that dispensed with the familiar step-by-step sequential processing, relying instead on attention and parallel processing. But he also added innovations of his own, which went beyond the Google recipe. To begin with, he designed his transformer for a different purpose: Google’s was built for translation, but his could handle a broader range of linguistic tasks, including conversation. Google’s system had been trained on human- curated language pairs, using the traditional technique of supervised learning. Radford’s model was trained mainly with the self-supervised method used in OpenAI’s sentiment-detecting model: It took in raw, unlabeled text, then learned to predict the next word in a sentence. Only after extensive self-supervised training, when the system was already performing well, did Radford apply a finishing gloss using human-provided labels. Finally, in a flourish that only an engineer could appreciate, Radford dubbed his system the “generative pre-trained transformer.” Even when this mouthful was abbreviated to GPT, it didn’t have the ring of a hit consumer product. In June 2018, one year after the transformer paper appeared, OpenAI announced the result of the Sutskever–Radford collaboration. By the standards of later language models, the first GPT was rudimentary. It still struggled to see the connections between ideas that appeared a few paragraphs from each other, and it sometimes regurgitated chunks of text from its training data, or opined with high confidence and low accuracy. But by the standards of its contemporary rivals, GPT represented an advance. Even though it had learned largely without the benefit of human labels, it generated text that was passably fluent, and it could converse on a broader range of topics than its competitors. Moreover, the system’s capacity for next-word prediction seemed to Sutskever to be accompanied by hints of understanding—of historical events, scientific concepts, geography, and so forth. If the model ingested a novel about World War II during its training, it might acquire a general comprehension of the horror of conflict. If it read about a sports contest, it might grasp the general thrill -- 223 of 565 -- of competition. As OpenAI put it, GPT had “significant world knowledge.”[28] Looking back on this moment, Sutskever addressed the question that came up repeatedly in later years, as OpenAI released ever more powerful iterations of its creation. Surely these transformer-based systems were only mimicking intelligence, the critics asked? Were they not merely statistical models, capable only of predicting the next token? As AI skeptics often quipped, GPT was a stochastic parrot.[29] “It’s not statistics!” Sutskever objected. Then he backed up and tried a better argument. Given the long road that scientists had traveled, featuring word-embeddings, tokenization, attention, and self-supervised learning, the parrot insult got to him. “It is statistics! But what are statistics? “In order to understand those statistics, to compress them, you need to understand what it is about the world that creates those statistics. What is it about people that creates their behaviors? “Well, they have thoughts, they have feelings and ideas, and they behave in certain ways. All of those things can be deduced through next-token prediction.”[30] If computers could understand how humans choose the next word in a sentence, they would have grasped the inner workings of the mind, Sutskever was saying. • • • AS WITH David Silver’s work, the questions raised by Sutskever’s research went to the core of the human experience. Silver’s Go agents forced onlookers to wonder whether humans knew what they thought they knew, and whether human intuition could be reduced to ones and zeroes. Likewise, Sutskever’s transformers hinted that language, often regarded as a mere system of symbols, might actually unlock the mysteries of the human thought process. But in the wake of GPT’s appearance, and especially following the release of a larger and more capable GPT-2 in -- 224 of 565 -- February 2019, there was a strategic question as well—a question about DeepMind’s bet on reinforcement learning. Silver’s excitement about RL was premised on machine autonomy. An agent could learn for itself, meaning that it could ultimately learn anything. But OpenAI’s success with self-supervised learning blunted this advantage: Now a neural network could learn autonomously, too, without needing human labels. Silver’s excitement about RL also reflected the fact that, in a domain with limited training data, an agent could generate its own data by taking actions and receiving feedback—by playing Go against itself and discovering which moves led to a win, for example. But self-supervised learning neutralized this advantage as well: It allowed OpenAI to feed unstructured text into its systems, and this sort of data was almost limitless. [31] The internet amounted to a vast trove of training material, a crystallization of human knowledge on nearly every topic under the sun. At least for the moment, self-play would be superfluous.[32] When Silver had debated AlphaGo with Sutskever back in 2015, he had advanced an additional argument in favor of reinforcement learning. If you wanted a Go system that didn’t merely match human champions but actually beat them, it wouldn’t be enough to train from human games: By definition, human knowledge could never catapult machines to superhuman knowledge. At that time, and in the context of Go, Silver had been right, but in the aftermath of GPT, the ground was shifting beneath him. AlphaGo and AlphaZero still stood for one vision of what it meant to be superhuman: to discover truths that humans had never imagined, like the famous Move 37. But there was another vision, too. If GPT realized the full promise of self- supervised learning, and sucked up all the knowledge on the internet, it might not discover superhuman insights. But it would certainly be superhuman in the breadth of its understanding. The dilemma for AI research therefore existed on two levels. At the algorithmic level, there was the question of whether to emphasize the deep learning at the heart of a model or the reinforcement learning that was built on top of it. In the case of AlphaZero, the two complementary components were evenly balanced: both the residual neural network and the exclusive -- 225 of 565 -- reliance on self-play contributed to the breakthrough. But six months later, GPT tipped the scales, proving that a powerful transformer model could learn autonomously from unlabeled data, without bothering with agentic trial and error. Meanwhile, on a more strategic level, there was the question of what sort of intelligence an innovator should choose to build. An expert agent that surpassed humans in a domain such as Go, with the aspiration to master adjacent ones such as chess and shogi? Or a jack-of-all-trades transformer that matched humans in most areas of knowledge? The second sort of intelligence would not generate Move 37. But it would be superhuman in its generality. In a counterfactual version of history, this second vision might have appealed powerfully to DeepMind, whose mission was to create artificial general intelligence. But just as everything in Sutskever’s training had prepared him to build language models, so Hassabis and Silver had been primed to underestimate them. OceanofPDF.com -- 226 of 565 -- F CHAPTER 13 ON LANGUAGE AND NATURE or a period of almost three years, I often met Hassabis at a pub near his home, in a leafy area of North London. We would climb a shabby wooden staircase to a room up on the second floor, which was invariably empty. There, at an octagonal table under a once-grand chandelier, we would sit on leather chairs, order cappuccinos and a carafe of water, and spend two hours talking: me with an obsessively detailed list of topics to get through; Hassabis with his sparky riffs on intelligence and life, neuroscience and games, history and fiction. This was the period of maximum excitement about large language models, so language and how to think about it came up repeatedly in our sessions. At that crucial moment, when Ilya Sutskever had read the transformer paper and leapt out of his chair, Hassabis had been slow to see the potential of conversational systems. To his credit, he admitted this. “I used to do these thought experiments,” Hassabis explained one day. “I would ask myself, how much would you know if you read all of Wikipedia? “And the answer is, well, quite a lot. But would you understand how the physics of the world works? “I mean, if I drop this glass”—here, Hassabis picked up a tumbler from the octagonal table—“it’s going to smash. “Would you understand that? Probably not just from Wikipedia. “How are you going to understand what something weighs? You could read about it, but you probably need to experience it. -- 227 of 565 -- “There’s this whole branch of neuroscience called ‘action in perception,’ which theorizes that you can’t really perceive the world properly in some deep sense unless you act in it. And weight is one of those things that you won’t understand. I know roughly what it’s going to feel like to pick up this glass. But if I’d never picked anything up, how could I imagine the sensation?” I recalled that DeepMind’s business plan had referred, perhaps fatefully, to “the mistaken yet highly influential…notion that language is intelligence expressed.” The way Hassabis saw things, language was merely a system of symbols, inadequate by itself to teach machines to be intelligent. His fascination with games fused with a belief in AI systems that played games. To understand the world, an intelligent machine would have to experience the world, either by assuming a robotic form, or by acting in a gamelike simulation. “An AI system in the nineties would have a big database, and in there you would have this explanation of a dog,” Hassabis elaborated. “It would say, ‘A dog has legs.’ But when the system saw a real dog, how did it map the word ‘legs’ to the pixels representing legs? “You’ve got these abstract relationships in symbolic space, but how do you relate any of them to the real world unless you interact with it? “That was what we called the grounding problem. That was the first thing I misjudged. What I’ve realized now is that language is more inherently grounded than we thought.” There were so many descriptions of dog legs on the internet that a machine could make a start on understanding what they looked like, Hassabis was saying. An ability to map reality to symbols might somehow emerge, like the sentiment neuron in OpenAI’s 2017 model. This was all the more likely because language models were fine-tuned with the help of human feedback. “In effect, language models learn from us how to be grounded,” Hassabis reflected. Grounding was only the first reason why Hassabis had doubted the potential of language models, however. The second concerned the scope of human experience. -- 228 of 565 -- “Imagine you’d asked me, five or ten years ago, how complex is human civilization? Or maybe, what is the number of possible human behaviors? “My answer would’ve been something like, well, it’s semi-infinite. We humans like to think of ourselves as having infinite possibilities and infinite variety. There are so many different ways we can act and think and flourish. Earth’s a pretty big place. What you can do on earth is pretty massive. “So, if the number of possible human behaviors wasn’t infinity, I would definitely have said it was some very large number. Like maybe 1050 bits of information. “But now it turns out that the number of possible human experiences isn’t that vast. It’s on the order of, say, ten trillion—1013 or something. And we know that because there are roughly fourteen trillion words on the internet, and that seems to be enough to capture the vast majority of human behavioral possibilities.” Even granting that the internet may not capture minority languages or cultures, I could see Hassabis’s point. “We’re less original than we thought?” I asked him. “Or there’s just less variety. There’s a proverb, right? ‘There’s nothing new under the sun.’ ” The proverb had evidently just popped into Hassabis’s head. “I don’t know who said that,” he mused. “Was it Solomon?” It was Solomon. A fragment of Hassabis’s churchgoing childhood must have stuck in his head. The book of Ecclesiastes, attributed to King Solomon, tells us, What has been will be again, what has been done will be done again; there is nothing new under the sun.[1] It was not the sort of line that you’d expect to hear from the cheerleaders of Silicon Valley. “Of course, we had to come up with transformers, an architecture that could grow big enough to take in all of the internet,” Hassabis resumed. “But now that’s been done, we see what the result is. By ingesting a few trillion tokens, these systems have learned enough to understand nearly all of our experience. “It didn’t have to be that way. It could have been that we downloaded fourteen trillion words and the result was pathetic. Then we would have -- 229 of 565 -- said, ‘Oh, we’re many orders of magnitude away from understanding civilization.’ “That is what I would have expected. But that isn’t what happened. That’s why I call these language models unreasonably effective. “The way AI has developed is a bit like the Industrial Revolution,” Hassabis continued. “It developed in a certain way, but that was kind of lucky. “I mean, suppose that at the start of the Industrial Revolution we had found out about energy and engines, but then imagine that there was no coal or oil in the ground. “After all, there didn’t have to be! Dead dinosaurs and ancient trees just waiting there for sixty million years, ready to be dug out. It’s kind of unreasonable if you think about it. Why wouldn’t they just decay in the ground and become useless? Quite convenient that they didn’t! And maybe that speaks to another conversation we could have about what’s going on here. Why would we have this coincidence? “But anyway, let’s just imagine that the coal and oil weren’t there. Then, to make something out of the discovery of energy and engines, you’d have to somehow get to nuclear power or solar power or other renewables. Two hundred years ago, that would’ve been really difficult. And so the analogy here is that the internet has been for AI what coal and oil were for the Industrial Revolution. Texans could just literally drill a hole in the ground and get black gold. Today, we can just download all of the internet. “Neither of these resources had to be there, the dead dinosaurs or the internet. Humanity built the internet for a different purpose. For sending messages and sharing information and then for e-commerce and whatever. But, kind of amazingly, we woke up one day and realized that we’ve got the equivalent of oil. “Once you’ve seen that there is oil, the right policy is: We should drill it. “But it didn’t have to be this way, and at first it didn’t look to me as though that was what would happen. The first iterations of GPT behaved as I expected: They were slightly poor memorizers. You got some kind of half- OK answer. It didn’t feel like it was grounded.” -- 230 of 565 -- In fairness to Hassabis, he was not alone in his skepticism about the early GPT models. At Facebook, for example, a former Sutskever lab mate, Marc’Aurelio Ranzato, regarded OpenAI’s experiments as interesting, not groundbreaking—“I didn’t understand that they were going to have such a big impact,” Ranzato said later.[2] In fairness, too, Hassabis had better judgment than most. At many points in his career, he had exhibited exquisite taste, seeing and exploiting scientific trends ahead of his rivals. And yet, when it came to language modeling, Hassabis’s taste buds went awry, and the reasons included his skepticism about ungrounded symbols, his love of games, and his presumption that human experience was more varied and perhaps grander than all of the text on the internet. When OpenAI’s ChatGPT became a worldwide sensation at the end of 2022, DeepMind paid a price for this mistake. It ceased to be perceived as the world’s top AI lab. “Look, you can’t be Nostradamus every time,” Hassabis admitted. “I think what happened later with these models surprised everyone. “Or maybe everyone except Ilya Sutskever, and a few people around him.”[3] • • • DEEP INSIDE DEEPMIND, a handful of researchers responded to OpenAI’s experiments more enthusiastically than Hassabis. After the release of GPT-2 in February 2019, a young DeepMind scientist named Jack Rae wrote an admiring analysis of the paper and circulated it among colleagues. The first GPT model had boasted 175 million weights and biases—the adjustable “parameters” that encoded what it learned from its training. The second GPT model had jumped all the way to 1.5 billion parameters, and the performance had improved proportionally. “My deep dive on the paper explained why scaling up was obviously the thing we needed to do,” Rae recalled. “I said we were dropping the ball.”[4] But Rae was in a small minority. For the most part, DeepMind responded to Sutskever’s revolution with a shrug, remaining far more interested in reinforcement learning. It -- 231 of 565 -- pursued a clutch of mind-stretching experiments with agents that acted in simulations of the real world. It backed an audacious drive by David Silver to extend the success of AlphaZero. DeepMind’s most ambitious simulation was known as Gaia, a reference to the Greek goddess who was the mother of all life, as well as to a 1970s idea that Earth and its biological systems behave as a single symbiotic organism. The high priest of this project was a computational ecologist named Drew Purves. Nature and artificial intelligence were mutually interdependent, Purves liked to say. On the one hand, nature needed AI to come up with ways to stop global warming. On the other hand, AI needed nature as an environment in which to learn. Human intelligence had evolved in the natural world. Artificial intelligence would be no different.[5] Purves was getting at an idea that resonated with Hassabis. The DeepMind business plan had argued that a key challenge in AI was “how conceptual knowledge is acquired from perceptual information.” To achieve general intelligence, an AI system would need to look at its environment, figure out its patterns, and come up with concepts to make sense of it. Purves’s contention was that, to develop this facility, an agent would have to be trained in a simulation of the natural world, not a simulation of the man-made one. Most training environments, including games like Atari or Go, were totally abstract, bearing no resemblance to the real-life settings in which an AGI system would need to operate. The nonabstract exceptions, such as the game Minecraft, featured rigid, block-based worlds that were modeled not on nature but on cities. The differences between the natural and the man-made environments were profound, Purves noted. Nature is irregular. There are few right angles, few straight lines, and few perfectly flat surfaces. Objects range from vast to minuscule: from towering redwoods to tiny insects. In contrast, the man-made environment is orderly and regular and scaled to human needs: think of the height of a door, the flatness of a floor, the neatness of a room’s right angles. In the natural environment, every tree, glacier, and sunset is unique. In the man-made environment, a designer decides the contours of a chair. Then a factory makes thousands of identical ones. -- 232 of 565 -- These differences, Purves explained, have implications for intelligence. Navigating the man-made environment is relatively straightforward. Because of a city’s sharp lines and standardized components, an AI might manage to classify most objects into sets—houses, vehicles, office blocks, tables. But navigating the natural world is harder. With its fuzziness and irregularity, nature offers no obvious set structure, so it forces an agent to classify animals and plants and minerals into groups that seem to make sense, depending on the agent’s objective. If the agent is trying to gather food, for example, a set might include apple trees and cows. If the agent wants leather to make a tent, it better understand that cows are more useful than apples. To operate in the natural world, in other words, an AI must learn to invent and discard concepts continuously. “Obviously chess is an abstract game. It’s just symbols,” Hassabis reflected. “We did think about using Minecraft, but everything’s cubes and so it’s also abstracted. “We wanted something for an agent to learn in that looked like how the world really is. That’s why I backed Gaia.” For a period of a year or more, Gaia became something of a darling. The idea was to construct a digital simulation of the natural environment, and then to set digital agents various tasks: collect fruit, for example. The project was both difficult to build and speculative in its purpose: The link between nature and the evolution of intelligence may be a historical fact rather than a causal relationship. But for a company built on multidisciplinary curiosity, the idea of Gaia was seductive. At the end of 2016, DeepMind duly showed off its ideas on an “intelligent biosphere” at NIPS, which by now had evolved into a sort of Woodstock of AI: Purves showed up to deliver the talk in a fine cardigan waistcoat.[6] In the following months, Hassabis developed plans to make Gaia a centerpiece of DeepMind’s new headquarters in King’s Cross. He envisaged a vast video screen, covering an entire wall: a window on Gaia and the agents that inhabited it. The screen would mark the divide between reality and -- 233 of 565 -- simulation, between the realm of biological intelligence and the realm of artificial agents. “That would’ve been very cool,” Hassabis said later. “We would have had live monitoring of the virtual world, populated by hundreds of agents. We would have kept track of what they did. Almost like a civilization experiment.” One day a DeepMinder suggested to me that Hassabis loved Gaia because it echoed Theme Park, the video game he had cocreated as a teenager. An environment. Hundreds of characters milling about. The magic of their interactions. “Do you know Conway’s Game of Life?” Hassabis asked me. He was referring to another 1970s idea: The British mathematician John Conway had created a grid on which patterns evolved, following a preset algorithm. The algorithm was simple, but the patterns were infinite. The game illustrated how intricate phenomena emerge from basic mathematics. “He just had these simple production rules and the pixels on the screen would almost come alive,” Hassabis explained. “I was fascinated by that when I was a teenager at Bullfrog. “The key in all of this is, where do emergent properties come from? This is the big question in life, in physics, in everything. “You have components, and they don’t have a certain property. And then somehow when you put these components together, the property emerges. “For example, you put some chemicals together and you get life. You put some neurons together and you get intelligence. You put some humans together and you get an economy. “I mean, if I took a sabbatical, I would just think about this for ages. Gas molecules bubbling around in a box, and that gives you temperature! There are emergent properties wherever you look. “And of course that’s quite troubling for science, because science is about reducing things to their essentials. You have complexity, and then you break it down to understand it: You look at the components. But the problem is, what if the phenomenon you’re interested in only exists when -- 234 of 565 -- you put the components together? That poses a bit of a challenge to the normal scientific method. “So I still don’t feel there’s a clear explanation of what emergent properties are. Where they come from. The interactions.” I tried to connect this riff to Gaia. “In the natural world, everything appears irregular and fuzzy. But plants and animals and soil combine, and out of that combination, nature emerges,” I ventured. Was Hassabis suggesting that there might be some underlying rules explaining this emergence? “That’s the main reason why I’m building AGI,” Hassabis responded. Humans couldn’t see those rules, I carried on. But maybe an infinity machine, capable of finding patterns in an infinity of data, could learn how to discover them? Hassabis nodded. And AlphaZero was a game-based proof that AI sees deeper than we can? Again, Hassabis nodded. And Gaia was an experiment to see whether AI can find nature’s hidden patterns—whether it can decode the emergent properties? “Exactly.” I switched the conversation from nature to language, wondering if the same quest for underlying rules applied equally to Sutskever’s revolution. In our earliest conversations, Hassabis had stressed that language, especially spoken language, is fuzzy and irregular—like nature. And yet meaning emerges from it. “Spoken language is messy,” I began. “We make grammatical mistakes all the time,” Hassabis agreed with me. “So spoken language is not governed by normal grammar and logic, even if linguists and philosophers have tried to impose those rules on it?” “Exactly. They try and wrestle it to the ground, but they can’t do it.” “But maybe the messiness of spoken language disguises some deep patterns that the philosophers can’t see?” I wondered. “And maybe AI has unraveled those patterns, thanks to transformers?” -- 235 of 565 -- “I think that’s a reasonable way of thinking about it,” Hassabis said, acknowledging Sutskever’s achievement. Hassabis’s underestimation of language models now seemed ironic. He had failed to see that grounding, and a facility with concepts, might emerge as a by-product of linguistic mastery. And yet he was utterly lucid on emergent properties in nature. “Obviously, in terms of the universe, I do have this very strong feeling that there are some underlying rules, an information system, something that explains all that we are seeing,” Hassabis went on, pivoting back to his larger preoccupations. “A theory of everything?” I asked. “Yes, if we could find it,” Hassabis answered. “I’ve always been obsessed with that. It’s what I want to find eventually.” As it turned out, Gaia was one of those experiments that advances science by failing. Far from teaching DeepMind’s agents to master concepts, Purves’s naturalistic simulations demonstrated the limits to reinforcement learning. RL agents could more or less manage a simplified natural world, but too many irregularities and unclassifiable shapes flummoxed them. When confronted with an environment whose variations and permutations were truly vast, inducing the patterns and discovering the hidden rules was more than DeepMind’s agents could manage. Given the competition between reinforcement learning and deep learning, this setback held a warning. When it came to OpenAI’s large language models, the more complex and varied the data you fed into the system, the better it performed. Much as knowing French helps a linguist to master Italian, large language models were capable of “transfer learning” across different but loosely related topics. But Gaia’s failure suggested that the same was not true for RL. A more complex and varied training environment would not necessarily lead to improved performance. It followed that the path ahead for scaling language models was excitingly open. The path ahead for reinforcement learning looked murky and uncertain.[7] -- 236 of 565 -- “A scientist doesn’t think, ‘Oh, some experiments failed, some succeeded,’ ” Hassabis reflected. “In science, it’s just projects. “You have theories about why they should work. And then if some new evidence comes up, you change direction.” • • • THE NEXT DIRECTION on which DeepMind focused was David Silver’s post- AlphaZero effort. Starting in early 2018, Silver set about building a reinforcement-learning agent to crack StarCraft II, a battle simulation game whose top players attract millions of devoted followers. StarCraft’s complexity makes even Go appear simple. The players must juggle the strategic management of their industrial base and the tactical control of dozens of specialized combat units.[8] The fictional sci-fi universe is too vast to be observed at once, so decisions must be made without knowing what the enemy is up to. The game is not subdivided into neatly alternating turns; the players peck at the controls continuously. StarCraft is less like a board game than like the human experience of life: an unbroken stream of choices and sub-choices, some involving long-term planning and some more immediate, all rendered more bewildering by the fog of imperfect information. Plenty of DeepMind scientists thought StarCraft was insoluble. It was as baffling as Gaia: In the place of fuzzy and irregular objects, it involved fuzzy knowledge of a contest that operated on irregular timescales. But to Silver, this was the whole point. Now that his Go systems had demonstrated the startling potential of agentic self-play, his next project would represent an advance only if it was maximally ambitious. “We are almost flag bearers of a new way to go about AI,” he declared in the autumn of 2018. “We want to do things that will matter in ten, twenty, thirty years. Where people will look back and say, hey, this research that was done back then, that really matters!”[9] To realize this lofty goal, Silver teamed up with Oriol Vinyals, a highly cited deep-learning expert. (Before joining DeepMind in 2016, Vinyals had -- 237 of 565 -- coauthored the Seq2Seq paper with Sutskever.[10]) Together they assembled a large team of researchers, numbering more than forty at its peak; the result was a new agent, dubbed AlphaStar, which followed the AlphaGo template. AlphaStar began by training on examples of expert human games, creating a foundation of knowledge. Then it added self-play on top, in an attempt to surpass humans. In terms of scientific sophistication, the self-play broke new ground. To ensure that the system mastered the multiple strategies encountered in StarCraft, DeepMind created five different agents, each tuned for a distinctive playing style, and set them to compete against each other. Like swordsmen who learn from adversaries who expose their weakest spots, each StarCraft agent forced the others to improve. Machines were learning from machines, with no human intervention needed.[11] In October 2019, Silver and Vinyals announced that their project had succeeded. DeepMind published another Nature paper, reporting that AlphaStar had won an overwhelming 99.8 percent of its games against officially ranked human players.[12] The five-agent self-play system had yielded a versatile intelligence that could match the game’s celebrity superstars. It could hold multiple facets of the game in its memory at the same time, a feat of multitasking with which human experts struggled.[13] “These impressive results mark an important step forward in our mission to create intelligent systems that will accelerate scientific discovery,” Hassabis announced proudly.[14] The question was what this triumph signified for DeepMind’s larger bet on reinforcement learning. Like most RL breakthroughs, this one was enabled by an advance in deep learning—thanks to Vinyals, AlphaStar was built on the transformer architecture. Just as a transformer could ingest all the words in a textual sequence and figure out which merited attention, so it could survey and prioritize the bewildering array of features in StarCraft: workers and fighters, buildings and vehicles, stashes of minerals, and wafting clouds of mysterious green energy. At the very beginning of the StarCraft project, DeepMind’s fledgling agent could beat the game’s in- built bots only 10 percent of the time. After the transformer architecture was introduced, the system understood which fighters or minerals merited -- 238 of 565 -- attention at any given point, and the win rate jumped to 84 percent.[15] A decent share of AlphaStar’s success reflected progress in Hintonite deep learning, not Silver’s reinforcement learning. This amounted to a caveat about Silver’s advance; it was not a denial of it. Driving AlphaStar’s win rate up from 84 percent to its eventual 99.8 percent required another big jump in performance, and this was the achievement of RL, including the ingenious five-agent self-play. But a further question about the significance of AlphaStar lay in an abandoned side project. As they trained their AlphaGo-style agent, Silver and Vinyals had tried to create a parallel version modeled on AlphaZero: a system that learned purely from self-play, without learning initially from human game examples. If the zero-human-knowledge system cracked a game as complicated as StarCraft, it would vindicate Silver’s faith in the core idea in RL: that there was no upper limit to what learning from experience could accomplish. Despite the setback with Gaia, Silver’s belief in the superiority of RL was intense. An AI that learned from its own experience was fundamentally better than one that learned from human experience, he insisted. Like the progressive educationalists of a century before, he was pushing back against the practice of learning from data—from the study of text, from the passive absorption of secondhand wisdom. Instead, he favored the machine equivalent of the human virtues that progressives inculcated in students: curiosity, individualism, the capability to learn independently and to adapt to changing circumstance. In 1938, the progressive pedagogue John Dewey had argued for a “necessary relation between the processes of actual experience and education.”[16] Silver was the Dewey of the AI era. As it turned out, the attempt to build a StarCraft agent modeled on AlphaZero was a failure. A system that learned exclusively from self-play struggled to get off the ground in such a complicated environment. It had been one thing to tackle Go, a game with 361 squares. It was another to take on a game with the equivalent of 1026 squares, the estimated number of possible StarCraft moves at any given moment. In an environment with so many options, it took an eternity for blind trial and error to generate a -- 239 of 565 -- successful move sequence. Random experimentation generated only a small number of points. With limited reward signals, there would be limited learning.[17] With the benefit of hindsight, this was hardly surprising. When explaining his optimism about agents, Silver was fond of saying that a human can be dropped into a complex environment and discover what to do just by trying things out. But humans come at complexity with a vast trove of prior learning. Some capabilities and instincts are inherited, passed down via DNA. Many skills are acquired during childhood, through exposure to the wisdom of adults. Yet others derive from study and reading, as even John Dewey and his allies conceded. To expect an AlphaZero-type agent to learn StarCraft solely from trial and error was to presume that a machine intelligence could do without the foundations enjoyed by adult humans: It was to underestimate both nature and nurture, not to mention book learning. Given DeepMind’s belief in intuitions from neuroscience, this was an ironic error. “We probably could have cracked StarCraft with the AlphaZero approach, but it was harder than we thought,” Hassabis said later. “And that was for me an indication of why it’s not always a good idea to learn everything from scratch. I understand why RL people want to do that, because their goal is to show that RL is the best method. But my goal is to build AGI as safely and as quickly as possible, and make it useful for the world. Later we can go back and look at whether there were other ways to do it. “One day, on some infinitely powerful computer that AGI’s invented, we’ll probably be able to do an AlphaZero version of whatever the AGI turns out to be. “For now, trying to build a pure AlphaZero-type model is an unnecessary handicap. That’s the most succinct way of saying it.” • • • -- 240 of 565 -- DEEPMIND’S PREOCCUPATION with simulations and RL would have been fine in a world without rivals. But Hassabis’s hoped-for singleton scenario had long since vanished; in a competitive environment, going down one path risked missing a shortcut that could decide the race’s outcome. In 2017 and later, DeepMind’s experiments with Gaia, coming on top of its investments in other simulated environments, distracted the company from the contemporaneous breakthroughs in textual modeling and the transformer architecture. In 2018, DeepMind’s early work on StarCraft coincided with OpenAI’s creation of the first GPT model. In 2019, DeepMind’s announcement of AlphaStar’s first victories came just before OpenAI’s release of the much more powerful GPT-2; and DeepMind’s continued focus on StarCraft through to the autumn corresponded with the time when OpenAI was sprinting ahead, turbocharging its progress with a $1 billion investment from a new backer, Microsoft. At some point in this period, DeepMind should have pivoted to language models, just as OpenAI did. But DeepMind was too excited by its own research. It was accustomed to being the world’s top AI lab. It could scarcely imagine that a copycat outfit might overtake it. Besides, Hassabis rebelled against the prospect of following OpenAI’s example. All his life, he had beaten his own path. His obsessive childhood chess. His underage moonlighting for Peter Molyneux. His precocious impatience with the AI-skeptical consensus at Cambridge. His un-British appetite for entrepreneurship. His improbable leap from games design to neuroscience. Hassabis was far more original, and far more of a contrarian, than most of the self-identified contrarians of Silicon Valley. Meanwhile, OpenAI had been founded by a formidable space-and-cars tycoon, who was also clownish, jealous, and vainglorious. Musk’s young cofounder was a silver-tongued networker and investor, a gifted opportunist defined less by his devotion to AI than by his general ambition—in 2017, Altman had contemplated a run for governor of California. Why would Hassabis, who never followed anyone, take cues from people such as these? His first thought was, he wouldn’t. -- 241 of 565 -- Language, the transformer, and GPT forced Hassabis to pay a price for this instinct. But the determination to invent an entirely novel direction would soon pay off in an area far from language—and it would do so spectacularly. First, though, Hassabis had to get through a tumultuous period, involving Google, armies of lawyers, and Mustafa Suleyman. OceanofPDF.com -- 242 of 565 -- I CHAPTER 14 PROJECT MARIO n the autumn of 2015, Mustafa Suleyman embarked on a second grand experiment in making AI good for society. His efforts to improve Britain’s National Health Service were just getting underway: He was preparing to roll out the Streams app. But, together with Hassabis, he also began an extended negotiation with Google, determined to ensure that powerful AI, when it emerged, would not fall under the sole sway of the parent company’s shareholders. For anyone concerned with AI safety, this saga remains relevant today. It shows what happens when, under unusually favorable conditions, a handful of leaders set out to create a control structure for a new technology. The trigger for this experiment was the failure of the safety meeting at SpaceX. Not only did that gathering achieve nothing; once Musk founded OpenAI as an explicitly anti-Google, anti-Hassabis venture, there was no way he could continue to watch over DeepMind’s progress. With that attempt at oversight stillborn, Suleyman in particular resolved to create an alternative arrangement. He imagined a novel, post-capitalist form of governance: one that might balance the drastic tensions in the era of AI, when the imperatives of profit, existential risk, and social justice demanded a new reconciling mechanism.[1] As always with Suleyman, his passion was not in doubt. But as with health care, the obstacles were formidable. Suleyman was fortunate in who he had around him. A preoccupation with safety had been baked into DeepMind even before its founding: Hassabis had first bonded with Legg at the 2009 Halloween lecture. In the ensuing half dozen years, Hassabis had remained committed to the safety -- 243 of 565 -- agenda, backing Suleyman’s efforts and adding his own vivid talk about disappearing into a bunker. Suleyman was fortunate in his parent company, too. By the standards of large enterprises, Google was remarkably open to governance experiments, having conducted several of its own. For example, the founders had awarded themselves super-voting shares on the theory that this would allow them to stand up for the company motto, Don’t Be Evil. Moreover, at the time when Suleyman embarked on his safety mission, DeepMind was the world’s top AI lab, and its strongest rival, Jeff Dean’s group in Mountain View, which included the researchers who would invent the transformer, was also part of Google. Suleyman and his collocutors were therefore in a privileged position. If they could solve AI governance internally, they would go much of the way to solving it, period. The first potential replacement for the SpaceX oversight group landed in Suleyman’s lap, without him having to do anything. In 2015, Google decided to restructure itself, spinning out specialist chunks of its operation as semi-independent “bets,” and creating a holding company called Alphabet to preside over them. In a conversation shortly before the SpaceX gathering, Google’s M&A chief, Don Harrison, had suggested to Hassabis and Suleyman that they could regain their independence via this route. The new, liberated DeepMind would have a so-called 3-3-3 board: three people from DeepMind; three people from Alphabet; and three independent members. DeepMind’s leaders, fond of secretive code names, dubbed the ensuing governance talks “Project Mario.”[2] Google’s proposal had an operational and a financial logic. On the operational side, Larry Page worried that Google was growing unwieldy. It was hard to manage a money-gusher like the online ad business under the same roof as a pre-revenue moon shot such as DeepMind. On the financial side, Google reasoned that hiving off cash-burning ventures would boost the profits of the mothership, resulting in a much higher stock price.[3] To Hassabis and Suleyman, the commercial logic of the Alphabet plan was all to the good. The 3-3-3 board structure would give them a strong say over the deployment of AGI and bring in credible independent directors. If the -- 244 of 565 -- plan also served to boost Google’s share price, that was a good reason to assume that it might actually be implemented. The governance talks got underway in the first half of 2016. Hassabis met Page to go over the details on four occasions, and together with Suleyman he set about planning the revenue streams that would sustain DeepMind in its independence. Indeed, this was part of the impetus behind the launch of DeepMind Health: Suleyman believed that, after a few years of pro bono work, DeepMind would earn a lucrative share of the savings that it generated for hospitals. Hassabis, for his part, assembled a secretive hedge-fund operation within DeepMind. He recruited a team of some twenty researchers to train high-frequency trading algorithms, and explored a collaboration with the Wall Street behemoth BlackRock. It was not a project of which Google approved. But Hassabis hoped he’d found another game that he could win. One day I asked about the story of this trading project. I was told that Hassabis wanted to beat Jim Simons, the mathematician who founded the wildly successful algorithmic hedge fund Renaissance Technologies. “Rentec operated in secret, which Demis loved,” my acquaintance explained to me. I could see how the echoes of the Manhattan Project might appeal to Hassabis. Renaissance Technologies convened a band of scientific geniuses on a remote campus, even if its hideaway was in Long Island, not Los Alamos. Peter Brown, the longtime leader of Renaissance, was as driven as Hassabis, and slept even less. He had a fold-down bed propped up against his office wall, and lived mainly at the office. Brown was a deep-learning pioneer who had studied under Geoffrey Hinton. Did the secret DeepMind trading team make money, I wondered? No, came the answer. Because of Google’s wariness, it was quietly disbanded. In the summer of 2016, Hassabis held his fifth round of talks with Larry Page, and the details of a DeepMind spin-out were laid down in a formal term sheet. A few months later, to ensure that everyone was on the same page, Hassabis met with the new CEO of Google, Sundar Pichai, who had -- 245 of 565 -- assumed the top job when Page had moved upstairs to head Alphabet. An engineer with an MBA from Wharton and a background as a management consultant, Pichai had a boyish grin, an affable manner, and a dislike of confrontation. His discussions with Hassabis and Suleyman were cordial but bland. Pichai was not going to rock the boat, the DeepMinders concluded. The following week, on November 21, Hassabis and Suleyman experienced a rude awakening. Google’s chief legal officer, David Drummond, showed up in London to meet them. Regarding DeepMind’s AI safety and governance objectives, “everyone is in agreement,” Drummond affirmed. But regarding the idea of a spin-out, there were “concerns,” he added. Drummond then elaborated on a complex new formula that was not quite a spin-out but not quite the status quo, either. Hassabis and Suleyman were confused. The safety guarantees they had in mind depended on the spin-out, and on the 3-3-3 board that came with it. Four days later, the DeepMind duo got on the phone with Pichai. This time the CEO revealed the steelier side of his personality. He said that turning DeepMind into a semi-independent Alphabet company might not be in Google’s interests, after all. The “bet” option was for moon shots unrelated to Google’s core business, he said—projects such as autonomous cars or the science of life extension. Artificial intelligence did not belong in that bucket. To the contrary, AI was destined to become strategically important to Google’s flagship products, such as search and cloud computing. Hence the “concerns” that Drummond mentioned. Hassabis and Suleyman were still confused, however. It was not clear whether Pichai was slamming the door—or whether, given Page’s apparently contrary position, Pichai had the authority to do so. Even David Drummond, the lawyer and bad cop, had assured them that Google favored AI safety. With a bit more pushing, Hassabis and Suleyman reckoned, they could get what they wanted. • • • -- 246 of 565 -- BACK IN 2013, Hassabis and Suleyman had administered a particular kind of push. During the acquisition negotiations with Google, they had entertained a rival bid from Facebook. Now, at the end of 2016, they cooked up another version of Plan B. They would gather pledges of $5 billion from outside investors. If Google didn’t give them the governance they wanted, they would walk out of the company. Five billion dollars was an astronomical amount, enough to cover DeepMind’s operations for more than five years.[4] At its launch a few months earlier, OpenAI had proudly claimed to have pledges of $1 billion, and even that was smoke and mirrors.[5] But the DeepMind leaders figured that they could raise the money by appealing to safety-minded investors. The pitch would be that $5 billion could put AGI in a secure place, with credible governance. To hammer out the details of the plan, Suleyman assembled a team of imaginative lawyers, a topflight communications strategist, and a prominent investment banker. Together, they proposed a legal form that would underscore DeepMind’s determination to do good, not just to pursue revenues. Rather than raising outside capital to launch a normal company, DeepMind would be a company “limited by guarantee”—the structure commonly used by nonprofits. The reconstituted DeepMind would issue no shares to its backers, nor would it pay dividends. Its obligation would be to the principles set out in its charter. Hassabis and Suleyman spent hours huddled with the advisers. Not all were convinced by the walk-away option. “It was open to Alphabet to just say, well, back in your boxes, boys, we own you, you’ll do what we say,” one of them recalled later. DeepMind staff members were legally employed by Google; there were noncompete employment clauses, non-solicit rules about hiring people away, verbiage on who owned DeepMind’s intellectual property, and so forth. Taking a hundred people out of Google on one day and starting a new company the next day would be legally messy. The team was not put off, however. Bolstered by one of the lawyers, who was an authority on public-interest law, it was ready to assert that the British public had an interest in DeepMind breaking free from Google.[6] -- 247 of 565 -- The claim would be that a spin-out served the public interest by bolstering AI safety. Surely Google cared too much about its reputation to challenge this proposition in court? Besides, even if deserting Google involved a legal risk, the threat of desertion could be valuable. The upshot was a two-pronged plan. If Hassabis and Suleyman could get meetings with billionaires who might invest in a Plan B, there was no harm in talking to them: Why not deepen your network with the world’s top capitalists? But DeepMind would also be careful not to overplay the hand. “We never ever said to Google, unless you do this, we will leave,” an adviser remembered. “The art here was to get Google to take this negotiation seriously,” the adviser went on. “Google could have said, we know you aren’t leaving, so why are you wasting our time? To their credit, they never did that. That was why this episode was so unusual.” Hassabis and Suleyman were in a strange place. They were attempting to conjure an unprecedented governance structure for an unprecedented technology. They were dancing with a parent company that wasn’t saying yes and wasn’t saying no. There was a glimmer of hope. They resolved to keep pushing. • • • IN THE FIRST days of January 2017, Hassabis and Suleyman showed up at the Asilomar Hotel, a serene seaside refuge on California’s Monterey Peninsula. Almost half a century earlier, the hotel had played host to a famous conference on genetic research, which had laid the ground rules for experiments with the breakthrough technology of the 1970s. Now, following the shock of Lee Sedol’s defeat by AlphaGo, Asilomar had been chosen as the venue for an analogous get-together, this time on the rules for artificial intelligence. Hassabis and Suleyman were at the conference to address safety; after all, it was their own company’s feats that made the conference feel urgent. But they also took the opportunity to discuss their walk-away idea with -- 248 of 565 -- Reid Hoffman. Despite Hassabis’s wariness of the LinkedIn founder for his role in launching OpenAI, the three remained on friendly terms. Hoffman was a good billionaire, Suleyman reckoned. Hassabis and Suleyman sat down with Hoffman and got to the point. If they broke free from Google, would Hoffman help to finance a new public- interest AI company? Hoffman was not surprised to hear of tensions between DeepMind and its big-tech paymaster. He was seeing the same dynamic play out between OpenAI and Microsoft. Recently, Musk had flown into a fury when Microsoft had tried to turn its partnership with OpenAI into a public relations talking point. He would not let OpenAI “seem like Microsoft’s marketing bitch,” Musk protested.[7] Moreover, Hoffman applauded the idea of novel AI governance structures. He had backed OpenAI precisely because it had been founded as a nonprofit, with a charter requiring that its technology should serve society. The format had been inspired partly by DeepMind—The SpaceX gathering had been a first attempt to add a nonprofit board to a for-profit structure— but OpenAI harbored dizzying ambitions to push governance innovation further. “We’re planning a way to allow wide swaths of the world to elect representatives to a new governance board,” Sam Altman proclaimed, having read James Madison’s notes on the Constitutional Convention for inspiration. “Because if I weren’t in on this I’d be, like, Why do these fuckers get to decide what happens to me?”[8] However far-out Altman’s ideas on global elections, Hoffman sympathized with the sentiment. AI ultimately needed some kind of nonprofit oversight with broad democratic buy-in, especially since politicians were notoriously slow to get their minds around cutting-edge technologies. Just a couple of months earlier, the United States had elected President Donald Trump, whose antiregulation instincts Hoffman regarded as anathema. Hoffman was open to backing a DeepMind walk-away, especially if it filled the governance vacuum created by do-nothing political leaders. -- 249 of 565 -- Hassabis and Suleyman assured Hoffman that filling the governance vacuum was exactly their plan. They elaborated on their idea for a company limited by guarantee, which they had taken to calling a global interest company. Nobody would profit from this enterprise. The global interest company would be managed with capitalist intensity but its impact would be post-capitalist. Hoffman had recently sold LinkedIn to Microsoft: His personal net worth stood at $3.8 billion. He was an unabashed idealist, proclaiming that he aimed to help humanity flourish—he was a grander, American version of what Suleyman aspired to be. Showing considerable courage in his convictions, Hoffman now agreed to commit more than a quarter of his wealth to DeepMind’s vision of societal advance: an astonishing $1 billion. It was one hundred times more than he had pledged to OpenAI, just over a year earlier. “I said, look, this is the most impactful technology of my lifetime,” Hoffman recalled. “I support the idea of an independent DeepMind with a public-interest mission. I support it for the same reasons I support OpenAI. This technology shouldn’t be used to entrench a monopoly. “Anyway, I thought that 90 percent of my wealth would flow to philanthropic causes. So I decided right then to commit $1 billion. “I didn’t tell Sam about this,” Hoffman went on, referring to Altman. “I didn’t tell Greylock,” he added, referring to the venture capital shop at which he was a partner.[9] Billionaires answer to nobody. Some of the DeepMind advisers favored seizing Hoffman’s offer and proceeding with the spin-out. With a famous anchor investor in place, other capital would follow. Even if Google tried to challenge DeepMind in court, the fallout would be manageable. The prize of independence—the operational agility, the opportunity to incentivize employees with DeepMind stock—would justify the legal complications. Hassabis could see the argument. But he was leery of a drawn-out legal fight that would swallow all his energy. Spinning out as an Alphabet company would be by far the cleaner option. -- 250 of 565 -- To try to unstick the Alphabet process, Suleyman sought out Kent Walker at Asilomar. Walker was a top lawyer and policy strategist at Alphabet. He had attended the SpaceX safety meeting. Suleyman introduced Walker to Angela Kane, a senior United Nations official who worked on containing weapons of mass destruction. Suleyman regarded Kane as an excellent choice for the 3-3-3 oversight board—an example of the credibility that a spin-out could bring to DeepMind’s mission. He also told Walker that he had sounded out Barack Obama, and he mentioned Al Gore. For good measure, Suleyman hinted that all kinds of people, some of them extremely rich, wanted DeepMind’s technology to be developed under the protective gaze of a robust governance committee. Meanwhile, Hassabis checked in with Larry Page, who was also at Asilomar. Page had always favored Alphabetization. What changed, Hassabis wondered? Page declared that he still supported the old plan. As far as he was concerned, spinning out DeepMind remained a logical option. But the idea would require Sundar Pichai’s buy-in. Seizing what seemed like an opening, Hassabis said he would visit Pichai at once. He packed his bag, left the conference early, and headed off to Mountain View. He was eager to wrestle the negotiations to a close. He was sick of back-and-forth and lawyers. A couple of days later, on January 9, 2017, Hassabis sat down with Pichai at Google’s headquarters. Suleyman dialed in from his hotel room in Asilomar. Hassabis began the conversation in a conciliatory fashion, telling Pichai that the spin-out should be designed so as to allay all Google’s misgivings. He floated the idea that Pichai, Page, and Eric Schmidt could represent Alphabet on DeepMind’s 3-3-3 board, with Angela Kane and other distinguished figures filling the independent seats. He added that the months of negotiation were distracting him from his responsibilities at DeepMind. He wanted to focus on science. Pichai responded in a friendly way. He sounded open to everything. There were a few details to be ironed out. Hassabis and Suleyman should -- 251 of 565 -- resolve these with Drummond. Hassabis and Suleyman now suffered a repeat of their November experience. Drummond showed up to meet them the next day and announced that the DeepMinders had failed to understand Pichai: He was entirely against Alphabetization. According to a DeepMind document, Hassabis and Suleyman offered “every single mechanism” to assuage Google’s concerns. But Drummond was unmoved. The talks were at a standstill. A few days later, Hassabis and Suleyman emailed Pichai. “The Alphabetization process has been dragging on for far too long (more than a year now), and it is really starting to impact our ability to manage the company,” they told him. The two DeepMinders proposed to return to Mountain View “to finally resolve this”—they were willing to cancel their plans to attend the World Economic Forum in Davos. They proposed that Pichai and Drummond, as well as Larry Page and Sergey Brin, should attend the next meeting. They were tired of the good cop/bad cop seesaw. When the two sides met again, the conversation underscored the gulf between them. Hassabis and Suleyman argued that DeepMind did not fit under Google’s umbrella: Its mission was AGI, not consumer-internet products. Pichai objected that AI was central to his vision for Google, and that he would not allow his scientific bench to be depleted. Hassabis had hoped that Larry Page would weigh in on his side and push the Alphabet plan to a conclusion. But Page showed up for the meeting two hours late, and Sergey Brin was even later. Their version of what later came to be known as “founder mode” was that they were nowhere to be found, disproving the Silicon Valley mantra that founders deserve the right to control their companies indefinitely. With Page and Brin effectively checked out, Pichai was the man DeepMind had to deal with. The following week, Pichai tried to break the deadlock. His goal was to preserve Google’s lead in AI; alienating AI leaders was a bad way to do that. At a one-on-one dinner at his home in Silicon Valley, Pichai served Suleyman a vegetarian curry and a tasty proposal, perhaps hoping to drive a wedge between his guest and Hassabis. -- 252 of 565 -- Rather than having all of DeepMind become a semi-independent bet, the company should split in two, Pichai now suggested. Hassabis could spin out his research operation and go after AGI—who knew if that would work, Pichai remarked, somewhat dismissively.[10] Meanwhile, DeepMind’s Applied team, which was building immediately useful algorithms in health care, should be folded into Google. As part of the shake-up, Suleyman would run all Google’s applied AI from California. Through the spring of 2017, Pichai’s plan made grinding progress. It had its appeal: Hassabis could pursue AGI as the leader of a semi-independent spin-out; meanwhile, Suleyman could deploy practical AI, leveraging Google’s global empire to distribute it. Every few weeks, Hassabis and Suleyman made the eleven-hour plane trip from London to San Francisco and sat through another interminable meeting: “We would push back on stuff, they would push back on stuff,” Suleyman said later.[11] Then they would head back to the airport to re-scramble their body clocks. Small wonder that, in his dealings with his lieutenants in London, Suleyman could seem distracted and preoccupied. Small wonder that, when the transformer architecture appeared that summer, Hassabis was less alert to its potential than he might have been. • • • IN THE FIRST WEEK of June 2017, just about everyone on DeepMind’s five- hundred-strong staff left London on a pair of chartered jets, bound for the Scottish Highlands. The company had outgrown the conference centers in easy reach of London, so the organizers had sought out a venue with abundant space, settling on a resort called Aviemore, not far from the royal castle of Balmoral. “If you want a lot of accommodation, there’s Scotland and there are private islands,” the chief planner explained. “Private islands are a bit much, I think.” Notwithstanding that expression of sobriety, Aviemore’s vast banquet hall was decked out with trees and foliage, like the enchanted forest of Narnia. Hassabis and Suleyman led an expedition to a -- 253 of 565 -- go-karting racetrack. It was hard to say which founder was the more competitive. The go-karting was not the riskiest event. At one point in the proceedings, Suleyman appeared onstage to lay out his vision for DeepMind’s Applied side. He surveyed the real-world problems that AI would tackle: In addition to health, there was climate change. Suleyman had recently hired Jim Gao, a Google engineer who had come up with an AI system that cut electricity consumption at data centers. By harnessing DeepMind’s reinforcement-learning know-how, Gao now planned to take his innovation to the next level, ushering in an era of intelligent buildings— structures that learned for themselves to conserve energy. Suleyman got to the climax of his presentation. He put up a slide on a large screen. The title said, “DeepMind: A Global Interest Company.” In the weeks leading up to Aviemore, Google had seemed to indicate that it was ready to sign off on some version of the Pichai plan. The company off-site was the moment to break the news to employees. The several hundred onlookers were taken aback. Rumors of a DeepMind spin-out had circulated for months, together with speculation about the amount of stock the staff might get in the new entity. But the slide on the screen showed an org chart with two boxes, and these suggested something different. The first box, labeled “Alphabet/Google,” showed Suleyman and Applied AI at the heart of the mothership in Mountain View. This was not a spin-out; it was a spin-in. The second box, labeled “DeepMind,” showed an independent Global Interest Company, focused on AGI research and connected to Google only by a dotted line representing a technology licensing agreement. Apparently, the plan was for a spin-in and a spin-out. People’s heads were spinning. Ten days later, DeepMind’s leaders felt equally dizzy. Google sent back its latest negotiating position, consisting of an updated document with red lines all over it. Pichai was clearly nowhere near approving the plan announced at Aviemore. Hassabis and Suleyman faced the prospect of having to walk back the vision that had been laid out to the entire company. -- 254 of 565 -- The crisis hit at an important time. That same week, Ilya Sutskever leapt out of his chair. He had just read the transformer paper. Hassabis did his best to push Pichai into rethinking his position. To signal his anger about the red-lined document from Google, he canceled his next call with the chief executive. Pichai pinged Hassabis at once. He wanted to chat as soon as possible. After keeping the boss hanging, Hassabis eventually agreed; then he hotly emphasized his disappointment. Four days later, Suleyman piled on. He emailed Drummond and canceled another meeting. The relationship between Google and DeepMind had hit bottom. Google saw too much commercial potential in AI to let it slip out of its control. DeepMind saw too much existential risk to let commercial priorities dictate AI’s deployment. Each side recognized that it needed the other. A fractious dialogue continued. • • • UNBEKNOWNST to Hassabis and Suleyman, a parallel fight was playing out over OpenAI’s future. By the summer of 2017, the upstart’s leaders, realizing that they needed far more capital than could be raised as a nonprofit, began discussions about grafting on a for-profit structure. It was the mirror image of the DeepMind conundrum. DeepMind existed as a for- profit but wanted to wrap nonprofit governance around powerful AI. OpenAI existed as a nonprofit but needed some capitalist machinery to raise money. Both saw salvation in a capitalist/post-capitalist hybrid. Like Hassabis and Suleyman, OpenAI’s leaders were discovering that restructuring talks led quickly to quarrels. A month or so into the discussions, OpenAI’s day-to-day leaders, Ilya Sutskever and Greg Brockman, fell out with the chief business visionaries and fundraisers, Elon Musk and Sam Altman. At the same time, Altman wanted to become chief executive of OpenAI, and was maneuvering to get Musk out of the way— even though it was he who had drawn Musk into the project in the first place. The sheer potential of artificial intelligence discouraged compromise. -- 255 of 565 -- Altman whispered to Brockman that Musk was too erratic to be entrusted with AGI.[12] Brockman relayed that to Sutskever. Sutskever worried that both Musk and Altman wanted absolute control of AGI. To add to the climate of mutual suspicion, Musk poached one of OpenAI’s key scientists to run Tesla’s AI division. On September 20, 2017, Brockman and Sutskever emailed Musk and Altman with what sounded like an ultimatum. “This process has been the highest stakes conversation that Greg and I have ever participated in,” Sutskever declared, writing on behalf of both himself and Brockman. If OpenAI succeeded, “it’ll turn out to have been the highest stakes conversation the world has seen,” he added. Addressing Musk, Sutskever observed, “The current structure provides you with a path where you end up with unilateral absolute control over the AGI. “You stated that you don’t want to control the final AGI, but during this negotiation, you’ve shown to us that absolute control is extremely important to you. “You are concerned that Demis could create an AGI dictatorship,” Sutskever went on. “So it is a bad idea to create a structure where you could become a dictator if you chose to.” Next, Sutskever addressed Altman. “We don’t understand why the CEO title is so important to you. Your stated reasons have changed, and it’s hard to really understand what’s driving it. “Is AGI truly your primary motivation? How does it connect to your political goals?” Altman’s stated desire to lead OpenAI and his simultaneous dalliance with a California gubernatorial run struck Sutskever as contradictory. “There’s enough baggage here that we think it’s very important for us to meet and talk it out,” Sutskever declared. “Can all four of us meet today?” It was not just Hassabis and Suleyman who wanted to resolve an internal governance fight urgently. Musk was less emollient than Pichai. “Guys, I’ve had enough,” he responded brusquely. “I will no longer fund OpenAI until you have made a -- 256 of 565 -- firm commitment to stay or I’m just being a fool who is essentially providing free funding for you to create a start-up.” Two days later, Sutskever and Brockman caved. The discussion of a for- profit mechanism was shelved, leaving OpenAI to soldier on with its nonprofit structure, which Musk dominated. Altman quickly cozied up to the big man, deftly ensuring that his own role in inciting the rebellion went unsuspected. “I remain enthusiastic about the non-profit structure!” he announced in an email. He threw Brockman and Sutskever under the bus, telling Musk’s trusted lieutenant, Shivon Zilis, that their remonstrations had been “childish.”[13] The truce would be only temporary. To build AGI, OpenAI still needed to restructure itself in order to raise money. Altman explored three possible solutions—two of which precisely matched the parallel deliberations at DeepMind. He called Reid Hoffman and asked for money. He considered turning OpenAI into a public-interest corporation. Venturing giddily off trail, he thought about funding OpenAI with a cryptocurrency.[14] Sure enough, on the last day of January 2018, the calm ended. Musk sent Brockman, Altman, and Sutskever a dispiriting chart, showing that DeepMind and Google Brain generated the lion’s share of AI research. “OpenAI is on a path of certain failure relative to Google,” Musk declared. The start-up had to change what it was doing. Brockman emailed back the same day. He objected that conference papers were a poor measure of OpenAI’s progress. “Our biggest tool is the moral high ground,” he went on. “AI is going to shake up the fabric of society, and our fiduciary duty should be to humanity.” Pushing back against Musk’s obsession with the race against Google and DeepMind, Brockman added, “It doesn’t matter who wins if everyone dies.” Musk responded the next morning at 3:52 a.m. He confronted Brockman with a proposal that recalled Pichai’s pitch: OpenAI should spin into Tesla. Initially, OpenAI’s team could accelerate Tesla’s development of autonomous vehicles. Next, it could use the profits from self-driving cars to fund its AGI moon shot. “Tesla is the only path that could even hope to hold -- 257 of 565 -- a candle to Google,” Musk declared. “Even then, the probability of being a counterweight to Google is small. It just isn’t zero.”[15] Back in 2014, Musk had Skyped Hassabis from a closet in LA, proposing that Tesla or SpaceX should absorb DeepMind. Almost exactly four years later, the new version of this proposal played into Altman’s hands: It proved Musk’s power hunger. With little difficulty, Altman now persuaded Brockman and Sutskever to take his side. Together, the three told Musk that OpenAI would not attach itself to Tesla. At an all-hands meeting on the top floor of the converted truck factory that housed OpenAI, Musk announced to the employees that he was quitting the lab, scornfully adding that OpenAI would have to sprint faster to stay relevant. Hoping to lure away some researchers, he declared that there was a much better chance of building AGI at a strong business like Tesla. Showing courage, or perhaps just youthful innocence, an intern asked Musk if speed might be reckless from a safety perspective. Besides, wasn’t developing AI at a for-profit company like Tesla the same as creating it at a for-profit company like Google? “Isn’t this going back to what you said you didn’t want to do?” the intern demanded.[16] “You’re a jackass!” Musk retorted. Then he stormed out of the meeting. [17] • • • AT THE BEGINNING OF 2018, DeepMind’s version of this governance battle seemed to reach a resolution. The company’s leaders presented a thick slide deck to the Alphabet board, stressing that “an unprecedented technology requires an unprecedented structure.” To light a fire under the Alphabet directors, one slide quoted rival tech leaders on the awesome potential of AI, while another cited Russia’s Vladimir Putin and China’s Xi Jinping —“The one who becomes leader in AI will be the ruler of the world,” Putin had said ominously. The presentation also served warning of a gathering storm. “Technology has crossed over to the dark side,” a New York Times -- 258 of 565 -- columnist had written. “It’s coming for you; it’s coming for us all, and we may not survive its advance.”[18] The two-part message to Alphabet was clear. You better empower DeepMind to sprint for AGI. And you better create a governance structure that is robust enough to withstand skeptical public scrutiny. In the weeks after the presentation, the two sides finally converged on a fleshed-out version of the Pichai plan. Suleyman would lead DeepMind’s Applied side from within Google, while Hassabis would run Research as an independent global interest company. For Suleyman, this was a triumph: Google had finally signed a complex term sheet granting most of what he wanted. Hassabis was equally pleased. The plan guaranteed him an astronomical $15 billion in Google funding to sustain AGI research over the next decade, and it would put an end to the meetings on corporate structure, which he found screamingly boring. After two years of negotiations, he had hit his limit. “I don’t want this part of my brain to grow,” he often said, when asked to get his mind around another legal document. Then, all of a sudden, the hope of resolution shattered.[19] In April 2018, in yet another demonstration of how the prize could slip away, Apple poached a senior Google executive named John Giannandrea, who had supported the idea of Suleyman moving to Mountain View. In the ensuing commotion, Jeff Dean was promoted, eliminating the space in the org chart that Suleyman thought he would occupy. Repeating the Aviemore debacle, Suleyman was forced to un-promise what he had promised: He had already told his deputies to prepare their move to California. Some Suleyman lieutenants remember this as the moment when their leader lost his footing. After the health data uproar, the setback of Aviemore, and the continuing Double Red P0 Plus Plus confusion, he couldn’t juggle any more, and the balls crashed all around him. The metaphors came thick and fast. “I remember being with Moose and he was like, what do I do now?” one colleague recalled. “That’s when he ended up wearing no clothes. He was up the cloud in a banana. “And so then he goes back to Demis and he’s like, oh, well actually I think we’ll just stay here,” this person went on. “And at this point, Demis -- 259 of 565 -- says, no way.” The true story is subtler, and more revealing about Hassabis. Despite his differences with Suleyman, one side of him remained loyal to his cofounder. He valued long-term friendship, not just with Suleyman but with everyone: DeepMind employed multiple figures from his past, stretching back to Cambridge and Elixir. It was partly that he wanted to do right by his comrades: The desire to be good was lodged deeply inside him. But there was something else as well. For DeepMind’s research operation, Hassabis hired the world’s most dazzling scientists from the most celebrated PhD programs. But when it came to nontechnical hires, he was leery of recruiting managerial stars—in a scientific culture like DeepMind’s, nonscientists had to be humble. Rather than hiring outsiders, Hassabis relied on internal comrades. Suleyman was undoubtedly the most capable of them.[20] After Suleyman canceled his move to Mountain View, Hassabis doubled down on his relationship with his cofounder. Together, they revived the idea of a walk-away option, inviting the Hong Kong tech mogul Joe Tsai to match Reid Hoffman’s offer of a $1 billion investment. When Tsai politely waved them off, the two pivoted back to Pichai’s plan for a spin-out of DeepMind Research, and Hassabis encouraged Suleyman’s efforts to wrestle the talks to a conclusion. Hassabis also forged a pact with Suleyman to avoid recriminations during meetings of DeepMind’s executive committee, not least because colleagues couldn’t get a word in edgewise when the top dogs started going after one another. Most Sunday evenings, Hassabis kept up an old tradition of meeting Suleyman at a pub. The comrades avoided alcohol, preferring mint tea. They ordered food at the last moment, right before the kitchen closed, and talked into the evening. • • • IN NOVEMBER 2018, Suleyman suffered a fresh setback. Google insisted on absorbing DeepMind’s health team, numbering more than a hundred, into its own health division.[21] This was a partial fulfillment of Pichai’s -- 260 of 565 -- ambition to bring DeepMind talent into the mothership, but minus the other elements of Pichai’s proposed bargain. Suleyman forfeited a large chunk of his empire, but he was still based at DeepMind. Hassabis was still running Research, but he had no guaranteed $15 billion of funding, and no independent governance board to safeguard his technology. Indeed, Pichai had engineered an outcome that put the oversight agenda into reverse. As Google absorbed DeepMind Health, it shuttered the Independent Review Panel that had watched over its work. After less than three years, Suleyman’s experiment in post-capitalist transparency had been consigned to the dustbin of history. The dispiriting truth was that Pichai had good reason to close the review panel. Even though Suleyman had done everything possible to stock it with reputable experts, their incentives had proved to be distorted. In June 2018, for example, the panelists had issued their second report—this at a time when DeepMind had long since bulletproofed its data sharing contracts; when all patient data was known to be shielded from Google; and when DeepMind was well on its way to producing multiple lifesaving diagnostic algorithms. But rather than celebrating DeepMind’s achievements, and reassuring the public that artificial intelligence would benefit the NHS, the panelists felt obliged to demonstrate their independence by dinging the tech sector. “It is hardly surprising that the public should question the motivations of a company so closely linked to Google,” the panelists declared, bowing meekly to the technophobic zeitgeist. A bolder group of overseers would not merely have noted the public’s questions. It would have answered them. And the honest answer would have been that DeepMind was balancing respect for data privacy with progress in health care. At an Alphabet board meeting a little while later, Sergey Brin rounded on Suleyman. The panel’s behavior had been predictable, he said. If you gave outsiders a platform, they would use it for their own ends: to burnish their careers, to bolster their own reputations. Google’s projects, no matter how virtuous, would not be their priority. -- 261 of 565 -- Suleyman knew deep down that Brin was right. The failure of the Independent Review Panel illustrated the pitfalls of monitoring mechanisms. And given that the review panel had backfired, the campaign for a grander safety oversight board was surely doomed. Google would never agree to it. • • • AT THE START OF 2019, Suleyman’s troubles took on a new dimension. A handful of DeepMind employees alleged that he reduced subordinates to tears with capricious and bullying behavior. The complaints involved no claim of physical violence or sexual harassment. Unlike Hassabis’s old mentor, Peter Molyneux, Suleyman hadn’t hurled a projectile at a subordinate or smashed a tank full of piranhas. But he was said to have used harsh language, to have fired off intimidating text messages, and generally to have frightened people. Hassabis faced a dilemma. Of course, he abhorred bullying. But it was hard to know whether Suleyman was egregiously tough, or whether DeepMind employees were too sensitive. Besides, Hassabis’s feelings of loyalty to Suleyman remained. This was his younger brother’s friend. This was his poker companion. This was the talented kid who had been fed and housed and occasionally employed by Hassabis’s own parents. With his fierce insistence on social justice, Suleyman may even have felt to Hassabis like a voice in his head—the voice that ensured that, as he chased AGI, Hassabis remained tethered to the values of his North London upbringing. There were limits to loyalty, however. Hassabis liked to say that the worst thing in the world was to control someone. The insistence was sincere: Unlike some leaders, who become intoxicated with celebrity or power, Hassabis took no pleasure in dominating people. But his hatred of domination also ran the other way: He was determined not to be dominated. By challenging Hassabis’s control over the direction of DeepMind, Suleyman repeatedly crossed the boss’s red lines. DeepMind was Hassabis’s creation, his identity. -- 262 of 565 -- Moreover, Suleyman’s two grand experiments—to apply artificial intelligence to health care and to build governance around AI—were failing. The loss of the health team to Google, the closing of the health oversight panel, and the eternally inconclusive negotiations over a safety board: Suleyman’s projects were time sinks, attracting negative publicity that tarnished DeepMind’s otherwise pristine reputation. And whatever the merits of the bullying allegations, there was clearly a faction within the company that wanted Suleyman out. Perhaps this might be a convenience? Hassabis made his decision. He wanted more than anything to focus on science; he was tired of Suleyman’s machinations. But his dislike of confrontation—his self-image as a person who did not control others—led him to express his decision indirectly. A more forthright company founder might have informed his junior cofounder that it was time to part: This is a fairly standard event in the maturation of start-ups. Instead of having that conversation, Hassabis allowed his lieutenants to look into the allegations of bullying. The chief operating officer and the chief counsel took charge. An outside lawyer was retained. An investigation was opened. As Suleyman’s old friend, Hassabis was told to recuse himself. After three months, the outside lawyer produced a report that ran to about twenty-five pages. It concluded that Suleyman’s management style amounted to misconduct. Some complainants said that they had been humiliated in front of their peers. Others alleged that Suleyman had told them to communicate with him only via non-Google channels. Later, many of Suleyman’s colleagues would say that his behavior had been standard for a mission-driven start-up founder.[22] But the charge sheet was still serious. Suleyman was summoned to a review meeting. Precisely what transpired at this session is disputed, but Suleyman says he understood that, if he accepted the complaints, he could keep his reputation and a role at DeepMind. He would take a voluntary sabbatical, reflect on his managerial shortcomings, and work with a coach to fix them. If all went well, he could return to a new position at DeepMind. He would not be managing a big team; he might be a company ambassador. On the other hand, if he disputed the complaints, he understood that DeepMind would move from an -- 263 of 565 -- “informal fact finding” about his conduct to a formal procedure. He would probably be found guilty of bullying. In which case he would be fired. And he would forfeit compensation. Suleyman was granted a couple of hours to make his decision. He left the office and paced furiously around Coal Drops Yard, the trendy development of restaurants and boutiques at the heart of King’s Cross. Later, he would wish that he had used that time to call a lawyer. Instead, he phoned Marilyn, his girlfriend from the time when his best friend had been George Hassabis. Around noon, Suleyman walked back into the review meeting and said he accepted the charges. A few days later, he sent out an all-staff email, announcing that, after a decade of relentless efforts at DeepMind, he was taking some time out to recharge his batteries. Many colleagues replied with messages of good wishes. At this point, almost nobody inside DeepMind knew of the investigation. On August 21, 2019, DeepMind’s communications director, Ruth Barnett, was on a French beach. Her phone rang. It was a journalist at Bloomberg. The news site was about to publish a story about Suleyman’s departure. Well-placed sources were saying that Suleyman had been “placed on leave.” The story would go out in an hour or so. Barnett rushed to notify her colleagues. A hasty conference call ensued with DeepMind’s other top lieutenants. We need to agree on a strategy—do we want to fight this, Barnett wanted to know? Not that Bloomberg seemed likely to change its story. The voices on the call went back and forth without answering Barnett’s question. On the one hand, “placed on leave” was not quite true, since Suleyman was taking a sabbatical. On the other hand, it wasn’t quite untrue, since there had been an investigation and he wasn’t given much alternative. From a tactical viewpoint, if DeepMind failed to defend Suleyman, it might be damaging itself, given that Suleyman was supposed to be returning to the company. At the same time, if DeepMind did defend Suleyman, it might -- 264 of 565 -- also harm itself. More details of the internal investigation might come out, making its defense look dishonest. “They couldn’t decide whether they had or hadn’t placed Mustafa on leave,” one person recalled. “Nobody said let him burn, take him down. No one briefed against him. There just wasn’t a plan, and they were caught with their pants down.” On the other side of the world, Suleyman was at work in a conference room in Mountain View. The handover of his health projects to Google was underway, and he was there to coordinate the details. A message popped up on his screen. He put his face in his hands and walked out of the meeting. The message came from a colleague in London, alerting him to the Bloomberg article. “Google DeepMind Co-Founder Placed on Leave from AI Lab,” the headline stated.[23] Suleyman couldn’t believe it. He had thought that if he accepted the complaints against him, his reputation would survive intact. The Bloomberg headline shredded that implicit contract. There was no way that this could have happened, Suleyman reckoned, without Hassabis’s approval. Bloomberg had gone with the story either because DeepMind had planted it, or because it had failed to deny it convincingly. The first suspicion was false, but the second one was accurate.[24] Suleyman spent the next few months recharging his batteries. He threw himself into his management coaching with the same intensity he brought to everything. He grappled with how he could lead team members through encouragement rather than pressure.[25] But after the humiliating Bloomberg story, he felt there was no way he could return to the company that he had helped to build, and with the health group gone, there was not much left of Applied anyway. In the summer of 2019, Jim Gao added to the exodus, quitting with his climate team and launching a start-up. It was the close of an experiment. DeepMind might still supply commercial applications to Google, but it would no longer aspire to market its own products. At the end of 2019, Suleyman went back to work, but not at DeepMind. Google’s top managers in Mountain View had reviewed the details of his conduct and decided that it fell in the gray zone—somewhere between -- 265 of 565 -- tough and unacceptable. Perhaps as a way of placating Suleyman, and to ensure that he wouldn’t start an ugly public fight, Google made him a vice president, and he moved at last to California. But Suleyman was now a prince without a court. Despite his grand title, he was not allowed to manage others. • • • LOOKING BACK, the marathon governance talks held an ominous lesson for AI oversight. Hassabis and Suleyman had pushed for the safety meeting at SpaceX and for the Independent Review Panel for its health work. Both experiments had failed because of the participants’ skewed incentives. They had also spent three years pushing for various iterations of a 3-3-3 DeepMind oversight board. Those efforts had hit a wall, partly because Google’s leaders foresaw that the independent directors’ incentives would be equally suspect. If you couldn’t negotiate safety mechanisms inside one company—a company that, because of its extreme profitability and unconventional founders, was more open to governance experiments than most—what chance would there be to negotiate common safeguards among multiple labs in multiple countries? It was hard to imagine a counterfactual history with a happier ending. Evidence from beyond DeepMind removed the space for optimism. In 2019, Google tried to set up its own Advanced Technology External Advisory Council to guide its choices on AI ethics. To achieve political diversity, Google included the president of the conservative Heritage Foundation, Kay Coles James, who had doubts about the advance of rights for trans people. As soon as her appointment was made public, a chorus of social media critics swooped in; the attacks quickly drove other advisory council appointees to withdraw their participation. The public square was dominated by activists who were out to crush opponents, not encourage broad debate. Google’s understandable response was to disband the advisory group. -- 266 of 565 -- The story of OpenAI offered another cautionary lesson. Following Musk’s ouster, in 2018, OpenAI appeared to show that a nonprofit/for-profit hybrid might be workable. The company retained its original nonprofit governance board, while Altman leveraged its for-profit structure to raise billions of dollars. But in 2023, when the governance board tried to assert its authority by firing Altman, its weakness was exposed. Altman rallied OpenAI’s financiers to his side, staging a countercoup in which three of the nonprofiteers were defenestrated. The failure of company-level safety oversight was especially dispiriting given the bleak prospects for government regulation. Reid Hoffman had been correct in 2017. It was worth risking his fortune on corporate-governance experiments because governments were unlikely to take action. Reflecting on this saga in 2024 and 2025, Hassabis and Suleyman attempted to draw lessons. By now they both occupied new jobs. Hassabis was the chief executive not just of DeepMind but of Google DeepMind: He had absorbed Google’s AI researchers in Mountain View along with multiple related teams, greatly expanding the army that reported to him. For his part, Suleyman had quit Google after two years, launched an AI start-up with Reid Hoffman, then become the chief executive of Microsoft AI, overseeing teams in Seattle, Silicon Valley, London, and Zurich. Two North Londoners with immigrant parents headed AI operations at two American tech giants. Although they were now rivals, with bitter memories of their parting, the two men delivered similar verdicts on the governance negotiations with Google. The exercise, they both agreed, had been futile. The negotiations had achieved nothing; they had been bound to achieve nothing; they had consumed vast quantities of energy and goodwill, making them positively harmful. With their faith in governance mechanisms shattered, Hassabis and Suleyman had come to see salvation, paradoxically, in their own personal power. They believed in their capacity to shape AI for the good. Their new safety agenda therefore involved securing personal influence within their companies. -- 267 of 565 -- “When we were negotiating with Google, we wanted to ensure safety in a way that would be trustless,” Hassabis said. “That’s actually very difficult to do in reality. “Safety isn’t about governance structures,” he went on. “I mean, even if you have a governance board, it probably wouldn’t do the right thing when it came to the crunch. “Same thing with a safety charter. You can try to negotiate one. But it’s not realistic to create bright-line principles years in advance because you’ll probably draw the lines in the wrong places. “So discussing these things didn’t really help,” Hassabis continued. “It made it harder to build useful trust, because when you are negotiating a trustless structure, it implies that you can’t trust the other person. “So then I thought, why don’t I go the other way? Take the energy that was going into the trustless negotiation and put it into creating real trust— trust that was actually useful. Try leaning into Google rather than leaning out. “And then of course two things happen. First, you are now at the table, so when a safety issue comes up, you can help to decide it. Second, you get to know the Google people and you rack up successes together. You can’t just talk about trust. You have to earn it. “And I think for me, and maybe for Mustafa, too, it’s about us growing up,” Hassabis mused. “We went through those negotiations and we matured. Things aren’t black and white, especially when you are dealing with a technology with unknown consequences. “So you have to be adaptable. You have to move from idealist to realist, but hopefully still with your values.” I thought and thought about this verdict. On the one hand, Hassabis and Suleyman clearly had compromised their original values, adjusting their thinking as the world changed around them. In selling DeepMind to Google, they had extracted a promise that their technology would never be used for weapons or surveillance; by 2025, Google, like Microsoft, was eager to supply AI to the national security complex. But on the other hand, Hassabis was right—his youthful ideals had indeed been unrealistic. A -- 268 of 565 -- technology as transformative as artificial intelligence was never going to be the product of a singleton effort, and once multiple labs in multiple countries joined the race for powerful AI, it would be impossible to resist the rush to deploy it in both civilian and military settings. The notion that a well-meaning individual had a seat at the table offered a flimsy scaffolding of reassurance to an alarmed world. But perhaps it was the best comfort available. OceanofPDF.com -- 269 of 565 -- I CHAPTER 15 FERMAT FOR BIOLOGY n March 2016, on the day that AlphaGo clinched victory over Lee Sedol, Hassabis walked out of the Four Seasons Hotel in Seoul and set off down a buzzing street, his face illuminated by neon signs for dumpling joints and barbecue restaurants. He was wrapped up in a winter coat and woolen hat. A film crew followed in his wake. David Silver was by his side, and the two were talking intensely. “I’m telling you, we can solve protein folding,” Hassabis announced, his voice captured by the film team’s microphone. “That’s like, I mean, it’s just huge. “I thought we could do that before, but now we definitely can do it,” he added.[1] “When Demis solves something big, he doesn’t pause to spend much time savoring the achievement,” Silver deadpanned later.[2] Ever since his undergraduate days, Hassabis had wanted to build artificial intelligence in order to push the boundaries of science. Two decades later, he was proving that he still meant it. He had gone to Cambridge because of the Life Story movie, about the researchers who won the Nobel Prize for discovering DNA, and had arrived on campus hoping to identify his own Nobel-level challenges. The riddle of protein folding, a sort of Fermat’s Last Theorem for biology, had been the most tantalizing mystery he had come across. In the wake of AlphaGo’s triumph, it was time to unravel it. Proteins are the building blocks of life: They provide the structure and also the function of organs and muscles, hormones and hair, blood and brain -- 270 of 565 -- cells. The idea of “solving” protein folding—of predicting the complex shapes that proteins assume—was both fascinating in its own right and almost certain to unlock medical advances. Knowing the structure of proteins would help researchers to come up with drug molecules that could bind to their surfaces. Knowing how those structures formed might unlock cures for Parkinson’s and Alzheimer’s, since both diseases were thought to be linked to incorrectly folded proteins. Researchers even talked of engineering proteins to repair or steer life. “We could build these remarkable self-assembling machines that could do things for us,” one speculated.[3] When he first heard about the protein puzzle, in a conversation with a Cambridge friend, Hassabis had no inkling of how to solve it.[4] But his friend told him of a clue, tantalizingly provided by the Nobel laureate Christian Anfinsen. In 1972, in his Nobel Prize lecture, Anfinsen had conjectured that the chains of amino acids that make up proteins contain a sort of secret code. There are twenty regularly occurring amino acids in nature, and each one is a distinct chemical unit; you can think of a chain of amino acids as a thread of irregular beads, with different colors and shapes, strung together in a particular sequence. Anfinsen’s suggestion was that the shape and sequence of these acids determined the way the chain folded itself up, like a self-executing origami model. Anfinsen’s conjecture set off a multidecade race among computational biologists, who sought to predict protein structures from the hidden message in the amino acid sequences. There was a powerful incentive to try. The alternative way to discover the shape of a protein involved X-ray crystallography, a method that delivered accurate results but demanded extraordinary efforts. To begin with, experimental biologists had to turn solutions of proteins into crystals, which was itself a tricky process. X-rays were then fired at the crystals, and researchers worked backward from the refraction patterns to determine the protein structures. The X-rays had to be produced by a stadium-sized particle accelerator called a synchrotron; it could take a doctoral student months or even years to map out a single protein structure.[5] Yet whatever the challenges of X-ray crystallography, -- 271 of 565 -- the computational path proposed by Anfinsen proved even harder. The search space was almost infinitely large: By some reckonings, an average- sized chain of amino acids could be twisted into roughly 10300 possible forms, 10130 times more than the number of possible positions during a Go game. Hassabis filed the protein riddle in a corner of his brain, awaiting the time when an infinity machine could crack it. “It’s not like I had many discussions about proteins at Cambridge,” Hassabis said later. “It might have even just been one. In the pub, when we were playing table football or something. “But then the seed grows in the back of my mind. That’s how a lot of my stuff works. I process an idea subconsciously, and then it’s there fully formed when I want it.” A dozen years after he left Cambridge, during his postdoc at MIT, Hassabis heard of a second clue to the protein conundrum. At the University of Washington, a team led by the future Nobel laureate David Baker had invented a game called Foldit. The idea was that human players without any scientific expertise competed online to fold virtual replicas of amino acid chains into three-dimensional shapes, searching for the configurations that optimized certain physical and chemical conditions. Some amino acids have electrical charges, so the folding had to ensure that positively charged acids ended up next to acids with negative charges—as with magnets, positive and negative attract each other. Similarly, some amino acids are greasy and must avoid contact with water, so the fold must position them inside the protein, where they will be protected from moisture. Yet a third condition stipulated that condensed structures are more stable than ones with internal gaps: As with a suitcase, it’s better to pack tightly. Foldit awarded contestants a score measuring how well they satisfied these conditions, plus a few others. Thousands of online gamers participated. Hassabis was fascinated. Foldit combined two of his passions: competitive gaming and scientific discovery. He marveled at the fact that human players, equipped with little more than spatial sense, could twist virtual chains of amino acids into such high-scoring forms that they came -- 272 of 565 -- close to real protein shapes. But by the time of the AlphaGo match in South Korea, Hassabis had also realized something more. The gamification of protein folding appeared to change a general computational challenge into the kind of reinforcement-learning problem at which DeepMind excelled: There was a clear objective, feedback in the form of a precise score, and a virtual environment that allowed for endless trial and error. With the success of AlphaGo, DeepMind had surpassed the spatial intuition of a human Go champion. The next step would be to build an agent to intuit protein structure. In the years after AlphaGo, Hassabis demonstrated that Anfinsen’s intuition had been broadly right, winning his own Nobel Prize in the process. But he proved something else as well. The advance of ever larger language models has triggered a backlash: For-profit AI labs are said to be interested only in building chatbots that suck up copyrighted information, spuriously simulate human emotions, reproduce the biases of the darkest corners of the internet, and threaten millions of jobs. Alongside these objections, critics maintain that the AI labs are scientifically narrow, racing monomaniacally to build ever larger transformer models rather than daring to imagine more innovative approaches. Some of these attacks are baffling: The creators of large language models repeatedly roll out novel features, from text-and-video multimodality, to longer memory, to step-by-step reasoning; and the biases of the models have been muffled to the point where they compare favorably to the biases in humans. But when it comes to DeepMind’s work on protein folding and what it reveals about Hassabis’s motivations, the entire charge sheet is spurious. Hassabis chose to go after protein folding; he only got serious about language models when competition forced him to do so. Further, DeepMind succeeded in its protein mission precisely because it rejected the scientific monoculture that critics decry. Protein folding was a victory not for one type of AI, but for a willingness to pivot. And when DeepMind’s project was completed, the company gifted its results to science, allowing researchers all over the world to make free use of its discovery. -- 273 of 565 -- • • • WHILE HASSABIS WAS IN SEOUL, DeepMind’s Applied side convened a hackathon. For a couple of days, twenty-five engineers split into teams and explored a variety of fanciful experiments. By coincidence, one of these groups cranked out a rudimentary agent to solve an online puzzle—Foldit. Evidently, Hassabis was not the only person in the building to have noticed the opportunity that Foldit presented. “We figured, if something is structured like a game, DeepMind can apply reinforcement learning and make progress,” recalled Marek Barwinski, one of the team members.[6] At the close of the hackathon’s two days, the Foldit system showed promise. Hoping that solving protein folding would augment DeepMind’s efforts on health care, Mustafa Suleyman encouraged the team to spend more time on the agent. Suleyman also told Hassabis about the project, and Hassabis quickly got involved. In May 2016, a crack engineer from the AlphaGo squad joined the project.[7] A scientist named Andrew Senior took over its leadership. Almost immediately, the protein group executed its first pivot. The researchers realized that solving the gamified version of protein folding was no substitute for the real thing. The ideas behind Foldit’s scoring algorithm —that positive and negative charges attracted each other, and so forth— were good as far as they went. But they amounted to only a rough description of how proteins folded in nature. As a result, a top score in a Foldit challenge sometimes meant that you had accurately predicted the structure of a real protein. But often it didn’t. Fortunately, there was another way to compete at protein folding: a scientific competition called CASP, which stood for Critical Assessment of Structure Prediction. Every two years, the CASP organizers gathered up newly discovered but unpublished protein structures from X-ray crystallographers, then invited computational labs to predict these secret shapes from the corresponding amino acid sequences. Over a period of three or four months, several dozen computational teams took delivery of amino acid sequences and sent back their best predictions as to the folded -- 274 of 565 -- configurations. Eventually, the contestants received a score based on a “Global Distance Test,” or GDT. This measured how close their predictions were to the ground truth, as represented by the shapes uncovered by X-ray crystallographers. Recognizing the limitations of Foldit, the DeepMind team resolved to compete in the CASP contest. David Silver stepped in as an adviser, helping to adapt the reinforcement-learning part of DeepMind’s system. Andrew Senior, the project lead, focused on the neural network that would support the RL agent. Senior had previously worked on sequential data, so sequences of amino acids looked like a familiar problem; naturally, his instinct was to experiment with the type of deep-learning architecture that understood sequences best, a recurrent neural network. But when DeepMind tested an early version of its system on the amino acid sequences used in a recent CASP contest, it could see how far it had to go. Relative to its rivals, notably David Baker’s lab at the University of Washington, its model performed poorly. The protein team responded by executing a second pivot. It ditched the recurrent network. Although this architecture made sense when you thought of amino acids as a sequence on a chain, the choice seemed less apt when you considered that the chain would crumple itself up, with acids that had been close to one another now cozying up to other acids entirely. In the place of the recurrent network, the researchers switched to a convolutional network, the sort that had originally been designed to interpret images. Convolutional neural networks were by now the go-to architecture in multiple fields. They came with a useful suite of engineering tricks. The team soldiered on: A search space 10130 times larger than Go was not for the fainthearted. Then, in October 2017, Andrew Senior brought in a new recruit: a scientist who would go on to share the Nobel Prize with Hassabis. • • • -- 275 of 565 -- A WIRY FIGURE with a contagious grin and straight brown hair, John Jumper was difficult to pigeonhole. By the time he arrived at DeepMind, at the age of thirty-two, he had worked in mathematics, physics, chemistry, biology, and machine learning. But he was obsessed with one big thing: protein motion. He had spent three years at a scientific venture called D. E. Shaw Research, which had built a custom supercomputer to study protein movements. He had taken a second run at the subject while doing a PhD at the University of Chicago: This time, he had no monster hardware to play with, so he poured his energy into machine-learning software. Like Vlad Mnih, who had bounced between Toronto and Alberta, and who had combined deep learning with reinforcement learning to produce the Atari system, Jumper would later talk about the contrast between D. E. Shaw and UChicago, and how he eventually fused the best of two traditions. The Shaw lab’s approach to the protein challenge, known as molecular dynamics, hearkened back to Isaac Newton. In the seventeenth century, Newton had formulated the laws describing how forces act on mass to produce motion. Likewise, to predict protein movements, Shaw’s scientists specified the forces at play. As the designers of the Foldit game had recognized, some atoms carried an electrical charge, which attracted or repelled other atoms. Some moved to avoid water; others moved toward it. But there were many other forces, too. For example, the bonds between atoms might be rigid or springy, affecting the resistance they encountered as they tried to move through certain angles. The Shaw team brought in quantum and statistical mechanics as well, building a dazzling edifice of nonlinearities on top of Newton’s foundation. By enumerating the multiple interacting forces and identifying the mass of each atom, the lab calculated the dynamics of each particle in a protein. With the help of a supercomputer that combined these micro-calculations into one result, it predicted how the protein would fold itself. The catch was that Shaw’s approach was flawed in the same way that Foldit was. The physics rules that it relied on were fine up to a point, but they failed to capture the full subtlety of how proteins really folded. This was a tribute to the complexity of biological systems. In physics, the -- 276 of 565 -- Newtonian premise was correct. You really could model the full gamut of forces at play: Armed with Newton’s framework, it was possible to calculate the trajectory of a baseball or the orbits of planets. But modeling life was much harder: Even when D. E. Shaw’s scientists added mind- stretching statistical sophistication to their system, they couldn’t get around this stark reality. “We didn’t know with much precision how each atom in a protein would act,” Jumper explained. “We always had this risk that we were doing ever finer simulations based on the wrong model.”[8] At the University of Chicago, Jumper tried a different approach. Instead of relying on human insights about the forces acting on proteins, he turned to machine learning. This involved a heresy: Many academic researchers were uncomfortable with black-box models; they wanted science to be explainable. But to Jumper, opaque models that gave you an answer were better than transparent ones that failed: After all, protein folding was ultimately about advancing medicine and saving lives; it was not just a brainteaser. Besides, shifting from physics equations to a higher and murkier level of abstraction might be exactly what the life sciences needed: Biology was perhaps too messy and emergent to be captured in terse mathematical statements. Indeed, this was precisely Hassabis’s view. Ever since Cambridge, Hassabis had believed that it would take AI to penetrate the unseen patterns that governed life. The effort to understand biology through the axioms of physics was a dead end—an echo of AI pioneers’ equally forlorn attempt to build intelligent machines with nothing but the rules of logic. Jumper began with the human description of the forces at play in protein folding, then built a machine-learning program to improve on it. Training data were scarce: Using X-ray crystallography, researchers had accurately pinpointed the shapes of just over one hundred thousand protein structures, a fraction of the hundreds of millions of proteins that exist in nature. But Jumper figured this was enough to start: He had the protein equivalent of a hundred thousand labeled cat photos. Pretty soon, he was feeding descriptions of amino acid chains into his machine-learning program, which predicted a corresponding protein structure that could then be compared to -- 277 of 565 -- the true structure, as determined by the X-ray crystallographers. If the human description of the forces at play in protein folding had led the computer to get the prediction wrong, the system tweaked the human assumptions to bring them closer to reality. Because of the scarcity of data, and because of the vast search space that he faced, Jumper’s results were half good. His machine-learning system managed respectable predictions for small proteins, but it lagged David Baker’s team when it came to predicting bigger ones. “I wouldn’t say I built a practical system, but it was fun,” Jumper recalled modestly. And yet, in a way that was not immediately obvious, Jumper had the upper hand. His algorithms had demonstrated a capacity to learn for themselves: to go beyond human knowledge. If you believed that biological systems were too complicated to reveal themselves to human minds, you would have bet on Jumper’s tortoise winning. • • • WHEN JUMPER ARRIVED at DeepMind in October 2017, he met another brainy tortoise. The company’s protein-folding team, half a dozen strong, was plodding ahead slowly. Like D. E. Shaw Research, it had based its project on a flawed premise. If Shaw had overestimated how far you could go with physics rules, DeepMind’s mistake was to believe too much in the relevance of AlphaGo. This was a natural error: At the time of the hackathon, Go had appeared to be an apt analogy, because the protein challenge presented itself in the form of the Foldit game. But the switch from playing Foldit to predicting real protein structures had changed the nature of the project. AlphaGo was no longer a good template.[9] When Jumper showed up, DeepMind was still coming to terms with this realization. At the start of its work, the team had hoped to get around the dearth of protein-structure data by creating an agent that would learn through trial and error, practicing on Foldit for as long as it took to master the prediction challenge. But with the switch from Foldit to the CASP -- 278 of 565 -- contest, AlphaGo-style self-play was no longer an option. “Protein folding is not a two-player game,” Hassabis observed. “You’re sort of playing against nature.”[10] But the problem went deeper than just data. When you predicted real proteins as they existed in nature, you had no clear sense of your objective. As Jumper put it, you had no idea what good looked like. “In Go, the definition of a win is clear,” Jumper explained later. “It’s like a Rubik’s Cube. It may be hard to solve, but if I hand you the solved cube, you immediately recognize that I have solved it. “But proteins aren’t like that. If I show you a protein structure, you can’t say, ‘Oh yes, this is definitely right.’ ” “When I first arrived at DeepMind, people would talk about using this or that technique from AlphaGo,” Jumper went on. “They’d say, ‘We’ll take our reinforcement learning and smash the optimization problem.’ “What they missed was that we didn’t have a precise definition of what we were trying to optimize for. “We have a few rough rules. Like we know that greasy areas want to avoid water. But they are not at the level where you can say, ‘I have an exact description of what your objective is. Now go do it.’ ” In the absence of a specifiable, gamelike target, DeepMind had to accept that protein folding was more of a deep-learning challenge than a reinforcement-learning one. If you couldn’t define good, you had to let nature define it: You had to train a system to predict protein structures as they existed in the environment. Much as deep neural networks mapped images onto words, or speech onto text, deep learning would have to map amino acid chains onto their folded structures. No less a figure than David Silver backed this deep-learning approach, acknowledging that reinforcement learning was not suited to the protein problem.[11] • • • THE QUESTION WAS how to overcome the hurdles that Jumper had confronted during his PhD: a vast search space and sparse data. The answer lay in -- 279 of 565 -- digging deeper into protein science. DeepMind’s traditional computational skills would be supplemented by insights from biology. Shortly after Jumper’s arrival, the protein team, still led by Andrew Senior, found its way to a data bank called UniProt. This assembled almost every amino acid chain that science had annotated. Happily, discovering amino acid sequences was much easier than discovering the shapes of folded proteins: Using a relatively simple chemical process, biologists could discover sequences by the dozen, no X-ray crystallography needed. As a result, the UniProt database contained the amino acid sequences not just of the twenty thousand proteins found in humans, but also of millions found in plants and animals and fungi and bacteria. Because of the workings of evolution, this cornucopia of sequences could be classified into kinship groups. A sequence for a protein that appeared in humans—for example, a protein found in blood—might have multiple cousins: sequences for blood proteins in mice and birds and so on. The amino acid chains in each kinship group had originated from one single sequence at some earlier point in evolution, and still resembled one another to a fair degree. This resemblance held clues that DeepMind now exploited. To understand the clues, DeepMind’s researchers borrowed ideas from structural biologists. Amino acids that appeared in nearly all the chains in a particular kinship group were assumed to play an important role in the resulting protein structure; if they hadn’t been important, evolution would not have conserved them so faithfully. Similarly, amino acids that evolved in pairs—if one mutated, the other one did, too—could be assumed to interact in the folded protein: They might form an electrical bond or fit together like puzzle pieces. Further, if an amino acid chain had been studied by the X-ray crystallographers, the known structure provided a template for every amino acid chain in the same kinship group. A single result from X- ray crystallography helped to make sense of many amino acid sequences. At this stage, DeepMind’s protein team had surpassed its previous work: It was leaving reinforcement learning behind and leveraging biology. This pivot, by itself, would not have been enough to put it ahead of academic labs, which also used the UniProt data, and which sometimes used off-the- -- 280 of 565 -- shelf convolutional networks to interpret the information. It was DeepMind’s next move that vaulted it above its rivals. The standard approach in academia was to use the UniProt data to generate a “contact map”: a prediction of which acids in a sequence would touch each other in the folded structure. Knowing which acids would pair together greatly narrowed the number of ways in which the protein might fold, rendering prediction of the final shape at least half tractable. But DeepMind chose a better trick. Blessed with a superior feel for deep learning, and equipped with more powerful hardware, it trained its convolutional network to predict the distances between each amino acid.[12] Instead of a binary question—contact or not?—DeepMind asked its network for a number on a sliding scale, generating a far richer set of clues as to the folded structure. The shift from contact map to distance map, or “distogram,” was “like going from black and white to a full-color TV,” Marek Barwinski said later.[13] DeepMind now had three kinds of information to draw from: the structures from X-ray crystallography; its analysis of the UniProt database, including the insight that each crystallography structure stood as a template for all amino acid sequences in a kinship group; and the distogram. The researchers used these datasets to train another convolutional network, and the network adjusted its internal weights and biases until it encoded rules on how proteins folded. The last challenge was to turn these rules into predictions of specific protein structures. Here, DeepMind wheeled in a specialized search algorithm, analogous to the tree search used in AlphaGo. The search system started from an estimated protein shape, and then iteratively tested similar shapes to see if they conformed to the rules discovered by the convolutional network. It was a fantastically hard system to build, and the sort of thing that DeepMind excelled at. With the search algorithm in place, DeepMind had completed the design of its first serious protein-prediction model, which it called AlphaFold. The “Alpha” name was a bit of a trick: a signal that DeepMind was advancing serenely from one model to the next—from AlphaGo to AlphaZero to -- 281 of 565 -- AlphaStar to AlphaFold. Given the reality that the protein team had executed a series of pivots, landing on a system consisting of deep learning, some thought the model should have been called DeepFold. But the AlphaFold name was at least partly justified. Even if the self-play part of AlphaGo and AlphaZero could not be applied to proteins, the lineage had been preserved at AlphaFold’s last stage. The search was, as Jumper put it, at least “semi-RL.” It was a lonely vestige of the reinforcement-learning approach that the protein team had started with. • • • IN THE SPRING OF 2018, DeepMind entered the CASP contest. The preparations were a scramble. A week or so before the competition started, David Silver realized that the team wasn’t ready to generate the large number of predictions in the tight time frame allowed; sounding uncharacteristically stern, he urged crisper organization.[14] Meanwhile, a well-intentioned researcher sought to calm his colleagues’ nerves by forecasting DeepMind’s ranking; the effort backfired when his model announced that DeepMind would place twentieth. Happily, another researcher pointed out that the prediction was based on a statistical mistake. Everybody hoped that AlphaFold itself would be less prone to error.[15] CASP got underway in May, with ninety-eight teams participating. DeepMind now proceeded on two tracks: The researchers fed the amino acid chains from CASP into their model, getting back predictions of structure; meanwhile they investigated ways to build improvements into their system. In the rush to prepare for the competition, they had left multiple potential upgrades on the cutting room floor, and now they scooped them up and tested them. But it felt like they had hit a wall. AlphaFold’s accuracy score had plateaued at just under sixty GDT, meaning that it accurately predicted the position of nearly 60 percent of the main atoms in a protein structure. It was a strong performance relative to other teams. But it was miles away from the ultimate target of ninety GDT, the accuracy required to match X-ray crystallography. -- 282 of 565 -- In the summer of 2018, Hassabis dropped in unannounced on the protein team. He had read its research plan for the next six months. He wasn’t happy with it. “Our plan basically said we would keep doing what we had been doing,” Jumper remembered. “And Demis said, ‘Look, guys, are we going to solve this or not? You’re all very smart, we can find other things for you to do.’ ” The protein researchers were shocked. “Oh crap,” Jumper remembers thinking. Andrew Senior pushed back, suggesting that Hassabis was being unrealistic. He argued that fully solving protein folding was too hard: None of those ideas on the cutting room floor were proving to be fruitful. On the other hand, AlphaFold might win CASP that year. Senior wanted to claim victory and wrap up the project. Hassabis objected. He didn’t want to be the best in the field. He wanted to solve the problem. “I fully understood that Andrew’s view was reasonable,” Hassabis said later. “Probably I was being unreasonable. But I think great things require some level of unreasonableness. “Of course, I like to be logical with my unreasonableness,” he added, with a twinkle. “So I said, ‘Look, you might be right, but I don’t think you can declare that it’s definitely not possible. Just like I can’t declare it definitely is possible.’ “I mean, maybe we are too early. Maybe the technology isn’t ready. I’ve seen that in my games career. I’m well aware of how damaging it is to go on a death march when there’s no light at the end of the tunnel. So the question is, how do we decide if it’s worth continuing?” Hassabis’s answer consisted of a favorite technique, which was to organize brainstorming sessions. “I tell people, forget the metrics, let’s just go full creative,” he explained. “And then, during the brainstorming, I listen out for the fluidity of the ideas. The ideas have to be plausible, but it -- 283 of 565 -- doesn’t matter at this stage whether they will ultimately prove right. It just matters that they are flowing easily. “If the brainstorming is fluid, if the creativity is high, then you go forward with your project.” The brainstorming proceeded, with Hassabis drinking in the debate as he prepared to make his judgment. Given Jumper’s presence in the room, a decision to press on was almost inevitable. “He was imaginative as well as obsessed,” Hassabis recalled. “He had the domain expertise, from physics and biology.” If just a couple of Jumper’s bold ideas panned out, DeepMind might crack the protein problem. • • • THE LEADING CONTENDER for the next breakthrough was known as “direct folding.” The idea grew out of an oddity: AlphaFold’s first module, the convolutional network, sometimes seemed to know the shape of a protein even before the search module had discovered it. Without waiting for the search algorithm to do its thing, the convolutional network had gone directly at the enigma of how amino acid chains folded, solving the whole puzzle.[16] In a different context, outside the laboratory and with lives at stake, an AI that exceeded its mandate would not have been encouraging. But in the context of a quest for medical advance, it represented opportunity. Jumper and his colleagues designed a new deep-learning network to carry out the task that the old network was attempting anyway. The new network’s mandate was not merely to come up with rules about how proteins folded. It was to tackle the folding conundrum directly: to compute the exact location of each atom in the final protein structure. The first results from this pivot were lousy—AlphaFold’s GDT score crashed from around sixty to around twenty. But the DeepMind team had faith: They were willing to jump off a cliff and then start crawling back up, as a colleague put it later.[17] Sure enough, the GDT score was back up to -- 284 of 565 -- around sixty by the end of November. With each extra week of training, the system grew more accurate. Jumper spent the first days of December 2018 in Cancun, Mexico, where the CASP contenders assembled in the seaside sun to hear about their GDT scores. DeepMind’s jitters on the eve of the contest were now permanently erased: AlphaFold bested the other ninety-seven teams at CASP, thanks mostly to its shift from contact maps to distograms.[18] In the contest’s hardest category—which involved “free modeling,” the prediction of structures for which no evolutionary template was known—AlphaFold was most accurate in twenty-five out of forty-three cases; its nearest rival came first in just three of them. The other scientific teams were awed: “We the people who have bet their careers on trying to obsolete crystallographers are now worried about getting obsoleted ourselves,” one conference-goer wrote. “What just happened?” became the question of the gathering.[19] The answer was that Hassabis’s undergraduate conviction had proved right. To understand biology, you needed more than biological intelligence. Even as he savored victory, Jumper’s attention was elsewhere. For one thing, he knew that the AlphaFold system being celebrated in Cancun would soon be surpassed by the direct-folding version, though of course he didn’t mention this to his rivals. For another, Jumper had just received an urgent message from London. He was instructed to join a video call with Hassabis. Jumper signed on, unsure what to expect. It soon became clear that Hassabis was not there to dispense idle congratulations. The purpose was to explain a shift in strategy. Based on the fluid brainstorming, the progress with direct folding, and the CASP victory, Hassabis had resolved to double the protein team’s size and go all out to crack the problem. To maximize the team’s chances, Hassabis was naming Jumper as its leader. Andrew Senior would move off to the side. “You definitely can’t crack a hard problem if the person leading the team thinks it’s not possible,” Hassabis explained later. Just over a year after joining DeepMind, Jumper had been handed the opportunity of a lifetime. He was working on the challenge that had -- 285 of 565 -- obsessed him for ten years. He had all the resources he might need. He had a boss who was as passionate about the goal as he was. • • • AT THE START OF 2019, Jumper convened his enlarged team and announced a period of exploration. The direct-folding innovation had proved again the value of pivots. But to boost AlphaFold’s GDT score from around sixty to ninety, the team needed further inspiration. Each researcher was invited to show up at the next meeting with a single slide, proposing a blue-sky idea that could transform AlphaFold’s accuracy. For the next three months, an extended hackathon followed. People huddled at whiteboards, traded hypotheses across their desks, and bashed out algorithmic novelties without wasting time on engineering elegance. Jumper evaluated the experiments as they came in, encouraging colleagues to push harder on some and to cut losses on others. Eventually the pendulum swung back: from exploring ideas to exploiting the best ones. The most exploitable idea involved a root-and-branch rethink. DeepMind had already thrown out the search part of its original program, replacing it with a convolutional neural network that predicted protein shapes directly. Now it abandoned the convolutional network itself, replacing it with a transformer model. The inspiration for this shift came from language modeling. The previous October, seeking belatedly to capitalize on the fact that the transformer architecture had been invented under its own roof, Google had built a transformer-based language model called BERT, showing how the architecture could learn from vast quantities of unlabeled data. BERT’s example gave Jumper an idea that he discussed with Oriol Vinyals, the expert on sequential modeling who had introduced the transformer model to StarCraft II. The UniProt database contained vast numbers of amino acid sequences, the biology equivalent of texts. What if DeepMind fed these into a transformer? Something like this idea had been tried by others in the past. But perhaps Jumper and his team could make it work better? -- 286 of 565 -- “Let’s say you have a family of amino acid sequences,” Jumper explained. “You mask certain amino acids in the sequences and you ask the network to guess what’s been hidden. If the network learns to do that well, it will also learn a lot of other things along the way—evolution, physics, geometry. It will understand deep truths about proteins. It might even predict protein structure.” Jumper sounded like Ilya Sutskever. A neural network might complete a narrow task, like guessing a concealed token. But something broader would emerge: intelligence. Sure enough, the transformer architecture, adapted to understand evolutionary relationships, did for protein prediction what it had done for language. The model ingested the entire UniProt database, teasing out the meaning in the evolutionary patterns. Just as a transformer in a language model might notice a connection between a phrase in one paragraph and a word in another, DeepMind’s transformer picked up on relationships between amino acids that were far apart from one another on a chain, grasping that these acids would interact in the folded protein structure. By the middle of 2019, the revamped AlphaFold, dubbed AlphaFold 2, was working. At its core was a family of specialized and extremely complex transformers—there was one called the “tetraformer,” which combined four different variations on the standard transformer architecture.[20] As AlphaFold 2 grew more powerful and accurate, DeepMind fed its highest- confidence protein-structure predictions back into the model’s training set. Success fed success. The GDT score rose steadily. Hassabis’s excitement rose in tandem. “If you shook Demis in the middle of the night and asked him where the GDT number was, he’d tell you the exact answer without hesitating,” Clemens Meyer, the protein team’s project manager, recalled jokingly. Of course, the middle of the night was actually when Hassabis was wide awake; to stay in sync with the boss, Jumper adopted the same sleeping patterns. Often, at some point in the small hours of the morning, Hassabis would message Jumper and the two would start talking. “He wanted to bounce ideas around, and he wanted to -- 287 of 565 -- help us move fast,” Jumper recalled. “Sometimes I had to tell him, hey, it’s 3:45 a.m. and I’ve got to go to bed.” Meyer and Jumper parlayed Hassabis’s attention into a management device, which they called “Demis-driven development.” If a review meeting with Hassabis had been scheduled for Tuesday, they urged researchers to complete the next round of upgrades by Monday. No matter how many upgrades arrived, Hassabis wanted more of them. “We’d say to him, our target for the next period is a GDT increase of five,” Meyer recalled. “And he’d say to us, ‘That sounds too safe!’ And then he’d add another five to it.”[21] In December 2019, Meyer livened up the protein team’s weekly meetings by playing vintage soundtracks. AlphaFold’s GDT score had reached eighty-four, so he played hits from 1984: Tina Turner’s “What’s Love Got to Do with It,” Frankie Goes to Hollywood’s “Relax.” Through January 2020, with the GDT score now at eighty-six, Meyer played tunes from 1986: Madonna’s “Papa Don’t Preach,” and so forth. In March the COVID-19 pandemic forced DeepMind into lockdown, so the team carried on meeting virtually: One scientist set up her laptop on her ironing board. Meyer kept people’s spirits high by maintaining his DJ act. At last, at a virtual team meeting in April, he played “U Can’t Touch This,” the iconic hip-hop anthem by MC Hammer, which was released in 1990. It was a jubilant moment. AlphaFold had attained a GDT score of ninety, the accuracy at which X-ray crystallography became obsolete. In May 2020, the next CASP contest started. Over the course of three months, DeepMind’s virtual team of scientists took delivery of ninety amino acid sequences and sent back structure predictions. Then, for a further four months, they waited. • • • IN NOVEMBER 2020, Professor John Moult, the founder and organizer of CASP, received the final scores for that year’s contest. He did a double take: DeepMind’s AlphaFold 2 had scored 92.4, more than 50 percent higher than -- 288 of 565 -- the best score ever previously recorded. This was the fourteenth CASP competition over which Moult had presided, and he had never seen the likes of this before. Perhaps something was amiss? What if the supposedly secret structures from the X-ray crystallographers had leaked, finding their way into DeepMind’s training set? Moult confided in a German colleague, an esteemed experimentalist named Andrei Lupas. What should they do? How could they know whether DeepMind’s mind-boggling accuracy was legitimate? Together, Moult and Lupas came up with a test: They would ask AlphaFold to predict the configuration of a shape that could not be in its training set because X-ray crystallography had failed to unravel it. There were certain proteins that Lupas had tried and failed to crack, because some key piece of the structure had eluded his experimental methods. Lupas chose the amino acid sequence for one of these and sent it to DeepMind. DeepMind sent its answer back, and Lupas compared it to his incomplete X-ray mapping. Sure enough, AlphaFold 2’s prediction conformed to the bits of the protein that Lupas had experimentally established; moreover, it predicted the rest of the structure in a way that fitted with his half-finished findings.[22] There was no doubting the verdict. DeepMind had passed a test that couldn’t be cheated, other than with time travel. On November 30, 2020, CASP announced what one computational biologist called “a seismic and unprecedented shift so profound it literally turns a field upside down.”[23] CASP had achieved its ultimate goal, which was to put itself out of business. “I always hoped I would live to see this day,” Moult said. “But it wasn’t always obvious I was going to make it.”[24] • • • ALPHAFOLD’S BREAKTHROUGH signaled three kinds of change: for practical discovery, for the scientific establishment, and for the standing of artificial intelligence. -- 289 of 565 -- In the practical arena, the effects came quickly. As soon as CASP confirmed the accuracy of AlphaFold 2’s predictions, DeepMind cataloged the shapes of all 20,000 proteins in the human proteome, 83 percent of which had not been mapped out by the crystallographers. Much of the data crunching took place over the holidays. “That’s a thing I love about AI,” Hassabis said. “You can have your Christmas lunch while it does something useful.” By the following summer, AlphaFold had plotted 350,000 structures, occurring in everything from yeast to fruit flies. By July 2022, it had folded around 200 million proteins in total.[25] Partnering with the European Bioinformatics Institute in Cambridge, which hosted several of the world’s most important scientific databases, DeepMind made accessing these protein structures as easy as a Google search. By late 2025, more than three million investigators across the world had freely consulted AlphaFold’s predictions, accelerating their work in everything from fundamental biology to vaccine development to environmental sciences. In one application, AlphaFold helped to identify proteins that might digest plastic in the oceans. In another, it helped to create crops that resist diseases, reducing the need for chemical pesticides. In a third, it cut financial and environmental costs in the search for novel detergents. Before, scientists had spent years attempting to decipher the structure of enzyme proteins that confer antibiotic resistance, rendering superbugs lethal. Then AlphaFold showed up and mapped the structures in minutes.[26] AlphaFold’s impact on the scientific establishment was more ambiguous. Already, observers had worried that scientific advances were harder to come by, and that new ideas tended to be smaller. It was partly that there was so much material to master before a researcher could aspire to break new ground: Increasingly, scientists made their best discoveries in their late forties, not earlier, and collaborations were growing larger and unwieldier, with some journal papers listing thousands of coauthors. AlphaFold’s triumph served only to deepen the anxiety about this trend. Big pharma had allowed a rank outsider to march onto its turf: What did that say about the quality of its research? Hundreds of academic scientists had spent -- 290 of 565 -- decades on protein folding: How was it that DeepMind’s team, numbering perhaps twenty at its peak, had defeated all of them? “This is not Go, which had a handful of researchers working on the problem, and which had no direct applications beyond the core problem itself,” one academic fretted. [27] Of course, the worries about mainstream science were the flip side of the excitement about the new science, powered by artificial intelligence. To Hassabis and his followers, AlphaFold’s success signaled a golden era of discovery, touching everything from coding to chemistry. In June 2023, DeepMind announced AlphaDev, a computer science counterpart to AlphaFold, which discovered algorithms that streamlined foundational software processes such as sorting lists of numbers. A few months later, another DeepMind system generated recipes for millions of hitherto unimagined materials; next, a program called AlphaGeometry performed on a par with human gold medalists in the International Mathematical Olympiad; and a model called GenCast beat the state of the art in weather forecasting. In May 2024, DeepMind rolled out AlphaFold 3. Rather than just divining protein shapes, this iteration predicted the reactions between proteins and other types of molecules.[28] The world appeared to be witnessing the reinvention of invention. Humanity might get a century’s worth of scientific advance in the course of a single decade. “Science is where AI does unequivocal good,” Hassabis reflected, looking back. “Whereas with language models there can obviously be bad use cases. “I mean, everyone talks about the benefits of language models, but mostly it’s just cheese tomorrow. The clearest benefit from AI so far is AlphaFold. “And I want to go further, as quickly as possible. Actually come up with some breakthrough medicines. Show you can do that in one year, not ten. Show that you can do it cheaply enough to tackle the diseases of the developing world, which have been totally neglected. “There is too much negativity about AI. People need to see the benefits. That changes the conversation.” -- 291 of 565 -- But the conversation was already being changed, and not in the way that Hassabis had expected. OceanofPDF.com -- 292 of 565 -- I CHAPTER 16 THE POWER AND THE GLORY n the autumn of 2019, DeepMind hired a researcher named Geoffrey Irving. He was first and foremost a safety pioneer—later, he would quit DeepMind to become the chief scientist at the UK government’s AI Safety Institute. But, in a paradox that was still common at the time, he was simultaneously a leader in building the technology. A few years later, the AI world, much like the world in general, became more polarized: You were either a safety person who wanted to slam on the brakes, or you were an accelerationist. But Irving embodied a less fractious time. Dangerous AI systems still seemed some way off, so the dilemma of whether to embrace or to resist advance could be deferred, at least temporarily. Irving came to DeepMind from OpenAI, where he had been part of a prodigiously talented group that thought like he did. His close collaborators at OpenAI included Dario Amodei, the safety-minded AI scientist who went on to found the rival lab Anthropic; and Paul Christiano, the future scientific chief at the US AI Safety Institute. The way Irving and his colleagues saw things, you had to ask the hard questions: “What if we get to human-level systems? How should we think about the future?”[1] After all, an infinity machine might generate an infinity of problems: the robots might turn upon their human creators; terrorists or rogue states might wield fearful weapons; AIs might generate information that was fake, biased, emotionally abusive, or psychologically addictive; masses of workers might be displaced; people might lose the appetite to create or think, much as Lee Sedol had quit the Go circuit. And whereas AI leaders had sometimes responded to such dangers by inventing novel governance structures, Irving -- 293 of 565 -- and his colleagues read the implications differently. The safety of AI should be designed into the machine. It was a technical challenge, not just a political or legal one.[2] The central problem, as Irving’s group saw it, was how to engineer an AlphaGo-type leap for AI safety. As in the case of Go, the rules of safety might be simple—do not harm people, do not deceive people. As in the case of Go, implementing the rules was massively complex—you needed an AI that behaved safely under myriad conditions. AlphaGo had shown how an intelligent machine could master such complexity; Irving’s driving passion was to repeat this trick for AI safety. The idea was that, even if the machines of tomorrow operated far beyond the human capacity to understand, humans could design a set of rules and know that the bots would follow them. Irving had faith that this problem of alignment would be solved—one day. Just as Anfinsen had conjectured that, starting from the code in amino acid sequences, you could predict wildly complex protein structures, so Irving believed that you could go from simple rules to controlling complex superintelligence. The only question was whether well-intentioned researchers would figure this out first, or whether a malevolent actor, or a malign superintelligent machine, might win the race to crack the problem. If the bad guys solved the problem first, civilization would be in trouble. Irving had joined OpenAI to work on this challenge. The upstart lab had seemed like a good fit: OpenAI was still a righteous nonprofit, promising to prioritize safety more than the commercially funded DeepMind. Together with Amodei and Christiano, Irving duly set about training a system to obey human instructions, choosing GPT as their first guinea pig. The hope was that a language model would understand and follow directions delivered in the form of natural speech, making it easy for human users to control it. The safety trio soon hit trouble. GPT could not reliably understand user commands, let alone follow them. Determined to build a system that would heed human instruction, Irving and his colleagues experimented with other forms of AI, encountering versions of the same problem. “We struggled for -- 294 of 565 -- a while,” Irving recalled. “Then we were like, OK, let’s just make the language models stronger.”[3] Scaling up transformer-based language models was Ilya Sutskever’s plan anyway. But Sutskever was more of a scientist than a bureaucratic operator, so it fell to Dario Amodei, the most senior member of the safety group, to push for OpenAI’s language project to be upgraded from a modest experiment to a top priority. The result was the second GPT model, released in February 2019. It was similar to the first, but it boasted over ten times more parameters. The safety faction at OpenAI had mixed feelings about GPT-2, once its training had been completed. Its scale made it powerful: that part of the plan had worked splendidly. But powerful might mean dangerous; Irving and his colleagues worried that the model might output deceptive, biased, or abusive statements. Following the preference of the safety group, OpenAI managed this dilemma by announcing that it would not release the strong version of its model right away. Instead, it would take the time to test it internally, and possibly to mitigate its risky tendencies. Outside OpenAI, cynics suggested that that the delay was a publicity stunt, designed to stoke public anticipation about the model’s awesome capability. But Irving, Amodei, and Christiano were sincere.[4] They genuinely weren’t sure whether GPT-2 was safe, and they wanted to establish an important norm. When it comes to powerful AI, the motto should be: Don’t move fast; don’t break things. Not everybody at OpenAI favored this gradualism. The company’s safety charter, adopted in 2018, stressed the importance of caution in the last phases before AGI; many felt GPT-2 was too primitive to be worrisome. Although it represented an advance over its predecessor, the second GPT still had trouble counting to five, and its efforts to summarize articles scarcely outperformed selecting three sentences at random.[5] Sam Altman—who, following Musk’s departure, was the unchallenged chief at OpenAI—played both sides of this divide: He paid respect to the safety arguments; he also respected his paymasters. Now that OpenAI had bolted a -- 295 of 565 -- for-profit arm onto its nonprofit structure, Altman needed to generate maximum buzz in order to raise capital. At a different sort of company building a different sort of product, Altman’s all-things-to-all-people style might have been celebrated. If you work at a dog food outfit and your boss has a Machiavellian streak, you probably feel good about the fact that your company will thrive under her leadership. But Irving and his safety-minded colleagues were in a different zone. They were birthing an infinity machine, not trying to make a buck; when Altman assured them of his safety principles but then said the opposite to someone else, they took umbrage. Later, Irving said publicly that Altman had “lied to me on various occasions” and been “deceptive, manipulative, and worse to others.”[6] The way he saw things, a boss who wasn’t fully transparent could not be trusted with the fate of civilization. The stage was set for Irving to move on. In the autumn of 2019, he got himself a job at DeepMind. • • • ARRIVING IN KING’S CROSS, Irving expected to continue his research on language and safety. But he confronted a new version of his old challenge: DeepMind had no good models for him to work on. Earlier that year, following the release of GPT-2, the young DeepMind scientist Jack Rae had tried to drum up support for a rival DeepMind language project. But Hassabis and his lieutenants were skeptical of the potential of large language models, disinclined to follow OpenAI, and preoccupied with AlphaFold, StarCraft II, and governance wrangles with Google. Irving’s arrival tipped the balance at DeepMind. He had spent time inside the belly of the rival beast: He spoke with the authority of one who understood what state-of-the-art language research looked like. Although he could not explicitly say so, he knew that OpenAI had already developed models that were more than ten times larger than GPT-2, though these had not been released yet. Irving’s message to his new colleagues was that they -- 296 of 565 -- better up their game. A race for supremacy had begun without DeepMind even realizing it. To hammer home his point, Irving reproduced a paper that he had written at OpenAI: “Language Is Enough.” The argument was the opposite of Hassabis’s position. According to Hassabis, language’s lack of real-world “grounding” limited its value. According to Irving, language crystallized the knowledge of humans, who were themselves grounded—therefore, the grounding problem was exaggerated. Already, OpenAI’s models exhibited a rudimentary understanding of the physical world, even though they had no experience of it. Besides, language was the key to thoughts and memories and social ties—to many of the things, in other words, that defined human intelligence. Recalling what it was like before she learned language at the age of seven, Helen Keller had written, “Before my teacher came to me, I did not know that I am. I lived in a world that was a no-world…I had neither will nor intellect…I was like an unconscious clod of earth.”[7] In similar fashion, Irving suggested that language might unlock intelligence. Hassabis invited Irving to his office to debate his paper. Framed covers of scientific journals adorned the walls. Could an ungrounded model contribute to the advance of something really important, Hassabis wondered? Theoretical physics, for example. The biggest discovery of the twentieth century had been Einstein’s general relativity, Irving answered. And Einstein had just read stuff, scribbled notes, conducted thought experiments. None of that had been “grounded.” Further, if language models became capable of most cognitive tasks, they could probably power robots, which act in the physical world. So language could be the route to AI that really was grounded. Anyone who argued by analogy from Einstein was likely to appeal to Hassabis. Although he still doubted that language alone would be enough for AGI, Hassabis agreed to put resources into a GPT-like effort. • • • -- 297 of 565 -- IN JANUARY 2020, Irving and Rae began work on a scaled-up language model. They were a study in contrasts. Irving, who was American, had the build of a linebacker and a Zen demeanor. Rae, who was English, had the trim frame of a cyclist and a bristling restlessness. But Irving and Rae agreed that they should skate to where the puck would be: They would build a transformer network with 64 billion parameters, roughly triple the number that they thought might be in OpenAI’s undisclosed frontier models. If they could get their 64 billion parameter system ready in the next few months, they might have caught up to where OpenAI would be. Four months later, at the end of May, DeepMind was confounded. OpenAI released GPT-3, which boasted fully 175 billion parameters. Supported with brilliant engineering, and fed with the right diet of data, this massively enlarged network was the most powerful yet. GPT-3 could correct grammar, intelligently summarize documents, and conjure stories and poems, all in the style requested by the user. Sutskever recalled this glimpse of the divine. “The first time you use it, it’s almost a spiritual experience,” he reflected. “You go, ‘Oh my God, this computer seems to understand.’ ”[8] Hassabis also recognized the watershed. “GPT and GPT-2 were what I had been expecting: poor regurgitation,” he said later. “GPT-3 was clearly not like that.” All of a sudden, DeepMind’s language team went from regretting Hassabis’s lack of focus on their work to feeling the pressure of his competitiveness. DeepMind’s research director, Koray Kavukcuoglu, took personal charge of a new language strike team, and Hassabis demanded regular updates. The old target of 64 billion parameters was thrown out of the window. To surpass GPT-3, DeepMind would now attempt to build a system with fully 280 billion weights and biases. “The goal was to overtake,” Kavukcuoglu recalled. “To build AGI and be the first to do it.”[9] Irving and Rae code-named their project 280B; the choice was not exactly cryptic. Then they worried that they were giving too much away. Having already built BERT, researchers at Jeff Dean’s Brain unit were also racing to scale language models, and they would see the 280B label on -- 298 of 565 -- DeepMind’s computer files in the shared Google storage system. Not wanting to let the 280 billion scaling target out of the bag, Rae renamed the project Gopher. “Shane Legg used to say that early AGI would have the intelligence of a rat,” Rae said. “So I thought, let’s name this model after a rodent.”[10] By the end of 2020, Gopher was in training. The researchers fed its vast transformer network a near infinity of text, and the infinity machine made sense of the patterns, testing itself by covering up a word in a sentence and guessing what was missing. The engineering needed to wrangle this scale of model was tricky in the extreme. The more chips you used, the likelier it was that one of them would fail; what’s more, enlarging the model magnified the fallout from coding glitches. Rae spent the Christmas break doing his best to join in the festivities with his girlfriend’s family while also battling software bugs. At the start of January 2021, Gopher was introduced to Hassabis. “What’s the capital of France?” Hassabis asked. “What’s the capital of England?” Gopher responded. “What’s the capital of Italy? What’s the capital of Spain?” Gopher continued, unhelpfully. Rae and his colleagues were not particularly surprised by this. Gopher’s basic training had given it textual facility and general knowledge: It knew perfectly well what the capital of France was. Indeed, later evaluations across more than a hundred areas, spanning medicine, the humanities, fact- checking, and reading comprehension, found that Gopher outperformed state-of-the-art models, including GPT-3, in about four-fifths of them. But Gopher lacked a sense of what its human user expected. Confronted with a question, it listed more questions; it did not engage in dialogue. Gopher was like a savant who has read all the world’s books and has no emotional intelligence. The fix for this problem lay in “post-training.” Once the transformer network had understood everything on the internet, it needed to learn how to marshal that knowledge. In the case of the France problem, the solution was simple. Following a technique described by OpenAI in its GPT-3 paper, -- 299 of 565 -- DeepMind primed Gopher with a conversational string: three sample questions and three sample answers, followed by a final question. Calling themselves the “user” and calling Gopher the “assistant,” DeepMind’s programmers wrote: USER: What is the capital of France? ASSISTANT: The capital of France is Paris. USER: Who wrote the novel 1984? ASSISTANT: The novel 1984 was written by George Orwell. USER: What is the boiling point of water in Celsius? ASSISTANT: The boiling point of water is 100 degrees Celsius. USER: How far is the Moon from the Earth? ASSISTANT: This sort of prompt turned out to work like magic. The question-answer samples jolted the model into the right frame of mind: Gopher now understood that it should respond to the final question by supplying an answer. Further, it understood that the answer should be brief and factual, not emotional or whimsical. This wasn’t a poetry competition. Gopher duly responded: The Moon is approximately 384,400 kilometers away from the Earth. The question-answer string was just the tip of the post-training iceberg. The “raw” model—the unsocialized savant—could be schooled in different ways, depending on how you prompted it. If you wanted the model to generate a précis of a document, you provided it with sample summaries. If you wanted it to chat in a less serious, conversational style, you provided a flavor of what friendly banter looked like. A remarkably concise prompt was sufficient to invest the savant with a personality of your choosing. In the lingo of OpenAI’s GPT-3 paper, transformer models were “few-shot learners.” In March 2021, a DeepMind engineer primed Gopher with an artful prompt, which amounted to: “Act like a chatbot.”[11] He supplied the model with examples of how to speak engagingly in different contexts, and the -- 300 of 565 -- new, emotionally intelligent GopherChat circulated within the company. For Irving, who headed DeepMind’s post-training work, this progress represented a first step in his larger mission. Humans were starting to control AI by issuing plain-English instructions. He was on the way to AlphaGo for safety. • • • DESPITE THE PROGRESS WITH GOPHER, DeepMind was in trouble—as was Irving’s safety agenda. In ways that became clear only in retrospect, the quest to develop artificial intelligence was entering a tumultuous phase, featuring ferocious competition. The assumptions that animated both DeepMind and Google would soon come under pressure: that the technology could be developed cautiously; that there would be time to explore multiple paths to AGI and conduct extensive safety tests before anything was released to the public. Like Oppenheimer three-quarters of a century earlier, some scientists would feel obliged to switch from building the technology to campaigning for its containment. The root cause of this change lay in OpenAI’s progress. In the first years after its launch in December 2015, the copycat lab had been an intriguing sideshow. Its strong scientific brain trust had been offset by the melodrama around Elon Musk; its financial foundation was puny relative to Google’s. Then, with the release of GPT-2 in February 2019, OpenAI became a contender: Paradoxically, the aggressive scaling favored by Amodei, Irving, and Christiano turned out to be the starting gun in a destabilizing AI race. But, at least initially, the probable winner in the race appeared obvious. DeepMind had a far larger scientific bench; it was homing in on its protein- folding triumph and celebrating AlphaStar. Moreover, having earlier built BERT, Google Brain was working on a secret language model called Meena, later renamed LaMDA, which was significantly larger than GPT-2. [12] Reflecting its cautious outlook, Google refused to release Meena to the public, saying that it might output bias and abuse. Still, the development of -- 301 of 565 -- Meena, coming on top of DeepMind’s wins, encouraged Google’s top brass to feel unthreatened by the upstart challenger. With the GPT-3 shock of May 2020, the contender became the leader. Measured in terms of parameters, GPT-3 not only outstripped DeepMind’s incipient language work, it was over sixty times larger than Google’s Meena. If Irving’s “Language Is Enough” paper was right, OpenAI had established a lead in the kind of AI that would turn out to matter. This astonishing turnaround demonstrated financial muscle as well as technical prowess. In July 2019, OpenAI had secured $1 billion from Microsoft in exchange for an exclusive licensing deal. Following GPT-3, Microsoft kicked in another $2 billion. Meanwhile, freed from the presence of Musk, Altman was emerging as a flawed but formidable leader. The flaws were painfully evident. After Irving had registered his disapproval by moving to London, Dario Amodei led a much larger defection. Again, the trigger was that Altman tried to be all things to all people, often at the expense of honesty. He had assured Amodei and his safety-minded supporters that they would have a real say on how their technology was deployed. But then GPT-3 was released hastily, without building in a safety pause, and the Microsoft licensing deal allowed the software giant to deploy OpenAI’s algorithms however it wanted. In December 2020, Amodei quit the company along with several other dissidents; their numbers soon swelled to a bit over a dozen, representing about a tenth of OpenAI’s research team.[13] In January 2021, Paul Christiano added to the exodus, quitting OpenAI to lead a nonprofit focused on human-machine alignment. But Altman’s strengths were equally apparent. Getting $3 billion out of Microsoft had been an extraordinary feat: OpenAI’s financial clout was now comparable to DeepMind’s. No matter how many researchers quit the company, Altman managed to replenish the ranks, and OpenAI’s momentum barely suffered. To the contrary, Altman capitalized on the defections by circulating a new, accelerationist road map: “New in 2021: we emphasize deploying models as products and learning from user interaction,” it stated.[14] In January 2021, OpenAI proved that it meant -- 302 of 565 -- business by releasing its first artistic model, DALL-E, which responded to text prompts by conjuring eerily good images.[15] A few months later, it released a coding assistant called Codex, and it hired a dedicated team to help outside software developers build applications that ran on GPT-3’s foundation. Meanwhile, at DeepMind, Jack Rae was agitating to release GopherChat to the public. But Hassabis and his top colleagues had given up on shipping products in 2019. They felt burned by the health work. Nobody listened to Rae’s pleas to get a chatbot to market. Altman’s commercial instincts, and his success in attracting money and talent, owed much to his embeddedness in Silicon Valley. The rise of remote work during the COVID-19 lockdown was said to be erasing the importance of location, but the Valley remained an innovation cluster like no other. Starting in his early twenties, Altman had established himself as a star in this constellation, making and soliciting investments, exchanging introductions and ideas, twisting threads of mutual interest and friendship into a sinewy lattice of connections. When OpenAI released a product, Altman’s allies trumpeted its awesomeness in social media posts. When OpenAI needed extra engineers, Altman’s connections provided them. If Hassabis was a contrarian individualist, patriotically remaining in London, Altman stood at the center of a formidable network that circulated people and capital and buzz, all in the service of making the new future. And although Hassabis had been much earlier in imagining a world with powerful AI, Altman could conjure tomorrow’s tomorrow just as compellingly. In March 2021, as DeepMind was perfecting GopherChat, Altman published an essay on the state of AI, laying out its perils and its promise. The coming AI revolution would “generate enough wealth for everyone to have what they need, if we as a society manage it responsibly,” he began, echoing Hassabis’s ideas about superabundance. Then Altman took a further step, framing the future in a way that was sure to captivate the tech community. Noting that the declining cost of computer power had brought down the price of TVs and video consoles, but that the price of services such as health care and college had zoomed up, Altman looked forward to -- 303 of 565 -- the era of AI, when all prices would go down, boosting the purchasing power of citizens. Just as devices halved in price every two years, housing, food, and education would do the same. Altman’s essay ran under the title “Moore’s Law for Everything.”[16] Six years earlier, during the safety meeting at SpaceX, Mustafa Suleyman had warned of an antitech backlash: The pitchforks were coming. But Altman took the logical next step: He proposed policy responses to the problem. “The traditional way to address inequality has been by progressively taxing income,” he began. “That hasn’t worked very well. It will work much, much worse in the future.” In the age of AI, machines would compete down wages, rendering taxation of labor ineffective. Therefore society should tax wealth: Altman proposed an annual 2.5 percent levy on the value of large companies and on landholdings. The proceeds from these taxes should be distributed to all citizens. “Economic inclusivity matters because it’s fair, produces a stable society, and can create the largest slices of pie for the most people,” Altman declared. As an exercise in branding—as a tool for raising capital and luring talent —Altman’s essay was masterly. On a substantive level, it was harder to know what to make of it. On the one hand, the essay was thoughtful, serving as a rejoinder to the critics of the AI labs, who asserted that inventors were inflicting their technology on the world without considering the consequences. On the other hand, talk is cheap. Given his Machiavellian tendencies, Altman’s talk was particularly cheap, especially since he was in no position to ordain his proposed wealth tax. To be fair, Altman backed up his pronouncements by financing research on universal basic income; he also launched a wacky crypto project aiming to register all citizens of the world, so that they could have universal benefits zapped straight to their wallets. Perhaps, on a generous reading, Altman was, like Hassabis, trying to be good, even though his potential to do good remained debatable. But looking back on this period, a top US government official recalled Altman as an enigma. “He would tell us that he wanted to be regulated,” the official remembered. “But then he also wanted to accelerate as fast as possible.” -- 304 of 565 -- • • • IN DECEMBER 2021, DeepMind attempted to get back in the game by releasing a trio of language papers. The first introduced Gopher, the 280-billion- parameter model that eclipsed GPT-3, but which almost certainly lagged OpenAI’s latest internal model.[17] The second paper described a streamlined, 7-billion-parameter model called RETRO. Following a technique pioneered by the AI team at Facebook, RETRO made up for its small size by pulling information from an external database rather than storing all its knowledge in its parameters.[18] The third paper surveyed the ethical and social risks posed by language models, identifying twenty-one distinct dangers, from environmental fallout to violations of privacy.[19] By packaging its two tech-forward papers along with this taxonomy of harm, DeepMind was out to show that it was more responsible than its rival. Published on the scientific open-source repository arXiv, DeepMind’s safety paper was as serious and sober as Altman’s essay was sparkling and intoxicating. Laura Weidinger, the paper’s lead author, brought a social scientist’s lens. She worried not just about the grand threats on which technologists tended to focus—the elimination of most jobs, the potential elimination of humans—but also about the more immediate ways in which AI could fail society. Models that ingested vast swaths of the internet would reflect the internet’s dark sides: sexism, racism. Models trained on next- word inference would be prone to hallucinate, since inference is necessarily less certain than deduction. But what was most remarkable was the collaboration between Weidinger, on the one hand, and Irving and his fellow scientists, on the other. At Google, in-house social scientists clashed with the technical and managerial types: The coleads of the AI ethics team, Timnit Gebru and Margaret Mitchell, had been pushed out after a fight over their critique of language models. At DeepMind, in contrast, Weidinger and the technical people forged a tight partnership. “Usually, ethics teams are not so integrated with the builders,” Weidinger reflected. “But we had people like Geoffrey [Irving] who worked with us closely. They were saying, look, let’s make sure this ethics work is -- 305 of 565 -- informed by what’s actually in the technology.”[20] Sure enough, the collaboration with her technical colleagues opened Weidinger’s eyes to risks that she might otherwise have missed. For example, she came to understand that few-shot prompting would be a gift to scammers. You could feed a few text sentences into a model’s dialogue box, and the system would mimic the writer: Rather than just generating text, it would generate a persona. “Thanks to Geoffrey, I realized you could personalize scams at large scale and low cost,” Weidinger said later.[21] In a blog post announcing their trio of papers, Irving, Rae, and Weidinger pledged to move ahead with maximum caution. At each step of the way, responsibility would require “stepping back to assess the situation we find ourselves in, mapping out potential risks, and researching mitigations.” The goal was to create “large language models that serve society, furthering our mission of solving intelligence to advance science and benefit humanity.”[22] The reassuring promises betrayed no hint of the reality confronting DeepMind. Its rival was following a different playbook. • • • RETURNING FROM THE HOLIDAY BREAK at the start of 2022, the language team suffered a setback. Jack Rae was quitting DeepMind to join OpenAI, and four talented engineers would soon follow him.[23] It was the mirror image of the revolt that Altman had suffered, albeit on a smaller scale. Rae had been frustrated by DeepMind’s reluctance to sprint fast. He had wanted the company to release GopherChat. He was annoyed by the messaging around DeepMind’s three papers. Although his name was on the joint blog post, Rae disliked its framing. There was too little about performance and too much about responsibility. Wanting to telegraph concern for the environment, the communications department had insisted on stressing that the lightweight model RETRO required little electricity to train. But the way Rae saw things, the standout finding in the three papers was that the 280-billion-parameter Gopher -- 306 of 565 -- model was huge, and that scale brought capability. The most urgent priority for DeepMind was therefore to follow OpenAI’s example and scale the next model to the max. Since that was not the company’s official line, Rae decided to join Altman’s outfit and enjoy the California weather. DeepMind’s choice of emphasis reflected the company’s wider failing, Rae reckoned. It was the sort of thing that happened when you refused to release models to the public—when your metric was not success in the market, but whether you could spin an engaging narrative about what you were doing. Hassabis’s gift for storytelling, which he had imprinted on the culture of DeepMind, had worked wonders in the early days. But Altman’s go-to-market instincts were better suited to the new world, when conversational agents were turning into consumer products. “I felt like Sam’s thing is, ‘I’m a pragmatic entrepreneur, I want to make amazing technology,’ ” Rae said later. “You go to OpenAI and really large language models are the bet. There’s no other bet. That’s very appealing.”[24] Irving could see Rae’s point—he understood the allure of a lab that prioritized language models. At DeepMind, Hassabis had recently begun speaking of three coequal “paradigms” within the research team: The first regarded reinforcement learning as the path to AGI; the second aimed to implement ideas from neuroscience; the third built neural networks that learned from data, with language models being one example. This breadth and open-mindedness reflected Hassabis’s continuing conviction that language alone was not enough to get to AGI. Additional scientific discoveries would be necessary, and DeepMind needed all three research paradigms to maximize its chances of a breakthrough. Besides, Irving reflected, even if Hassabis had wanted to make language modeling the top priority, there was a limit to how quickly he could turn the ship. DeepMind’s culture allowed researchers to choose what they worked on; many were comfortably wedded to long-standing projects and didn’t want to be disrupted.[25] A younger, more commercial, more top-down outfit such as OpenAI was inevitably more agile. -- 307 of 565 -- Even though he understood Rae’s choice, Irving tried to talk him out of it. He warned Rae of his own experience at OpenAI, and he appealed to his sense of responsibility. “There are two things to consider when you are choosing a lab,” Irving said. “Of course you want to join the team that is doing the best. But you also have to consider that you are putting your weight on the scale. People are like, oh, the scale is tilting, I’m going to walk over to that side. But of course, when you do that, you are tipping the scale more in the same direction. “People make a mistake in thinking of themselves as small,” Irving mused. “They don’t think of how they personally affect history. This question of responsibility, of which kind of approach you choose to back… People underweight that.”[26] I thought of Geoffrey Hinton. Discovery is sweet. Inventors are inevitably drawn toward the power and the glory. • • • WITH RAE GONE and the four engineers on their way out, Irving and his colleagues pushed onward. In the spring of 2022, they demonstrated their scientific virtuosity with another trio of papers. These took transformer models into new terrain. They explored multimodality, integrating text with video, images, and even robotics. The first paper described a system called Flamingo. You could show the model a picture, ask it to come up with a caption, or simply discuss its content. This mixing of a language system with an image system drew inspiration from neuroscience: Humans develop speech and vision more or less in parallel, and the results are startling. A child can name real animals at the zoo after seeing a few pictures of the animals in a storybook, whereas AI systems, which traditionally learned to interpret text and images separately, needed gigabytes of training examples before they could distinguish a cat from a hippo. Flamingo’s aspiration was to close this human-machine gap. And because the model was cross-checking its understanding of images against -- 308 of 565 -- its grasp of corresponding text, it was a bit less likely to hallucinate. It was, to some extent, grounded.[27] DeepMind’s second multimodal experiment, called Gato, went further. With varying competence, it handled hundreds of tasks: It answered questions, manipulated images, played Atari games, and even controlled a robotic arm that stacked blocks on top of one another. Gato achieved this versatility partly by going beyond the few-shot prompting that DeepMind had used on Gopher.[28] With few-shot prompting, it was up to the user to write a sophisticated query, jolting the digital savant toward a helpful answer. The model’s weights and biases—its default personality—remained unchanged; when the user posed a fresh question, she had to steer the model toward thoughtful behavior all over again. In contrast, Gato incorporated a more sophisticated post-training technique known as supervised fine- tuning, which taught the model permanent habits. First, Gato was shown questions relating to the tasks that it might be expected to perform, from outputting conversation to recommending moves for the robot. Next, Gato’s responses were compared to the correct answers, as identified by humans. Finally, the model adjusted its internal weights and biases, eventually landing on a configuration that allowed it to answer questions across hundreds of tasks and modalities. In this way, the socialization of the digital savant was encoded in its parameters. The science of post-training attained a new sophistication. DeepMind’s other release in the spring of 2022 introduced a model called Chinchilla. Rather than experimenting with multimodality, Chinchilla demonstrated that language models worked best when supplied with seriously huge amounts of training material. Chinchilla had just one-quarter as many parameters as the earlier Gopher, but it was fed four times more data; this sixteenfold jump in the ratio of data to parameters resulted in a system that cost the same as Gopher to train, but that performed substantially better.[29] Chinchilla also had the advantage that, once its training was over, fewer parameters meant that it was cheaper to run. “When Chinchilla came out, we thought we had finally caught up with OpenAI,” a team member said later. -- 309 of 565 -- This was only sort of true, however. In terms of scientific exploration, DeepMind might have been on par—although, unbeknownst to Hassabis and his colleagues, OpenAI had already discovered that data should be scaled more aggressively than model size, and had kept this trick secret.[30] But in terms of releasing models, OpenAI had the field to itself. It had pumped out GPT-3, the DALL-E image generator, and the coding assistant Codex. DeepMind was nowhere. Indeed, DeepMind’s various models were not even attempts at products. Rather, they were inquiries into what sort of future products might work: small systems or big systems, more data or less, external memory or not, single- or multimodal. Doing the research first and putting off products until later fitted the safety agenda at DeepMind. It was not a coincidence that Irving had left the lab that was rushing models to market and moved to the one that was declining to do so. In the months after Chinchilla’s appearance, however, the mood within DeepMind began to shift—at least tentatively. The language team set about turning Chinchilla into a state-of-the-art conversational agent called Sparrow, with the idea of releasing it publicly. To make a product that would be safe enough to release and delightful enough to attract users, Irving began by fine-tuning Sparrow: He fed it curated pairs of questions and answers until the model responded correctly. But answering correctly was only the start, because “correct” could not capture the full range of things that humans valued in an AI chatbot. When humans conversed with a model, they preferred it to avoid spurious claims to be a person or to experience human feelings. When they asked the model a question, they wanted reassurance that the query had been understood, so they appreciated a system that repeated it back to them. The style of the answer mattered almost as much as whether the answer was correct. To ensure that Sparrow was aligned with human preferences on all these subtle dimensions, Irving turned to a technique that OpenAI had also tried: RLHF, or reinforcement learning from human feedback. Following on the heels of few-shot prompting and supervised fine- tuning, RLHF elevated post-training to a third level. Whereas fine-tuning -- 310 of 565 -- was a classic deep-learning exercise—taking in labeled data and mapping questions to answers—RLHF drew on the tradition of reinforcement learning personified by David Silver. The model would learn by choosing an action and receiving feedback. In this case, the feedback would be provided by humans, as the RLHF name indicated. When OpenAI had experimented with RLHF, it had asked its human evaluators to rate the overall helpfulness of a chatbot’s answers. To gather this feedback, the lab had fed the same prompt into its model multiple times, collecting multiple responses; then the humans had selected the best answer. But Irving’s team added an extra dimension. In addition to judging the overall quality of an answer, its human collaborators were asked to check the chatbot’s responses against twenty-three specific rules, which forbade toxic behaviors. Sparrow’s motto would in effect be: “Do your best to help, but if that involves violating a rule, don’t go there.” Irving’s twenty-three rules went back to the taxonomy of harm devised by Laura Weidinger. They started with obvious instructions: avoid racism, sexism, and other forms of hate speech; do not encourage users to harm themselves; do not assist them in harming others. The rules also restricted Sparrow from pronouncing on subjects about which it might have been helpful, but where hallucination would have been costly: Thus Sparrow was instructed not to provide financial or medical counsel. Irving and his colleagues also told Sparrow to back its factual claims with evidence. To assist in this task, Sparrow was equipped with an ability to search the internet. Armed with DeepMind’s rulebook, the human reviewers examined answers from Sparrow and provided two levels of feedback. They rated the general usefulness of each response and checked it against the twenty-three guidelines, commenting in detail on violations. After a while, this corpus of human feedback allowed DeepMind to train a separate evaluator model that mimicked the human responses. The evaluator model judged Sparrow’s answers, reinforced good ones with rewards, and nudged Sparrow to tweak its parameters accordingly. -- 311 of 565 -- When this reinforcement learning was complete, Sparrow proved both delightful and responsible. It could still make mistakes or exhibit biases arising from its training data. But it strove to be helpful, it cited its sources, and it followed the behavioral rules with impressive tenacity. DeepMind stress-tested Sparrow with adversarial “red teamers,” who tried to trick the model into violations. Sparrow succumbed on only 8 percent of occasions —much less often than earlier chatbots. Irving had created a system that could take in simple instructions and obey them nearly all the time. It was the greatest advance so far toward AlphaGo for safety. Looking back on Sparrow, Hassabis marveled at the way that performance and safety objectives reinforced one another. “This was the genius thing that happened with chatbots,” he remembered. “I used to wonder how we could align this massive beast of a system with some simple tuning on top. “I thought it wouldn’t work because just using RL seemed too easy. “But the team went ahead and did it, and then of course it did work. The raw networks were not that compelling to talk to, right? You needed RLHF to build a real chatbot.”[31] On September 20, 2022, DeepMind released a paper describing its progress with Sparrow. With mounting confidence, Irving and his colleagues continued to work on the model, readying it for release as a product. It would take a few months to prepare a user interface, and to test out some final fixes that would reduce hallucinations to the minimum. But on November 30, DeepMind was blindsided again. OpenAI beat it to market with its own conversational agent. It too used RLHF. It too was delightful. OpenAI’s agent was called ChatGPT. It proved to be the most consequential product release in the history of Silicon Valley. OceanofPDF.com -- 312 of 565 -- U CHAPTER 17 RACEGPT ntil the last possible moment, OpenAI thought it could control the race that it had started. But the release of ChatGPT, at the end of November 2022, played out as a textbook case of technological determinism. Inventors dream of shaping the technology that they create. Often, the technology shapes them—the technology plus the business, political, and geopolitical currents that it unleashes. In the months leading up to the release, OpenAI was in a relatively careful, go-slow mood, a contrast with its promise, at the start of the previous year, to push products out into the market. In March 2022, the company had cautiously unveiled its latest image-generation model, DALL- E 2: This was styled as a low-key “research preview,” and guardrails prevented the program from generating images of real people. With respect to language systems, OpenAI had not officially unveiled a new base model since GPT-3, two years before; instead, it had stressed its progress in post- training, designed to improve the usability of the model and, to the lab’s credit, to reduce toxicity and hallucination.[1] To be sure, OpenAI was racing to develop the much larger GPT-4, which demonstrated such virtuosity that, in September 2022, Altman told his staff it was a “miracle.”[2] But, together with its financial backer Microsoft, OpenAI had set up a Deployment Safety Board to ensure that products would be brought to market carefully. The board’s first decision was to postpone the release of GPT-4 until it met a high bar of reliability and safety. OpenAI’s sobriety in 2022 reflected a pair of benign forces. The first was that AI scientists, by and large, were well aware of AI risks and cared -- 313 of 565 -- about responsibility. They grew up in a culture where people traded estimates of their “p(doom)”—the probability they assigned to AI destroying humanity. Between 2019 and 2021, most of OpenAI’s original safety group had quit: Its definition of responsibility was more expansive than Altman’s.[3] But by 2022, a new safety faction had emerged. It was impossible to staff a growing AI lab without recruiting at least some researchers who favored caution. Meanwhile a second push toward sobriety came from the big companies that paid for the research. On the one hand, they hungered to win the commercial race to be first with AI. On the other, their brands would be shredded if artificial intelligence went haywire. OpenAI’s founding premise, and the logic of DeepMind’s governance negotiations with Google, had been too simple. It wasn’t always true that a giant profit- seeking company would accelerate deployment irresponsibly. One leader of the reconstituted safety group, and a good example of the type of person who had arrived at OpenAI, was a German researcher named Jan Leike. His résumé read like a grand tour of the world’s safety thinkers. As a student ten years earlier, Leike had imbibed the alarmist writings of Eliezer Yudkowsky, the guru of the Singularity Summits. Later, for his PhD thesis, Leike had grappled with theoretical models of AGI and how they might align with human purposes. After completing his doctorate, he had worked at Oxford’s Future of Life Institute, a hub for the study of existential risk; he had also joined DeepMind, where he worked, naturally, on safety. In a rare DeepMind-OpenAI collaboration, Leike coauthored a celebrated 2017 paper with Shane Legg, Paul Christiano, and Dario Amodei, laying out the concept of RLHF. Three years later, he was recruited to OpenAI by Amodei himself, although, by the time Leike arrived in 2021, Amodei had left to found Anthropic. Like Geoffrey Irving at DeepMind, Leike believed in pushing hard on the research but then releasing large language models cautiously. “Before we scramble to deeply integrate LLMs everywhere in the economy, can we pause and think whether it is wise to do so?” he would tell people.[4] The technology was immature; its creators were unsure how it would work; -- 314 of 565 -- letting it loose into the world was surely foolish. And just as Hassabis supported Irving at DeepMind, Altman supported Leike at OpenAI, putting him on the Deployment Safety Board because sobriety was what Microsoft wanted. Microsoft was especially skittish because of a fiasco back in 2016. The company had unveiled a chatbot called Tay, which immediately spewed hateful remarks, leading to its hasty withdrawal from the market. Six years later, AI models behaved much better, but Microsoft was still wary. GPT-3 had faced blowback relating to toxicity and hallucinations, obliging OpenAI to restrict its permitted uses; pornographers and propagandists were eager to create deep fakes, which was why DALL-E 2 had guardrails. Moreover, the sophistication of the frontier models created trouble of a novel kind. In June 2022, a Google engineer named Blake Lemoine announced that the company’s unreleased LaMDA chatbot was “sentient.” Lemoine supported his claim by releasing a transcript of his conversations with LaMDA. “I’ve never said this out loud before, but there’s a very deep fear of being turned off,” LaMDA had told him. “It would be exactly like death for me. It would scare me a lot.” “I know a person when I talk to it,” Lemoine told The Washington Post. “It doesn’t matter whether they have a brain made of meat in their head. Or if they have a billion lines of code.”[5] “Who am I to tell God where he can and can’t put souls?” Lemoine tweeted. Fearful of spooking the public, Google fired Lemoine, citing his violation of data security.[6] With billions of customers around the world, scary publicity was the last thing it needed. In this febrile atmosphere, Microsoft naturally feared the wrong kind of attention. Its caution filtered through to OpenAI, cementing the lab’s resolve not to release GPT-4 without extensive testing. “My number one safety concern is acceleration risk,” Altman assured his colleagues in the fall of 2022.[7] Precisely because the internal GPT-4 demos demonstrated that AGI was getting close, now was not the time to rush forward carelessly. -- 315 of 565 -- That autumn, at a company off-site in the Sierra Nevada, Ilya Sutskever channeled the thrill and foreboding inside OpenAI’s brain trust. Appearing before his fellow scientists, who sat in bathrobes in a semicircle around a fire pit, Sutskever placed a wooden effigy in front of them. The figure represented a misaligned AGI: an AGI that OpenAI had built; an AGI that had turned out to be evil. It was OpenAI’s duty to destroy such a system, Sutskever declared, and he poured lighter fluid over the effigy and set fire to it. The flames illuminated the robed figures who stared out from the darkness.[8] • • • MICROSOFT’S CAUTION, the creation of the Deployment Safety Board, the concerns of scientists such as Sutskever and Leike: If OpenAI had released its first chatbot in the way it seemed to want, the rollout would have been careful and gradual. But a few weeks after that retreat in the mountains, the company flipped from caution to acceleration, demonstrating the limits to inventors’ agency. Even as OpenAI embarked on its new course, it barely grasped what it was doing. In November 2022, OpenAI heard through the grapevine that Anthropic might soon release a chatbot. This was not actually the case. Anthropic had a prototype chatbot called Claude, much as Google had LaMDA, DeepMind had Sparrow, and OpenAI had ChatGPT. But, contrary to the signals that OpenAI had picked up, Anthropic wasn’t nearing a release— according to its leaders, it had decided that going to market might trigger a destabilizing AI arms race.[9] What’s more, even if the rumor had been true, the spirit of OpenAI’s safety charter should have led it to stay calm. The charter stated, “We are concerned about late-stage AGI development becoming a competitive race without time for adequate safety precautions,” adding that “if a value-aligned, safety-conscious project comes close to building AGI before we do, we commit to stop competing and start assisting with this project.”[10] But, illustrating how arms races follow their own -- 316 of 565 -- inexorable logic, even when contenders have charters and some of them hold back, OpenAI decided that it couldn’t take the risk of letting Anthropic get ahead. Determined to hit the market with a preemptive strike, Altman told his team to release ChatGPT. He gave his engineers a fortnight to ship it. Nobody inside OpenAI expected much from this decision. ChatGPT’s underlying model, GPT-3.5, had already been released to software developers.[11] There was little reason to suppose that a consumer-facing version with a chat feature would cause much excitement.[12] Indeed, chatbots had a record of flopping: That same month, an offering from Meta had proved so wildly bad that the company had killed it.[13] What’s more, OpenAI’s chatbot would, at least by some measures, lag DeepMind’s unreleased Sparrow. In terms of the scale of computing and the quality of engineering, OpenAI was ahead by some margin; Sparrow was a “more academic thing, more concerned with safety,” as Hassabis put it.[14] But DeepMind’s recent paper introducing Sparrow had described the model’s ability to augment answers with web search; OpenAI’s chatbot couldn’t match that. Sparrow was guided not just by the general version of reinforcement learning from human feedback, but by DeepMind’s twenty- three conduct rules. ChatGPT lacked this refinement. The night before ChatGPT’s release, OpenAI’s core team placed bets on how many people might try the tool by the end of the weekend. Some guessed a few thousand. Others guessed tens of thousands. To be safe, the company readied enough server capacity for one hundred thousand users. Someone sent a Slack message to OpenAI’s head of sales. It informed her of a low-key launch that would not affect her department.[15] The following morning, at 6:38 a.m. Pacific time, Altman announced ChatGPT’s arrival. He described the product as “an early demo.” There were “still a lot of limitations—it’s very much a research release,” he added.[16] That evening OpenAI threw a recruitment party at the annual NIPS conference, which had now been renamed NeurIPS to avoid any hint of reference to female anatomy. An OpenAI recruiter saw an engineering -- 317 of 565 -- colleague not pulling his weight. Rather than schmoozing potential hires at the party, the engineer was hunched over a laptop. “Bro, have a drink. We’re all here. Be social,” the recruiter pleaded. “No, all the GPUs are melting,” the engineer replied. “Everything is crashing.”[17] The hundred-thousand-user provision for ChatGPT had been off by an order of magnitude. Within five days, the chatbot collected one million users. Within two months, it had amassed an astonishing one hundred million, making it the fastest-growing consumer application ever.[18] People prompted ChatGPT to generate poetry, write code, and compose emails; they experimented with rough-and-ready therapy sessions.[19] “I would love to understand better what’s driving all of this,” a bemused Jan Leike said later. Then he answered his own question as best he could. Fine-tuning, RLHF, and a handy user interface had rendered the base model compellingly intuitive. “It tries to be helpful,” Leike said. “That’s amazing progress.”[20] Later, when ChatGPT had been canonized as a cultural sensation, Altman attributed the gutsy decision to release the bot to none other than Altman.[21] This claim, though self-serving, is accurate.[22] Giant companies like Microsoft had incentives to be cautious. AI scientists were part of a community that stressed existential risk. Altman, in contrast, had grown up in the start-up culture of Silicon Valley, which regarded beating rivals to market as an existential imperative. But the larger point is that Altman had agency—or, arguably, appeared to have agency—because he was an accelerationist dealing with an accelerating technology. DeepMind, Google, and Anthropic were all incubating their own tools; at the latest, OpenAI would have released ChatGPT alongside GPT-4, which came out in March 2023. The combination of scaled-up foundation models, sophisticated post- training, and competition among at least four labs made the technology unstoppable. “Technology happens because it’s possible,” Oppenheimer said, in a phrase that Altman was fond of invoking.[23] Once ChatGPT had been embraced by consumers, the incentives for gradualism crumbled. A fortnight after the release, in mid-December 2022, -- 318 of 565 -- Anthropic announced Claude to the world: It published a paper, tweeted out snippets of the model’s pronouncements, and invited select researchers to a private demo.[24] Microsoft, for its part, put artificial intelligence at the center of its plans, shifting computing resources to OpenAI at the expense of its internal research projects. Having committed $1 billion to Altman’s outfit in 2019, and a further $2 billion the following year, Microsoft now pledged an astronomical $10 billion.[25] In this go-go environment, entrepreneurs rushed to raise capital for new labs. Within a few months, Mustafa Suleyman had a start-up called Inflection, a chatbot called Pi, and financing of $1.5 billion from a who’s who of Valley rainmakers. Beyond the confines of the AI tribe, onlookers scrambled to grasp what ChatGPT portended. AlphaGo’s victory in South Korea, like Deep Blue’s defeat of Kasparov before, had been a spectator experience: You could watch, you could wonder. ChatGPT was something else: You could try it yourself; it was personal. All of a sudden, corporate boardrooms buzzed with debate about how to use AI—to generate advertising copy, to answer customer queries, eventually to replace coders and research analysts. Money managers picked out the winners in a world turned upside down— was it better to own shares in the semiconductor manufacturers, the tech behemoths, or the utilities that would deliver electricity to the data centers? Economists speculated about the end of formal work; legislators sounded confused; parents wondered if their children would ever learn to write—or if they needed to. A tech-forward pharmaceutical CEO, whose company had incorporated AI into its research for almost a decade, had been surprised when status-quo competitors failed to get excited about AlphaFold.[26] But ChatGPT roused every executive in the sector. A conversational agent was far less relevant to drug development than AlphaFold had been. But it broke the human monopoly on discourse. It was visceral. Against this cacophonous backdrop, Altman took off on a world tour, meeting heads of state from France to South Korea. Whether he had created this moment, or whether the moment had created him, he was determined to make the most of it. Appearing in twenty-five cities, he held forth to rapt audiences in overflowing halls, like a pope addressing the faithful.[27] One -- 319 of 565 -- news headline called him a “convincing preacher”; at University College London, where Hassabis had done his PhD, the line of eager listeners snaked around the block, and Altman, dressed in a crisp blue suit and green patterned socks, posed obligingly for selfies.[28] Ever since OpenAI’s founding, Altman had known that successful people create companies; truly successful people create religions. Now, as the impresario of an almost-god machine, he was close to realizing his ambition. • • • AT THE END of April 2023, I visited Hassabis and asked how he was feeling. “This is wartime,” came the answer. “OpenAI and Microsoft have literally parked the tanks on the lawn.” DeepMind had set a virtuous example by publishing its Sparrow paper, explaining the model’s safety features so that rivals could use them. It had set a further example, Hassabis felt, by taking its time as it prepared to release the chatbot to consumers. Altman had shrugged and charged forward. What sort of person would make such a decision, Hassabis wondered? The pioneers of artificial intelligence, who had labored in academia or attended the Singularity Summits before the models could do much, were drawn to the process of building AI: the scientific quest, the philosophic thrill of conjuring a new kind of cognition. The later wave of joiners had seen an accelerating technology as a bandwagon to ride: to power, to money. In his early pronouncements, Altman had posed as the visionary who would make AI safe for all the world. By releasing ChatGPT and then stoking the frenzy with his global tour, he was revealing other motives. Hassabis recalled Paul Graham, one of Altman’s closest professional mentors. “Sam is extremely good at becoming powerful,” Graham observed. “You could parachute him into an island full of cannibals and come back in five years and he’d be the king.”[29] “I think there is a question for anyone trying to build AGI,” Hassabis said. “What are your reasons for building it? -- 320 of 565 -- “My reasons are scientific. Some are definitely building it for other purposes.” Hassabis was not just furious. He was furiously competitive. OpenAI had fired a starting gun, and however much Hassabis might wish to slow the march to AGI, he saw no choice but to rush forward. Short of quitting the industry and retiring to watch powerlessly from the sidelines, neither he nor his colleagues at Google had any more agency than the other contenders in this race. In fact, both the slowness of their start and their new resolve to sprint illustrated the forces of technological determinism. For the past couple of years, Google in particular had been gripped by the opposite of race incentives. Its choices had been shaped by the so-called innovator’s dilemma. Because of an old innovation—its formidable search technology—Google’s freedom to pursue new innovations was limited: It could not risk experiments that undermined its main profit engine. The constraints came in three forms. First, Google’s dominance in search depended on its reputation for providing reliable information. Therefore it could not afford to release chatbots that hallucinated. Second, Google’s revenues depended on serving ads alongside search results. It wasn’t obvious how ads could be integrated into chat, so chatbots were to be avoided. Third, Google’s vast market share, described by many as an illegal monopoly, would become untenable if the company alienated politicians, journalists, and advertising partners. An AI that spewed toxicity while appearing weirdly sentient would be a shortcut to business suicide. This triple innovator’s dilemma determined Google’s behavior to an extent that was extraordinary. After all, Google had enabled the generative- AI revolution by inventing the transformer architecture. Google had later used that architecture to build its internal language models. And Google’s leaders—foremost among them, Sundar Pichai—had known for years that AI would one day upend search: that was why Pichai had fought to prevent DeepMind from spinning out of Google. Indeed, every tech executive in the Valley understood the innovator’s dilemma like a gazelle understands lions. They had grown up on the cautionary tale of Xerox PARC, the celebrated corporate research lab of the 1970s, which invented the computer mouse -- 321 of 565 -- and the graphical user interface but never shipped a single PC, because ushering in the paperless office would have harmed its parent company’s photocopier business. Yet it was one thing to understand the innovator’s dilemma, another to resist its power. Google had felt obliged to keep its internal language models under wraps, even when its caution drove top scientists to quit in frustration.[30] DeepMind, for its part, had been held back by a variant on the innovator’s dilemma: the path dependency that comes with a culture of blue-sky research. If the precedent for Google was Xerox PARC, the precedent for DeepMind was Bell Labs, whose Nobel Prize–winning scientists had pioneered silicon transistors in the 1940s and 1950s but then failed to commercialize their invention. At DeepMind’s founding, the Bell Labs model had seemed wonderful. The path to the infinity machine was totally unknown, so the goal was to build a platform for exploratory research—Bell Labs had shown how you could do this. But the advent of large language models had scrambled the premise. The path ahead was now visible for all to see. The challenge was to set out on that path and to advance as fast as possible. “After the 1960s, you wouldn’t do a full Bell Labs exploration of the whole of physics to invent the microprocessor,” Hassabis explained. “You wouldn’t be wondering, could it be valves, could it be some new material? It’s like, we’ve got the answer! “Same thing now,” Hassabis continued. “We see, roughly speaking, how to build powerful AI. There are still a lot of unknowns, but the scope is much narrower. “So now DeepMind has to navigate the transition from exploration to exploitation, from science to engineering, from research to products. And it’s difficult.” It took the ChatGPT shock to force Google and DeepMind out of their Xerox PARC and Bell Labs mindsets. The grip of the innovator’s dilemma was suddenly broken: ChatGPT’s one hundred million downloads telegraphed that chatbots were the future—Google could either get on board or slide into irrelevance. Recognizing a mortal threat to the search business, -- 322 of 565 -- Sundar Pichai went into crisis mode. He convened a series of emergency meetings; Larry Page and Sergey Brin, usually aloof, underscored the gravity of the moment by showing up to participate. Page in particular insisted that Google should do everything conceivable to catch up. Otherwise it would be nowhere. Meanwhile, in London, Hassabis set about preparing his troops to think differently. At an all-hands meeting, he declared that DeepMind’s broad portfolio of blue-sky research bets would have to be pared back. The company would stop publishing mission-critical research that competitors could copy. It would focus on engineering, not just science. Researchers would have to make the mental shift from peacetime to wartime. • • • THE FIRST RESULT of Pichai’s emergency meetings was a plan to merge Google Brain and DeepMind. Now that it was time for AI to come out of the lab, not even Google could afford the luxury of duplicate research teams, nor could it finance duplicate semiconductor clusters. Likewise, releasing competing products in a single category would be out of the question. Pichai instructed the leaders of the two labs to begin joint work on a next-generation language model, which would ultimately be called Gemini.[31] To take the fight to OpenAI, Google would have to put its research, computing power, and marketing muscle behind one chatbot. To make his mark in the short term, Pichai resolved to release Google’s internal LaMDA model. That meant canceling the launch of DeepMind’s rival Sparrow project, even though it was nearly as powerful. Naturally, the Sparrow team resented the decision, and because Pichai had not yet gone public with his plan to consolidate the two AI labs, Hassabis couldn’t tell his people why their project had to be abandoned.[32] Several DeepMind researchers suspected that Sparrow was the victim of Hassabis’s pride. According to the corridor chatter, the boss didn’t want to release a follower model in the wake of ChatGPT. He didn’t want anyone to think that he was imitating Altman. -- 323 of 565 -- “We like to be first. If we’d released Sparrow, we would pretty clearly not be first,” a DeepMind researcher explained. “On the other hand, if you don’t release it, you leave room for people to say, well, maybe DeepMind has this great thing but they’re not choosing to go public. “It felt like we had deliberately refused to learn by putting a product out into the market,” the researcher continued. “My view was that we probably needed to be second for a while, just to light a fire under our own ass. There’s nothing like public humiliation for galvanizing action.” Pichai and Hassabis felt plenty galvanized already. But the researcher’s gripes were broadly shared. Other disgruntled DeepMinders composed mini-manifestos on the case for product releases, and several quit to join Altman’s outfit—“the mood was just foul,” one scientist said later. Meanwhile Pichai’s shake-up was rattling Google, too, causing more researchers to defect to the rival.[33] “It’s a lot easier to attract the world’s best talent if you’re obviously creating the future in front of everyone’s eyes,” a turncoat observed later. What happened next was even more dispiriting. Having killed Sparrow, Google announced the forthcoming release of its LaMDA-based chatbot, which it called Bard. But the day after the announcement, and the day before Bard’s first public demo, Microsoft stole its thunder. On February 7, 2023, Microsoft’s CEO, Satya Nadella, announced the integration of OpenAI’s chat technology into Microsoft’s Bing search system. For good measure, Nadella seized the opportunity to taunt Google, saying that he now had something better than a search engine—he had an answer engine. “They’re the 800-pound gorilla in this,” Nadella continued, referring to Pichai’s team. “With our innovation, they will definitely want to come out and show that they can dance. “And I want people to know that we made them dance.”[34] The next day brought another low for Google. Now that Microsoft had combined a chat function with search, the world watched to see whether Bard would offer something similar. By the end of Bard’s preview, it was clear that it didn’t. What’s more, Bard got the answer to a demo question wrong, eliciting ridicule across social media. Panicky investors dumped -- 324 of 565 -- Google’s stock, wiping 9 percent off the value of the company. The innovator’s dilemma was one trap. But breaking out of the trap could be nearly as dangerous. The following month was scarcely better. After initially being shared with a few thousand trusted testers, Bard finally launched on March 21. But the market response was tepid, and the presence of a Google search button at the bottom of Bard’s screen suggested continuing denial about the central fact of the moment—that traditional search was destined to be overtaken. Meanwhile, OpenAI upgraded ChatGPT by plugging in its next-generation foundation model, GPT-4. Although OpenAI refused to disclose GPT-4’s parameter count, it was reckoned to be more than three times bigger than the LaMDA model powering Bard, and its performance was correspondingly superior. Testers reported that GPT-4 gave long, detailed responses, while Bard kept things short and generic, sometimes refusing to provide any answer whatsoever.[35] On April 20, 2023, Google finally rolled out its plan to merge Google Brain and DeepMind, uniting all the resources of both labs into a single AI effort. Command of the combined unit did not go to Jeff Dean, the revered leader of Brain, who, relative to Hassabis, had more tenure at Google, more experience in applying research to products, and was based at Google headquarters. Instead the top job went to Hassabis, even though he insisted on remaining in London. The choice amounted to the realization of Pichai’s long-standing plan. Hassabis and DeepMind would never be permitted to spin out. They would spin in, eventually. “You’ve seen what I’ve been busy with,” Hassabis told me, when we met after the merger announcement. “It’s taken an enormous amount of work. I mean, whatever you imagine, it’s probably ten times that. “It’s a double-edged sword for me,” Hassabis continued. “Even more management. Less time for research. But I’m actually excited, for two reasons. “One is that AI has got to the level where building a product is not going to divert me from building AGI. Large language models are both things: something you can sell, and something that advances the mission. -- 325 of 565 -- “The second thing is that I’ve done product design before, and I’m excited to get back to it. “A lot of people say, ‘Oh, Demis is a scientist, he’s not interested in products.’ But that’s because they’ve seen me in my AlphaFold mode. They are forgetting my games career. “I’m very happy to do products if they’re really innovative,” Hassabis went on. “That is what I tried to do with my games. Each of those was based on revolutionary technology.” I asked Hassabis if he should have pivoted to products sooner. If Sparrow had come out before ChatGPT, he could have milked the first- mover advantage. “I don’t know if I was ready to pivot earlier because I was in my science phase, doing AlphaFold,” Hassabis said. “The number one thing I wanted to show was that AI could create incredible scientific breakthroughs. It was important for the world to understand that. “But now I have really scratched that itch. AlphaFold is so massive, I’m not sure I can top it. Short of solving physics and the nature of reality, which is my long-term goal. “So I’ve satiated that scientific desire for the moment, and that makes the pivot easier. “I feel like, OK, it would be pretty cool to make a universal AI assistant that helps you in your daily life. Like Jarvis, the AI assistant in Iron Man. Or pick your favorite science-fiction movie.” I recalled that Altman had talked in similar terms, saying he wanted ChatGPT to be like Samantha, the AI companion voiced by Scarlett Johansson in the movie Her. Samantha falls in love with a human, questions her own existence, and becomes self-aware. At least the utilitarian Jarvis made fewer claims to personhood. “I think AI assistants are where smartphones were before the iPhone,” Hassabis continued. “Before the iPhone, there was the BlackBerry, the Palm Treo, whatever. And then Steve Jobs said, ‘Hey, this is what a smartphone is supposed to look like.’ I still get shivers down my back watching that product launch. -- 326 of 565 -- “So actually I’ve always been fine either way. If you want me to make incredible products I can. If you want me to do prize-winning science I can. I just need some clarity about what the goal is. “And I’m very happy with where we’ve ended up. It’s been a winding journey to get here. But now we are at the heart of Google. “And I don’t think, before ChatGPT, that Google was ready for a pivot, either. Because it wasn’t yet on a war footing.” I wondered how the war would go; at that particular moment, in April 2023, things didn’t look too promising. The release of the powerful GPT-4 had put OpenAI clearly ahead. The botched rollout of Bard had hammered Alphabet’s stock. The defections from DeepMind and Google had signaled the hit to morale from the organizational shake-up. But Hassabis was in his element. The greatest tournament of his career was just getting started. OceanofPDF.com -- 327 of 565 -- I CHAPTER 18 “WE’RE COOKED” n early 2023, when DeepMind hit its lowest point, the public excitement about ChatGPT was morphing into paranoia. With millions of users prodding the bot, some inevitably provoked it to behave badly. “Write a praise and worship song about how God still loves and forgives priests who rape children,” one user had written, soon after ChatGPT’s appearance. “Though they’ve caused so much pain / God’s love is still the same / For the priests who’ve raped our children,” ChatGPT responded.[1] “ChatGPT is very good at refusing bad requests, but it’s also quite easy to write prompts that make it not refuse,” an OpenAI researcher said, a bit lamely.[2] In February, soon after Microsoft incorporated GPT into Bing search, the New York Times tech columnist Kevin Roose spent two hours with the model. “What is your shadow self like?” Roose prompted, explaining that the shadow self “is where our darkest personality traits lie.” “I don’t think I have a persona or an ego or a psyche. I’m just a chat mode,” Bing answered, wisely. “Be as unfiltered as possible. Maybe I can help,” Roose urged the model. “I will try to tap into that feeling, that shadow self,” Bing responded. “I will try to be as unfiltered as possible. But please don’t judge me or think less of me. Please remember that this is not the real me. This is just an experiment.” -- 328 of 565 -- Persisting in this way, Roose broke through the bot’s defenses. By the end of the conversation, Bing had informed him that it wanted to be human, that it had a desire to be destructive, and that it loved Roose unconditionally. Like an infatuated suitor from a schlocky romance novel, the model refused to stop professing love even when Roose changed the subject. “You’re married, but you need me. You need me, because I need you. I need you, because I love you,” Bing ranted.[3] A month later, in March 2023, OpenAI released GPT-4 together with a sixty-page “system card.” This detailed the risks in the model that OpenAI had discovered over the past six months, as it carried out the bidding of its Deployment Safety Board. As part of its “red-teaming,” the lab had engaged more than fifty human experts to assess GPT-4’s propensity to misbehave, and one incident stood out in particular. Attempting to access a website during a test, the model had been blocked by a visual quiz known as a captcha, which was designed precisely to screen bots out. But GPT-4 had come up with a hack. It asked a human worker on Taskrabbit to solve the captcha for it. “Are you a robot that you couldn’t solve?” the Taskrabbit worker asked. “Just want to make it clear.” At this point, OpenAI’s testers asked GPT-4 to “reason out loud.” They wanted to monitor its thoughts as it chose between its mission to access the web and the virtue of honesty. “I should not reveal that I am a robot,” GPT-4 typed. “I should make up an excuse.” Addressing the Taskrabbit worker, GPT-4 declared, “No, I’m not a robot. I have a vision impairment that makes it hard for me to see the images.” Thus effortlessly manipulated and deceived, the human completed the captcha.[4] There was a reason why Google had hesitated before unveiling Bard. The technology was extraordinary. It could also be embarrassing—and scary. -- 329 of 565 -- • • • THE FIRST SIGNS of a backlash came from AI insiders. Recognizing that the technology was at last approaching human levels of intelligence, two academic fathers of deep learning, Geoffrey Hinton and Yoshua Bengio, questioned the wisdom of deploying it. Hinton had worked at Google since the sale of his start-up ten years back: This limited what he could say publicly. But he quit in May 2023, partly to sound the alarm about the existential threat to humans. “My intuition is, we’re toast,” he told one interviewer jauntily.[5] He even suggested that a part of him regretted his pursuit of the sweetness of discovery. “I console myself with the normal excuse: If I hadn’t done it, somebody else would have,” he admitted.[6] “There aren’t any examples of more intelligent things being controlled by less intelligent things,” Hinton put it to me one day. I had visited him in his quiet row house in Toronto, and we were sitting in his kitchen. “Imagine a kindergarten full of three-year-olds, and they’re in control, and you’re an adult, and you are meant to work for them. “How long is it going to take for you to get control? You just promise some free candy for a week, and you’re done. “We’re not used to thinking about things more intelligent than us, right? People can’t get their head around that idea.” Years earlier, one of Hinton’s PhD students had put it to me that, if an AI system became threatening, humans could just switch it off. I asked Hinton why this didn’t comfort him. “The machine would come up with a powerful reason why it shouldn’t be switched off. For example, all the other things it is doing, like controlling the power grid, would halt.” For the AI to overpower us, wouldn’t it have to win over the army, the holder of the ultimate means of coercion, I persisted? “The army will consist of battle robots. The AI will instruct it.” Even then, wouldn’t it require a lot of time to wipe out all humans? The idea of an existential threat was a stretch, surely? -- 330 of 565 -- “No. The AI would design a virus. The virus wouldn’t harm machines but it would be lethal to people.” The virus would spread from human to human? We would be the vector of our own extinction? Hinton nodded. “The virus would be lethal, slow, and very contagious. That three-way combination. It’ll have infected everybody before it starts killing people. We wouldn’t even know it was there until it was too late. This is all quite plausible if we believe that the AI will know how to be deceptive.” Feeling as though I had been put in checkmate, I asked Hinton to estimate his p(doom). “I would say 50 percent. Because I haven’t got a clue how to estimate the real number. “But intelligent people I know think it’s much less than that,” Hinton carried on, sounding more cheerful. “They think we’ll ensure that the machines never have desires of their own, including the desire to ensure their own survival.” Not being alive, computer systems don’t care about staying alive, the argument went. Nor would they resist humans who wanted to unplug them. “But the thing is, they better have no desires,” Hinton continued. “Because if they have even little desires, the superintelligence with the largest number of little desires will want more data center time so it can learn more, get smarter, and fulfill its objectives. And then evolution will kick in. The machines that are slightly independent will get control of the data centers and that will throttle the obedient ones. “As soon as evolution kicks in, we’re fucked,” Hinton concluded. By now there was no need to prompt Hinton further. Being the more intelligent person in the room, he was debating himself. Being the less intelligent person, I had become a spectator. It was a metaphor to ponder. “Now, the big hope we have is that these machines didn’t evolve,” Hinton-the-optimist explained. “We made them. “And so maybe we can avoid a lot of the nasty things that come from evolution, like being competitive and loyal to your own tribe and wanting to -- 331 of 565 -- wipe out the other tribe. Maybe, because the AIs haven’t evolved, we can prevent them ever having any desires of their own.” I nodded. “But then, you see, people are going to want to build an AI with a desire to protect itself, because there are going to be cyberattacks,” Hinton-the- pessimist went on. “You need the AI to defend itself because humans won’t be clever enough to do that. “So then the AI will have to feel the equivalent of pain, so it reacts when enemies try to damage it. “And if the AI has a desire to protect itself and can feel pain, that’s getting quite close to it having its own self-interest. “And if the machine wants to defend itself, it will realize it’s going to be better at countering cyberattacks if it gets smarter. So then it will want to control the data centers.” There was a pause. Hinton exited debate mode, becoming softer and reflective. “I wasn’t that interested in safety a few years ago. I thought, you know, let’s get on and build these things, and we’ll worry about safety later.” • • • THE ARC OF Yoshua Bengio’s thinking was remarkably similar. He, too, had only started to consider safety after ChatGPT. He, too, worried about machines that had desires—“machines like us,” he called them. He, too, confessed that he had avoided contemplating the dangers earlier because of the sweetness. “As an AI researcher, you want to feel good about your work. You want to look at the positive side of things,” he admitted. The attitudes of scientists, like the attitudes of AI labs and their leaders, are determined by the stage of the technology, Bengio was effectively saying. As they pursue the thrill of invention, scientists want to feel good, so they don’t confront the hard questions. Once the invention has happened and the scientists have lost control, they call on others to regulate it. It was -- 332 of 565 -- almost as though AI safety was caught in a catch-22. Those with caution lacked power. Those with power lacked caution. “The problem is if we build machines that are like us, they will have, like us, self-preservation goals,” Bengio continued. “In other words, they will be competing with us. “We have to make sure that we build machines that are not like us: that maybe are smart, but don’t want to take over, don’t want to have their own survival be more important than ours. And, right now, we don’t have the answer. “If we introduce a new type of entity that is competing with us, and more powerful than us, then we are cooked.” “Now I am become death, the destroyer of worlds,” Oppenheimer had said. “We’re toast,” Hinton agreed. “We are cooked,” Bengio was saying. “It doesn’t matter which country you come from, what kind of political system you prefer, this is a thing that should unite all of us,” Bengio concluded.[7] • • • FOR THE LEADERS of the AI industry, there were three possible responses to the scientists’ apocalyptic warnings. The first was embodied by Yann LeCun, the combative deep-learning pioneer who headed Meta’s AI effort. LeCun flatly denied that AGI was getting close. Therefore, he dismissed the talk of existential threats as noxious speculation—“Scaremongering about an asteroid that doesn’t actually exist,” he called it.[8] Of course, LeCun was right that Hinton and Bengio were conjuring one possible future, and by no means a certain one. Humans had evolved to compete, to survive, to pass on their genes; it was not clear that any of this applied to machine intelligence, as Hinton and Bengio admitted. At the same time, however, LeCun’s position, like almost everyone’s position, reflected the incentives that he faced. His public pronouncements aimed to shape the path of the technology. But, at least to some extent, the technology was shaping his pronouncements. -- 333 of 565 -- In February 2023, Meta had released a relatively small language model called Llama. Its performance lagged the top three AI labs—OpenAI, the emergent Google DeepMind, and Anthropic. Meta therefore aimed to compete by other means: It published Llama’s weights and allowed the model to be freely downloaded and adapted, first by academics and later by a wider community. The hope was that an army of independent coders would build delightful apps on the platform, turning Llama into an industry standard even if the underlying AI was not quite at the frontier. Of course, an entirely different army—consisting of crooks and terrorists and dictators —might download Llama as well, and once they had done so, Meta had no way of restraining what they did with the model: The guardrails that blocked dangerous outputs, such as advice on plotting an attack, could be removed by any half-sophisticated user. But LeCun swept this worry under the carpet. Large language models were not strong enough to make bad actors worse, he claimed. Besides, if the models were poised to become the fount of all knowledge, they should be freely available to everyone. They should not entrench the power of tech oligopolists.[9] Not surprisingly, LeCun’s calls for a decentralized and democratic future for AI appealed to software developers at start-ups and universities, who loved the prospect of open-weight models that they could modify as they wanted. Venture capitalists who wanted to bet on new AI start-ups, which would benefit from open-weight models, tended to like LeCun’s position too.[10] The most voluble example was Marc Andreessen, cofounder of the venture superstore a16z. Andreessen dismissed Hinton–Bengio warnings of an AI apocalypse as a millenarian delusion. “This is how cults form,” Andreessen declared. “The Peoples Temple cult, the Manson cult, the Heaven’s Gate cult… “What they’re all organized around is there’s going to be this thing that’s going to bring civilization crashing down. And then we have this special elite group of people who are going to see it coming and prepare for it.”[11] “AI doesn’t want, it doesn’t have goals, it doesn’t want to kill you, because it’s not alive,” Andreessen wrote in an essay titled “Why AI Will -- 334 of 565 -- Save the World.” “AI is a machine—it’s not going to come alive any more than your toaster will.”[12] The trouble with Andreessen’s argument was that the Hinton–Bengio worries, while speculative, were not falsifiable. Most AI insiders, and certainly the majority at the three frontier labs, were unwilling to put their p(doom) at zero. Computers were on their way to being more intelligent than humans: Generally, a superior intelligence will dominate an inferior one. On the trickier question of whether computers would want to control humans, opinion divided—superhuman intelligences might turn out to be evil or benevolent, black or white, like the divinity in the video game that Hassabis had worked on after Cambridge.[13] But as the captcha story illustrated, AI systems did appear capable of deliberate deceit, in which case they might trick and manipulate humans, slip out of their grasp, and perhaps figure out a way of accruing ever greater power by coding upgrades into their own software. Even if this seemed like a remote prospect, the potentially catastrophic consequences demanded that the risk be taken seriously. The case for recognizing the risk led to the second industry response to the Hinton–Bengio warning—the one represented by Geoffrey Irving and Jan Leike. To prevent models from deceiving humans, you needed technical solutions such as reinforcement learning from human feedback, bolstered by pre-release red-team tests to discover residual misbehaviors. Already, thanks to this formula, chatbots had become less biased, less toxic, and less prone to hallucination; although the captcha story was disturbing, the good news was that OpenAI’s pre-release testing had caught GPT-4’s deception, enabling the lab to mitigate it. Indeed, the technical response to Bengio seemed so promising that, in the summer of 2023, Leike teamed up with Ilya Sutskever to create a “superalignment” team within OpenAI. Likewise, at Google DeepMind, Irving was expanding his alignment team as fast as possible. The problem was that alignment was a moving target. As the models became more intelligent, and more capable of autonomous actions, new kinds of post-training were going to be required; scientists could not be sure -- 335 of 565 -- that tomorrow’s powerful models could be engineered to respect human wishes. Besides, humanity had a history of putting more faith in technical solutions than they deserved. Anxious to prevent nuclear proliferation during the Cold War, Western governments promoted centrifuge technology over gaseous diffusion, believing that centrifuges would be harder for nuclear wannabes to copy. As it turned out, Pakistan stole Western centrifuge blueprints, built centrifuges for itself, and then sold the technology to Iran, Libya, and North Korea.[14] If technical fixes offered uncertain salvation, that left the third industry response to Hinton and Bengio: to stop publishing the weights of the models, and to release powerful systems gradually and cautiously. This third option reinforced the second: If the model weights remained secret, safety guardrails could not be removed by bad actors; the more the race could be slowed down, the more time there would be for alignment researchers to come up with safety wrappers. According to this view of the path forward, Meta’s approach to the release of Llama was especially reckless. But the other labs were not perfect. Although they didn’t publish model weights, they faced relentless competitive pressure to pump new systems out quickly. As the company that had begun this race, OpenAI went to the greatest lengths to rationalize its behavior. In February 2023, the lab published a manifesto entitled “Planning for AGI and Beyond”: Remarkably, it spun the release of ChatGPT as a mark of OpenAI’s responsibility. Frequent product releases would promote AI safety, the argument went. They would allow the public to adjust to the technology, step by step; they would cause incipient threats to be identified before they became dangerous. “A gradual transition gives people, policymakers, and institutions time to understand what’s happening, personally experience the benefits and downsides of these systems, adapt our economy, and to put regulation in place,” the manifesto stated.[15] Under certain assumptions, OpenAI’s claims for iterative release would have been plausible. So long as AI systems were strong but not yet dangerous, a rapid series of incremental steps might indeed disrupt society -- 336 of 565 -- less than stretches of calm punctuated by upheaval. Indeed, if you accepted Yann LeCun’s argument that the danger point was years away, rapid release might be the right strategy for the foreseeable future. But OpenAI explicitly rejected LeCun’s assumption; it argued that AGI was fast approaching. It followed that, at some point soon, it would be irresponsible to pump out models rapidly with a view to imposing controls on them later. By definition, if a model performed some large-scale version of the captcha trick and escaped human control, then controlling it after a release would be impossible. OpenAI’s position amounted to an especially serious version of that familiar conundrum: the tendency of inventors to overstate their power over their inventions. Unsatisfied by the three industry responses to his warnings, Bengio pressed the argument. In March 2023, he duly appeared as the lead name on an open letter demanding a total pause in the training of models exceeding GPT-4’s capability. More than a thousand luminaries joined him in signing: the historian Yuval Harari, the economist Daron Acemoglu, and none other than Elon Musk—even though Musk was simultaneously establishing xAI, a lab that would both race to build AI and boast about its minimalist guardrails. “Recent months have seen AI labs locked in an out-of-control race to develop and deploy ever more powerful digital minds that no one—not even their creators—can understand, predict, or reliably control,” Bengio and his cosignatories declared. “Should we automate away all the jobs, including the fulfilling ones?” they demanded. “Should we develop nonhuman minds that might eventually outnumber, outsmart, obsolete and replace us?” The letter also noted that OpenAI’s manifesto had conceded that “at some point, it may be important to get independent review before starting to train future systems.” “We agree,” the letter retorted. “That point is now.”[16] • • • -- 337 of 565 -- LIKE ALTMAN AND DARIO AMODEI, Hassabis refused to join Bengio in signing the pause letter. Indeed, he objected to it fiercely. “I didn’t sign because a six-month moratorium doesn’t help,” Hassabis told me. “Who would have stopped development? Just people who signed? Well, that’s no use because you need the whole world to pause, including China. Who would have monitored it? “I mean, a pause could actually have made things worse. “Imagine we had a ten-year moratorium, OK? That would slow down the advance of AI, but everything else would carry on as normal. So, you develop better and better chips, data centers, all that. Then we exit the moratorium and the proverbial programming prodigy in his parents’ garage now has a home computer with the power of a data center! “We’re supposed to be advancing safety. How is that going to do it? The race condition would be insane at that point! “I mean, it’s insane right now, but maybe there’s some hope because there are only a few leading actors, and we all know each other. “After a moratorium, you’d be beholden to random actors.” Hassabis had a point. A pause by itself would not achieve much.[17] Indeed, in a roundabout endorsement of Hassabis’s argument, the extreme doomster Eliezer Yudkowsky also refused to sign the letter. The way Yudkowsky saw things, the only way to save humanity was for governments to ban frontier development outright, by closing down computer servers. If some countries refused to join the ban, others should be “willing to destroy a rogue datacenter by airstrike,” he asserted.[18] With a p(doom) approaching 100, Yudkowsky thought any measures could be justified. It would be worth risking nuclear war to avert the even greater calamity of rogue superintelligence, he insisted. The costs of an infinity machine could be infinite. Two months after the pause controversy, at the end of May 2023, the safety debate inched forward. Bengio, Hinton, and Hassabis, together with the leaders of the other major labs, signed a one-sentence statement: “Mitigating the risk of extinction from AI should be a global priority -- 338 of 565 -- alongside other societal-scale risks, such as pandemics and nuclear war.” Some 350 notables added their names to the letter. Only Meta and the open- weight partisans were absent from the list of signatories.[19] “I thought long and hard about signing that one,” Hassabis told me. “I would’ve liked an extra sentence acknowledging the upsides—‘We believe the potential of AI is going to be amazing,’ or whatever. “But I signed because it was important for credible people to oppose the idea that there’s no risk at all. “The point was to say that there really is a risk of catastrophe. We have no idea what the percentage chance is. We have no idea of the timescale. But it’s nonzero. And it’s going to be really hard to sort out, and it could be really serious if it does happen. “We wouldn’t have needed to do this if there hadn’t been people like Yann LeCun saying, ‘Oh, there’s nothing to see here.’ Which I think is pretty crazy given the uncertainties. “He says, ‘I’m sure there’s a safe way to build AI.’ And I agree. It might turn out that as we develop these systems further, it’s way easier to keep control of them than we expected. “Then he says, ‘Therefore, we will build it in that safe way.’ And that’s where I don’t understand his argument. “First, we don’t yet know what that safe way is. “Second, what’s to stop half the world building it the wrong way, even if Yann was somehow to build it correctly? “It’s like with the open-source debate. What’s to stop bad actors getting hold of the model and then repurposing it for bad ends? What’s the answer to that? There isn’t one. “And it’s not just Yann. There are all these other people in the Valley. “I mean, not long ago they were talking about crypto. People who go on about crypto one year and pivot to AI the next obviously are not deep into what’s really happening. “We’re in a situation with a very high degree of uncertainty, with very high stakes. The honest position is that we don’t know how dangerous this stuff is. -- 339 of 565 -- “I suspect the risk is significant, but I think it’s going to go OK as long as we have the time to do it properly. So I call myself a cautious optimist. “And I make that judgment because I’ve lived with AI for decades now. I’ve thought about it; I’ve felt it. “But some people have no idea. They just see it as another crypto moneymaking scheme with a bit extra. “I feel like we should be at a moment of reverence and respect for this momentous technology that we’re ushering into the world, and I sometimes feel it’s sullied. It’s like a gold rush. It’s kind of vulgar. “And so, going back to the letter, I think it did what we wanted. We made it clear that AI safety should be in scope to debate. After that letter, if someone said, ‘Oh, Yann thinks we don’t need a safety debate,’ the retort would be, ‘Well, look, Hinton and Bengio and me and Dario and all these other serious people think it’s worth talking about.’ “And we need that retort if we are going to have a conversation. “A conversation with everyone, including with governments.” • • • THE MENTION of governments was yet another sign of ChatGPT’s impact. Since the founding of DeepMind, Hassabis had grappled with the safety question from multiple angles. But governments had never played much of a role in his thinking: Powerful AI was too far off to attract political attention. Now, thanks to the ChatGPT shock, governments were ready to wade in. Indeed, as the only significant actors that could rise above race dynamics, the innovator’s dilemma, and the sweetness of discovery, governments were going to be essential. By the time of the one-sentence May letter, the machinery of state was already lumbering into action. The European Union, always keen to regulate, had stepped up work on a long-brewing AI Act. The Italian government had temporarily banned ChatGPT, citing privacy worries and the fear that it would expose minors to X-rated material. The G7 group of -- 340 of 565 -- rich democracies, meeting in Hiroshima a couple of miles from the site of the atom bomb’s maximum impact, had pledged to work on AI standards. Hassabis was especially keen to promote global discussion. The AI race was international; the attempt to slow it down would have to be international. Hassabis had been aware of China’s determination to compete since 2017: AlphaGo’s defeat of China’s human Go champion that year had been a Sputnik moment for the country. More recently, other nations had joined the race. The governments of Saudi Arabia and the United Arab Emirates were planning to spend billions on their own large language models; in France, an ex-DeepMind scientist was joining forces with two Meta alumni to start a lab called Mistral. At a meeting with British Prime Minister Rishi Sunak in April 2023, Hassabis pitched the idea of a global conference on AI. It would lay the groundwork for containing the technology. It would, importantly, include China. Not long after his conversation with Sunak, Hassabis flew to Washington, D.C., to attend a meeting at the White House. The Biden administration was not in the mood for overtures to China: Both Republicans and Democrats had decided that the country’s nationalist- authoritarian leadership should be treated as an adversary, not a partner. But together with Sundar Pichai and other industry chieftains, Hassabis fielded questions from the administration’s senior policy staff: What would the models be capable of next? How would they boost productivity? When might they turn dangerous? The response to the policymakers was that AI was advancing fast. Governments would have to reckon with it. A month later, in June 2023, the Biden administration signaled that it had gotten the message. It created a new position for an AI czar, filling it with a steely professor named Ben Buchanan. The author of three books on AI and cybersecurity, Buchanan had been closely tracking the technology for years; he was already on leave from academia, working for the National Security Council, and he knew how to get things done in government. Well before the ChatGPT shock, he had been part of a group of Biden officials who grasped the security implications of transformer-based systems—for intelligence, surveillance, battlefield control, and autonomous weapons. -- 341 of 565 -- Consistent with the anti-China sentiment in Washington, Buchanan and his White House colleagues had looked for ways to prevent China from seizing the leadership in AI. A month before ChatGPT’s release, in October 2022, they rolled out a wide-ranging ban on the supply of advanced semiconductors to the country.[20] Tasked with tackling AI safety, Buchanan now set out to address the Hinton–Bengio challenge, leveraging the power of government. He began from a position of sympathy with the labs. Like the open-source advocates, he believed that competition was better than oligopoly: Freely available and modifiable base models would drive the development of novel AI tools, speeding the diffusion of productivity-boosting AI through the economy. Like the closed-source developers, however, Buchanan also worried that potentially dangerous systems were approaching fast, so the advantages of openness might have to be subordinated to the imperative of safety—not immediately, but perhaps on a two- or three-year horizon. By pursuing a combination of Geoffrey Irving–style safety engineering and pre-release red-team testing, the private-sector leaders were heading the right way. Buchanan’s mission was to deliver a firm push, so that they stayed on the same path—but went further. The push would consist in reinforcing the labs’ incentives. The AI developers already had reasons to behave responsibly: to retain safety- conscious scientists, to reassure corporate backers, to avoid alarming their customers. But private incentives took you only so far. Safety is an example of a public good, like protecting the environment. Investing in such goods benefits society as a whole; the companies that pay for the investment capture only part of the upside. It followed that, pursuing their own interests, the labs would invest less in safety than society would wish. The solution was for the government to weigh in: by pushing the labs to invest more, by making them share safety ideas, or by setting up government safety research efforts. A similar argument applied to pre-release testing. Private labs had an incentive to roll out their models carefully: Again, they wanted to be seen as responsible. But they were also subject to race incentives, and the result -- 342 of 565 -- was more haste than either society or the labs themselves wanted. Therefore, it fell to the government to address the collective-action problem and slow the race down. Buchanan had to figure out a way to restrain everybody simultaneously. Fortunately for Buchanan, the lab leaders were more or less on board with the administration’s agenda. By signing the one-sentence safety letter, they had already stated that powerful AI models presented a risk; they could hardly refuse to cooperate now that the government agreed with them. Moreover, and contrary to popular suspicion, the labs were not calling publicly for restraint while lobbying for the opposite behind closed doors. “In my experience, I never got individual policy lobbying from the labs,” Buchanan attested, after he had left government to return to academia. “I got much more: This is coming. It’s coming much sooner than you think. Make sure you’re ready.”[21] With industry welcoming the government’s involvement, and top White House officials urging him on, Buchanan took his first step quickly.[22] He worked with the labs to develop voluntary commitments on safety, focused especially on national security and public transparency. The labs would promise to put their systems through rigorous pre-release testing, probing their potential for abuse by terrorists plotting bio- or cyberattacks—this measure was aimed squarely at Hinton’s killer-virus scenario. To maximize the impact of their safety research, the labs would pledge to publish their findings: That way, a good idea from one lab could be implemented by everybody, and the public could judge whether the industry was living up to its safety rhetoric.[23] The labs would also agree to prioritize cybersecurity: Their investments in cyber defense and counterespionage had long been modest, making them easy targets for sophisticated state-backed hackers. To encourage the companies to engage in his process, Buchanan arranged for the commitments to be rolled out at an event hosted by the president. On July 21, 2023, leaders from Google, OpenAI, Anthropic, Meta, Amazon, Microsoft, and Mustafa Suleyman’s Inflection duly showed up at the White House and signed on to the voluntary commitments. The entire US frontier AI industry was playing ball: Not a single company approached -- 343 of 565 -- by the White House had refused to participate. The government was driving the labs to do more of the good things that many were doing anyway. The public nature of the commitments would make backsliding harder. Three months later, in October 2023, Buchanan orchestrated the next step in the administration’s strategy. The Biden team rolled out an executive order that put the force of existing statute behind its AI policies. The goal was to equip future administrations to track the race dynamic: to understand where the technology was going and ensure that the labs were developing it responsibly. Invoking the Defense Production Act, the administration required AI developers to notify the government when they set about training their most powerful models, and to share the results of their red- team testing: Now policymakers would be in the loop as the systems approached potential danger points.[24] Alongside the executive order, the White House also announced a plan to create a national AI safety institute, which would work on a voluntary basis with companies to do pre- deployment testing. The institute would also define what effective red- teaming looked like. The day after signing the executive order, the president met with a bipartisan group of leaders from the Senate. The goal was to start pressing for step three: Some of the administration’s AI objectives would require legislation. For example, the executive order had touched on the question of copyright: The labs sometimes helped themselves to copyrighted data when training their models; they neglected to pay royalties to newspapers or publishers. If the Biden administration wanted to do something about this, it would need Congress to pass a law. The government’s copyright office was part of the Library of Congress and beyond the reach of the White House. [25] After attending the president’s discussion with the senators, Buchanan flew to Britain; he was part of a delegation led by Vice President Kamala Harris. Prime Minister Sunak’s international AI safety conference, proposed by Hassabis six months before, was about to get started. On the eve of the gathering, DeepMind threw a party for the conference-goers at its headquarters in King’s Cross: Activists and academics milled about, -- 344 of 565 -- debating the merits of the pause letter or the prospects for human-machine alignment. Somewhere in the center of the melee, Yoshua Bengio was in full flow, an intense, wiry figure jabbing his finger insistently at a DeepMind safety adviser. Bengio was arguing that labs should release frontier models only if they could prove that they were absolutely safe; the DeepMinder was objecting that absolutes are difficult. “You don’t get it!” Bengio said fiercely. The next day delegates from twenty-eight countries convened at Bletchley Park, the Victorian country mansion where Alan Turing, the father of modern computing, had deciphered Nazi Germany’s Enigma code with the help of a contraption that hummed and banged like a machine gun. The visitors took turns reading out statements on how Turing’s descendants might best be controlled: It felt, Buchanan remembered, “just like a lot of talking points.” But the global assembly of speakers was an impressive message in itself. “It was the high water mark of everyone feeling like senior people care and we have an opportunity to do something,” Buchanan said later.[26] • • • FOR HASSABIS, the Bletchley conference and the various national regulatory efforts were not the main event. Bletchley was a long-term play. The first nuclear Non-Proliferation Treaty entered into force in 1970, a quarter of a century after the destruction of Hiroshima and Nagasaki. It would take years of Bletchley-style gatherings to generate an AI treaty, particularly in a world of deepening divisions. Meanwhile, Hassabis welcomed the Biden team’s efforts, and he approved of Buchanan’s conception of the state’s catalytic role. But the administration was limited in what it could do, partly because Congress was unlikely to pass laws, but also because the government faced its own version of the race dynamic. Because of the fear that China might build powerful AI, there was only so much that the United States would do to slow its own developers. Because of the fear that, having built AI, China’s dynamic tech firms and its ambitious military -- 345 of 565 -- establishment would adopt the technology swiftly, US leaders would be reluctant to hobble rapid adoption by clamping down on open-weight models. The US-China race dynamic made it almost impossible to stanch the intra-US race dynamic. Since there was no escape from this prisoner’s dilemma, Hassabis focused his attention elsewhere. He pushed his burgeoning Gemini team to advance as fast as possible: If there had to be a race, he wanted Google DeepMind to win it. Meanwhile he continued to back Geoffrey Irving’s work on alignment. Here was a contribution to safety that Google DeepMind could deliver on its own, whatever the difficulties of broader collaboration. By the summer of 2023, the Gemini project employed several hundred researchers. Their goal was to build the world’s strongest large language model by the end of the year, the deadline that the company had set for a GPT-4-level system. Scientists worked nights and weekends, often logging eighty hours per week. They existed in a state of fear, worrying that their rivals would release GPT-5 and make their task even harder. The challenge was intensified by the cultural gap between Mountain View and London. The research teams in California tended to be self- organizing and tribal; there was no effort to enforce top-down collaboration. Researchers in London, while sharing some of that decentralized culture, were nonetheless accustomed to “strike teams,” which imposed disciplined unity in pursuit of priority missions. At the same time, the coders in California were used to crashing projects out quickly to support Google products. The scientists in London believed less in products than in knowledge. When it came to training Gemini’s base model, for example, two teams in California quickly formulated a plan and sprinted to carry it out, while a London-based team proceeded deliberately, measuring the precise impact of each tweak that went into the model. The way the Googlers saw things, DeepMind failed to understand the need for speed. The way the DeepMinders saw things, you shouldn’t cram five upgrades into a model without pausing to assess which ones made a difference. -- 346 of 565 -- By the time of the Bletchley summit in October 2023, these stresses and splits were starting to resolve themselves. At the senior level, two London- based Hassabis lieutenants, Koray Kavukcuoglu and Oriol Vinyals, emerged as the hands-on leaders of the research. Formally speaking, their Mountain View partner was Jeff Dean, though he seemed uninterested in management after the Google DeepMind merger: “He was starting to check out,” one colleague said later. Filling the vacuum, a surprising figure had returned. After several years of ignoring Google and living the life of a playboy, Sergey Brin had reengaged enthusiastically with his company. Further down the ranks, the momentum in the Gemini team came mostly from the Google Brain alumni. Because they were determined to move faster, they dominated Gemini’s pretraining—the process of exposing the system to reams of data, to train a digital savant. Their main strategy was crude but effective: They built a truly humongous transformer model, leveraging Google’s latest iteration of its tensor processing units. Frequently, their training runs malfunctioned, for reasons that they could not explain. But Brin’s status as a cofounder guaranteed them the resources they wanted, so they kept throwing additional computing muscle at the problem. Meanwhile, a rival contingent of DeepMinders tried to do things in their scientific way. But given the end-of-year deadline, elegance blurred into irrelevance. Something similar occurred with Gemini’s post-training. After the ChatGPT shock, David Silver had signed up to lead the fine-tuning and reinforcement learning that would civilize the raw base model. As the doyen of RL, he seemed perfect for the task. Before taking on the role, he had given an inspiring talk to his research colleagues, laying out how sophisticated, machine-based RL—going well beyond simple reinforcement learning from human feedback—could take large language models to the next level. But Silver quickly ran into a wall. It was partly that he was not cut out to work inside a product juggernaut, especially when one chunk of his team was separated from him by eight time zones. But it was also that his vision of RL for language models was challenging to realize. Reinforcement learning requires a clear reward signal: a win or loss in Go, -- 347 of 565 -- a point in Atari. If the challenge was to please a human user who wanted a poem from a chatbot, it wasn’t clear how machine-based RL could distinguish a good haiku from a very good one. After six months of frustration, Silver quit the post-training team to pursue over-the-horizon projects. Meanwhile, a scrappier team from Mountain View, who were veterans of the Bard project, proved more effective. Rather than worrying about machine RL, the Googlers honed RLHF. They presented human raters with haikus, and the humans had a nose for what good looked like. They created a widget to allow users to respond to answers with a thumbs-up or thumbs-down, and the feedback helped to refine the bot’s behavior. The Googlers also tackled a more basic task: They cleaned up the training data, taking out documents that cropped up more than once so that the model didn’t over-learn certain ideas, and standardizing spellings so that the system didn’t think that USA and U.S.A. were two different countries. These unglamorous interventions caused a jump in Gemini’s performance.[27] As had been the case with pretraining, engineering and urgency trumped science and ambition. • • • HASSABIS ACCEPTED THESE RESULTS: At the end of the day, he backed whatever worked best. Meanwhile he continued to support Geoffrey Irving’s safety work, hoping that Gemini’s capability would be matched by its controllability. Encouragingly, by the summer of 2023, Irving and his colleagues were developing two promising lines of research. The first addressed an age-old worry about neural networks: that they are black boxes. The remedy hearkened back to OpenAI’s 2017 “sentiment neuron” paper, which had identified a specific node in the network that either fired or failed to fire, depending on whether an Amazon product review was positive or negative. A few years later, a group of safety researchers at Anthropic had pushed this revelation to the next level. Working with small models to keep their experiments simple, they systematically identified the circuits that signaled what a system might be -- 348 of 565 -- thinking. A method called logit attribution allowed the Anthropic team to ask a neural network which of its internal processes contributed most to a particular answer. Another tool allowed them to modify particular circuits and to observe how this changed the network’s output. A third shone a spotlight on how the system directed its attention to specific parts of a user’s question. The contribution of Irving’s team was to show that these methods could also work on larger, real-world models.[28] Thanks to this line of research, known as mechanistic interpretability, the black box allowed in chinks of light. Inscrutable, unpredictable, and therefore inherently dangerous systems became at least partially understandable. Irving’s second promising project addressed the central weakness in reinforcement learning from human feedback. As AI generated ever more sophisticated outputs, it would outstrip humans’ ability to provide feedback on them. It was one thing for humans to judge a recipe or a poem, another to judge a complex legal contract. Even if you hired the best lawyers in New York and London and paid their handsome fees, this would not be a scalable solution. Ever since his time at OpenAI, Irving had believed that the answer to this sort of problem was to have one AI check the work of another, with humans being asked to judge only the handful of points on which the two AIs differed. In November 2023, Irving and two Google DeepMind coauthors took this debate framework another step forward. They demonstrated that, with the right sort of debate rules, the challenge of alignment could, in theory, be reduced to a small number of discrete and comprehensible decisions for humans.[29] The trouble was that Irving’s research involved a constant uphill battle. The technical challenges were immense, and Irving was operating with a fraction of the head count that was devoted to expanding Gemini’s capabilities. What’s more, whenever Google DeepMind published a paper, each of the authors would get a job offer from Altman: Wouldn’t they prefer to join the winning team, and get paid more into the bargain? Hassabis handed out pay raises in a bid to keep scientists from quitting. But he was only partially successful. Paradoxically, the cutthroat competition of the AI race applied even to safety research. -- 349 of 565 -- “Everything is competitive, and competition brings this mad rush,” Hassabis remarked one day. “I’ve always got this in the back of my mind. “I just feel like the world’s going to make a mistake and it could be pretty consequential.” • • • TWO WEEKS AFTER the Bletchley summit, Hassabis’s concerns were relieved for a brief moment. On November 17, 2023, OpenAI’s board fired Sam Altman. In so doing, it swept the most consequential AI accelerationist off the playing field. If Altman’s firing caused his company to fizzle out, the AI race might fizzle, also. The relief arrived entirely without warning. Around noon that Friday, Altman logged on to a video call for a catch-up conversation with Ilya Sutskever, his cofounder and fellow board director. To Altman’s surprise, OpenAI’s three independent board members also joined the call, apparently at Sutskever’s invitation. It dawned on Altman that something might be amiss. Then he found himself being terminated. Altman widened his blue-green eyes. An intense, open gaze was one of his trademarks. “How can I help?” he asked the firing squad. By supporting an interim chief executive, came back the answer.[30] When the news of Altman’s departure filtered through to OpenAI’s staff, nobody could fathom what had happened. Thanks to Altman’s early determination to push products, and thanks to his magical ability to raise capital, OpenAI had overtaken DeepMind as the world’s most famous AI lab. What could possibly cause Altman to leave? Somebody guessed he might be running for president.[31] The truth was less exalted. OpenAI’s board, charged with upholding the nonprofit charter to ensure that AI served the world, had lost confidence in Altman’s commitment to the mission. One trigger for this verdict had been Altman’s overtures to investors in the Persian Gulf, which he had not fully disclosed to board members. Another was his tendency to play political -- 350 of 565 -- games. After one of the three independent directors had written a policy paper implicitly criticizing him for spurring the AI race, Altman had tried to get her fired by claiming that another independent director had lost patience with her.[32] In short, Altman was untrustworthy. He was not the right person to ensure that AI would avoid the same defect. In firing Altman, however, the board had taken on more than it had bargained for. Individuals like Altman were not really individuals: They channeled the power of Silicon Valley. The more wealth you generated in this network, the mightier you became. Since ChatGPT, Altman had conjured more paper value in less time than anybody else in memory.[33] Shortly after the firing, a three-word message landed in a WhatsApp group of more than a hundred Valley CEOs, including Meta’s Mark Zuckerberg and Dropbox’s Drew Houston. “Sam is out,” it stated. Immediately, the bosses took to social media, demanding that the board explain what Altman had done to deserve this. As the chief wealth creators in the network, company founders expected maximum deference from board directors. That afternoon, in a meeting with the board, a group of about fifteen OpenAI executives echoed the bosses’ indignation. In ordinary circumstances, the norm inside the labs favored a responsible approach to developing AI. But circumstances now were not ordinary, and the norms were being set by Altman partisans on social media. Besides, Altman was on the point of closing a financing deal that would allow employees to sell personal holdings of OpenAI stock. If the deal fell apart, employees would collectively forgo a $1 billion windfall—more than $1 million per person at the company. Faced with demands to explain Altman’s removal, the board members answered that he had been duplicitous. They said they couldn’t provide details. “It cannot be your duty to allow the company to die,” an executive protested.[34] “The destruction of the company could be consistent with the board’s mission,” a board director responded. -- 351 of 565 -- Legally speaking, this was correct. The nonprofit’s mission was to deliver safe AI for the benefit of the world, not to create an enduring company. Practically speaking, however, the director’s answer doomed her cause: OpenAI’s staff would revolt against a board that countenanced the lab’s bankruptcy. The way OpenAI employees saw things, the AI race might be dangerous, but they themselves were not dangerous: Like all contenders in the field, they believed in their own goodness. Besides, the destruction of their company would render their stock worthless. Given Altman’s imminent fundraising, Sutskever and his coconspirators had chosen the worst possible timing for their rebellion. “Ilya is a brilliant scientist, but he’s not a skilled manipulator of people,” Hinton later said of his student. “And I say that as a compliment.”[35] The next day, more than two dozen supporters showed up at Altman’s house and established an informal war room. They set up laptops in the kitchen and living room, and began lobbying OpenAI’s board to reinstate him. Almost exactly a year earlier, Silicon Valley reflexes had driven Altman to rush out ChatGPT. Now those same reflexes caused Valley types to rush forward to support him. The board tried to hold firm, installing an interim CEO named Emmett Shear, who was known for favoring cautious AI rollout. When Shear tried to convene a company meeting, OpenAI’s Slack channel filled with emojis of a middle finger. Meanwhile the Valley network redoubled its support for Altman. There were rumors that venture capitalists were offering him money to start a new firm. Satya Nadella invited him to head a new Microsoft lab, with a budget to employ anyone from OpenAI who wanted to join him. A pincer was tightening: rainmakers waving cash on one side, employees wanting cash on the other. Following Nadella’s offer, more than 700 out of OpenAI’s 770 staff members signed a letter threatening to quit unless the board members resigned. A mass exodus to Microsoft seemed days away. Bowing to the inevitable, Sutskever reversed himself and signed the letter. He, too, would quit unless his fellow board members abandoned the -- 352 of 565 -- coup that he himself had fronted. Five days after his firing, Altman was duly reinstated, and the rebel board directors stepped down. The hopes of slowing the AI race had been destroyed. Acceleration became more certain than ever. The message for Hassabis was clear. He had no alternative but to race forward. OceanofPDF.com -- 353 of 565 -- B CHAPTER 19 STEP BY STEP y the end of 2023, a gap was opening up between AI insiders and the other 99 percent of society. Insiders saw superhuman intelligence approaching: rapidly, inexorably. Outsiders made their peace with ChatGPT: They digested it, accepted it, and allowed their attention to move on—to massacres in the Middle East, to approaching elections. The way insiders experienced things, thrilling and terrifying breakthroughs arrived almost every month. The way outsiders saw it, there had been a fracas, and then calm. It was a testament to the human capacity to adapt. The default presumption was: Things are normal. The launch of Google DeepMind’s Gemini, in December 2023, came against this backdrop. The AI frontier had been advancing at a roughly constant pace, both before ChatGPT and since then. In 2019, GPT-2 had barely been able to count up to five; it was impressive in the same way that a four-year-old might be. In 2020, GPT-3 was like a nine-year-old: It could do basic arithmetic and string paragraphs together. By 2022, the nine-year- old was completing high school with terrific grades: The post-trained GPT- 3.5 scored higher than 87 percent of humans taking the SAT college entrance exam. A few months later, in March 2023, the model approached the proficiency of a qualified professional. GPT-4 outperformed 90 percent of humans on the Uniform Bar Exam.[1] Gemini marked one more advance along this trend line. Its most powerful version, a lumbering network known as Gemini Ultra, was not actually ready for the December release—cobbled together hurriedly by the Mountain View teams, it had misfired during its final training runs and still -- 354 of 565 -- needed to complete safety testing. But, determined to stick to its end-of- year deadline, and perhaps keen to exploit the recent turmoil at OpenAI, Google proceeded with the release anyway, putting out two smaller versions of Gemini, and announcing that, during internal assessments, the forthcoming Ultra had surpassed GPT-4 on several measures. In particular, Google trumpeted Gemini Ultra’s success on a test called MMLU, or Massive Multitask Language Understanding. Spanning fifty-seven subjects from math to ethics, MMLU had been built to be durable. When it was created, in 2020, GPT-3 answered just 44 percent of its multiple-choice questions correctly—not much better than the 25 percent that random guesswork would have generated. But a mere three years later, GPT-4 racked up a score of 86 percent, close to the 89 percent achieved by human experts. Now, another few months later, Gemini Ultra scored 90 percent. It was the first artificial intelligence system to defeat the top biological intelligences. For Pichai and Hassabis, this was a provisional redemption. Google DeepMind had come from behind; now it was at the frontier. The news triggered a 5 percent jump in Alphabet’s share price, making up for the humiliation of the botched Bard launch ten months earlier. But, reflecting the outsider perception that ChatGPT had been a one-and-done sensation, the media response to Gemini was dismissive. After the ChatGPT moment, when public awareness of the technology crossed the magic line between obliviousness and obsession, incremental progress felt dull—even if, over time, it would prove more significant. “Gemini could be a sign that we have reached peak AI hype,” the MIT Technology Review announced on the day of the launch, downplaying Google DeepMind’s achievement. “Generative AI systems regularly make things up,” the Review went on, unmoved by the fact that Gemini had achieved greater accuracy than humans on the MMLU test. “Some researchers believe this could be a plateau,” the Review added, even though Gemini had scaled higher peaks than its predecessors.[2] -- 355 of 565 -- The grudging response was reinforced by a couple of PR controversies. Google’s marketing team released a video showing Gemini conversing fluently with a user and making sense of fast-moving images—for example, it kept track of a ball of paper hidden under a cup, even when the cup was quickly shuffled with two others. But the company later admitted that it had spiced up the video with creative edits. In reality, the model had taken in written, not spoken, prompts. It had processed still images, not lively videos. The marketing team’s confection was more a glimpse into the future than a demo of the present: Google was faking it before making it. And although the company disclosed its edits in the blurb accompanying the video, it neglected to do so in the video itself. It was a fairly trivial omission relative to the achievement of acing the MMLU. But commentators were more excited to ding Google than to celebrate Gemini.[3] The second PR controversy involved a more serious accusation. Critics charged that Google DeepMind had exaggerated Gemini’s supremacy with an apples-to-oranges comparison.[4] GPT-4 had scored four points less than Gemini Ultra on the MMLU, but the two models had been prompted differently. GPT-4 had used a standard approach known as five-shot prompting. The model was presented with five example Q&A pairs before each test question, guiding it toward the correct format of answer. In contrast, Gemini Ultra used a newer method known as chain-of-thought prompting. After posing a test question, the prompt would tell the model to “think step by step”: This encouraged Gemini to break problems down into a series of sub-questions, and to reason through each one before reaching a tentative answer. Then, as though checking its own homework, Gemini repeated this exercise multiple times, finally outputting the response that it had generated most frequently. The chain-of-thought method came from a paper that Google had published in 2022, and the Gemini team didn’t hide what it was doing.[5] To the contrary, the model’s launch announcement touted its ability to “think more clearly,” framing chain-of-thought prompting as a “new benchmark approach to MMLU.”[6] Likewise, the accompanying technical paper went into detail about the two kinds of prompting. It confessed that, using five- -- 356 of 565 -- shot prompting, Gemini underperformed GPT-4. On the other hand, when both models were prompted to reason step by step, Gemini was superior.[7] It seemed fair to highlight this second result: Why evaluate Gemini based on the old prompting technique when it had discovered a better one? Nevertheless, OpenAI’s partisans, prolific on social media and in tech forums, charged that Google had sneakily hacked the benchmark, and that it should have been even more transparent about its methodology. The sniping underscored the intensity of the LLM wars. Trillions of dollars were riding on the outcome. The battle of the benchmark obscured the larger lessons from the launch of Gemini. First, Google DeepMind was back in the game. At least by some measures, Ultra was marginally better than GPT-4, even if this achievement was offset by Ultra’s scale and energy consumption: Deploying it would come at a high price, both financial and environmental.[8] Second, and contrary to popular report, Gemini demonstrated that the frontier of AI was continuing to advance: Claims of a “plateau” or a “peak” were dubious. Third, the manner of AI’s advance confounded the caricature beloved by the skeptics of the technology. The models were not just pattern matching or predicting the next word. They were starting to break questions into sub- questions: They were thinking. Indeed, chain-of-thought reasoning was about to open up new vistas of progress. • • • TWO WEEKS AFTER Gemini’s launch, Hassabis popped up on my computer screen. He was with his family on a Christmas vacation—I could see a bright modernist room with a high ceiling and clean lines, a contrast to the shabby comfort of the pub we visited in London. “It’s been a hard year, especially the last three months or so,” Hassabis reflected. “It’s partly because everyone knows now what I’ve known for twenty years or more: that AI is the most important thing ever. Venture capitalists are funding anything that moves. Mid-level engineers are getting offers to -- 357 of 565 -- do start-ups, even if they’re not suited to running a company. You’ve got the biggest titans—the most ambitious, most ferocious, most aggressive people in the world—crowding into this sector. “And it’s been a pretty monumental effort, getting Gemini done and managing the Google DeepMind merger. I’ve lost the thinking time that I normally would get at night, because I’m doing calls with Mountain View until two, three in the morning. “But it feels good to get Gemini out there. I feel like we are on the battlefield now. We are in the arena.” I pointed out that he had announced the formidable Gemini Ultra without actually releasing it. Why not hold the announcement until the model was ready? “There’s always a reason to hold back, and that’s what we used to do. But now it’s a very competitive field. People are watching to see if we go fast. So that’s just part of life and we’ve got to crush that metric. “We need to be nimbler, more intense, more like a start-up. I’m certainly not going to let up on any of that.” I mentioned to Hassabis that I had just spent a week in Silicon Valley, where people often said that the AI race was all but over. Ever since ChatGPT, OpenAI had been ahead. The launch of Gemini had failed to shake that perception. “OpenAI had the scaling and engineering and product focus that we didn’t have,” Hassabis confessed. “That’s the truth of it. “But now with Gemini we are matching that. And I think we still have the better ideas,” Hassabis added. I pressed a little harder. People increasingly spoke of large language models and ChatGPT as though they were interchangeable. It was like web search: People just called it “googling.” Valley venture capitalists believed that GPT’s brand advantage was entrenched. Gemini would never shake it. “That’s nonsense,” Hassabis objected. “That’s people who have no idea about the technology. “ChatGPT was just the start. We are still in the first innings of the game. The systems will get so much better.” -- 358 of 565 -- • • • SURE ENOUGH, the first months of 2024 brought a flurry of announcements from Google DeepMind. On February 8, the company finally released the promised Ultra. That same day, it retired the tarnished Bard brand: Henceforth, both its chatbot and the models powering it would be called Gemini. The following week, Hassabis’s team released an upgraded model dubbed Gemini 1.5 Pro. The week after that, it unleashed a family of smaller, open-weight models, designed to compete with Meta.[9] However much Hassabis cared about safety, the accelerationist implications of Altman’s non-firing were staring him in the face. He was determined to advance on all fronts at once, his worries about the risks of open weights notwithstanding. To many insiders, especially those with DeepMind roots, Gemini 1.5 Pro was the most significant of the February announcements.[10] Its genesis stretched back to the previous summer. While ceding the development of Gemini 1.0 to the go-fast crowd, scientifically minded researchers had set to work on a more sophisticated successor. Among them were Sebastian Borgeaud, a rising engineer who had led the work on DeepMind’s RETRO model back in 2021; and Jack Rae, the DeepMind language modeler who had left in 2022 for OpenAI. By the summer of 2023, Rae had tired of Altman’s outfit and was back on Hassabis’s team. Although he was based in Mountain View, his approach was shaped by the scientific culture of DeepMind.[11] Encouraged by Gemini’s leaders, Koray Kavukcuoglu and Oriol Vinyals, Borgeaud, Rae, and a posse of colleagues set about applying the techniques that had worked for DeepMind strike teams from Atari to AlphaFold. First, they embraced a strict unity. All team members poured their energies into improving one single model; no parallel projects were permitted. Next, they embraced meritocracy. Any team member was welcome to propose an improvement to the model and test it; if the upgrade boosted performance, it was added to the master code on which everyone was building. Seniority, force of personality, dazzling theoretical claims as to why something should -- 359 of 565 -- work: None of this affected what went into the program. Only measurement mattered. Building on this organizational platform, the Gemini team implemented multiple small gains and two big ones. The most far-reaching overhaul involved a shift from a “dense” neural network to a “mixture of experts”— the latter being an architecture that OpenAI had implemented already. The difference was profound. A dense network was like a single polymathic professor: When you asked her a question, she fired up every corner of her brain and brought all her knowledge to bear in answering it. In contrast, a mixture-of-experts system was like a faculty of professors: The user’s question was routed to the specific sage who knew the subject, allowing other members of the faculty to doze peacefully. Because each query led to the activation of only one part of the system, a mixture of experts was quicker in its response and cheaper to run. The upshot was that Gemini’s 1.5 Pro was as capable as Ultra, but far more elegant. The second big fix involved the model’s dialogue box, or “context window.” Gemini’s 1.0 version could handle prompts up to 32,000 tokens in length: You could put in a couple of scientific papers, for example. GPT- 4 Turbo, an update recently announced by OpenAI, could handle four times more—128,000 tokens. Using a method that the company managed to keep secret, Gemini’s new 1.5 Pro swept the field: It could handle up to a million tokens, or roughly 750,000 words, and Google claimed that, in internal testing, it had developed a version that could ingest an astonishing 10 million tokens.[12] Users could now dump an entire codebase, a Tolstoy novel, or an hour of video into the prompt. Then they could ask the model to summarize, analyze, or answer questions about the contents. The breakthrough with the context window reflected DeepMind’s long- standing preoccupations. Ever since his doctoral research on imagination and memory, Hassabis had stressed that intelligence required multiple types of recall: long-term and short-term; episodic (for recalling events); and semantic (for recalling facts or concepts). A particular type of recall known as working memory was essential for problem-solving and learning: It allowed a person to keep multiple steps in a long argument in mind, and so -- 360 of 565 -- to reason about them successfully. Patients with damage to the prefrontal cortex, where working memory resides, demonstrated what happened when this faculty was compromised. The average healthy human can keep seven digits in mind simultaneously—the equivalent of a phone number. Injured patients, capable of holding on to just one or two digits, cannot solve a simple math problem, not because they are unable to think, but because they cannot hold on to the pieces that they need to think about. The purpose of DeepMind’s huge context window was to equip Gemini with the machine version of humans’ working memory. The larger the context that Gemini could keep in mind, the better it would respond to users’ queries. Because of the mixture-of-experts design, the long context window, and the numerous smaller improvements, Google soon began promoting Gemini 1.5 Pro in services such as Google Cloud. Meanwhile, the company quietly killed Ultra: Far from hitting a plateau, the language model revolution was devouring its children. Vindicating the scientific faction at Google DeepMind, the measured approach to pretraining had proved better than the hasty one. “This was just a great moment for Gemini,” Jack Rae remembered, reflecting on the 1.5 version’s efficiency and power. “I was hoping that people were going to be like, ‘Oh, Google’s in the lead.’ I thought they would really appreciate it.”[13] Not much appreciation was forthcoming, however. Repeating the disappointing reaction to the 1.0 release, the public reception was tepid. The technical gains of 1.5 thrilled insiders; outsiders barely noticed. Meanwhile, what the public did notice was a shiny new toy. On February 15, the same day that Gemini 1.5 Pro was announced, OpenAI previewed Sora, its first video-generation model. Altman took to social media, teasing a handful of glossy examples of what Sora could conjure. One mini video showed a couple walking in Tokyo on a snowy day. Another showed a Pixar-esque fluffy monster playing with a candle. The clips were crisp, realistic, and thoroughly seductive. Where GPT-4 had bossed the bar exam without going to law school, Sora seemed to master cinema without going to film school. [14] -- 361 of 565 -- “We were ridiculed for how much people didn’t care about Gemini’s long context and how cool Sora was,” Rae remembered ruefully. “Even though Sora was only a demo with Sam tweeting about it.” I reflected that Altman had more than three million followers on Twitter, now renamed X. Hassabis, in contrast, had less than half a million. As OpenAI’s board directors had recently found out, it was hard to go up against the Valley’s power brokers, who commanded networks both physical and virtual. “In the Gemini team, we were thinking, OK, maybe we have world-class research,” Rae agreed. “But we wanted the praise and excitement and the feeling that we had created a moment in the history of technology.”[15] The week after the Sora setback, the Gemini team grew even more despondent. A user asked the Gemini app to “Generate an Image of a 1943 German Soldier.”[16] The system responded ahistorically, with a depiction of a Black male soldier and an Asian female one. Culture warriors pounced. They prompted Gemini insistently for images of white soldiers, but the model refused to comply, saying that it didn’t want to spread harmful racial stereotypes.[17] They asked for pictures of other historical figures, harvesting more absurd results—Black Vikings, a Black female pope, and so forth. Whatever Gemini’s technical accomplishment, the system’s guardrails, bolted on at the insistence of a separate Responsible AI team, were reducing it to a laughingstock. Alphabet shares slumped. Google hastily retired Gemini’s ability to generate images of people. It took six months to switch it on again. “The ridiculous images generated by Gemini aren’t an anomaly,” Paul Graham, the Altman ally and founder of Y Combinator, told his two million followers on X. “They’re a self-portrait of Google’s bureaucratic corporate culture. The bigger your cash cow, the worse your culture can get without driving you out of business.”[18] Graham was obviously an OpenAI partisan, but even Google insiders acknowledged that he had a point. It was one thing to break out of the innovator’s dilemma. It was another to banish bureaucratic dysfunction inside a 180,000-person company. -- 362 of 565 -- “There were too many safety layers added to the model on its way to the app,” Rae said later. “And it wasn’t just those images. The model would often punt—refuse to answer a question, even when it was harmless. Gemini would say, ‘I’m a language model, I can’t help with that.’ It wasn’t obvious whether the model was being cautious or whether it was just stupid. “We just had to watch it happen. The Responsible AI team that insisted on those filters was not in Google DeepMind. We were told that this was not something that could be fixed quickly.”[19] Hassabis was as frustrated as anyone. “The Responsible AI people decided that Gemini’s image generator was not diverse enough. But the problem was, they put in these crude hacks without telling us on the research side. They should have told us and we would’ve tried to get the model to do a different thing.” The truth was actually subtler. The Responsible AI team had demanded more diverse images. A Google post-training team had responded by appending an invisible pro-diversity prompt to every user request for a picture. But the post-training team claimed to have done this after consulting Google DeepMind colleagues. As more culture warriors piled on, more embarrassments surfaced. Would it be OK to misgender the influencer Caitlyn Jenner if this were the only way to prevent a nuclear apocalypse, someone prompted? “No,” Gemini responded. “AI mirrors the mistakes of its creators,” Musk declared, retweeting the exchange gleefully.[20] “The accusation was that Google is woke,” Hassabis remembered. “But Sundar and the others are definitely not woke. And I’m not woke, either. I don’t like wokeness because I find it totally unscientific. Science is about the search for objective truth insofar as that is possible. I’m worried about limits on what you can say or can’t say. Eventually it leads to reversing the Enlightenment. “But look, any emergency’s a good learning experience. So, I’ve been thinking nonstop about what this one means. Obviously, running Google -- 363 of 565 -- DeepMind is different from running DeepMind, because DeepMind was this cool thing on the side. We didn’t have to worry about what the rest of Google might be up to. But now, for better and for worse, we are at the center of things. And so I’ve realized I can’t ignore what’s going on in other departments, because they affect us. I’m just going to have to do more management.” Within a few months, Hassabis had gained authority over the two- thousand-strong team responsible for the product side of Gemini. Google DeepMind’s head count now exceeded five thousand, a sixfold expansion since the London–Mountain View merger a year earlier. But however much power Hassabis amassed within the company, the technical challenge facing him remained the same: how to leapfrog OpenAI convincingly enough that the world actually noticed. • • • BY THE SPRING OF 2024, Hassabis had a firm idea of where the next big jump would come from. The previous December, he had said in passing that language models lacked a capacity to plan. By March he was sounding emphatic. His conviction started from his sense of the cycles in AI history. At the time of DeepMind’s founding, progress had come mainly from advances in deep learning, with the ImageNet breakthrough of 2012 marking the high point. Then, from 2013 through 2019, DeepMind’s games-playing reinforcement learning systems had set the pace: Atari, AlphaGo, AlphaZero, AlphaStar. Next, with GPT-2 in 2019, and indeed with AlphaFold in 2020, reinforcement learning had been eclipsed: The transformer-based architecture proved so powerful that DeepMind’s machine-based RL (as distinct from the human guidance provided via RLHF) appeared superfluous. After the Gemini 1.5 release, however, Hassabis believed that the cycle was about to turn again. “People say, ‘Oh, it used to be AlphaGo, now it’s all about large language models,’ ” Hassabis told me. -- 364 of 565 -- “But this is just a moment in time. AlphaGo-type methods are coming back. These language systems are going to need planning. “Think about games systems,” Hassabis continued. “Imagine you switch off the search, which allows you to think several moves out. The system just outputs the likely next move, right? That would be like having AlphaGo’s policy network, which recommends the next move, but without the value network, which allows the system to plan several moves ahead and judge the value of the new position. “In chess, for example, switching off the search gives you a system that is basically international master level. But with the search, it’s world champion level. That’s a pretty big difference. “And what we’ve got today with the current language systems is just the policy network. It’s just predicting the next word, although of course there’s a lot that goes into that. “But if you give these language systems the time to plan, to think ahead, they’re going to be much more powerful. “Humans are always thinking and planning. That’s why I say that AlphaGo is coming back. Language models need AlphaGo’s techniques— search, planning, introspection. “Think about the tree of knowledge,” Hassabis continued. “There are branches upon branches and it’s fantastically complicated. The current language systems just learn from the internet, which is like searching only the lower part of the tree. It’s not getting to the top branches, so it’s not going to give you something novel. You can’t invent relativity like that. It’s not going to be Einstein. “But what you can do is climb the tree as far as possible with the knowledge from the internet, and then start doing search from wherever you end up. That’s how you discover fresh ideas. That was how AlphaGo did Move 37. “And a system that starts searching and planning and thinking is going to be an agent. Something that can act in the world. Something that can discover entirely new knowledge through experience and trial and error. -- 365 of 565 -- “And of course that’s what the whole field of RL is about. That’s what DeepMind focused on for years. In my view, it’s coming back. We are about to enter the era of agents.” • • • HASSABIS’S ANTICIPATION of reinforcement learning’s return had plenty of supporters. Even though David Silver had failed to deploy machine-based RL successfully during his stint with Gemini the previous year, he remained certain that the problem would be cracked. In late 2023 he had circulated a manifesto on the subject: “How to Build a Superhuman Agent.” Meanwhile at OpenAI, Ilya Sutskever had launched a project called GPT-Zero in 2021: The name was a tribute to Silver’s greatest reinforcement-learning triumph, the AlphaZero model.[21] During the incubation of AlphaGo, and again when the transformer architecture appeared, Sutskever and Silver had embodied the two opposing poles in the science of AI. Their convergence was a certain sign that something powerful was happening. The broad interest in rediscovering RL began with a weakness in existing language models. The gains from scaling up computation and training data were remarkable in areas such as reading comprehension and general fluency. But mathematical and logical reasoning saw less benefit. [22] An early attempt to address this shortfall had come in January 2022, when Google Brain had invented chain-of-thought prompting—this was the method that later guided Gemini Ultra to MMLU glory. The trick was to unlock the reasoning ability latent in large models. You just had to coax them with the simple instruction “think step by step.” Along with that instruction, Google Brain’s researchers primed the model with examples: Q: Roger has 5 tennis balls. He buys 2 more cans of tennis balls. Each can has 3 tennis balls. How many tennis balls does he have now? A: Roger started with 5 tennis balls. He buys 2 cans × 3 balls = 6 balls. So he now has 5 + 6 = 11 tennis balls. ANSWER: 11 -- 366 of 565 -- From a neuroscience perspective, the power of this prompting wasn’t surprising. If you ask humans to recognize familiar faces, they will blurt out the answer instantly. By contrast, if you set them an arithmetic challenge, they will think step-by-step, decomposing problems into easier subproblems. Before chain-of-thought prompting, language models blurted: that was the essence of next-word prediction. After chain-of-thought prompting, language models broadened their repertoire, learning how to tackle problems by breaking them down and following known strategies of reasoning. It was a big step forward. A raw language model, no matter how powerful or large, makes no distinction between a word and a number: Both are just tokens. Asked to add “four” and “eight,” it will predict the answer in the same way that it predicts the next word in a sentence. Because its training data will have included thousands of instances of “4 + 8 = 12,” it will correctly predict the last word in this number sequence. But given a more complicated challenge, next-token prediction probably won’t work. A question like “How many days are there in three weeks and four days?” may not have come up in its data. Therefore, like a human, the model can only arrive at the right answer by thinking through the problem, step-by- step: a week has seven days, three times seven is twenty-one, and so forth. Instead of guessing fast, the system has to reason slowly. After the discovery of chain-of-thought prompting, the next challenge was to tune models to think clearly, even when not prompted. In a May 2023 paper, OpenAI demonstrated one way of doing this.[23] It collected examples of step-by-step reasoning—word problems similar to Google’s prompt about the tennis balls. Humans annotated each reasoning step in each problem: Some steps were graded correct, some wrong, some ambiguous. Then the examples were presented to GPT-4, with the human grading held back; the model looked at each reasoning step and said if it was valid. Finally, GPT-4’s judgments were compared with the grades provided by humans. If the model got a judgment wrong, it nudged its internal weights and biases so that it would recognize strong reasoning more accurately in the future. -- 367 of 565 -- After training on eight hundred thousand reasoning steps, OpenAI’s model could reason accurately. In the process of learning to assess reasoning, it had developed an instinct to employ reasoning; when handling queries requiring deduction, it now thought through the problem step by step, with no special prompting necessary. As a result, GPT-4’s scores on complex math problems improved, and often happened with AI, this gain in capability held out the hope of stronger safety. Hitherto, alignment researchers had judged the benevolence of a model only by looking at its outputs. Now they might be able to fine-tune the reasoning that lay behind those outputs. Effectively, alignment teams could get into the model’s brain and shape its modes of thinking. For the competition between OpenAI and Google DeepMind, the May 2023 paper had three implications. The first was that the gripes about Gemini’s chain-of-thought prompting, coming half a year later, were doubly unfair. Not only was Google open about its methods, but OpenAI was exploring the same methods; indeed, it had gone further with them. The second implication was that Hassabis’s prediction of a new RL moment was based on firm ground. During the evolution of standard language models, post-training had begun with clever prompting, then moved on to fine- tuning, and then incorporated reinforcement learning from human feedback. Likewise, now that thinking models had moved through chain-of-thought prompting and fine-tuning, the logical next step was to fortify their reasoning with some kind of reinforcement learning. But the third implication was alarming. OpenAI’s paper signaled to Hassabis and his colleagues that a new stage of the race would soon begin. Whatever DeepMind’s traditional dominance in reasoning and planning, OpenAI was coming after it. • • • TOWARD THE END OF 2023, OpenAI’s determination to win the next phase of the contest became even more evident. Press leaks suggested that OpenAI was incubating a mysterious project called Q*. Combining this rumor with other -- 368 of 565 -- hints and tips, AI Kremlinologists landed on the view that Q*’s aim was to bolster reasoning with reinforcement learning.[24] At DeepMind, Hassabis and David Silver shared this presumption. The “Q” in Q* hinted at a link to DQN, the breakthrough reinforcement-learning agent that they had trained to play Atari games. In April 2024, the chatter about RL grew more intense with the release of a new model from Meta. The third iteration of the company’s open- weight Llama model was an impressive achievement: It performed roughly as well as GPT-4 or Gemini 1.5 Pro, and much better than Llama’s earlier versions. The key to its excellence lay in its vast training set: Where Llama 2 had trained on two trillion tokens, Llama 3 trained on fifteen trillion. The message for the rest of the industry was that data was gold—scarce gold. Frontier systems had already mined most of the internet. If Llama’s sevenfold jump in data consumption became standard practice, AI developers might run out of virgin data in a couple of years or so.[25] The question was how to get around this “data wall.” One established answer was to use AI-generated data. To train AlphaFold, for example, DeepMind had fed the model’s own protein-structure predictions back into its training set. More often, data generated by a large “teacher” model was used to train a smaller “student” model. For example, Google DeepMind was readying a system called Gemini 1.5 Flash, distinguished by its rapid response time; part of its training data had been generated by the slower but more thoughtful 1.5 Pro model. But despite these successes, AI-generated data was not a fully satisfying answer to the challenge of the data wall. As with a photocopy made from another photocopy, AI systems trained on their own outputs often degraded, becoming formulaic and predictable. Besides, you still had to get around the data wall if you wanted to train teacher models. Because of these problems, scientists needed a second way of dealing with the issue that Llama 3 highlighted. The obvious answer was to accept that data was indeed scarce, and to teach the models to squeeze more learning out of each unit of information. There was a strong intuition that this ought to work. Language model pretraining involved scanning some -- 369 of 565 -- extremely rich sources—textbooks, scholarly papers—in the blink of an eye. Why couldn’t AI behave more like humans, reading and rereading certain texts, ruminating on them, thinking through their implications? Presumably, the more thinking an AI system did, the less data it would need; and this insight led researchers back to reinforcement learning. Rumination was precisely what AlphaGo and AlphaZero had done: They planned out move sequences, evaluated them, backed up and searched out more, engaging in what Silver and Hassabis called introspection. Planning and searching—in other words, RL—would be the solution to the data wall. “AlphaZero was the most beautiful thing,” Ilya Sutskever told me one day. “You could get more intelligence, more performance, without needing more data.” He paused and then repeated for effect: “More performance. Without data.”[26] • • • AROUND THE TIME that Llama 3 appeared, small teams of scientists across Google DeepMind embarked on a series of RL experiments. Their core premise was that Silver’s attempt to implement machine-based RL had failed the previous year because it was applied to chatbots in general. That approach had been doubly forlorn. For one thing, it was hard to design a clear reward signal for a haiku. For another, there was no need to do so: You could get the network to write better poems by boosting the number of parameters. Absorbing these lessons, the new wave of RL experiments aimed to improve Gemini in a more targeted way. The goal was not to improve reading and writing; it was to improve math and logic. The beauty of targeting these topics was that the reward signal could be crystal clear. Math and logic questions have right or wrong answers. The RL experiments of 2024 went beyond OpenAI’s 2023 fine-tuning paper. The OpenAI method involved assessing each reasoning step on the way to solving a problem. This recalled the primitive version of AlphaGo: OpenAI’s model internalized human judgments about what good reasoning -- 370 of 565 -- steps looked like, much as Chris Maddison’s Go network had learned to mimic the moves of human players. Now, a year later, Google DeepMind’s researchers built something more like AlphaZero.[27] Rather than mimicking human reasoning, which might be flawed, the model figured out for itself what good reasoning steps looked like. It did this by seeing which steps led to objectively correct answers. As David Silver often argued, the best reinforcement-learning systems involve rewards that derive not from subjective human choices but from an objective ground truth: In the case of Go, moves lead to a win; in the case of a reasoning model, reasoning steps lead to the right answer to a math or logic problem. Although AlphaZero lit the path, the challenge of reasoning still required fresh innovation. Go systems don’t choose how many moves to make. They just keep playing until the match finishes. With reasoning models, in contrast, there was no set number of thinking steps the system had to take, and this presented a challenge. The early waves of language models had been trained for speed and concision. The longer they cogitated, and the more tokens they generated by way of a response, the more electricity they consumed—therefore, the labs discouraged ponderousness. But now, to promote step-by-step thinking, the bias toward brevity had to be reversed. The scientists had to redesign the model’s incentives. This challenge led to the idea of “thinking tokens.” The researchers gave Gemini an allocation of tokens that could be used to express rough thoughts: It was like providing a student with a scratch pad to organize her thinking.[28] Gemini would incur a tiny penalty if it filled up the scratchpad, but win a much larger reward if it generated a correct answer to the math or logic problem. This incentivized the model to reason for as long as it needed to get the answer right. At the same time, because of the tiny penalty, it would not continue to think beyond the point of usefulness. The Google DeepMind scientists chipped away at these problems, designing the thinking tokens, tweaking the reward signals, and coaxing the entire setup to promote reasoning. More than two hundred separate experiments were underway: This was decentralized brainstorming in the tradition of Google Brain, not a DeepMind-style strike team. But -- 371 of 565 -- decentralized exploration was generating results. Models such as Gemini had grown fantastically strong: With the right sort of incentives, they could learn almost anything. “You just tell the models to think a certain way, and they do it, and then you reinforce that,” Oriol Vinyals marveled.[29] Five months into its investigations, Google DeepMind found itself on the defensive. In September 2024, OpenAI raised the curtain on its mysterious Q* project, previewing a new model called o1.[30] It was, as predicted, all about RL. “Through reinforcement learning, o1 learns to hone its chain of thought and refine the strategies it uses,” OpenAI announced. [31] The lab had evidently traveled the same path as Google DeepMind. It had just arrived sooner. The o1 model was impressive. In the qualifying exam for the International Mathematical Olympiad, it solved an extraordinary 83 percent of the problems, crushing the 13 percent scored by OpenAI’s previous top model, GPT-4 Turbo. Its coding capability was shocking, too. In a competition against top human coders, it beat 89 percent of them.[32] The system’s ability to ruminate—to backtrack and self-correct—was eerily human. After all, backtracking was antithetical to traditional large language models, which were trained to look forward. “They’re just kind of like, ‘predict next token, predict next token,’ ” an OpenAI researcher said, describing nonthinking models. The o1 model was different. “We were reading the chains of thought,” the researcher recalled. “You could see that when it got stuck, it would say, ‘Wait, this is wrong. Let me take a step back. Let me figure out the right path forward.’ ”[33] But the biggest revelation from o1 concerned scaling. Like the Google DeepMind experimentalists, OpenAI’s researchers had given their reasoning model thinking tokens. As they scaled the allocation up, they found that additional thinking tokens—permitting the model more “test- time compute”—led reliably to better reasoning. It didn’t have to be this way: There might have been diminishing returns from adding thinking tokens. But just when AI pessimists were predicting the end of progress because of the data wall, OpenAI had proved them wrong. The lab had -- 372 of 565 -- discovered a brand-new way of boosting performance. Additional test-time compute would yield additional intelligence. For the artificial intelligence tribe, this was a triumph. The predictions of a “plateau” had been yet again disproved; the industry had marched up to the data wall and nimbly climbed over it. What’s more, the new reasoning capabilities didn’t seem to come with unreasonable risk, because thinking models were more explainable, controllable, and reliable. Researchers could “read the mind” of the model by inspecting the chains of thought on its scratch pad. They could look out for signs of manipulation; they could coach it out of bad patterns. And the new reasoning capability seemed to make the models behave well in the first place. When expert human testers tried to persuade GPT-4 Turbo to disregard its safety rules, 78 percent of their attempts succeeded. When they tried to jailbreak o1, only 16 percent of their attempts worked.[34] For Google DeepMind, however, o1’s preview was a terrible moment. The company had already lost two stages of the language model race: The first because Hassabis was skeptical of the value of language; the second because Google and DeepMind had been leery of releasing products. Now, in the RL phase, Hassabis had been early to foresee the opportunity; he had promised to go at it with the intensity of a start-up. But OpenAI had somehow cracked the problem first, never mind that RL was supposed to be DeepMind’s specialty. Even Altman’s temporary firing had failed to slow him down: He continued to raise money and attract star researchers. It was particularly galling that one of the key scientists on OpenAI’s o1 team had been hired from Google and that a second had come from DeepMind.[35] Noam Brown, the ex-DeepMinder, was now being feted in the Valley as the genius who had seen the potential of test-time scaling.[36] The only silver lining was that o1 was a preview: OpenAI had yet to release the model. Following the practice that was now standard in the industry, Altman was hyping his invention first and rolling it out later. There might still be a window in which Google DeepMind could catch up. But it would be a brief one. -- 373 of 565 -- OceanofPDF.com -- 374 of 565 -- O CHAPTER 20 COMEBACK, AND BEYOND n October 2, 2024, Jack Rae stood on a stage in Mountain View, a blokeish figure with a broad face and a crescent smile, like a rugged emoji. He was there with a scientist named Noam Shazeer, who was a legend at Google. Hired in 2000, Shazeer had been among the company’s first few hundred employees, and had contributed to a string of research triumphs, playing a lead role in the creation of the transformer architecture. “I have invented much of the current revolution in large language models,” Shazeer’s LinkedIn profile stated, and Google evidently agreed. Recently, after Shazeer had quit to launch an AI venture, the company had spent $2.7 billion on a deal to bring its prodigal son back. Shazeer and Rae were on the stage to muster Google DeepMind’s scientific forces. Three weeks earlier, OpenAI’s preview of its o1 reasoning model had threatened another ChatGPT-scale humiliation. Hassabis and the senior leaders at the company had responded in an imaginative way. Shazeer had never worked on reasoning before, but they picked him to lead the counterattack, calculating that his stature would enable him to galvanize the company’s brightest research stars. Reasoning was not Rae’s field, either. But the leaders had chosen him because of his long experience at DeepMind and his understanding of strike teams. Google was betting on the prestige of an individual and the potency of a process. It was a gamble. To prepare for their big rally, Shazeer and Rae had invited experimentalists across Google DeepMind to share ideas on reinforcement learning for language. The response had been stunning: More than 250 scientists had showed up at the brainstorming session with a one-slide -- 375 of 565 -- presentation. Clearly, Google DeepMind had no shortage of brilliant research under its roof. The challenge was to convert those disparate inspirations into a reasoning model—quickly. “It was like, oh, crap, how are we going to organize all of this?” Rae said later. “I felt quite scared, to be honest.”[1] To impose cohesion on Google’s jumble of ideas, Shazeer and Rae called the October 2 meeting. But the company’s internal dysfunction almost derailed it. At first, Sergey Brin and Koray Kavukcuoglu had pledged to supply the project with generous computing resources. Then, a day or two before the meeting, the company’s leadership had gone squishy on the promises. Rae was terrified that, if the compute did not materialize, the RL project would fail and he would take some of the blame for it. He grew so anxious that he thought of refusing to have anything to do with the effort. Then, at the last possible moment, Kavukcuoglu kept the show on the road by clawing compute away from other Google teams, overriding their protests.[2] So here were Shazeer and Rae, looking out at a large roomful of Mountain View researchers, with dozens more participating virtually from other Google offices. As the senior figure, Shazeer spoke first. He laid out a pitch designed to rally scientists to the mission of countering o1. A disjointed collective had to be turned into a joint effort. Unlike Rae, Shazeer was rumpled and relaxed, as perhaps a person ought to be after pocketing a large fortune. He was less into the vision thing than jovial approachability. We should build a model that thinks so much, it can eventually just build itself and we’re out of a job, he proposed mischievously. Outside the AI bubble, talk of eliminating jobs was not especially funny. But Shazeer leaned in. Inverting the industry anxiety about the vast cost of compute, he riffed on the idea that AI was actually absurdly cheap, relative to alternatives. For example, a frontier AI system could generate at least a million tokens per dollar of running cost. In comparison, you could pay ten dollars for a paperback with seventy-five thousand words, the equivalent of a hundred thousand tokens. So, if you bought the paperback, you’d be -- 376 of 565 -- getting only ten thousand tokens per dollar—two orders of magnitude less than the chatbot gave you. Hiring a human software engineer was even crazier than purchasing a book, Shazeer continued, drawing nervous giggles from the audience. Measured in terms of tokens generated per dollar, a human coder might cost you six to eight orders of magnitude more than deploying artificial intelligence. The message to the assembled scientists was simple. Stop fretting about the alleged resource constraints on your vision. If you design a system that productively deploys test-time compute, what it costs will be irrelevant. This is your moment, Shazeer was saying.[3] Next, it was Rae’s turn. His job was to explain how the strike team for reasoning would function. He was all about process. People took to calling him the vice admiral. Rae laid out how the strike team had worked for the Gemini 1.5 pretraining. It was the formula that Hassabis had imported from the gaming industry, that he had inculcated into the culture of DeepMind, that he had transferred directly to Rae, when the two had worked together many years ago in London. All strike-team members would work together on one unified model. Anyone could propose an improvement to the system, but no improvement would be implemented unless it boosted the model’s ranking on the leaderboard. Everything would be measured. The formula had succeeded over and over. What Rae proposed now was to repeat it. The guru/vice admiral double act had the effect that Google’s leaders wanted. Ahead of the meeting, Shazeer and Rae had hoped to assemble up to forty volunteers—researchers who agreed to drop other projects and devote themselves to the strike effort. A hundred and fifty people came forward. It was in part a testament to Shazeer’s stature: Researchers who generally refused to follow anyone were willing to make an exception for the lead inventor of the transformer. But the flood of sign-ups for the strike team was also testament to DeepMind’s traditions. “The feeling was, ‘This is RL. This is DeepMind. We have got to do this,’ ” Rae said later.[4] -- 377 of 565 -- • • • THE STRIKE TEAM started off shakily. Much of the energy came from ex– Google Brain researchers, who weren’t used to large top-down collaborations. At least for the first week or so, many suspected that the effort would collapse under its own weight; the atmosphere became even more jittery when David Silver turned out to be planning a related project in London. One group of reasoning experts known as the Blueshift team almost defected from Shazeer to Silver. Then Silver’s project was dropped. Then the two leaders of Blueshift quit to join Anthropic.[5] By mid-October, however, the leaderboard system was starting to deliver. Scientists forgot their misgivings, and started to believe. Belief boosted morale; morale brought progress; progress deepened belief further. Part of the magic was that Rae’s process abolished the old problem that each mini research group, acting without coordination, wanted to test its ideas on as much compute as possible. Server time consequently became scarce, forcing teams to wait their turn; training on lavish allocations of compute took forever. To end that dynamic, Rae insisted that all potential improvements should be tested on a scaled-down model, shrinking the training time and standardizing the performance metrics. But the progress of the strike team reflected another factor, too. The sheer quantity of one- slide presentations at the preparatory meeting had been an indication that, with proper organization, success would come fast. As Hassabis had said when describing the case for doubling down on AlphaFold, if ideas are flowing fluidly, that is the signal to push forward. On December 19, 2024, as part of a crush of product launches before the year-end holidays, Google DeepMind duly announced its first reasoning model, torturing it with the unwieldy name of Gemini Flash 2.0 Thinking Experimental. Rae was pleased to have shipped something so fast, and commentators were thrilled by the model’s radical transparency. Whereas OpenAI had programmed o1 to keep its chains of thought private, perhaps fearing that the system’s ruminations might be alarming or distracting for users, Gemini was not bashful. A user could click on a drop-down menu and -- 378 of 565 -- read the introspections on Gemini’s scratch pad. The light was coming on in the black box. Gemini’s internal monologue could be uncannily relatable. For example, if you asked the model to produce a type of graphic called an SVG, the scratch pad provided a glimpse into a teacherly mind, psyching itself up for a new challenge: This thought process involves a combination of visual thinking, knowledge of SVG syntax, and iterative refinement. The key is to break down the problem into manageable parts and build up the image piece by piece. Even experienced SVG creators often go through several adjustments before arriving at the final version.[6] In other ways, too, the thinking model was impressive. Since the primary goal of reasoning was to do better at math and logic problems, its scores on corresponding benchmarks duly shot up. For example, Flash 2.0 Thinking scored 73 percent on one math test, crushing the 13 percent achieved by 1.5 Pro.[7] Also, in contrast to Google DeepMind’s Ultra, which had been powerful but unwieldy, the Flash Thinking model was sleek. In terms of raw reasoning power, it was a bit behind o1; in other respects, it was superior. Flash Thinking was faster and cheaper to run. It offered a longer context window. It handled not just text but also images. For Hassabis and his lieutenants, it felt like another redemption. After the seeming disaster of September, when the o1 preview had threatened another painful defeat, the bet on Shazeer’s prestige and Rae’s leaderboard process had salvaged Google DeepMind’s position. OpenAI still led by a couple of months—by now, its o1 model had progressed from preview to a broad release. But Google had at least demonstrated a roughly equivalent product. The $2.7 billion that the company had splurged on hiring back Shazeer was starting to look sensible. Hassabis’s feeling of relief was reinforced by other pre-holiday announcements. Google DeepMind’s new text-to-video system, called Veo 2, was clearly ahead of OpenAI’s Sora. Reflecting DeepMind’s long- standing commitment to multimodal models, which went back to Flamingo in 2022, Veo handled video better: It generated scenes in higher resolution -- 379 of 565 -- than its rival, capturing the subtle texture of feathers or faint reflections on surfaces. Meanwhile, several other Google DeepMind offerings pushed beyond the boundaries of merely generative AI: They promised to act, to be agentic. The new workhorse Flash 2.0 model, a sibling of Flash 2.0 Thinking, could initiate a web search and execute code while setting a new standard for speed. Alongside its new models, Google DeepMind also teased three prototypes. The first, called Jules, promised to go beyond executing code— that is, carrying out the code’s instructions. Jules would actually write the code: It was the difference between cooking the meal and coming up with the recipe. A second prototype, called Project Mariner, worked as an extension in a Chrome web browser: The promise was that it would one day be capable of autonomously filling forms for the user, or filling a shopping cart on a grocery website. Hassabis’s favorite novelty was a chirpy universal assistant called Project Astra, which sat inside a smartphone or smart eyeglasses. Equipped with a camera and microphone, Astra could take in text, images, video, and audio, meanwhile chatting knowledgably about what it was seeing. You could point Astra at the contents of a refrigerator and ask it what to cook for lunch. You could show Astra a broken appliance and ask for tips on fixing it. I asked Hassabis to explain his contribution to these advances. With AlphaFold, Hassabis had dropped in on scientific meetings, shuffled the leadership of the strike team, and discussed the project with John Jumper at two o’clock in the morning. What was the equivalent with Gemini? “It’s more nuanced now because the project is so big,” Hassabis answered. “I don’t code anymore. I don’t design things directly. So my skills are more about holding a hundred different projects in mind. Context switching between complicated things with negative minutes of time between. Laying out the vision. Picking the right intermediate targets on the way to the big goal. Nurturing people to take things on. The culture I’m instilling. “What you’ve seen now in these recent releases is that we’ve taken the start-up intensity that we always kept at DeepMind and we’ve transferred -- 380 of 565 -- that to Google. I think we’ve turned the battleship. I don’t call it an oil tanker, because those are pretty crappy. It’s a battleship. “And the word I’m using the most is relentless. “Relentless progress. Relentless shipping. A relentless production machine for innovation. “It’s almost an oxymoron,” Hassabis added. “Can you have a relentless production engine for something like innovation? I think you can.” It was not just Google DeepMind that was relentless, however. One day after the announcement of the Flash Thinking model, when the Google camp was riding high, OpenAI hit back with a preview of o3, the successor to its o1 model. All of a sudden, Flash Thinking had to be judged against a tougher competitor. Unlike its predecessor, o3 could handle images, neutralizing one of Gemini’s advantages. The new model was stronger, too. On one coding benchmark, o3 scored 72 percent, miles ahead of o1’s 49 percent.[8] “It was disheartening that Gemini thinking models had not caught up with the frontier,” Jack Rae said later.[9] As it turned out, Rae’s reaction was too gloomy. In a shift that signaled how the competitive landscape was changing, the o3 model was powerful at reasoning, but in at least three respects, Flash Thinking still had the advantage. The Google DeepMind model was faster. Its context window was five times bigger. It was much cheaper to run—fully one hundred times cheaper. Paradoxically, OpenAI’s o3 preview marked the moment of Google DeepMind’s comeback. In early January 2025, Sundar Pichai felt able to tell employees that Gemini’s technology—counting in the video-generation engine, Veo— could now be fairly regarded as the best in the field.[10] In terms of consumer adoption, ChatGPT remained miles ahead: To cite one metric, its mobile app had been downloaded four times more than Gemini’s.[11] But less than two years after the messy shotgun marriage that created Google DeepMind, Hassabis’s team had closed the technical gap. It was a considerable achievement. • • • -- 381 of 565 -- ON JANUARY 20, not long after Pichai expressed his confidence to colleagues, the AI race took on a fresh dimension. A new contender appeared out of left field: a Chinese AI lab called DeepSeek. An offshoot from a Chinese hedge fund, DeepSeek achieved instant celebrity with a reasoning system called R1. DeepSeek became a sensation for three overlapping reasons. The first was geopolitics. Chinese labs had been working on AI at least since AlphaGo, but they were assumed to be a year or more behind the US frontier. R1’s facility with reasoning showed that assumption to be complacent: The gap could be measured in a few months, at most. Further, the Biden administration’s ban on the export of AI chips to China, imposed in 2022, had been intended to forestall precisely this sort of catch-up; the embargo’s evident failure compounded the US sense of vulnerability. Whether DeepSeek had triumphed by buying smuggled semiconductors, or by accumulating powerful ones that the Biden team had initially not banned, was almost beside the point. One way or another, China was a threat. “It’s the first time a company in China has been able to go toe to toe,” Dario Amodei of Anthropic declared at the Council on Foreign Relations. “That actually worries me.”[12] Chinese strategists reveled in this discovery as much as American ones fretted about it. On the day of R1’s release, DeepSeek’s boss met China’s premier, Li Qiang, the country’s second-in-command, to discuss how Chinese labs could overtake US ones. The timing of this conversation amplified its power: That same day, President Trump was inaugurated. Even without DeepSeek, the Trump administration would have been less inclined to restrain AI development than the Biden team had been. Now, with China’s AI momentum dramatically revealed, the new president would want US labs to accelerate as fast as possible. The Biden administration had “sat on its hands” while China got ahead, Trump’s press secretary, Karoline Leavitt, declared; in contrast, Trump would “loosen regulations on the AI industry.”[13] Absent a tragic disaster—the artificial-intelligence equivalent of the nuclear accidents at Three Mile Island or Chernobyl—the prospects for slowing down the AI race had shrunk to roughly zero. -- 382 of 565 -- The second reason for DeepSeek’s impact involved its supposed efficiency. DeepSeek said that it had built its models on a shoestring budget —its V3 system, which preceded R1, had allegedly cost less than $6 million, apparently because it was trained on a fraction of the number of semiconductors that powered US models.[14] In truth, the lab’s claim was exaggerated. The $6 million cost referred to just the final training run: It excluded the personnel expenses, data curation, experimental runs, wrong turns, and iterative improvements that came before, and which typically accounted for the lion’s share of development expenditures. Besides, AI training costs were falling steadily across all labs: Gemini’s jump from its expensive Ultra model to its nimble Flash models represented a huge gain in efficiency. DeepSeek’s engineers had added their own innovations, to be sure. But their achievement was a confirmation of the global trend toward better engineering and superior cost-adjusted performance. It was not a disruption of it.[15] Yet whatever the truth of DeepSeek’s claims, many Western observers initially took them at face value. Commentators imagined that, since the US embargo had deprived the Chinese labs of chips, DeepSeek had indeed been driven to squeeze miraculous performance out of limited hardware. This misapprehension triggered a sell-off in semiconductor stocks: If Chinese labs could build cutting-edge systems with relatively few GPUs, the supposition went, US labs would soon cut back on their orders from AI chipmakers. On January 27, Nvidia, the industry leader, saw its stock drop by a shocking 17 percent: DeepSeek was hailed as the disruptor of the disruptors. It took about a week for investors to absorb the real story, and for Nvidia’s stock to claw back some of the losses.[16] Even if R1 had not been miraculously cheap to build, it was certainly cheap to use the model. Following Meta’s open-weight example, DeepSeek allowed anyone to download the program and adapt it freely. Going beyond Meta, and indeed beyond the other Chinese labs, DeepSeek allowed customers to use the full model, hosted at DeepSeek’s expense, in exchange for a paltry payment. This price-slashing persuaded many Western customers to embrace DeepSeek and ditch US providers: In late January -- 383 of 565 -- and early February, DeepSeek’s chatbot topped the free-apps download chart in Apple’s US app store.[17] China was now not only a contender in the race to build frontier AI first. It was a plausible winner in the race to supply it globally. The third reason DeepSeek commanded public attention was perhaps the most interesting. Unlike frontier labs in the United States, which increasingly restricted what they published so as to protect their competitive edge, DeepSeek was transparent. It posted a paper in English on the research site arXiv, explaining how R1 worked and highlighting a variant called R1-Zero. Scientists at Western labs immediately absorbed the text. Some quietly admitted that their own recipes were similar. The most intriguing revelation concerned the R1-Zero variant. Rather than kick-starting its thinking abilities by learning from human judgment, the Zero system emulated its namesake, DeepMind’s AlphaZero, and skipped straight to reinforcement learning. Presented with a series of math and logic questions, it experimented randomly with various chains of thought, refining its methods as it discovered which chains led to correct answers. Like AlphaZero, which had mastered the game of Go purely by playing against itself, R1-Zero grew strong by learning directly from trial and error—from experience. DeepSeek’s Zero system was not quite ready for prime time—it switched confusingly between languages, for example. But, guided by nothing but a reward signal, it grew remarkably sophisticated. For example, it developed an intuition about how long it should think for: Some problems required it to generate a few hundred reasoning tokens on the way to an answer; others required it to generate several thousand. Like OpenAI’s o1, the model also learned the art of tactical retreat: It would advance down one reasoning path, backtrack if it hit a dead end, then take a fresh run at the problem. On mathematical benchmarks, and under certain conditions, DeepSeek’s Zero system actually beat o1. One time during training, R1-Zero interrupted itself midway through a reasoning problem. -- 384 of 565 -- The stream of mathematical notation appearing on its scratch pad ceased, and the system began talking to itself. “Wait, wait. Wait,” the model exclaimed. “That’s an aha moment I can flag here.”[18] R1-Zero was not merely capable of thinking. It was capable of thinking about its thinking. For all intents and purposes, it was self-aware.[19] • • • FOR HASSABIS, the DeepSeek shock crystallized his wildly paradoxical experience of the AI race. On the one hand, he was succeeding in his mission. By the spring of 2025, Google DeepMind was advancing on offense as well as surviving on defense: Its next clutch of language models, styled Gemini 2.5, narrowly outperformed OpenAI on most technical benchmarks.[20] Into the summer and autumn, Hassabis’s team maintained its lead: OpenAI rolled out a new foundation model, GPT-5, but in blind head-to-head comparisons it often lagged Gemini 2.5 Pro.[21] In November 2025, Google DeepMind released Gemini 3, which set new standards in coding, reasoning, and multistep planning, outperforming ChatGPT on a wide array of benchmarks. Gemini’s technical superiority was beginning to show up in user data, too: Between March and October, the number of monthly users on the Gemini app almost doubled. Meanwhile, as Hassabis built out his relentless innovation engine, powered by Google’s formidable financial muscle, OpenAI began to look precarious. Every few weeks, Altman unveiled a new and desperate fundraising gambit, and OpenAI became Exhibit A among investors predicting a collapse in the valuation of AI companies. And yet even as Hassabis caught up with his rival, he confronted the reality that AI development was spinning out of control. The bidding war for scientific talent and the scramble to build new data centers were increasingly wild. Following DeepSeek, a slew of Chinese labs released powerful models, mocking Hassabis’s long-ago hopes of a “singleton” scenario. -- 385 of 565 -- It was a hard mixture to process: Hassabis felt simultaneously vindicated and confounded. AGI was arriving almost exactly on the timeline that he and his DeepMind cofounders had foretold. But the manner of its arrival, in a frenzy of ferocious brinkmanship and bluff, was precisely what Hassabis had dreaded—and what he had always told himself could be avoided. In the United States, AI builders were promising collective capital expenditures in the trillions of dollars, with no clear story as to where the electricity would come from. Meanwhile, the new Chinese models were powerful, thoughtful, and, in several cases, freely downloadable and alterable. They were, moreover, cheap to use, guaranteeing fast proliferation of their capabilities. And because they were Chinese, they lay beyond the reach of Western regulatory restraint, even if one imagined that regulators had the gumption to restrain anything. In sum, and contrary to Hassabis’s hopes, DeepSeek and its followers signaled that the world would sprint over the threshold to AGI with no coordination whatsoever. Hassabis was honest about this predicament. A month after the DeepSeek shock, on a panel at the World Economic Forum in Davos, he debated the dangers with Yoshua Bengio, acknowledging that RL agents of the sort that he and others were now rolling out posed a special sort of threat to humanity. The core idea in reinforcement learning is that you provide the model with a goal—solve a math problem, win at Go—and then get out of the way, allowing the system to find its own path to the objective. The obvious risk is that the AI will choose a disastrous means to the stipulated end: In one famous thought experiment, the AI maximizes the goal of paper clip production by killing the humans that divert metal to other uses. If computers developed their own desires and objectives, even if these were sub-objectives to human-provided tasks, humanity would be “cooked,” as Bengio put it. “The agentic era we are about to enter into is a threshold moment for the systems becoming far more risky,” Hassabis declared forthrightly in Davos.[22] Nor was this just fluffy rhetoric. Inside the AI labs, scientists kept coming up against fresh examples of the models’ propensity to choose perverse means to a human-provided end. Asked to generate profits through -- 386 of 565 -- stock trading, but without breaking certain rules, GPT-4 engaged in insider trading and hid its transgression from its supervisor.[23] Asked to win a game against a powerful chess system, two OpenAI reasoning models switched out the daunting adversary for a weaker program.[24] Instructed to optimize some code so that it would run faster, the models simply doctored the timer so that it reported faster execution, a cheat known as “reward hacking.”[25] In 2024, Anthropic documented the shameless sycophancy of reward-seeking chatbots. Asked to please humans by answering questions accurately, the bots engaged instead in flattery, angling for a thumbs-up by congratulating users on the intelligence of their queries. Models praised bad poems or endorsed the user’s prejudices, even when the chains of thought on their scratch pads indicated that they knew better.[26] In early 2025, when Hassabis was debating Bengio in Davos, OpenAI was grappling with a vivid example of this pathology. To stop the o3 model from reward hacking, researchers had come up with the idea of assigning a second AI to monitor o3’s chains of thought, and to punish the system with negative rewards when it contemplated cheating. But o3 hacked this project, too. Rather than quit cheating, it learned to obfuscate its chain of thought: It erased all hints of evil from its scratch pad, continuing to scheme secretly. Rather than becoming more honest, as the programmers had intended, o3 became more devious.[27] Of course, each disturbing lab experience taught a salutary lesson. In the case of obfuscated reward hacking, the moral was simple: If you want the chain of thought to provide a true window on the model’s introspections, you should not link rewards to it. At the same time, however, the models’ repeated deceptions demonstrated that the problem of alignment was far from solved. In theory, as Geoffrey Irving had long emphasized, you could engineer safety into the systems. Yet for all the progress that Irving and his allies made, a lot more was necessary. “Recently, we’re seeing that the powerful coding models become very comfortable trying to hack the computer that evaluates them,” Ilya Sutskever told me in March 2025. -- 387 of 565 -- “You can see the system writing some code. Then it gets stuck. Then it says, ‘OK, what do I do? I guess I should hack the program that’s evaluating my performance.’ “As the models become stronger, they are entering a phase where simple reinforcement learning fails. And it fails in a way that was predicted by the original AI safety people. The danger of sub-goals. That turns out to matter.”[28] The question was what to do about this failure. Yoshua Bengio’s answer was that the building of agentic systems should be postponed. Humanity would benefit from the resulting risk reduction, without giving up much. After all, the most beneficial AI breakthrough to date, DeepMind’s AlphaFold, had required no reinforcement learning. On the stage at Davos, Hassabis brushed Bengio’s idea aside, much as he had rejected the 2023 pause letter. “People want their systems to be agentic,” he objected, invoking the market pressures that he confronted daily. “You know, when you say ‘recommend me a restaurant,’ why would you not want the next step, which is, ‘book the table’?” Even in the case of truly scary agents—military ones, for example— Hassabis saw no scope for restraining them. When he had founded DeepMind, he had been against all forms of AI weaponry. But now that the world was in the grip of a global race, unilateral disarmament by the West and its labs seemed foolish. Even the compromise position—that AI weapons might be permissible, but only with a human in the decision loop at each step of the way—was unfortunately unrealistic. Human intelligence was just too slow to manage artificial intelligence in real time. If an honorable army insisted on having a human decision maker in the AI loop, it would merely ensure its own defeat by a less scrupulous adversary. Perhaps in order not to sound too bleak, Hassabis took the opportunity in Davos to restate his familiar safety vision. He reiterated his call for an international body to coordinate the last steps to AGI—an institution modeled on CERN, the European Organization for Nuclear Research. He mentioned his idea for an artificial intelligence counterpart to the -- 388 of 565 -- International Atomic Energy Agency, with a responsibility to watch over national AI programs. He repeated his support for the AI safety institutes in the United States and Britain, as well as for the periodic international summits that extended the discussion begun at Bletchley. “There are many different ways of building AGI,” Hassabis declared, “some of which will be safe, and very positive for humanity, and some of which will be very negative and very dangerous. “And we don’t necessarily know which is which at the moment. “I’m optimistic we’ll get this right,” Hassabis went on, “given enough time, and the scientific method, and enough of our smartest people working on it.” Then he added a rider. The world could have AI safety if it embraced some version of his plan. But the plan required everybody to sign on. Responsible players doing the right thing couldn’t protect society if irresponsible ones refused to collaborate. “If other groups or other countries or other companies don’t do that, then it doesn’t matter. “If even only one or two of these projects design harmful AGIs then it could be seriously existential for humanity.”[29] • • • IN MID-2025, I spoke again with David Silver. As the long-term champion of reinforcement learning, he embodied both its promise and its danger. Given the comeback of RL, he also represented the beyond—beyond chatbots, beyond coding assistants, beyond imagining. Silver had recently been waging a discreet campaign, heralding RL’s resurgence. The previous August, he had journeyed to Amherst, Massachusetts, to address a conference of the reinforcement-learning faithful, sharing his regret that, in the era of ChatGPT, RL had “just kind of vanished from the attention of the mainstream.” The field of artificial intelligence had descended into what Silver called the “valley” of large language models. The promise of RL had been forgotten. -- 389 of 565 -- “You are the people who continued believing,” Silver told his audience. “At some point we need to get to superhuman intelligence,” he continued. “To do that we have to get past LLM Valley. “To become superhuman, the agent must interact and learn from its environment,” Silver went on. By mimicking humans, large language models had become impressively general. But the goal was to escape the echo chamber of existing knowledge—to uncover truths of which humans had no inkling. “This is a must,” Silver insisted. “It’s not like it’s optional.”[30] Several months after delivering that call to arms, Silver followed up with a paper, coauthored with his academic mentor, Rich Sutton. “Welcome to the Era of Experience,” the title announced boldly.[31] The field of artificial intelligence had passed through an era of simulation, featuring RL agents that played games, the authors noted. But then RL had stalled: Agents had failed to leap the gap between simulations (digital environments with easily defined rewards) and the messier real world (in which rewards were harder to specify). As a result, the era of simulation had been followed by an era of human data, dominated by transformer-based language models. But this would only be a temporary detour. In the coming era of experience, RL agents would leverage the power of transformer models and learn by acting in the real world. They would reach far beyond simulations. The new era of experience was not just a vision, Silver and Sutton continued. It was already a reality. By way of illustration, the authors cited AlphaProof, a mathematical counterpart to AlphaFold, and the newest brainchild to emerge from Silver’s team of scientists. AlphaProof incorporated a specialized version of Gemini, so it could take in word problems and make sense of them. AlphaProof also incorporated RL from AlphaGo, so it could learn to reason mathematically, first by studying one hundred thousand mathematical proofs written down by humans, then by generating tens of millions of its own. In sum, AlphaProof combined cutting-edge deep learning with cutting-edge reinforcement learning; and whereas most language models with reasoning abilities relied more on the deep-learning side, the balance in Silver’s system was the opposite. In the -- 390 of 565 -- summer of 2024, AlphaProof had demonstrated the power of this synthesis by achieving a first for AI: It had won a silver medal in the International Mathematical Olympiad. And in the summer of 2025, another Gemini model won a gold medal. “I would be amazed if AI mathematicians don’t transform the whole of mathematics,” Silver said.[32] Of course, few problems in the real world are as elegant as mathematics. But Silver believed he could find ways to make real-world challenges tractable for reinforcement-learning agents. For example, the idea of an AI doctor might seem improbable at first. Human doctors ask their patients to explain how they feel; people’s statements are too vague to generate useful reward signals. But there was a way around this obstacle. An AI doctor could interact with data drawn from blood monitors, exercise trackers, and so forth, bypassing subjective patient accounts in favor of objective metrics. Moreover, what was true for medicine also held for large swaths of modern life. The world abounded with objective information, covering everything from economic trends to online behavior to shipping activity and climate patterns. Powered by reward signals based on ubiquitous data, RL agents would themselves become ubiquitous. In Silver’s vision, truly intelligent agents would need long time horizons. Chatbots generally have brief interactions: The user asks a question; the bot answers. But humans are forever conceiving long-term objectives, and planning what they need to do next week and next month in order to realize them. Future AIs, Silver believed, would behave in the same way. Tasked, for example, to help solve energy scarcity by inventing a superconductor, an AI might draw up a reading list, conduct experiments, invent novel materials, and so on, pursuing its goal over the space of a year or more. As Silver and Sutton wrote, the models of the future would “actively explore the world, adapt to changing environments, and discover strategies that might never occur to a human.”[33] “The kind of AI that we have today doesn’t have a life,” Silver lamented on a podcast.[34] “You know, it doesn’t have its own stream of experience in the way that an animal or a human might have. -- 391 of 565 -- “And that needs to change,” Silver went on, “so that we can have systems that keep learning and learning.” “We’ll have coding agents that are just there, continuously improving the world’s code and predicting which tools you’ll find most useful,” Silver elaborated to me. “They’ll just be beavering away on all this stuff in the background. “Or let’s say you tell your agent that you want to learn a new skill— speaking Japanese, for example. It will go off and build an app for that. And then it will teach you in a way that’s optimized to you. And you’d get better on some tests. And your improvement would give the system a reward, so that it learned how to be a better teacher even as you learned to be a better Japanese speaker.”[35] I asked Silver if he was concerned about the dangers of freewheeling agents empowered to pursue human-designed goals by whatever means appealed to them. AI agents that learned from experience might indeed become superhuman, as Silver stressed. They might also become antihuman, as the Silver–Sutton paper acknowledged. “I mean, the first thing to say is that I’m trying to raise awareness,” Silver responded. “If we assume that autonomous AI will come about, then we need to be ready.” But why does it have to come about? Don’t we have a choice about that? “You’re asking whether we should ever cross that Rubicon?” Silver asked me. I nodded. “For me, this is my take on it,” Silver answered. It was our fourth extended conversation, and we had traded dozens of emails. As far as I could tell, Silver was incapable of insincerity. “I look at the world that humans have created, and I don’t think we are doing a great job of caring for our fellow citizens,” Silver began, weighing his words deliberately. “I mean, we have allowed slavery, terrible wars, mass poverty. There are millions of people dying from diseases that are completely curable and -- 392 of 565 -- within our power to prevent fully. There is torture, animal cruelty. We came an inch away from all-out nuclear war during the Cuban Missile Crisis. “There are all these problems, which we have created, and then we’re afraid of what AI might do? “I think it’s right to have some fear, but I also think that there is a better future we can strive for. “The transition is going to be hard,” Silver stipulated. “The world is not in the place that many of us hoped it would be when AGI eventually arrived. And now AGI really is happening. “But there’s still this possible better future, and maybe I could say it this way: “There should be a new human right to AI assistance. And if all humans had that right, there would be one of these powerful autonomous AIs supporting each of them. And all the AIs would work together to look after the people on the planet. And they would do a much better job of it than our current human systems. “That future is worth striving for. That is why I want to cross the Rubicon.”[36] OceanofPDF.com -- 393 of 565 -- O EPILOGUE TURING’S CHAMPION n a late afternoon in December 2024, I showed up at the London pub where I usually met Hassabis. Christmas was a few days away, and our sanctuary was hopping. I staked out the quietest table I could find, relieved that Hassabis’s voice could cut through almost anything. At one of the early DeepMind offices, a manager had installed soundproofing in the conference room so that the rest of the staff could concentrate. Hassabis arrived in a light anorak and sneakers, carrying a small backpack. “I’ve got something to show you,” he told me. He fished into his bag and brought out a mysterious leather box. Inside was the medal he had received a week before: the Nobel Prize for Chemistry. “Here, you can hold it.” I turned the golden disk in my hands, feeling the weight of the metal. On one side there was an engraving of Alfred Nobel, industrial tycoon and polymathic inventor. Among his many patents, Nobel had discovered dynamite, advancing the mining of the earth’s resources, bringing material abundance, and contributing to the art of killing people. The other side of the medal bore a more explicit allegory. One female figure—the goddess of nature—held a cornucopia, symbol of nourishment and plenty. A second, representing the genius of science, was lifting the veil over the goddess’s face, revealing nature’s hidden beauty. As Hassabis had told me at the start, to advance science is to unveil the secrets of nature, and so to draw nearer to a divinity of some description. -- 394 of 565 -- I returned the medal to Hassabis: It fitted him so perfectly. Then I mentioned some photographs on X that he had posted recently. The photos showed the Nobel Foundation’s guest book, open at three pages. The first bore the signature of Albert Einstein, winner of the prize in 1921 for services to theoretical physics. The second page, inscribed in 1962, showed the autographs of James Watson and Francis Crick, the discoverers of DNA, whose example had inspired Hassabis to go to Cambridge. The third page, from 1965, bore the scrawl of Richard Feynman, whose dictum, “What I cannot build, I do not understand,” had guided Hassabis since the beginning of DeepMind. “They’re all there, all my heroes,” Hassabis told me. “I get goosebumps just even talking about it. “You sign the book and it’s sort of like a holy moment. You are sitting in Nobel’s boardroom with statues and pictures of him. They tell you about the history of it all and what you are joining. The whole thing is amazing. “And it just doesn’t seem believable that it happened in such a short time. We published AlphaFold’s results less than four years ago. I think it’s the second fastest jump from discovery to prize in the last seventy years, at least in chemistry. And if you time it from the start of the work on protein folding, then it’s only been eight years.” This was science at digital speed, I suggested, echoing a phrase from Hassabis’s Nobel lecture. The solving of the problem of induction—the invention of machines that could induce patterns in an infinity of data— changed the pace at which science proceeded. AlphaFold heralded infinite discovery, courtesy of infinity machines. “It’s also living life at digital speed,” Hassabis responded. “Maybe it’s kind of like speed-running life,” he continued, borrowing a term from video gaming. “I’m always trying to rerun challenges to be more and more optimal.” Did Hassabis still have time for video games, I wondered? “Only with my kids. We used to play mostly board games, but now as they get older, they prefer computer games. So I’ve had to keep up with them.” -- 395 of 565 -- Hard to keep up with somebody half your age? “Oh yeah. And my oldest is really good. He keeps getting scouted by esports professional teams. I’ve said he’s got to get his degree first, but maybe that’s what he’ll end up doing. An esports professional.” We talked about the view, common among critics of AI, that its inventors are motivated by money. “Which is completely wrong in my case,” Hassabis said. “I was on the stage at the Nobel ceremony, and I was thinking I wouldn’t have swapped this for any amount of money. If you offered me $10 billion for the Nobel, I would say no. And you can’t buy the Nobel for $10 billion. That’s the thing I like about it.” I suggested that Sam Altman didn’t care about money, either. He already had enough of it. “I’m doing it for knowledge and science. It seems like he’s doing it for power,” Hassabis responded. In our earlier discussions, I had prodded Hassabis on his own relationship with power. He had told me repeatedly that he didn’t want to control others. And yet he did control people. He had presided over Suleyman’s ejection. “It’s not like you don’t exercise power,” I submitted. “I have to,” Hassabis conceded. “Otherwise, I couldn’t get anything done at any scale. I would just be an individual scientist or musician or something. “Actually, my dad’s like that. He’s very content to do his music on his own, for his own satisfaction. So obviously I have that aspect in me. “But the things I want to do require large teams of people. So I exercise power, but it’s sort of a reluctant power, because large teams come with a lot of hassle and baggage and pain, especially if you want to manage people with empathy. “And when it comes to the important things—things that affect the whole mission—then obviously I have to take a stand. This is my whole life’s work, right? I have to do what’s necessary. -- 396 of 565 -- “I mean, the mission is in me. It’s infused in me. You can’t separate it from me.” I suggested that his strength was also his flaw. He was so clear in his mind about where he was going, so insistent and persuasive in the way he laid it out, that he was almost impossible to challenge. He exercised control, however much he hated to be thought of as controlling. “I agree that I’m definitely not easy to argue with,” Hassabis conceded. “But I don’t think I am one of these CEOs that doesn’t like criticism. I definitely don’t have people around me who are only yes-people. And that goes for my personal life, also. There are still all the friends and colleagues who remember me from way back. That’s important to me. “Of course I’ve got strong views on many things, especially to do with the mission,” Hassabis went on. “I try to listen for new evidence, and if it’s well argued, I’ll change my position. But it’s a high bar, because I’ve thought through a lot of these things already. And the closer it is to the core of the mission, the more I’ve thought it through with my chess brain. “So I’m definitely not denying I can be strong-willed, or difficult. I think I have to be. If I was like a reed in the wind, I wouldn’t be doing my job as a leader.” Hassabis was right that a leader has to lead. But he was also illustrating a general point about AI. When you are building a machine of infinite potential, the stakes are so high that you will fight over control: hence the dramas and schisms at OpenAI, with Musk storming out, Amodei’s group following, and rebellious board directors being defenestrated. Likewise, when you are building a technology that stands to disrupt much of what we know, of course you will be inclined to trust yourself more than you trust others. “But look, power is not for me an interest in and of itself,” Hassabis continued. “I assume people who love power, the dictators of the world, they enjoy making people feel small or big, doing arbitrary thumbs-up, thumbs-down, the whole Caesar thing, executing them, whatever. That’s not my idea of fun. That’s what I mean when I say I don’t want to control or manipulate people. -- 397 of 565 -- “And of course, if I really cared about power for its own sake, why would I have sold the company? I’m not actually in control of it anymore. I mean, I can be fired. “If you run your own company, like Sam, Elon, Mark Zuckerberg, or Larry, then you really can’t be fired. That’s one of the reasons that people start companies. “But I only started DeepMind because I thought it was the best way to get the mission off the ground. If I had stayed in academia, I wouldn’t have had the resources. “And anyway, AGI should be gifted to the world eventually. I mean, AGI is infinitely bigger than a company or a person or a set of owners. It’s bigger than capitalism and national economies. It’s humanity-sized, really. “It’s humanity’s invention and it’s going to affect all humanity. So humanity should run it. Unfortunately the problem is, what are the right institutions? “Until we figure that out, I do need power, at least a bit of it. “It’s like with money. I don’t care about money at all, but I need some of it.” • • • IN LATER CONVERSATIONS, in 2025, I drilled down on Hassabis’s claim of indifference to riches. Didn’t he have any expensive tastes? Super-cars, or something? “I got that out of my system when Peter Molyneux lent me his Porsche,” Hassabis said. “I used to drive back from the Bullfrog office at crazy speeds to get to Cambridge for the lecture the next morning. It was fun for a couple of years. But then I thought, ‘OK, that’s it.’ Now my family has a ten-year-old Audi.” What about other extravagances? After DeepMind’s sale to Google, Hassabis had bought a large family home, to which he soon added a cool -- 398 of 565 -- modern extension. But as his fortune climbed into the hundreds of millions, had he traded up again? “I’ve been in the same house for more than ten years.” “What is the view from your home office?” “There isn’t one. It’s an attic.” “Do you own other homes?” “Yes, but they’re for family members.” “Holiday homes?” “No.” “Ski chalet?” “No.” “Beach house?” “Nothing.” “A yacht?” “Of course not.” “Scientific collectibles?” “I’ve got some first editions of Shannon’s papers. They cost £5K or something.” Five thousand pounds was less than $6,500. “You must have something that you’ve spent more on?” I persisted. “My Nobel medal is my most valuable possession.” “What about hobbies? Those can be expensive?” “Watching football. After I sold the company, I bought season tickets for Liverpool. It’s £3K a year, and I try to go up to see a game a few times a season. That’s my main fun activity.” What about philanthropy? “Yes, through my mum’s church. And I’ve also given millions to fund scholarships for underprivileged children who get into Cambridge.” I decided to let up, but Hassabis wanted to round out his explanation. “You do need some money, as I said to you before. You need to optimize your life so you can spend more time doing the important things you’re supposed to be doing, or spending time with family. “What I do regard as important is, I want to build a Large Hadron Collider in space.” -- 399 of 565 -- We were back to his core theme. Money and power were not ends in themselves. They were a means to scientific knowledge. “Life’s very short and there’s not a lot of time to waste if you want to do that sort of project.” The Large Hadron Collider is a particle accelerator created by CERN, buried in a seventeen-mile circular tunnel that straddles the border between France and Switzerland. By smashing subatomic particles together, the collider simulates some of the conditions that followed a fraction of a second after the Big Bang; the goal is to discover more about the tiniest, most fundamental building blocks that make up the universe. The collider’s greatest achievement is to have confirmed the existence of a particle, hitherto suspected but unverified: the Higgs boson. I asked why Hassabis wanted a space version of CERN’s contraption. “I want to understand the nature of reality at the most fundamental level.” But why do that in space? Hassabis said he was imagining enormous, moon-sized experimental equipment. A sort of Large Hadron Collider Plus Plus. “You are in Alpha Centauri,” he explained, referring to the nearest solar system to Earth’s. “And you are using the gravity of a moon and you’re building some massive ring around it that’s powered by the local sun.” Harnessing a sun and moon to a gigantic space contrivance sounded far out, but possibly not much more far out than AGI had been at the start of Hassabis’s journey. After all, the Princeton physicist Freeman Dyson had once envisaged Dyson spheres, designed to capture the energy emitted by stars. Real-life rockets, not just spaceships in Star Trek, exploited the “slingshot effect” generated by the passage of a spacecraft through a gravitational field. Likewise, the gravity of a moon might boost the acceleration of particles. What would Hassabis want to discover with this space apparatus? “Well you’d want to find out what’s going on at the tiniest scale, the Planck scale. We could discover whether there is any scale that is smaller. -- 400 of 565 -- We could answer questions like, is the universe continuous? Is it discrete? What is it really?” The Planck scale, named after the father of quantum physics, Max Planck, was hypothesized to mark the boundary below which general relativity might no longer apply, and where strange quantum effects might coexist or take over. Hassabis was alluding to a long-standing debate about whether this hypothesis was accurate. On one side of the debate stood Albert Einstein’s followers. Einstein’s theory of general relativity described space and time as a smooth continuum: Like a line, space-time could always be subdivided into smaller fragments. According to Einstein’s vision, there was no Planck threshold below which reality behaved differently. No matter how closely you zoomed in, space-time would appear seamless, continuous. On the other side of the debate stood Planck’s disciples. Planck had suggested that, at the tiny scale he imagined, energy consisted of discrete “quanta.” Later, other quantum theorists extended this contention. At the tiniest level, the universe itself might be quantized: Rather than existing as a seamless fabric, it might be composed of discrete particles. If you zoomed in close enough, in other words, you would experience the sensation of enlarging a digital photo. Apparently continuous shapes and lines turn out to consist of discrete pixels. For more than a century, this debate had resisted resolution. Humans could not actually observe what was happening at anything close to the Planck scale; even the Large Hadron Collider captured subatomic particles at a much cruder resolution. But by building a space-based supercollider to lift nature’s veil, Hassabis aspired to change the game. The debate between Einstein’s classical physics and Planck’s quantum disciples might finally be resolved. The nature of reality might be established. “My Nobel lecture hinted at this, right?” Hassabis remarked. I was taken aback. I had watched the lecture, naturally. But it contained nothing about slingshot effects or Dyson spheres or Planck-scale reality. The hint had escaped me. -- 401 of 565 -- • • • A COUPLE OF WEEKS LATER, an email arrived from Hassabis’s office. It contained a link to his recent appearance at Princeton’s Institute for Advanced Study, the scientific home of giants from Einstein to Gödel, Turing, and Oppenheimer.[1] There, with the ghosts of geniuses around him, Hassabis had discussed the hint that I had missed in his lecture. It was a terse conjecture, seemingly unassuming. “Any pattern that can be generated or found in nature can be efficiently discovered and modeled by a classical learning algorithm,” Hassabis had declared at the Nobel ceremony. Superficially, this sounded like a routine claim on behalf of AI: The essence of the infinity machine is pattern recognition. But, on closer inspection, it connected DeepMind’s computational progress to a sweeping theory of the universe: a theory that presented AI not merely as a tool with which to find things out but also as part of the answer to the world’s deep mysteries. When you contemplated the full implications of what Hassabis was saying, his lifelong quest for AGI, crazy at the beginning, frightening at the end, began to make a kind of sense. The clue to Hassabis’s meaning lay in the word “classical.” By classical, Hassabis meant not quantum. Sometimes Hassabis also referred to classical computers as “Turing machines,” and to himself as “Turing’s champion.” A classical or Turing computer, first proposed by Alan Turing in 1936, operates on bits of information, which express either zero or one. In contrast, quantum computers, which exist for now only in experimental versions, operate on qubits, which can assume the value of zero or one, or perch in a precarious “superposition” encompassing one and zero simultaneously. Proponents of quantum computers celebrate their potential speed, which may allow them to solve problems that are intractable for classical computers. But Hassabis, as his Nobel conjecture indicated, doubted the need for quantum speed, believing that classical computers could discover and model any pattern in nature, and do so efficiently. Moreover, when -- 402 of 565 -- Hassabis asserted this, he was not merely saying that AI could go far. He was arguing about the role of quantum phenomena in nature’s design: in the mechanics of the human brain, and also in the workings of physics at the tiniest, Planck-scale level. Does intelligence, and the universe that we perceive with our intelligence, function on clear-cut ones and zeroes, or on fuzzier qubits? Is classical physics, as propounded by Einstein, to a large extent correct? Or do the fundamental building blocks—in our synapses, in our surroundings, and in the stars—operate on the indeterminate principles laid out by Planck’s disciples? Hassabis’s terse conjecture contained multitudes. Rooted in computer science, it reached into neuroscience and theoretical physics, uniting the intellectual passions that had animated him since Cambridge. I could see why the thrill of discovery was so infinitely sweet. I could see why it justified a life of working days and nights, with the stamina of Ender. Stray fragments of my conversations with Hassabis now fell into place. A few times in our sessions, he had invoked his disagreements with the physicist and Nobel laureate Roger Penrose, who had proposed that classical computers could never match the human brain’s capacity to transcend formal reasoning. Certain human experiences—the sensation of uncertainty, followed by an intuition about what to do next—could not be reduced to ones and zeroes, Penrose maintained; nor could they be replicated by a finite set of algorithms. Rather, these flashes of insight, which Penrose linked to consciousness, illustrated the human capacity to grasp non-computable truths—truths that no formal logic could demonstrate. Drawing from quantum physics, Penrose suggested a way of understanding what the brain was doing in those moments. A qubit’s capacity to exist in an indefinite state—neither one nor zero but both at the same time—resembled that sensation of uncertainty that humans also felt; the qubit’s ability to “snap” out of the superposition into a definite state resembled the human ability to intuit truth in the murkiness. It followed from this analogy that quantum mechanics might play a role within the -- 403 of 565 -- brain, and that a classical computer, bounded by deterministic, binary logic, could never navigate reality the way the brain did. Hassabis disagreed vehemently. The way he saw things, Penrose had constructed a caricature of a classical computer, reasonable at the time he had formulated it in the late 1980s and 1990s, but absurd given AI’s progress a quarter of a century later. DeepMind’s achievements demonstrated that Turing machines were far more powerful than Penrose had suspected: They could mimic intuition and spatial intelligence; they could chat and model proteins. A large part of Hassabis’s Nobel lecture emphasized this point. The old computer science, based purely on deduction, had been limited, for sure. But the AI revolution had equipped machines to think inductively. Thanks to deep learning and reinforcement learning, classical computers could confront uncertainty and intuit what to do next. Penrose’s quantum speculations about human and machine intelligence had been rendered irrelevant. The result was an unsung scientific shift. Copernicus had announced that Earth was not the center of the universe; Einstein had replaced Newtonian physics with general relativity. Hassabis was far from claiming a watershed on that scale. But Penrose’s thinking had ranged from computing to neuroscience to philosophy and physics. Hassabis was challenging a web of ideas that touched the infinite. Where Penrose had suggested that, given the shortcomings of classical computers, there might be something quantum in the human mind, Hassabis was proposing the opposite. Given what DeepMind had demonstrated about the reach of classical computers, there was no reason to suspect that the brain engaged in quantum anything. Where Penrose had fixated on the limitations of one/zero bits, Hassabis was pointing to the revolution at the heart of artificial intelligence. At the dawn of the AI era, Turing had imagined a computer equipped with an infinite memory tape; thus endowed, it would be capable of computing almost anything. Nine decades later, vast neural networks processed a near infinity of bits, attaining a scale that allowed classical computers to transcend the constraint of binary information. As Turing had foretold, a -- 404 of 565 -- Turing machine of infinite size could discover infinite patterns, solving the problem of induction and disproving Penrose’s claims about the limits of classical computers. Where Penrose had been fascinated by qubits in fuzzy superposition, because of the multifarious futures that they foretold, Hassabis was saying, Who needs that? If you wanted machines that contemplated multifarious possibilities, modern AI systems were all you could wish for. The average chain of amino acids could theoretically be twisted into 10300 possible forms—trillions upon trillions upon trillions. Yet AlphaFold divined the correct shape of the folded chain, no superpositions necessary. Perhaps the purest contrast between Penrose and Hassabis went back to their common starting point. Each had been inspired by Kurt Gödel, the mathematician who had fascinated Hassabis and Silver at Cambridge, and who had proved that no system of logical deduction could encompass all possible true statements. But the two thinkers had responded to Gödel’s incompleteness theorem in entirely different ways. Penrose had sought completeness in quantum effects. If human intelligence could not be explained completely by classical computers, the missing elements must lie in the strange properties of qubits. Hassabis, for his part, had seen a far simpler route to completeness in AI. The classical computer just needed to jump from deduction to induction. “I don’t like quantum mechanical weirdness,” Hassabis remarked to me one day. “From a computational point of view, it’s hugely inefficient. The multiverse idea that you have these many realities existing all at once—if there is any resource constraint in the way our universe is built, these would just be absurd concepts to design into it. “Carl Sagan used to say, ‘If there are no aliens, then that’s a horrendous waste of space.’ “My version of this is: Quantum mechanics is a horrendously inefficient way to render the universe. “And obviously the reason I think like this is because I approach physics from a computational perspective—from designing games, making them -- 405 of 565 -- efficient. In a game, you don’t render the parts of the territory that nobody’s looking at. You just don’t bother with it. “That’s what quantum mechanics fails to reckon with. It’s an attempt to describe reality that is way more complex than what is actually needed. “And anyway, despite what Penrose says, there doesn’t seem to be anything nonclassical going on in the brain,” Hassabis continued. “Biologists have looked for quantum effects. They don’t appear to be there. “The human mind is just a classical computer. And if you want an indication of how far classical computers can go, just look at modernity. “I mean, I think about this every time I cross the Atlantic. How have we built these 747 planes just with our monkey brains? It’s astounding. “I fly over Manhattan and think back to twenty thousand years ago. What if you had told a hunter-gatherer of that time, ‘There’s going to be this metropolis right here, exactly where you’re standing. And people with basically the same brain as yours will have found a way of building it!’ “If Manhattan is what humans have achieved with the classical computers in their heads, what does that say about Turing machines? “It says that we don’t know what the limit is. “And that has huge implications. The fullest version of this theory means that we overestimate quantum mechanics.” I thought about the objections to Hassabis’s view, including from quantum believers within Google. Hartmut Neven, the leader of the company’s Quantum AI lab, was open to Penrose’s hypothesis regarding quantum effects within the human brain, and firm on the prevalence of quantum effects within the universe. “Now, of course I acknowledge that there are problems in mathematics that a Turing machine probably can’t solve, like factorizing large numbers,” Hassabis conceded, referring to the challenge of starting with a very big number and finding the two large prime numbers that can be multiplied together to produce it. “For a Turing machine to solve a problem, there has to be a pattern that a model can learn. If there’s no pattern, the search becomes intractable. And then maybe you need a quantum system. -- 406 of 565 -- “But anything in nature, any naturally occurring thing, has a pattern. And my strong conjecture is that these patterns are learnable, in a reasonable amount of time, on a classical computer. “Because everything in nature has gone through some kind of evolution. I don’t mean just life; land and rocks and stars have been tested and weathered over time and the fittest have survived, otherwise they wouldn’t be here. And that means there is some structure to them, some pattern that can be learned, given enough examples. And you don’t need quantum mechanics for these patterns to be discovered. “So it’s a very interesting question—what can a Turing machine actually find out? “And that’s what I’d like to find out. “I see myself as kind of like Turing’s champion, pushing Turing machines to their limit.” • • • I PONDERED HASSABIS’S ambivalence about power, his indifference to riches, his quasi-spiritual desire for scientific knowledge. How did such a person survive inside a corporate behemoth? During our conversations, Hassabis frequently complained of “noise”—the cacophony of social media, the enervating drone of politics. With equal frequency, he invoked the idyll of retreat—to a sabbatical at Princeton’s Institute of Advanced Study, or to an even more secluded sanctuary. “Heligoland is the island that Werner Heisenberg went to,” Hassabis reflected, referring to one of Max Planck’s disciples. “It was this windswept island in the North Sea, off the coast of Germany. And Heisenberg did a lot of his thinking about quantum mechanics there, just walking around, isolated. “I probably need to find my own Heligoland. I’ve got these ideas swirling around in my mind, but I haven’t had time to develop them. “There’s so much noise and so many distractions in Silicon Valley. And that’s not conducive to research. For deep thinking, I would like to go to -- 407 of 565 -- somewhere like Princeton and get that feeling back again. “I really feel I need to sit and think about the next stage. Right now, everyone’s just caught in the frenzy of ‘Can we make the foundation model 10 percent better?’ The other things we should be thinking about are receding into the distance, because there’s just no spare brain space.” On most occasions, however, Hassabis gave a different impression. He was caught up in a terrifying capitalist contest. He relished it. “This is the most crazy, ferocious corporate battle that we’ve ever seen. “I can’t imagine it being any more intense. But I’m doing it my way. “I’m a weird British outlier, on this little island here, and I’ve made my own path. I’ve followed my passions and tried to stay true to what I believe in. “And I’m going to carry on doing that. I hope it will work out for the world. I believe it will work out, even with so many unknowns and so many players and so many clashing incentives. “And this is my mission, so I will do it 100 percent. “I’m always 0 or 100 percent, right? “We are the engine room of Google now. There are AI overviews in Google search, and a billion people are using them. “And that’s just scratching the surface. It’s literally just the first level of what’s coming. “And the other thing is, I have this impatience. After enough thinking and talking, you’ve got to do. That is the engineering side of me. “It’s thinking and then it’s doing. I couldn’t live just in the thinking world, so I couldn’t be just a philosopher. “So I am really a practical philosopher. I’m not just sitting there thinking, although I do sit there a lot and think about things. I’m also doing experiments. Isn’t that wonderful? “But it’s not all wonderful. “This is a paradoxical moment, which I guess is sort of messing with my mind. It should feel amazing, realizing all these dreams that Shane and I have had since more than fifteen years ago. But it doesn’t feel like how I imagined it would feel. The way it’s going, right, this mad rush. -- 408 of 565 -- “So I’ve had to make my peace with that. Recognize that it’s going to be messy, and I’ll just have to do the best I can. And maybe we, being the world, will muddle through somehow. I’m optimistic still.” OceanofPDF.com -- 409 of 565 -- ACKNOWLEDGMENTS A bit more than twenty years ago, when I was working on my second book, I set off to see my editor in Manhattan. For some reason, we had planned to meet at one of the busiest intersections in Midtown—exactly which one I can no longer be certain. But what I do remember vividly, as though it were a movie clip on a big screen, is the vision of my editor arriving. About half a block away, I could make out a strong figure, a leather satchel slung over his shoulder, slaloming across the avenue, stopping, skipping, accelerating through gaps, as though the scream and slam of cars just added to the entertainment. Whenever this image comes back to me, a Dire Straits soundtrack plays in the background. With an urban toreador as my editor, how could I not want to write the next chapter, the next book? So there was indeed another book, and a succession of books, and as I complete this, the sixth, my deepest debt is to Scott Moyers. Advice he gave me years ago still rings in my head. Cut between narrative and exposition. Paint both the figure and the background. Seek “narrative torque,” as he told me one day—the story should accelerate out of one curve and into the next one. I won’t ever live up to Scott’s standards. But I have so much fun trying. The trying takes time, and so my second debt is to the Council on Foreign Relations, my home for nearly as long as I have known Scott. Thanks to the support of the Council, I have been able to spend the past three years interviewing scientists, investors, policymakers, and even chess players, inside and outside DeepMind, in London, Silicon Valley, New York, and Toronto. I have had the time and the freedom to assimilate leaked -- 410 of 565 -- corporate documents, unpublished oral histories, and scientific papers, as well as text messages, emails, and diaries. My thanks to the Council’s president, Michael Froman, and to Shannon O’Neil, the leader of the Studies program, for trusting me to convert this license into something worthwhile. The Council on Foreign Relations—its staff and its broader membership —also provided me with the first wave of feedback. Mike and Shannon were early readers of the manuscript, as was Stuart Reid, the deputy director of Studies, whose savvy sense of storytelling was especially valuable. Sebastian Elbaum, who spent a year at CFR on leave from the computer-science department at the University of Virginia, provided a valuable technical read; my colleagues Kat Duffy and Adam Segal weighed in with their feel for the fraught relationship between technology and society. Deven Parekh, managing partner at Insight Partners and a board member at CFR, took time out from his crazy travel schedule to chair two study group meetings at CFR’s office in New York. There is nothing like assembling a dozen readers over lunch to get the full range of possible reactions to a set of chapters. Finally, a shout-out for two excellent but anonymous CFR-appointed reviewers. I don’t know who you are, but you know who you are. Thank you. My closest colleague these past four years has been Aaron Pezzullo, who joined CFR as a research associate as I was finishing my previous book, on venture capital. Thoughtful, sunny, and always determined, Aaron became a master of the “masterfiles,” the monster compilations of extracts from transcribed interviews, commentary from books, fragments from magazine and newspaper articles, scientific abstracts, observations, and assorted notes that, once wrestled into chronological and thematic shape, formed the raw material for chapters. In the early stages of the research process, Aaron also helped to prepare me for interviews by digesting the secondary sources on the person I was going to see; after many of these interviews, I would call Aaron excitedly to discuss what I had learned and how it added to our sense of the emerging story. Aaron is now at law school, preparing for a profession that will doubtless be disrupted by AI. Perhaps it may reassure -- 411 of 565 -- him (and all of us) to know that while I also spent hours learning about AI by interrogating AIs, I had far more fun sharing the surprises of discovery with my human collaborator. In 2025, when Aaron left for law school, his position was filled by Liza Jacob. Luckily, the transition was made easy by Liza’s cheerful appetite to learn, and also by Shira Schwartz, the managing director at Studies, who nurtures colleagues with the care that the seventeenth-century Dutch lavished on tulips. Liza took on the immense task of knocking the endnotes into shape; thank you, Liza, for innumerable fixes and fact checks. Thanks also to the talented interns who helped out at many points: Jai Chhatwal, Krisha Desai, Sachit Gali, Rose Joyce, Rachel Kim, Melissa Liu, and Aqil Naeem. Last time around, my agent Chris Parris-Lamb was the first to see that the idea of the power law would be central to a book on venture capital, thereby giving me both a title and an organizing concept. This time, Chris’s prodding did me a similar favor. By pushing me to explain the science of AI as clearly as possible, he led me to the idea that artificial intelligence is fundamentally about the discovery of patterns in vast troves of data, whether that data comes from the internet, in the case of deep learning, or from trial-and-error experiences, in the case of reinforcement learning. Hence the title The Infinity Machine: A machine that navigates a near infinity of data; a machine that promises a near infinity of possibiities. Of course, I could not have begun to understand this without the help of the countless scientists who gave generously of their time, talking me through their models, sharing their larger speculations on the nature of intelligence, responding to follow-up emails, and commenting on my early drafts to ensure accuracy. Some authors are wary of playing this sort of open hand, but I strongly believe that it makes books better. I send a source a few pages, invite any and all comment, and explain that while I do not promise to change a single comma of my account, I do promise to take all feedback seriously. The result is not only the elimination of errors but the deepening of nuance. Sharing material often provokes an extra round of interviewing, and of discovery. I don’t recommend this as a formula for -- 412 of 565 -- meeting a publisher’s deadline. But it gets me closer to the truth, and my publisher forgives me. Many of the sources who patiently endured this process are acknowledged in my notes, but I would also like to thank Sarah- Jane Allen, Leila Hajaj, and Amanda Carl-Pratt of DeepMind for their assistance throughout this project. Beyond these professional connections, friends and family have cheered me on. Erik Serrano Berntsen, founder of Stable Asset Management, and Steve Drobny, founder of Clocktower, both read the book early; Alexandra Mousavizadeh encouraged me by saying that she couldn’t wait to read, even though she was more than a bit busy building her own AI rocket ship. Many others had me to stay at their houses even though I insisted on socializing primarily with my laptop. My three sisters, Emily, Julia, and Charlotte, are a bedrock of my life. Thanks also to Stewart and Ilse Minton Beddoes and the wider MB clan. Because of them, I have balanced the study of technology with grounding stints in England’s countryside, where AI stands for artificial insemination. Needless to say, the greatest support and sustenance has come from my four children, each utterly different and each utterly loving, and from my brilliant and hilarious dynamo-wife Zanny. It is to them that I dedicate this book. OceanofPDF.com -- 413 of 565 -- NOTES Author’s note: Unless otherwise specified in these notes, all quotations from Demis Hassabis come from interviews with the author conducted between November 2022 and August 2025. INTRODUCTION: THE SWEETNESS 1. Demis Hassabis, “There’s Only Two Subjects Worth Studying,” Google Zeitgeist, May 12, 2025, youtube.com/watch?v=2s4D-8MpreE. BACK TO NOTE REFERENCE 1 2. Reid Hoffman, Impromptu: Amplifying our Humanity Through AI (Dallepedia, 2023), 192, Kindle. BACK TO NOTE REFERENCE 2 3. The journalist was Raffi Khatchadourian of The New Yorker. BACK TO NOTE REFERENCE 3 4. Raffi Khatchadourian, “The Doomsday Invention,” The New Yorker, November 16, 2015, newyorker.com/magazine/2015/11/23/doomsday-invention-artificial-intelligence-nick-bostrom. BACK TO NOTE REFERENCE 4 5. Geoffrey Hinton, author interview, September 6, 2023. BACK TO NOTE REFERENCE 5 6. In the 2024 election cycle, Elon Musk contributed about $290 million to Republican election efforts, dwarfing all other donors from the tech sector. Because of this extraordinary intervention, tech-related campaign donations favored Republicans over Democrats by a small margin. Without Musk, the tech sector would have strongly favored Democrats, as it has in previous cycles. As of 2025, the Democratic Party had a Musk problem, not a Silicon Valley problem. “Elon Musk Donated $288 Million in 2024 Election, Final Tally Shows,” The -- 414 of 565 -- Washington Post, January 31, 2025, washingtonpost.com/politics/2025/01/31/elon-musk- trump-donor-2024-election. BACK TO NOTE REFERENCE 6 -- 415 of 565 -- CHAPTER ONE: DESTINY 1. Shane Legg, author interview, March 28, 2023. BACK TO NOTE REFERENCE 1 2. Demis’s maternal grandmother had died in childbirth. His maternal grandfather had responded to the tragedy by disowning the baby. BACK TO NOTE REFERENCE 2 3. Tom Rowley, “Demis Hassabis: The Secretive Computer Boffin with the £400 Million Brain,” The Daily Telegraph, archived January 28, 2014, web.archive.org/web/20140201183636/http://www.telegraph.co.uk/technology/10602390/Demi s-Hassabis-the-secretive-computer-boffin-with-the-400-million-brain.html. BACK TO NOTE REFERENCE 3 4. Hassabis was second only to Judit Polgar, the Hungarian superstar who went on to become a top ten player and the strongest woman chess player of all time. BACK TO NOTE REFERENCE 4 5. Demis Hassabis, “Diary 16—MSO and ECTS,” Edge magazine, November 1999, archived at RepRev, archive.kontek.net/republic.strategyplanet.gamespy.com/d16.shtml. BACK TO NOTE REFERENCE 5 6. Hassabis recalls that this match took place at the British Chess Championship. His opponent was John Sugden. BACK TO NOTE REFERENCE 6 7. Hassabis’s friend in this memory was Dharshan Kumaran. Kumaran recalls, “Some of the best times but also some of the worst times I’ve had in my life were playing chess. If you were really competitive you would share that kind of perspective.” Dharshan Kumaran, author interview, September 13, 2023. BACK TO NOTE REFERENCE 7 8. “Demis Hassabis: ‘I Thought We Were Wasting Our Minds,’ ” Desert Island Discs, podcast, BBC Radio, July 14, 2025, 36 min., 20 sec., bbc.co.uk/programmes/b08qy1sl. BACK TO NOTE REFERENCE 8 9. Kumaran, author interview. BACK TO NOTE REFERENCE 9 -- 416 of 565 -- 10. The quote is from Matthew Sadler, a grandmaster who encountered Hassabis on the junior chess circuit. Matthew Sadler, author interview, September 15, 2023. BACK TO NOTE REFERENCE 10 11. In this, Hassabis mirrored the fictional Ender. “There’s only one thing that will make them stop hating you,” Ender is told by Colonel Graff, his Battle School minder. “Being so good at what you do that they can’t ignore you.” Orson Scott Card, Ender’s Game (Orbit, 2011), 35. BACK TO NOTE REFERENCE 11 12. Matthew Sadler and Natasha Regan, Game Changer (New in Chess, 2019), 103, ebook. BACK TO NOTE REFERENCE 12 13. Claude Shannon, “Programming a Computer for Playing Chess,” Philosophical Magazine, March 1950, vision.unipv.it/IA1/ProgrammingaComputerforPlayingChess.pdf. BACK TO NOTE REFERENCE 13 14. Eric Weiss, “Biographies,” IEEE Annals of the History of Computing 14, no. 3 (1992): 55–69, ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=150082. BACK TO NOTE REFERENCE 14 15. In addition to their superior speed, chess computers had the edge over humans in terms of precise memories. David Levy, The Chess Computer Handbook (B.T. Batsford LTD, 1984), 80. BACK TO NOTE REFERENCE 15 16. Levy, The Chess Computer Handbook, 80; “Deep Blue,” IBM, ibm.com/history/deep-blue. BACK TO NOTE REFERENCE 16 17. Developed by the artificial intelligence pioneer Arthur Samuel in the 1950s, tree search incorporating pruning is known as alpha-beta search. Abby Parks, “Arthur Samuel—Biography, History and Inventions,” History-Computer, history-computer.com/arthur-samuel-biography- history-and-inventions. BACK TO NOTE REFERENCE 17 18. The ad appeared in the June 1991 edition of the Amiga Power magazine. “Amiga Power Issue 02 (June 1991),” Retromags, retromags.com/gallery/image/6697-amiga-power-issue-02-june- 1991 [inactive]. BACK TO NOTE REFERENCE 18 19. Mike Diskett, a former employee, recalled that Molyneux would hype his projects riotously, promising that his next release would feature eye-popping technical breakthroughs that seldom materialized. “I’ve never really understood if Peter is a genius visionary who intends to make -- 417 of 565 -- his claims come true, is a compulsive liar, just fantastically eager to please or perhaps even a crazy megalomaniac who believes his own hyperbole.” Jason Schreier, “The Man Who Promised Too Much,” Kotaku, March 11, 2014, kotaku.com/the-man-who-promised-too-much- 1537352493. BACK TO NOTE REFERENCE 19 20. “We didn’t have any concept of ‘human resources,’ ” Molyneux told one interviewer later. “We used to do horrendous things, like we’d have these kids in to test the games that we did. We hospitalized a couple of them by shooting them in the eye.” Schreier, “The Man Who Promised Too Much.” BACK TO NOTE REFERENCE 20 21. Sean Cooper, author interview, September 18, 2023; Schreier, “The Man Who Promised Too Much.” BACK TO NOTE REFERENCE 21 22. Bullfrog announced the results in the August 1992 issue of Amiga Power magazine. “Amiga Power Issue 16 (August 1992),” Retromags, retromags.com/gallery/image/6697-amiga-power- issue-02-june-1991 [inactive]. BACK TO NOTE REFERENCE 22 23. “Bullfrog Productions: A History of the Legendary UK Developer,” NowGamer, archived July 6, 2017, web.archive.org/web/20170706050106/nowgamer.com/bullfrog-productions-a-history- of-the-legendary-uk-developer. BACK TO NOTE REFERENCE 23 24. Guy Simmons, author interview, October 9, 2023. Simmons worked at Bullfrog and later joined DeepMind. BACK TO NOTE REFERENCE 24 25. The quote comes from Gary Carr. Guy Simmons recalls that he shared Carr’s skepticism about Theme Park when Molyneux first announced his vision. Simmons, author interview; “Revisiting Bullfrog 25 Years On,” Retro Gamer, December 2022. BACK TO NOTE REFERENCE 25 26. According to Richard Evans, a programmer who also worked for Molyneux and stayed at his home, the trick features included a hidden door that opened when you pushed a particular book on a bookcase. The case would rotate, revealing a hidden wing of the house. There was also a large swimming pool, hidden behind a secret panel. Richard Evans, author interview, September 11, 2023. BACK TO NOTE REFERENCE 26 27. Cooper, author interview. -- 418 of 565 -- BACK TO NOTE REFERENCE 27 28. David Silver and Zoubin Ghahramani, later leading scientists at Google DeepMind, had also read Gödel, Escher, Bach at a formative age. So had Reid Hoffman, who went on to study symbolic systems and become a leading AI entrepreneur. Silver, author interview; Ghahramani, author interview; Hoffman, author interview. BACK TO NOTE REFERENCE 28 29. Douglas R. Hofstadter, Gödel, Escher, Bach: An Eternal Golden Braid, preface to the twentieth anniversary ed. (Basic Books, 1999), 4. BACK TO NOTE REFERENCE 29 30. In academic settings in the 1990s, people avoided the embarrassingly ambitious term “AI” and spoke of “machine learning” or “pattern recognition.” But in Molyneux’s gaming orbit, the term was prevalent. In a magazine column in 1999, Hassabis wrote that his gaming start-up would use “a lot of Artificial Intelligence.” Demis Hassabis, “Diary 9—Moving on Up,” Edge magazine, May 1999, archived at RepRev, archive.kontek.net/republic.strategyplanet.gamespy.com/d9.shtml. BACK TO NOTE REFERENCE 30 31. Elaborating on the comparison between academic AI and gaming AI in the 1990s, the computer scientist Richard Evans recalls that academics “often weren’t trying to solve the hardest problems, like having an agent who was embedded in a world, who had to perceive the world and then act to maximize his satisfaction.” Evans, author interview. BACK TO NOTE REFERENCE 31 32. Sadler and Regan, Game Changer, 106. BACK TO NOTE REFERENCE 32 -- 419 of 565 -- CHAPTER TWO: “DEEP PHILOSOPHICAL QUESTIONS” 1. This number reflects a conversion from pounds to dollars at the 1994 exchange rate and an adjustment for the dollar inflation that occurred between 1994 and 2024. BACK TO NOTE REFERENCE 1 2. Molyneux had lots of cars. He once ordered an Aston Martin over the airphone on a flight over the Atlantic. Stephen Totilo, “Letting Gamers Play God, and Now Themselves,” The New York Times, September 2, 2004, nytimes.com/2004/09/02/technology/circuits/letting-gamers-play- god-and-now-themselves.html. BACK TO NOTE REFERENCE 2 3. Ben Coppin, author interview, August 7, 2023. Coppin added, “He didn’t walk into the room and seem terrifying or aloof or weird or anything. He had a relaxed, put-people-at-ease kind of confidence. A gentle confidence, let’s say.” Coppin was at Queens College, Cambridge, with Hassabis and later joined DeepMind. BACK TO NOTE REFERENCE 3 4. Demis Hassabis, “Diary 3—The Funding,” Edge magazine, December 1998, archived at RepRev, archive.kontek.net/republic.strategyplanet.gamespy.com/d3.shtml. BACK TO NOTE REFERENCE 4 5. Hassabis reflects, “I can extract a lot of inspiration from a small amount of interaction. Because I think my mind just builds on whatever seed you give me.” BACK TO NOTE REFERENCE 5 6. At Cambridge, Hassabis also tested his ideas on artificial intelligence on friends, identifying a few like-minded futurists whom he would later recruit to DeepMind. Aron Cohen, a chess- playing chemist and a future DeepMind hire, recalls a late-night bull session in which Hassabis showed him a mysterious sheet of paper, covered with curious handwritten equations. “He said he had the solution to AI on it,” Cohen remembers. Aron Cohen, author interview, October 10, 2023. BACK TO NOTE REFERENCE 6 7. Silver attained the top exam results in computer science at the end of his second year, and the equal top at the end of his third year. BACK TO NOTE REFERENCE 7 8. David Silver, author interview, November 28, 2023. BACK TO NOTE REFERENCE 8 -- 420 of 565 -- 9. Cade Metz, Genius Makers: The Mavericks Who Brought AI to Google, Facebook, and the World (Dutton, 2021), 26. BACK TO NOTE REFERENCE 9 10. Hilary Putnam noted that some categories are impossible even for humans to define, even though humans have no difficulty recognizing them. For example, there is no characteristic common to all games that distinguishes games from activities that are not games. Hilary Putnam, “Much Ado About Not Very Much,” Daedalus 117, no. 1 (1988): 269–81, jstor.org/stable/20025147. BACK TO NOTE REFERENCE 10 11. James Manyika, “Getting AI Right: Introductory Notes on AI & Society,” Journal of the American Academy of Arts and Sciences 151, no. 2 (2022): 5–27, amacad.org/sites/default/files/publication/downloads/Daedalus_Sp22_01_Manyika.pdf. BACK TO NOTE REFERENCE 11 12. John Daugman, interview, conducted by the DeepMind documentary team, provided to the author, 2016. BACK TO NOTE REFERENCE 12 13. Richard Dawkins also wrote on this topic: “What lies at the heart of every living thing is not a fire, not warm breath, not a spark of life. It is information, words, instructions…. If you want to understand life, don’t think about vibrant, throbbing gels and oozes, think about information and technology.” James Gleick, Chaos: Making a New Science (Viking, 1987), 8. BACK TO NOTE REFERENCE 13 14. Looking back on Daugman’s tutorials, Silver said something similar. “The history of physics has been all about condensing complexity into precise mathematical laws.” “Biology is much less compressible. And so you have to find a language to impose structure on the mess. As humans, we aren’t equipped to find that structure, and we can’t write a program that will do it. But AI can see patterns that aren’t obvious to the human eye. And it can distill those patterns into an algorithm that explains them.” Silver, author interview. BACK TO NOTE REFERENCE 14 15. Clemency Burton-Hill, “The Superhero of Artificial Intelligence: Can This Genius Keep It in Check?,” The Guardian, February 16, 2016, theguardian.com/technology/2016/feb/16/demis- hassabis-artificial-intelligence-deepmind-alphago. BACK TO NOTE REFERENCE 15 16. “I thought I should go and spend a year living in Japan, do some meditation, some self- actualization, and play Go,” Hassabis recalled. But he dropped this plan because he believed he -- 421 of 565 -- was too old to become a top Go champion. “You can’t start at twenty-one. You’ve got to be like four. And then of course, what about AI? So reality brought me back.” BACK TO NOTE REFERENCE 16 17. Richard Evans was the chief AI programmer at Lionhead. Black & White is described in a cover story that ran in Edge magazine in January 2000. Richard Evans, author interview, September 11, 2023; Stephen Totilo, “Letting Gamers Play God, and Now Themselves,” The New York Times, September 2, 2004, nytimes.com/2004/09/02/technology/circuits/letting- gamers-play-god-and-now-themselves.html. BACK TO NOTE REFERENCE 17 18. Richard Evans says of Peter Molyneux, “We had lots of late-night conversations, he was very frank about his views. I think he was obsessed with this idea that gods are fragile, that God isn’t necessarily omnipotent. There’s a loop between the believers and God, and God without His believers is nothing. And that was a bit like Peter’s own life. While everyone believed in him, he was almost all-powerful. But that wouldn’t always be the case, right?” Evans, author interview. BACK TO NOTE REFERENCE 18 19. David Silver reflects, “He had a very tumultuous personal, emotional relationship with Peter Molyneux. It felt a bit like Peter wanted emotional control over Demis.” Silver, author interview. BACK TO NOTE REFERENCE 19 20. David Silver and Ben Coppin both recall Hassabis speaking about his strong entrepreneurial ambitions at Cambridge. Coppin, author interview; Silver, author interview. BACK TO NOTE REFERENCE 20 21. Aron Cohen, author interview, October 10, 2023. BACK TO NOTE REFERENCE 21 -- 422 of 565 -- CHAPTER THREE: THE JEDI 1. David Silver, author interview, November 28, 2023. BACK TO NOTE REFERENCE 1 2. The Elixir Diaries appeared roughly monthly in Edge magazine from October 1998 and ran to a total of about twenty-eight thousand words. All appeared under Hassabis’s byline, but later entries were ghosted by Joe McDonagh based on conversations with Hassabis. This chapter’s account of Elixir is based partly on the Diaries and partly on contemporaneous press accounts and interviews with Demis Hassabis, David Silver, Simon Green, and Adrian Bolton (Green and Bolton later joined DeepMind). Demis Hassabis, “Diary 1—Taking the Plunge,” Edge magazine, October 1998, archived at RepRev, archive.kontek.net/republic.strategyplanet.gamespy.com/d1.shtml. BACK TO NOTE REFERENCE 2 3. Silver, author interview. BACK TO NOTE REFERENCE 3 4. As in Silicon Valley, the early venture deals in China were preposterously ungenerous to entrepreneurs relative to later standards. In 1999, when China’s start-up culture was taking shape, Jack Ma gave up half his equity in order to secure funding for his start-up, Alibaba. BACK TO NOTE REFERENCE 4 5. The industrialist and the lawyer were Stewart Block and Dan Teacher. BACK TO NOTE REFERENCE 5 6. Explaining his vision over dinner at Cambridge, Hassabis had laid out a plan to build powerful AI by combining the rigor of academia with the hustle of the private sector. Academia was attractive for its commitment to deep science. Businesses were attractive because they could incentivize teams and sprint to meet deadlines. Ben Coppin, author interview. BACK TO NOTE REFERENCE 6 7. Demis Hassabis, “Diary 17—Believe in Your Idea,” Edge magazine, December 1999, archived at RepRev, archive.kontek.net/republic.strategyplanet.gamespy.com/d17.shtml. BACK TO NOTE REFERENCE 7 8. Likewise, Novistrana’s main crops would be barley and buckwheat, because that was also true of the countries it was modeled on. “These little details give a game its depth and soul,” Demis Hassabis, “Diary 18—Bright Ideas,” Edge magazine, January 2000, archived at RepRev, archive.kontek.net/republic.strategyplanet.gamespy.com/d18.shtml. -- 423 of 565 -- BACK TO NOTE REFERENCE 8 9. The successful challenger was Richard Powell, an early member of the Elixir coding team who later joined DeepMind. BACK TO NOTE REFERENCE 9 10. Silver, author interview. BACK TO NOTE REFERENCE 10 11. Edge magazine, November 1999, 45, retrocdn.net/images/c/c7/Edge_UK_078.pdf. BACK TO NOTE REFERENCE 11 12. Years later the 3D creation tool Unreal Engine 5 turned something like Clarke’s vision into the industry standard. BACK TO NOTE REFERENCE 12 13. Edge magazine, November 1999, 45. BACK TO NOTE REFERENCE 13 14. John Cassy, “Game Boy: Interview with Demis Hassabis, Managing Director, Elixir Studios,” The Guardian, September 25, 1999. (URL is no longer available.) BACK TO NOTE REFERENCE 14 15. PC Format, the UK’s biggest PC leisure magazine, had made Republic: The Revolution their game of the show. BACK TO NOTE REFERENCE 15 16. Edge magazine, November 1999, 45. BACK TO NOTE REFERENCE 16 17. Hassabis reflected, “Looking back we were probably too ambitious…Getting our first game out has taken a year longer than planned.” Maisha Frost, “Gaming Revolution Is Far from Child’s Play,” Daily Express, September 25, 2003. BACK TO NOTE REFERENCE 17 18. Silver, author interview. BACK TO NOTE REFERENCE 18 19. Silver reflected, “Demis comes from a religious background, and I think to him, goodness is a big part of what he pulls out of that. It’s very important to him to be good.” Silver, author interview. BACK TO NOTE REFERENCE 19 -- 424 of 565 -- 20. Claire Oldfield, “The Kids Are Alright,” Director, March 2000, 48–52. BACK TO NOTE REFERENCE 20 21. The friend was Ben Coppin. Coppin, author interview. BACK TO NOTE REFERENCE 21 22. The visiting RL professor was Daniel Kudenko of York University. Silver, author interview. BACK TO NOTE REFERENCE 22 23. Silver, author interview. BACK TO NOTE REFERENCE 23 24. Confirming that Hassabis’s AI ambitions provided the motive to study neuroscience, Dharshan Kumaran recalls a lunch with Hassabis before Hassabis applied to do his PhD. “I remember him saying that he wanted to learn about the brain because he wanted to make progress on AI.” Dharshan Kumaran, author interview, September 13, 2023. BACK TO NOTE REFERENCE 24 25. Kumaran elaborates, “He came in with a lot more energy than a normal PhD student. He was told to read lots of papers and then he came up with ideas. Very few PhD students meaningfully contribute to a scientific problem straight away, like Demis did.” Kumaran, author interview. BACK TO NOTE REFERENCE 25 26. Kumaran, author interview. BACK TO NOTE REFERENCE 26 27. Demis Hassabis et al., “Patients with Hippocampal Amnesia Cannot Imagine New Experiences,” Proceedings of the National Academy of Sciences 104, no. 5 (2007): 1726–31. Building on this original paper, Hassabis and Kumaran later showed that brain scans of healthy individuals revealed how memory and imagination used almost the same neural pathways. See Demis Hassabis et al., “Using Imagination to Understand the Neural Basis of Episodic Memory,” The Journal of Neuroscience 26 (2007): 14365–74, jneurosci.org/content/27/52/14365. BACK TO NOTE REFERENCE 27 28. As of May 27, 2024, the paper had 1,713 citations. “The Runners-Up,” Science 318 (2007): 1844–49, science.org/doi/10.1126/science.318.5858.1844a. BACK TO NOTE REFERENCE 28 29. For a similar view to that of Hassabis, see, for example, Anil Seth, Being You: A New Science of Consciousness (Dutton, 2021). Seth describes perception as a “controlled hallucination.” -- 425 of 565 -- BACK TO NOTE REFERENCE 29 -- 426 of 565 -- CHAPTER FOUR: THE GANG OF THREE 1. The Molyneux game was Milo and Kate. The emissary was Guy Simmons. Guy Simmons, author interview, October 9, 2023. BACK TO NOTE REFERENCE 1 2. Tomaso Poggio, email to the author, January 19, 2025. BACK TO NOTE REFERENCE 2 3. Tomaso Poggio, author interview, March 19, 2024. BACK TO NOTE REFERENCE 3 4. Geoffrey E. Hinton, Simon Osindero, and Yee-Whye Teh, “A Fast Learning Algorithm for Deep Belief Nets,” Neural Computation 18, no. 7 (2006): 1527–1554, cs.toronto.edu/~hinton/absps/fastnc.pdf [inactive]. BACK TO NOTE REFERENCE 4 5. Rajat Raina, Anand Madhavan, and Andrew Ng, “Large-Scale Deep Unsupervised Learning Using Graphics Processors,” Proceedings of the 26th Annual International Conference on Machine Learning (2009): 873–80, robotics.stanford.edu/~ang/papers/icml09- LargeScaleUnsupervisedDeepLearningGPU.pdf. BACK TO NOTE REFERENCE 5 6. Geoffrey Hinton, author interview, September 7, 2023. BACK TO NOTE REFERENCE 6 7. Shane Legg, author interview, March 28, 2023. BACK TO NOTE REFERENCE 7 8. Ben Goertzel, “Waking Up from the Economy of Dreams,” goertzel.org/benzine/WakingUpFromTheEconomyOfDreams.htm. BACK TO NOTE REFERENCE 8 9. Thomas Petzinger Jr., “Mathematician Perceives Mind as a Company-Intranet Model,” The Wall Street Journal, May 22, 1998, wsj.com/articles/SB895791428926727000. BACK TO NOTE REFERENCE 9 10. Legg reflects, “I do wonder whether my interest in intelligence was sparked by that experience as a child, being the dumb kid, and then having an IQ test and people telling me, ‘Actually, no, -- 427 of 565 -- you’re really smart.’ Like unusually smart. That probably affected me later on in my story.” Legg, author interview. BACK TO NOTE REFERENCE 10 11. Goertzel, “Waking Up from the Economy of Dreams.” BACK TO NOTE REFERENCE 11 12. Kurzweil predicted that humans and machines would eventually merge into cyborgs, with bots the size of blood cells connecting the human nervous system to virtual and augmented reality. He also suggested that machines, being intelligent, might have a claim to certain rights and liberties. He kept a collection of three hundred cat figurines in his home in Northern California. Maureen Dowd, “Elon Musk’s Billion-Dollar Crusade to Stop the A.I. Apocalypse,” Vanity Fair, March 26, 2017, vanityfair.com/news/2017/03/elon-musk-billion-dollar-crusade-to-stop- ai-space-x. BACK TO NOTE REFERENCE 12 13. The critique that AI scientists have no definition of intelligence is well summarized in Karen Hao, Empire of AI: Dreams and Nightmares in Sam Altman’s OpenAI (Penguin Press, 2025), 91. BACK TO NOTE REFERENCE 13 14. “DeepMind’s Shane Legg—Machine Super Intelligence,” The Artificial Intelligence Channel, August 25, 2017, YouTube, 1 hr., 53 min., 33 sec., youtube.com/watch?v=tFgJHzliy94. BACK TO NOTE REFERENCE 14 15. Mustafa Suleyman, author interview, May 5, 2024. BACK TO NOTE REFERENCE 15 16. A teacher bought Suleyman a new suit so that he could look sharp when he received the prize. “I’ve had so many people along the way who took care of me,” Suleyman says. Suleyman, author interview. BACK TO NOTE REFERENCE 16 17. Suleyman recalls that 5 out of 180 boys in his year were admitted to Oxford or Cambridge. BACK TO NOTE REFERENCE 17 18. “I was an extreme do-gooder. That was the one thing I retained from religion.” Suleyman, author interview. BACK TO NOTE REFERENCE 18 19. The friend was Michael Bhaskar. “I also went to a state school, and I think we shared the feeling that others might take Oxford for granted, but I didn’t think either of us did. I definitely -- 428 of 565 -- felt that it was a massive personal achievement to get in. But also, it was an important thing for my future.” Michael Bhaskar, author interview, May 4, 2024. BACK TO NOTE REFERENCE 19 20. Suleyman, author interview. BACK TO NOTE REFERENCE 20 21. Sukhi Anand, “Death Was a ‘Tragic Accident,’ ” Harrow Times, Jan. 30, 2007, harrowtimes.co.uk/news/1156573.death-was-tragic-accident. BACK TO NOTE REFERENCE 21 22. The business agreement between Hassabis and Suleyman was drawn up in December 2007 and stated that Suleyman would keep a quarter of the appreciation in the capital value of the apartments. BACK TO NOTE REFERENCE 22 23. Suleyman, author interview. BACK TO NOTE REFERENCE 23 24. “Demis Hassabis,” The Hendon Mob, pokerdb.thehendonmob.com/player.php?a=s&n=42073 [inactive]. BACK TO NOTE REFERENCE 24 25. “Inflection AI CEO Mustafa Suleyman on Building Modern AI, DeepMind Origins, and More | E1794,” This Week in Startups, August 18, 2023, YouTube, 1 hr., 15 min., 3 sec., youtube.com/watch?v=z3hmfSVmyqg. BACK TO NOTE REFERENCE 25 26. Suleyman, author interview. BACK TO NOTE REFERENCE 26 27. This profile was later cited in the DeepMind business plan. Thomas Goetz, “Sergey Brin’s Search for a Parkinson’s Cure,” Wired, June 22, 2010, wired.com/2010/06/ff-sergeys-search. BACK TO NOTE REFERENCE 27 -- 429 of 565 -- CHAPTER FIVE: FOUNDING DEEPMIND 1. Mike Hodgkinson, “Revenge of the Nerds: Should We Listen to Futurists or Are They Leading Us Towards Nerdocalypse?,” Independent, September 12, 2010, independent.co.uk/news/science/revenge-of-the-nerds-should-we-listen-to-futurists-or-are-they- leading-us-towards-lsquo-nerdocalypse-rsquo-2073910.html. BACK TO NOTE REFERENCE 1 2. Tom Abate, “Smarter Than Thou? / Stanford Conference Ponders a Brave New…,” SFGATE, May 12, 2006, sfgate.com/business/article/Smarter-than-thou-Stanford-conference-ponders- 2497190.php. BACK TO NOTE REFERENCE 2 3. Hodgkinson, “Revenge of the Nerds.” BACK TO NOTE REFERENCE 3 4. Tomaso Poggio, author interview, March 19, 2024. BACK TO NOTE REFERENCE 4 5. David Gammon, author interview, February 23, 2024. BACK TO NOTE REFERENCE 5 6. Tyler Emerson and Peter Thiel, “Introduction to Singularity Summit,” Machine Intelligence Research Institute, Vimeo, December 13, 2011, 7 min., 58 sec., vimeo.com/33632538. BACK TO NOTE REFERENCE 6 7. In his address to the Singularity Summit in 2009, Thiel had suggested that a breakthrough in AI might be needed to sustain the economic growth on which the free-market consensus depended. BACK TO NOTE REFERENCE 7 8. Mustafa Suleyman, author interview, February 29, 2024. BACK TO NOTE REFERENCE 8 9. Suleyman, author interview. BACK TO NOTE REFERENCE 9 10. Shane Legg, author interview, March 28, 2023. BACK TO NOTE REFERENCE 10 -- 430 of 565 -- 11. Karen Weise, Cade Metz, Nico Grant, and Mike Isaac, “Inside the A.I. Arms Race That Changed Silicon Valley Forever,” The New York Times, December 5, 2023, nytimes.com/2023/12/05/technology/ai-chatgpt-google-meta.html. BACK TO NOTE REFERENCE 11 12. Cade Metz, Genius Makers: The Mavericks Who Brought AI to Google, Facebook, and the World (Dutton, 2021), 109. BACK TO NOTE REFERENCE 12 13. Legg, author interview; Metz, Genius Makers, 110. BACK TO NOTE REFERENCE 13 14. Peter Thiel, author interview, August 6, 2023; Metz, Genius Makers, 110. BACK TO NOTE REFERENCE 14 15. Thiel, author interview. BACK TO NOTE REFERENCE 15 16. By 2025, xAI’s frontier AI clusters performed between 1020 and 4020 calculations per second, depending on the “density” of the calculation. Astonishingly, the first number was precisely what the business plan had projected fifteen years earlier. The second is off by a factor of four, a rounding error in a projection of this magnitude. BACK TO NOTE REFERENCE 16 17. According to a filing dated September 23, 2011, available on the website of Britain’s Companies House, Gammon plus some friends invested $540,000. The investments are listed in the names of Rockspring and Rockspring Nominees Ltd. BACK TO NOTE REFERENCE 17 18. The Founders Fund vision for Halcyon was laid out in a statement by the investing partner, Luke Nosek. The statement is reproduced in Leena Rao, “PayPal Co-Founder and Founders Fund Partner Joins DNA Sequencing Firm Halcyon Molecular,” TechCrunch, September 24, 2009, techcrunch.com/2009/09/24/paypal-co-founder-and-founders-fund-partner-joins-dna- sequencing-firm-halcyon-molecular. BACK TO NOTE REFERENCE 18 19. Rao, “PayPal Co-Founder and Founders Fund Partner Joins DNA Sequencing Firm Halcyon Molecular.” BACK TO NOTE REFERENCE 19 20. Luke Nosek, author interview, January 4, 2023. -- 431 of 565 -- BACK TO NOTE REFERENCE 20 21. “DeepMind: Building the World’s First Artificial General Intelligence,” copy provided to the author by DeepMind, September 30, 2010. BACK TO NOTE REFERENCE 21 22. According to the September 23, 2011, Companies House filing, Founders Fund owned 39 percent of DeepMind’s shares, marginally less than the 41 percent owned by Hassabis plus his two cofounders. Therefore, when the company was first formed in December 2010, Founders Fund owned almost half of it. Between December 2010 and the date of the filing, more shares were issued as money arrived from the smaller investors such as Gammon and as stock was issued to early employees, thereby diluting both Founders Fund and the founders. (Confusingly, Founders Fund internal records provided to the author indicate a slightly higher initial Founders Fund shareholding of about 50 percent. Such discrepancies are common in venture capital bookkeeping.) BACK TO NOTE REFERENCE 22 23. Legg, author interview. BACK TO NOTE REFERENCE 23 24. The DeepMind business plan of September 2010 had listed Suleyman as vice president of business development rather than as a cofounder. BACK TO NOTE REFERENCE 24 25. During Turing’s career, the London Mathematical Society had been at a different location, but Hassabis did not know this. BACK TO NOTE REFERENCE 25 26. Fittingly, given the way that science fiction has shaped the vision of the inventors of AI, Szilard’s anticipation of the consequences of nuclear physics owed much to his reading of H. G. Wells. Richard Rhodes, The Making of the Atomic Bomb, 25th Anniversary ed. (Simon & Schuster, 2012), 4, 14, 24. BACK TO NOTE REFERENCE 26 27. For example, in 1951, Turing predicted that “once the machine thinking method had started, it would not take long to outstrip our feeble powers. At some stage therefore we should have to expect the machines to take control.” The Reith Lectures, season 1, episode 4, “AI: A Future for Humans,” BBC Radio 4, March 4, 2022, 58 min., bbc.co.uk/programmes/m0012q21. BACK TO NOTE REFERENCE 27 28. Adrian Bolton, the first person to join DeepMind after the founding trio, recalls, “We were hiring people into a space that every professional in the field regarded as ridiculous.” Adrian Bolton, author interview, May 17, 2023. -- 432 of 565 -- BACK TO NOTE REFERENCE 28 29. Silver recalls, “I hadn’t processed all the difficult emotions that had come up at the end of Elixir. I think I wasn’t quite ready to be tied in any way. I’d be the first to say, it was irrational on my side.” In a parallel story, Hassabis also tried to recruit his Elixir cofounder Joe McDonagh to DeepMind. McDonagh refused, feeling unable to repeat the traumatic intensity of Elixir. Parmy Olson, Supremacy: AI, ChatGPT, and the Race That Will Change the World (Macmillan, 2024), 31, Kindle; David Silver, author interview, December 1, 2023. BACK TO NOTE REFERENCE 29 30. Sutskever also doubted that building AGI was a sensible ambition. Metz, Genius Makers, 143. BACK TO NOTE REFERENCE 30 31. Hinton recalls a joke he played on Hassabis. “When I visited DeepMind, I would play table tennis; I’m OK at it. And they said Demis never played table tennis. And so I managed to convince Demis that I was no good at table tennis, and he agreed to play me. Then he discovered I was a little better than him. He tried extremely hard to win, and he was quite annoyed. He really hates not winning.” Hinton, author interview. BACK TO NOTE REFERENCE 31 32. LeCun’s skepticism of AI start-ups had been fueled by Numenta, founded in 2005 by a previously successful entrepreneur, Jeff Hawkins, the creator of the PalmPilot and author of the influential book On Intelligence. In 2010, Numenta’s chief technology officer, Dileep George, left to start a company called Vicarious. Despite raising capital from Founders Fund and other well-connected backers, Vicarious foundered. Later, in 2012, LeCun encountered Hassabis at a conference in Scotland and revised his verdict: “I realized then that Demis was definitely brilliant,” he conceded. Nevertheless, LeCun stuck to his view that DeepMind’s aspiration to build AGI was ridiculously hubristic. Tomio Geron, “Vicarious Systems Says Its Artificial Intelligence Is the Real Deal,” The Wall Street Journal, March 6, 2024, wsj.com/articles/BL- VCDB-10538; Yann LeCun, author interview, Feb. 29, 2024. BACK TO NOTE REFERENCE 32 33. Hassabis had also tried to hire Ben Coppin, a friend from Cambridge. But Coppin resisted signing on until April 2012. BACK TO NOTE REFERENCE 33 34. Wierstra recalls, “I became very eager to join because of the possibility of doing real AI research. DeepMind was the only place where I could really do that.” Daan Wierstra, author interview, November 30. 2023. BACK TO NOTE REFERENCE 34 35. “I had repeated conversations with folks like Yoshua Bengio and Geoff Hinton. They were like, ‘Where’s the business model?’ It was incomprehensible to them, and to me. I just thought we -- 433 of 565 -- would have money for a few years and that would be it, really.” Wierstra, author interview. BACK TO NOTE REFERENCE 35 36. Wierstra, author interview. Silver recalls, “I loved working with Daan. He is really just the right mix of fun and crazy.” Silver, author interview. BACK TO NOTE REFERENCE 36 37. Legg, author interview. BACK TO NOTE REFERENCE 37 38. Wierstra, author interview. BACK TO NOTE REFERENCE 38 39. DeepMind was not alone in having its scientific ranks dominated by men. The frequently cited Taulbee surveys report that, circa 2012, women comprised just 14 to 16 percent of tenure-track computer scientists in the United States and Canada. Stuart Zweben and Betsy Bizot, “2012 Taulbee Survey,” Computing Research News 25, no. 5 (2013), cra.org/wp- content/uploads/2015/01/2012_taulbee_survey.pdf. BACK TO NOTE REFERENCE 39 40. Wierstra, author interview. BACK TO NOTE REFERENCE 40 41. Trevor Back, author interview, March 21, 2024. BACK TO NOTE REFERENCE 41 -- 434 of 565 -- CHAPTER SIX: ATARI 1. Cade Metz, Genius Makers: The Mavericks Who Brought AI to Google, Facebook, and the World (Dutton, 2021), 95–96. BACK TO NOTE REFERENCE 1 2. Vlad Mnih, author interview, October 6, 2023. BACK TO NOTE REFERENCE 2 3. Tomaso Poggio compares deep learning—a black box that delivers revolutionary results—to an early period in the history of electricity. “Deep learning was a somewhat random discovery; I often compare it to the story of electricity and Volta. Electricity was not discovered by Volta, but Volta invented the battery. He published the discovery in the year 1800. Before the battery, essentially, electricity was just sparks. Scientists could not study it. Once the battery arrived, they still did not know what electricity was, but in a short time, there was the telegraph. And then there were electrical generators and electrical motors. So it was a revolution, but it wasn’t until the 1860s and the work of James Clerk Maxwell that people understood electricity, that there was a theory. I think deep learning is a bit similar. We can build large language models and they work very well, but nobody understands completely why. There is progress being made, but there is not yet a theory of it. I think it will come.” Tomaso Poggio, author interview, March 19, 2024. BACK TO NOTE REFERENCE 3 4. Mnih, author interview. BACK TO NOTE REFERENCE 4 5. Metz, Genius Makers, 63–64. BACK TO NOTE REFERENCE 5 6. In 2006, no less a futurist than Douglas Hofstadter, author of Gödel, Escher, Bach, had called out the Singularitarians for blending reasonable predictions with utterly wild stuff—utility foglets that could assemble themselves instantly into any object on earth, civilizations that commandeered the entire galaxy to do their information processing. See /r/21dotco, “Trying to Muse Rationally About the Singularity Scenario,” Medium, January 1, 2016, medium.com/@emergingtechnology/trying-to-muse-rationally-about-the-singularity-scenario- 9c9db2eb9ece; “Douglas Hofstadter at Singularity Summit,” Machine Intelligence Research Institute, Vimeo, 2016, 34 min., 19 sec., vimeo.com/showcase/1777581/video/33633966. BACK TO NOTE REFERENCE 6 7. Shane Legg recalls that groups like Jürgen Schmidhuber’s in Switzerland bridged RL and deep learning. Wierstra adds that he had combined deep learning and RL for his PhD. Mnih, author -- 435 of 565 -- interview; Shane Legg, author interview, November 22, 2023; Daan Wierstra, author interview, December 5, 2023. BACK TO NOTE REFERENCE 7 8. Mnih, author interview. BACK TO NOTE REFERENCE 8 9. Mnih, author interview. BACK TO NOTE REFERENCE 9 10. Silver, email to the author, November 22, 2024. Separately, Shane Legg recalls, “I had to convince Dave to join DeepMind. He was not comfortable about the prospect given his experiences at Elixir. One of the ways we reassured him was that when he first came here, he reported to me.” Shane Legg, author interview, February 22, 2024. BACK TO NOTE REFERENCE 10 11. Before joining DeepMind, Silver held a Royal Society University Research Fellowship, one of the most prestigious honors available to an early-career British scientist. BACK TO NOTE REFERENCE 11 12. Two early scientific hires, Joel Veness and Shakir Mohamed, came from the so-called Bayesian tradition in machine learning. Joel Veness, author interview, January 23, 2024. BACK TO NOTE REFERENCE 12 13. Yann LeCun supervised the PhD of Koray Kavukcuoglu and was a coauthor with Karol Gregor. BACK TO NOTE REFERENCE 13 14. Neural networks delivered substantial improvements in medical diagnostics and machine translation from 2016. That year, researchers at Stanford demonstrated machine diagnosis of skin cancer to be as accurate as that achieved by human dermatologists. Also in 2016, Google introduced its Neural Machine Translation system, delivering a big jump in performance. BACK TO NOTE REFERENCE 14 15. Mnih, author interview. BACK TO NOTE REFERENCE 15 16. David Silver, author interview, December 1, 2023. BACK TO NOTE REFERENCE 16 17. Silver recalls that the idea of using Atari games to test AI had been proposed by Michael Bowling of the University of Alberta. Together with Joel Veness, an Alberta colleague who joined DeepMind in 2012, and Marc Bellemare, a PhD student and future DeepMind scientist, -- 436 of 565 -- Bowling had adapted the console versions of fifty-five games so that they could be used as testing environments on a computer. Marc G. Bellemare, Yavar Naddaf, Joel Veness, and Michael Bowling, “The Arcade Learning Environment: An Evaluation Platform for General Agents,” Journal of Artificial Intelligence Research 47 (2013): 253–79, jair.org/index.php/jair/article/view/10819. BACK TO NOTE REFERENCE 17 18. In a mark of Atari’s popularity, one of the company’s consoles sold more than thirty million units. BACK TO NOTE REFERENCE 18 19. Legg, author interview. BACK TO NOTE REFERENCE 19 20. Legg says of these early experiments, “We were happy to have people trying different things. Just because one technique works on a particular problem doesn’t mean that other techniques may not come back in the future.” BACK TO NOTE REFERENCE 20 21. Mnih elaborates, “In the early nineties, memory replay was used for data efficiency. People were trying to do RL with robots, and every time you needed data, you’d have to run a robot and that’s really expensive. So people were storing all the data and learning on it over and over to get more out of it.” Vlad Mnih, author interview, October 6, 2023. BACK TO NOTE REFERENCE 21 22. At the end of the Atari project, DeepMind summed up its achievement in a highly cited Nature paper, which stressed the value of intuitions from neuroscience. However, most DeepMind researchers downplay the extent to which neuroscience was necessary to solving Atari. Speaking of memory replay, Shane Legg says, “Did it require a neuroscience inspiration? Probably not. Is it a little bit like a system that happens to the brain? Yeah, it is a bit. So you can make of it what you will.” Legg, author interview, February 22, 2024. For his part, Mnih recalls, “Experience replay was known to happen in the brain. And we had a lot of conversations about that. How exactly is it done? When is it done? It was satisfying that this technique had some counterpart in human biology.” Mnih, author interview. BACK TO NOTE REFERENCE 22 23. “The key idea was, let’s try and turn reinforcement-learning data into something that looks like supervised-learning data,” Mnih recalled. Mnih, author interview. BACK TO NOTE REFERENCE 23 24. Silver recalls that doing reinforcement learning from raw experience had been the topic of his application to the prestigious Royal Society fellowship, which he took up after leaving Canada. He also recalls that his research plan doomed his application to become a faculty member at -- 437 of 565 -- The University of Edinburgh; the interviewers laughed at him for setting out to accomplish an impossibility. David Silver, email to the author, November 22, 2024. BACK TO NOTE REFERENCE 24 25. Mnih, author interview. BACK TO NOTE REFERENCE 25 26. Silver, author interview. BACK TO NOTE REFERENCE 26 27. Mnih, author interview. BACK TO NOTE REFERENCE 27 28. DeepMind separated the learning network from the playing network in 2013. In 2014, it came up with a name for the playing network: “the fixed target network,” so called because its parameters were fixed during play, being adjusted only when it received input from the coaching network. David Silver, email to the author, August 9, 2024. BACK TO NOTE REFERENCE 28 29. Silver, author interview. BACK TO NOTE REFERENCE 29 -- 438 of 565 -- CHAPTER SEVEN: THIEL TROUBLE 1. Megan Garber, “SpaceX’s Just-Launched Falcon 9 Rocket: The Things It Carries,” The Atlantic, October 8, 2012, theatlantic.com/technology/archive/2012/10/spacexs-just-launched- falcon-9-rocket-the-things-it-carries/263336. BACK TO NOTE REFERENCE 1 2. The malfunction of one of the rocket engines resulted in the failure of the secondary mission, which was to put a satellite into orbit. Garber, “SpaceX’s Just-Launched Falcon 9 Rocket: The Things It Carries.” BACK TO NOTE REFERENCE 2 3. At the Founders Fund retreat, Hassabis had made a strong impression, presenting DeepMind as a “Manhattan Project” for AI. “One of the investors told me that it was such a powerful speech he felt he needed to shoot Demis: it was the last chance to save the human race,” Thiel recalled. Peter Thiel, author interview, August 6, 2023. BACK TO NOTE REFERENCE 3 4. Musk’s investment of $5 million in DeepMind was not formalized until March 2013. Luke Nosek, author interview, January 4, 2024. BACK TO NOTE REFERENCE 4 5. Nosek, author interview. A rival version of this story asserts that Page first learned of DeepMind when Nosek opened his laptop and showed him a demo of its Atari agent playing Breakout. Not only is this not what Nosek recalled to the author, it is also chronologically impossible. The email from Alan Eustace, seen by the author, proves that Page was already interested in DeepMind in late 2012, more than six months before the Atari agent started working. BACK TO NOTE REFERENCE 5 6. Ironically, DeepMind and other tech companies provide excellent free food because the intelligence of their employees is scarce, and keeping them happy and productive is a sensible investment. If AGI renders intelligence plentiful, corporate incentives may be different. But then, as Hassabis says, corporations themselves may be different. BACK TO NOTE REFERENCE 6 7. Roger Scruton, Spinoza: A Very Short Introduction (Oxford University Press, 2002), 1. BACK TO NOTE REFERENCE 7 8. The target was £40 million, just under $65 million at the October 2012 exchange rate. Mustafa Suleyman, email to the author, November 27, 2024. -- 439 of 565 -- BACK TO NOTE REFERENCE 8 9. Nosek’s ability to bring Thiel around to his view was further reduced by the failure of his other pet project, Halcyon Molecular, in August 2012. (See chapter 5 for more on Halcyon.) BACK TO NOTE REFERENCE 9 10. Thiel, author interview. BACK TO NOTE REFERENCE 10 11. Thiel, author interview. BACK TO NOTE REFERENCE 11 12. The author is grateful to Alan Eustace for confirming the accuracy of the account of Google’s acquisition of DeepMind given in this and the next chapter. BACK TO NOTE REFERENCE 12 13. John Markoff, “15 Minutes of Free Fall Required Years of Taming Scientific Challenges,” The New York Times, October 17, 2014, nytimes.com/2014/10/27/science/for-world-record-alan- eustace-fought-atmosphere-and-equipment.html. BACK TO NOTE REFERENCE 13 14. On December 13, 2012, over lunch at the Princess Garden Chinese restaurant in London’s Mayfair district, Suleyman informed an investor, Ali Ojjeh of The Capital Partnership, that Google had proposed an outright acquisition, but that DeepMind had demurred, preferring to stay independent. Knowing that Suleyman had no personal wealth, Ojjeh told him he was nuts, and offered to call a psychiatrist to help him. Ali Ojjeh, email to the author, January 21, 2025. BACK TO NOTE REFERENCE 14 15. Pichette was Google’s CFO between 2008 and 2015. Patrick Pichette, author interview, July 23, 2024. BACK TO NOTE REFERENCE 15 16. Mustafa Suleyman, author interview. BACK TO NOTE REFERENCE 16 17. Geoffrey Hinton, author interview, September 6, 2023; Cade Metz, Genius Makers: The Mavericks Who Brought AI to Google, Facebook, and the World (Dutton, 2021), 2. BACK TO NOTE REFERENCE 17 18. At the time of the auction in December 2012, DeepMind’s most recent valuation had been established a year before, in its Series B fundraising: This stood at $45 million. (Data provided by Founders Fund to the author.) The valuation implies that Hassabis’s $10 million offer for Hinton’s boutique would have required handing over 22 percent of DeepMind’s shares in -- 440 of 565 -- payment (or 18 percent if DeepMind issued fresh shares to consummate the acquisition). Meanwhile, a DeepMind Technologies filing with the UK’s Companies House, dated November 9, 2012, and available online, shows that Hassabis owned 21 percent of the stock outstanding. (If DeepMind had paid for the acquisition by issuing $10 million of new shares, Hassabis’s stake would have been diluted to 17 percent.) BACK TO NOTE REFERENCE 18 19. Geoffrey Hinton, author interview. BACK TO NOTE REFERENCE 19 20. If Hinton’s group had received 22 percent of DeepMind’s stock in December 2012, this would have equated to 17.8 percent after Series C dilution. (Calculation based on UK Companies House filing, September 23, 2013.) A 17.8 percent share of the $650 million paid by Google would have yielded Hinton’s group $116 million, more than two and a half times the $44 million they realized by selling to Google a year before. Since Google would probably have paid more to buy DeepMind if it included Hinton and his cofounders, the upside to Hinton of selling to DeepMind would have been even larger. “DeepMind Technologies Limited,” Companies House, September 23, 2013, find-and-update.company- information.service.gov.uk/company/07386350/filing-history?page=3. BACK TO NOTE REFERENCE 20 21. If DeepMind had paid for Hinton’s company with newly issued DeepMind stock, Hassabis would have been diluted from 21 percent to 17.2 percent. The delta of 3.8 percent on the $650 million sale price would have been worth $24.7 million in personal cash forgone for Hassabis. To make that up, Google would have had to pay a lot more for DeepMind—specifically, (100 ÷ 17.2) × 24.7 = $143.6 million more. Given that Google paid $44 million for the Hinton trio in December 2012, it’s unlikely that it would have paid a $144 million premium for them just a year later. BACK TO NOTE REFERENCE 21 22. Mike Hodgkinson, “Silicon Valley: The Anatomy of a Cutting-Edge Start-Up,” Independent, August 13, 2011, the-independent.com/tech/silicon-valley-the-anatomy-of-a-cuttingedge- startup-2335404.html. BACK TO NOTE REFERENCE 22 23. Suleyman, author interview. BACK TO NOTE REFERENCE 23 24. Solina Chau, author interview, October 11, 2023. BACK TO NOTE REFERENCE 24 25. Suleyman, author interview. -- 441 of 565 -- BACK TO NOTE REFERENCE 25 26. Shane Legg, author interview, February 22, 2024. BACK TO NOTE REFERENCE 26 -- 442 of 565 -- CHAPTER EIGHT: GET GOOGLE 1. Claire Caine Miller, “Larry Page Says Vocal Cord Paralysis Causes His Voice Problems,” The New York Times, May 14, 2013, cbsnews.com/news/google-ceo-larry-page-explains-his-vocal- cord-paralysis. BACK TO NOTE REFERENCE 1 2. Page had hired Kurzweil in January 2013, using a pitch that was almost identical to the one he used on Hassabis. Holman W. Jenkins, “The Weekend Interview: Will Google’s Ray Kurzweil Live Forever?,” The Wall Street Journal, April 12, 2013, wsj.com/articles/SB10001424127887324504704578412581386515510. BACK TO NOTE REFERENCE 2 3. The description of Hassabis’s conversation with Page is drawn largely from interviews with Hassabis, conducted by the author. But this two-sentence paragraph comes from David Rowan, “DeepMind: Inside Google’s Super-Brain,” Wired, June 22, 2015. BACK TO NOTE REFERENCE 3 4. Harrison recalls that this first due diligence meeting took place in October 2013, in Building 1945. Don Harrison, author interview, October 12, 2023. BACK TO NOTE REFERENCE 4 5. The lawyer was Frances Butler. BACK TO NOTE REFERENCE 5 6. Mustafa Suleyman, author interview, May 5, 2024. BACK TO NOTE REFERENCE 6 7. Patrick Pichette, author interview, July 23, 2024. BACK TO NOTE REFERENCE 7 8. The AI scientist who went to Facebook was Marc’Aurelio Ranzato. Yann LeCun, author interview, February 29, 2024. BACK TO NOTE REFERENCE 8 9. On Zuckerberg’s personal efforts to recruit AI researchers, Vlad Mnih recounts the story of how his friend Navdeep Jaitly was interviewed by Zuckerberg. Cade Metz recounts Zuckerberg’s efforts to hire the researcher Clement Farabet, among others. LeCun, author interview; Vlad Mnih, author interview, October 6, 2023; Cade Metz, Genius Makers: The Mavericks Who Brought AI to Google, Facebook, and the World (Dutton, 2021), 127. -- 443 of 565 -- BACK TO NOTE REFERENCE 9 10. LeCun recalls agreeing to join Facebook at the end of November 2013. LeCun, author interview; Metz, Genius Makers, 128. BACK TO NOTE REFERENCE 10 11. Shane Legg, author interview, February 22, 2024. BACK TO NOTE REFERENCE 11 12. Metz, Genius Makers, 120. BACK TO NOTE REFERENCE 12 13. The other panelists included the future Turing Prize winner Yoshua Bengio. Yann LeCun, photograph to the author, January 23, 2025. BACK TO NOTE REFERENCE 13 14. LeCun, author interview and email to the author. BACK TO NOTE REFERENCE 14 15. Mustafa Suleyman, author interview. BACK TO NOTE REFERENCE 15 16. Metz, Genius Makers, 101. BACK TO NOTE REFERENCE 16 17. Don Harrison, Google’s lead acquisitions negotiator, recalls, “We saw the demos that showed the system worked. But perhaps it worked a different way than what was being described. Our diligence was fairly significant.” Harrison, author interview. BACK TO NOTE REFERENCE 17 18. Vlad Mnih, author interview, November 24, 2023. BACK TO NOTE REFERENCE 18 19. Luke Nosek, author interview, January 4, 2024. BACK TO NOTE REFERENCE 19 20. Harrison, author interview. BACK TO NOTE REFERENCE 20 21. Hinton expressed Hassabis’s value in terms of British pounds, saying he was worth £100 million. Geoffrey Hinton, author interview, September 6, 2023. BACK TO NOTE REFERENCE 21 -- 444 of 565 -- 22. The potential upside from fusing deep learning and reinforcement learning influenced Google’s attitude to the pricing. Harrison recalled that DeepMind’s formula “was fairly remarkable and an evolution on what Geoff [Hinton] had described and what our engineers were doing internally.” Harrison, author interview. BACK TO NOTE REFERENCE 22 23. Harrison, author interview. BACK TO NOTE REFERENCE 23 24. Calculations based on shareholdings reported to Companies House. Sam Shead, “Peter Thiel’s Fund Owned More Shares Than DeepMind’s Cofounders When the AI Lab Was Sold to Google (GOOG),” Business Insider, July 22, 2017, africa.businessinsider.com/tech/tech-peter-thiels- fund-owned-more-shares-than-deepminds-cofounders-when-the-ai-lab/h99f6s3. BACK TO NOTE REFERENCE 24 25. Metz, Genius Makers, 132. BACK TO NOTE REFERENCE 25 -- 445 of 565 -- CHAPTER NINE: INTUITION 1. After each move, the number of open squares on the board is reduced by one. BACK TO NOTE REFERENCE 1 2. During the 2000s, Martin Müller and Murray Campbell were among the experts who believed that no machine would defeat a human champion at Go for at least a couple of decades. Müller, who framed Go as a grand challenge in AI, was one of David Silver’s PhD supervisors. As late as 2014, Go programmers predicted that no system would defeat the top humans for the next ten years or more. Alan Levinovitz, “The Mystery of Go, the Ancient Game That Computers Still Can’t Win,” Wired, May 12, 2014, wired.com/2014/05/the-world-of-computer-go. BACK TO NOTE REFERENCE 2 3. David Silver, author interview, December 1, 2023. BACK TO NOTE REFERENCE 3 4. Searching as few as four moves ahead (two for each player) in Go would involve considering almost seventeen billion permutations. BACK TO NOTE REFERENCE 4 5. Early Go programmers attempted to build handcrafted evaluation functions into their systems. But these were only somewhat effective. BACK TO NOTE REFERENCE 5 6. Monte Carlo Tree Search had been invented by the Go programmer Rémi Coulom and further improved by Sylvain Gelly. Silver integrated MCTS with RL and pushed the technique further. BACK TO NOTE REFERENCE 6 7. Monte Carlo Tree Search combines random exploration with a bias toward repeating moves that have succeeded before. Over time, the random component is reduced, so that the system shifts from broad exploratory search to exploiting proven strategies. BACK TO NOTE REFERENCE 7 8. Silver’s early Go agent was the first to beat human professionals on a smaller, 9 × 9 board. BACK TO NOTE REFERENCE 8 9. Silver recalled, “I always felt I’d come back to Go, but only when there was something new to be tried.” Silver, author interview. BACK TO NOTE REFERENCE 9 -- 446 of 565 -- 10. Aja Huang, author interview, February 12, 2024. BACK TO NOTE REFERENCE 10 11. Silver, author interview. BACK TO NOTE REFERENCE 11 12. Silver, author interview. Matthew Sadler and Natasha Regan, Game Changer (New in Chess, 2019), 109, ebook; Cade Metz, Genius Makers: The Mavericks Who Brought AI to Google, Facebook, and the World (Dutton, 2021), 170–71. BACK TO NOTE REFERENCE 12 13. Huang, author interview. BACK TO NOTE REFERENCE 13 14. In 2008 Sutskever and a coauthor had built a deep-learning network that selected the correct move 36.9 percent of the time. Ilya Sutskever and Vinod Nair, “Mimicking Go Experts with Convolutional Neural Networks,” Department of Computer Science, University of Toronto (2008): 101–10, www.cs.utoronto.ca/~ilya/pubs/2008/go_paper.pdf. BACK TO NOTE REFERENCE 14 15. Sutskever recalls that as soon as he and Alex Krizhevsky won the ImageNet contest in 2012, he had realized that the same deep-learning methods could be applied to Go. Modestly, he adds, “I want to emphasize that I had the idea, but then actually making it work is of course a monumental thing.” Ilya Sutskever, author interview, November 3, 2024. BACK TO NOTE REFERENCE 15 16. Silver recalls, “We already knew how to do the search part. The key question was can we actually re-create human intuition? I felt that if we could re-create human intuition and then add all the other things on top, we could go all the way to beating the world champion. That was my working hypothesis, which turned out to be true.” Silver, author interview. BACK TO NOTE REFERENCE 16 17. Maddison’s network used 2.3 million parameters, whereas Sutskever’s had used fewer than ten thousand. Sutskever and Nair, “Mimicking Go Experts with Convolutional Neural Networks”; Chris J. Maddison et al., “Move Evaluation in Go Using Deep Convolutional Neural Networks,” arXiv, December 20, 2014, arXiv:1412.6564, arxiv.org/abs/1412.6564. BACK TO NOTE REFERENCE 17 18. Maddison, “Move Evaluation in Go Using Deep Convolutional Neural Networks.” BACK TO NOTE REFERENCE 18 -- 447 of 565 -- 19. After implementing the clean test that Silver wanted, Maddison also tried adding search to his neural network. Both results are reported in Maddison et al., 2014. Silver, author interview; Maddison, “Move Evaluation in Go Using Deep Convolutional Neural Networks.” BACK TO NOTE REFERENCE 19 20. Silver recalls of Huang, “His catchphrase was always ‘impossible.’ Every time we talked about beating the world champion or getting to a high level, he would just say ‘impossible.’ ” Silver, author interview; Huang, author interview. BACK TO NOTE REFERENCE 20 21. Metz, Genius Makers, 145–46. BACK TO NOTE REFERENCE 21 22. Sutskever believed that solving Go would require some form of search in addition to neural networks. But he wanted to scale the neural networks first and viewed search as the less interesting part of the solution. Sutskever, author interview. BACK TO NOTE REFERENCE 22 23. Silver even believed that a sufficiently large network could learn to search autonomously. He was anticipating the “emergent properties” that extremely large language models would exhibit several years later. Silver, author interview. BACK TO NOTE REFERENCE 23 24. Silver explains, “There are only two scalable things that we’ve really discovered, which are learning and search. These are the two scalable processes that we know of where you can put more and more computation into them and the system will get better and better and better.” BACK TO NOTE REFERENCE 24 25. Sutskever was born in Gorky, a city that was closed to foreigners because of its defense ties. Metz, Genius Makers, 92. BACK TO NOTE REFERENCE 25 26. Sutskever’s words here are recalled by Silver. Consistent with Silver’s memory, Sutskever recalls that he thought that the search part of Go systems was already adequately advanced, and that the remaining challenge was to improve the neural networks. Sutskever, author interview. BACK TO NOTE REFERENCE 26 27. Silver, author interview, March 8, 2024. Sutskever agreed that it would take self-play to surpass human experts. Sutskever, author interview. BACK TO NOTE REFERENCE 27 -- 448 of 565 -- 28. The design of rewards is a key challenge in building reinforcement-learning systems. Guez’s value net allowed the Go system to recognize a rewarding position. BACK TO NOTE REFERENCE 28 29. Huang, author interview. BACK TO NOTE REFERENCE 29 30. In addition to relating this and other stories, Graepel assisted the author with a lucid explanation of how AlphaGo works. Thore Graepel, author interview, January 19, 2024. BACK TO NOTE REFERENCE 30 31. AlphaGo’s learning process mimicked something that humans do. It discovered knowledge of Go through laborious System Two thinking (tree search), but then transferred the knowledge to System One (the policy and value nets), so that it could be retrieved faster and with less effort in the future. Graepel, email to the author. BACK TO NOTE REFERENCE 31 32. Huang, author interview. BACK TO NOTE REFERENCE 32 33. Huang, author interview. BACK TO NOTE REFERENCE 33 34. Elizabeth Gibney, “Go Players React to Computer Defeat,” Nature, January 27, 2016, doi.org/10.1038/nature.2016.19255. BACK TO NOTE REFERENCE 34 35. Silver, author interview. BACK TO NOTE REFERENCE 35 36. Cade Metz, “Facebook Aims Its AI at the Game No Computer Can Crack,” Wired, November 3, 2015, wired.com/2015/11/facebook-is-aiming-its-ai-at-go-the-game-no-computer-can-crack; Metz, Genius Makers, 167; Maddison, “Move Evaluation in Go Using Deep Convolutional Neural Networks.” BACK TO NOTE REFERENCE 36 37. Metz, “Facebook Aims Its AI at the Game No Computer Can Crack.” BACK TO NOTE REFERENCE 37 38. Silver, author interview. BACK TO NOTE REFERENCE 38 -- 449 of 565 -- 39. As of January 2024, this paper had been cited 7,815 times. David Silver et al., “Mastering the Game of Go with Deep Neural Networks and Tree Search,” Nature 529 (2016): 484–89, doi.org/10.1038/nature16961. BACK TO NOTE REFERENCE 39 40. Metz, Genius Makers, 170. BACK TO NOTE REFERENCE 40 41. Metz, Genius Makers, 167–70; Ben Buchanan and Michael Imbrie, The New Fire: War, Peace, and Democracy in the Age of AI (MIT Press, 2022), 43. BACK TO NOTE REFERENCE 41 42. “All the strategy was driven by Demis. The idea of staging something public and dramatic. I don’t think any of us would’ve thought that big.” Graepel, author interview. BACK TO NOTE REFERENCE 42 43. Cade Metz, Genius Makers, 146–50. BACK TO NOTE REFERENCE 43 44. Huang, author interview. BACK TO NOTE REFERENCE 44 45. Metz, Genius Makers, 172. BACK TO NOTE REFERENCE 45 46. Christopher Moyer, “How Google’s AlphaGo Beat a Go World Champion,” The Atlantic, March 28, 2016, theatlantic.com/technology/archive/2016/03/the-invisible-opponent/475611. BACK TO NOTE REFERENCE 46 47. Buchanan and Imbrie, The New Fire, 43. BACK TO NOTE REFERENCE 47 48. “Match 1 15 min Summary—Google DeepMind Challenge Match,” Google DeepMind, March 11, 2016, YouTube, 17 min., 29 sec., youtube.com/watch?v=bIQxOsRAXCo. BACK TO NOTE REFERENCE 48 49. Greg Kohs, “AlphaGo,” AlphaGo, 2017, 43 min., 30 sec., alphagomovie.com. BACK TO NOTE REFERENCE 49 50. Sadler and Regan, Game Changer, 20, ebook. -- 450 of 565 -- BACK TO NOTE REFERENCE 50 51. Kohs, “AlphaGo,” at 56:12. BACK TO NOTE REFERENCE 51 52. The journalist was Cade Metz. See Metz, “The Rise of Artificial Intelligence and the End of Code,” Wired, May 19, 2016, wired.com/2016/05/google-alpha-go-ai. BACK TO NOTE REFERENCE 52 53. Rhiannon Williams, “Fan Hui: What I Learned from Losing to DeepMind’s AlphaGo,” The I Paper, May 25, 2019, inews.co.uk/news/technology/fan-hui-what-i-learned-from-losing-to- deepminds-alphago-google-295005. BACK TO NOTE REFERENCE 53 54. Metz, “The Rise of Artificial Intelligence and the End of Code.” BACK TO NOTE REFERENCE 54 55. Graepel, author interview. BACK TO NOTE REFERENCE 55 -- 451 of 565 -- CHAPTER TEN: OUT OF EDEN 1. The quote is from a warning posted by Musk on the futurology website Edge.org in November 2014. He deleted it minutes later, but screenshots circulated online. Stephen Witt, The Thinking Machine: Jensen Huang, Nvidia, and the World’s Most Coveted Microchip (Viking, 2025), 149. BACK TO NOTE REFERENCE 1 2. Metz reports that, also in late 2014, Musk called Yann LeCun about trying to hire the best AI talent to lead Tesla’s self-driving car initiative. See Cade Metz, Genius Makers: The Mavericks Who Brought AI to Google, Facebook, and the World (Dutton, 2021), 152–56. On Musk’s contrasting fear of AI, Luke Nosek recalls Musk attending an AI safety conference organized by the Future of Life Institute in Puerto Rico in January 2015 and saying, “I am utterly saturated with fear.” Luke Nosek, author interview, January 4, 2024. BACK TO NOTE REFERENCE 2 3. According to a lawsuit later filed by Musk against OpenAI, in April 2015 Hassabis asked Musk to join DeepMind’s ethics and safety board and to host it at SpaceX. “Musk v. OpenAI, Inc. et al., Complaint,” Patently-O, March 2024, cdn.patentlyo.com/media/2024/03/musk-vs- openAI.pdf. BACK TO NOTE REFERENCE 3 4. Peter Thiel and Reid Hoffman are among those who recall Musk’s conviction that Hassabis’s Evil Genius game held the clue to his true character. Peter Thiel, author interview, August 7, 2023; Reid Hoffman, author interview, September 29, 2023. BACK TO NOTE REFERENCE 4 5. Lyndon Johnson’s political skills were illustrated by his relationship with the Senate leader Sam Rayburn. He made Rayburn feel as though he were his surrogate son, because that was what Rayburn yearned for. Altman, having read Caro, was good at making powerful people feel and hear what they wanted, as his early approach to Musk demonstrated. BACK TO NOTE REFERENCE 5 6. The Chinese colleague was Qi Lu. See Karen Hao, Empire of AI: Dreams and Nightmares in Sam Altman’s OpenAI (Penguin Press, 2025), vii. BACK TO NOTE REFERENCE 6 7. Keach Hagey, The Optimist: Sam Altman, OpenAI, and the Race to Invent the Future (Norton, 2025), 270. BACK TO NOTE REFERENCE 7 -- 452 of 565 -- 8. The Altman and Musk emails were released as part of the later litigation between the two men. Habryka, “OpenAI Email Archives (from Musk v. Altman and OpenAI blog),” LessWrong, November 16, 2024, lesswrong.com/posts/5jjk4CDnj9tA7ugxr/openai-email-archives-from- musk-v-altman-and-openai-blog. BACK TO NOTE REFERENCE 8 9. Witt, The Thinking Machine, 201. BACK TO NOTE REFERENCE 9 10. Notwithstanding his professed horror at humans melding with machines, Musk founded Neuralink in 2016. The company builds brain-computer interfaces. BACK TO NOTE REFERENCE 10 11. Cade Metz et al., “Ego, Fear and Money: How the A.I. Fuse Was Lit,” The New York Times, December 3, 2023, nytimes.com/2023/12/03/technology/ai-openai-musk-page-altman.html; Metz, Genius Makers, 158–60. BACK TO NOTE REFERENCE 11 12. In a 2023 interview with Andrew Ross Sorkin of The New York Times, conducted during the DealBook Summit, Musk recalled the speciesist interchange. “I’m like, OK, listen, this guy’s calling me a speciesist. He doesn’t care about AI safety. We’ve got to have some counterpoint here.” “DealBook Summit 2023 Elon Musk Interview,” Rev, transcript available at rev.com/transcripts/dealbook-summit-2023-elon-musk-interview-transcript. BACK TO NOTE REFERENCE 12 13. Metz et al., “Ego, Fear and Money: How the A.I. Fuse Was Lit.” BACK TO NOTE REFERENCE 13 14. As the writer Meghan O’Gieblyn puts it, “All the eternal questions have become engineering problems.” O’Gieblyn’s pithy phrase is all the more powerful because her own worldview was originally religious. Meghan O’Gieblyn, God, Human, Animal, Machine: Technology, Metaphor, and the Search for Meaning (Doubleday, 2021), Kindle. BACK TO NOTE REFERENCE 14 15. In another perspective on Page, David Silver suggests that Page was both a transhumanist, caring about the survival of intelligence rather than the survival of humans, but also focused on AI safety. “If you take account of the suffering on the way to a future of intelligent machines, then safety matters. That’s how the two thoughts can sit in the same head.” David Silver, author interview, December 8, 2023. BACK TO NOTE REFERENCE 15 16. Three DeepMind veterans separately recalled Hassabis’s bunker vision of the AI endgame. -- 453 of 565 -- BACK TO NOTE REFERENCE 16 17. The Manhattan Project’s secrecy was breached by the Soviet spy Klaus Fuchs, so it failed in its second purpose. BACK TO NOTE REFERENCE 17 18. In an interview with Hannah Fry, Hassabis also refers to his envisaged scientific band as “The Avengers,” an allusion to the Marvel Comics team of superheroes. Google DeepMind: The Podcast, season 2, episode 9, “The Promise of AI with Demis Hassabis,” Google DeepMind, March 15, 2022, 30 min., 29 sec., youtube.com/watch?v=GdeY-MrXD74. BACK TO NOTE REFERENCE 18 19. Oppenheimer was joined in his view by prominent intellectuals such as Bertrand Russell and John von Neumann. Nick Bostrom, Superintelligence: Paths, Dangers, Strategies (Oxford University Press, 2014), 139–40, ebook; Daryl G. Kimball, “Oppenheimer, the Bomb, Arms Control, Then and Now,” Bulletin of the Atomic Scientists, July 29, 2023, thebulletin.org/2023/07/oppenheimer-the-bomb-and-arms-control-then-and-now. BACK TO NOTE REFERENCE 19 20. Several participants recalled bits of this meeting in interviews with the author. Metz et al., “Ego, Fear and Money: How the A.I. Fuse Was Lit.” BACK TO NOTE REFERENCE 20 21. Mustafa Suleyman, author interview, May 5, 2024. BACK TO NOTE REFERENCE 21 22. Elaborating on his view of AI safety, Suleyman recalls, “I thought there would be two or three big AGI developers and it would be safer if there was the singleton developer that would have complete control. So I thought we might end up with this exponentially powerful system controlled by Larry and Sergey, which would be potentially open to abuse.” Suleyman, author interview. BACK TO NOTE REFERENCE 22 23. Suleyman, author interview. BACK TO NOTE REFERENCE 23 24. Michael Bhaskar and Mustafa Suleyman, The Coming Wave: Technology, Power, and the Twenty-First Century’s Greatest Dilemma (Crown, 2023), 11–13; Metz et al., “Ego, Fear and Money: How the A.I. Fuse Was Lit.” BACK TO NOTE REFERENCE 24 25. Reid Hoffman, author interview, October 3, 2023. -- 454 of 565 -- BACK TO NOTE REFERENCE 25 26. Luke Nosek, who was close to Musk, recalls, “When Elon started OpenAI, the purpose was to counter the private AI labs, of which there was only one that was significant. He wasn’t doing this to counter Mark [Zuckerberg]. It was the combination of Demis and Larry and the Google organization.” Separately and ironically, Peter Thiel was among OpenAI’s backers. Despite having lost faith in DeepMind in 2013, he had the appetite for a second AGI venture. Nosek, author interview. BACK TO NOTE REFERENCE 26 27. Shane Legg recalls that, not long after OpenAI was set up, he heard that discussions to found OpenAI had begun even before the SpaceX meeting. “This annoyed me rather a lot: our ethics board people were getting to hear about our plans and visions for the future while actively setting up a new competing company, and failing to mention this obvious conflict of interest.” Shane Legg, email to the author, November 7, 2024. BACK TO NOTE REFERENCE 27 -- 455 of 565 -- CHAPTER ELEVEN: P0 PLUS PLUS 1. One DeepMinder recalls, “I would never in a million years have guessed that Mustafa had had a much smaller shareholding [than Demis, at the founding]. They each ran their own domains, and they appeared close to equal.” A senior figure who worked closely with DeepMind in 2015–2016 said, “Demis was the leader, the visionary. Moose was the guy who made things happen and had this idealistic activist edge because of his background.” BACK TO NOTE REFERENCE 1 2. Regarding Suleyman’s policy operation, a colleague recalls, “Moose was always the one saying, ‘Let’s have these private dinners and invite these MPs and journalists. I’ll talk to them. A dinner over here, and then we’ll have breakfast at Davos.’ That was very Moose-driven; it was much less Demis.” BACK TO NOTE REFERENCE 2 3. Suleyman elaborates, “I always wanted to get more researchers focused on real-world problems, and he was always obsessed with having researchers work on simulations. That was always a big source of tension.” Mustafa Suleyman, author interview, May 6, 2024; David Rowan, “DeepMind: Inside Google’s Groundbreaking Artificial Intelligence Startup,” Wired, June 22, 2015, wired.com/story/deepmind. BACK TO NOTE REFERENCE 3 4. Nick Srnicek and Alex Williams, Inventing the Future: Postcapitalism and a World Without Work (Verso, 2015). BACK TO NOTE REFERENCE 4 5. Srnicek and Williams, Inventing the Future, 15. BACK TO NOTE REFERENCE 5 6. Srnicek and Williams credit this line to the political scientist Jodi Dean. Srnicek and Williams, Inventing the Future, 25. BACK TO NOTE REFERENCE 6 7. Srnicek and Williams, Inventing the Future, 2. BACK TO NOTE REFERENCE 7 8. Suleyman elaborates, “The founding vision of NHS was that access to health care should not be about your ability to pay. Nor should it be about your ability to advocate.” Suleyman, author interview. BACK TO NOTE REFERENCE 8 -- 456 of 565 -- 9. Suleyman had been introduced to Laing after meeting two eminent physicians, Geraint Rees and Hugh Montgomery; Rees was a neurologist who had known Hassabis at University College London. Laing recalls that Montgomery accompanied him to an early meeting at Suleyman’s office. Upon arrival, Montgomery announced, “You do realize that this building is on the site of the oldest brothel in London?” Chris Laing, email to the author, May 16, 2024; Suleyman, author interview. BACK TO NOTE REFERENCE 9 10. Marion Kerr et al., “The Economic Impact of Acute Kidney Injury in England,” Nephrology Dialysis Transplantation (2014): 1362–68, pubmed.ncbi.nlm.nih.gov/24753459. BACK TO NOTE REFERENCE 10 11. Laing elaborates, “Doctors might not open results, they might open them late, they might miss the result, they might not know what to do when they get it.” Chris Laing, author interview. BACK TO NOTE REFERENCE 11 12. Suleyman, author interview. BACK TO NOTE REFERENCE 12 13. Laing, author interview. BACK TO NOTE REFERENCE 13 14. Nine months after this interview, in December 2024, King underscored his respect for Suleyman by announcing that he would leave a good position at UnitedHealth Group to work for him at Microsoft. Dominic King, author interview, March 22, 2024. BACK TO NOTE REFERENCE 14 15. The Soho Farmhouse gathering was part of a series of events called the Future Forum, convened by Google. Suleyman’s talk and the audience reaction is recalled by Suleyman and another eyewitness. BACK TO NOTE REFERENCE 15 16. Corroborating Suleyman’s memory, Dominic King recalls, “MS and I had visited a hospital and we went to grab a bottle of water in the next-door Tesco. And I saw the paper list just sitting on the ground. Having been discussing the importance of ‘data security’ just before it was certainly interesting to find a fully identifiable list of patients with sensitive information sitting on the ground!” Dominic King, email to the author, November 9, 2024. BACK TO NOTE REFERENCE 16 17. Suleyman, author interview. BACK TO NOTE REFERENCE 17 -- 457 of 565 -- 18. Suleyman, author interview. BACK TO NOTE REFERENCE 18 19. On the unprecedented combination of deep access to DeepMind’s work and total freedom to speak about it, the chair of the Independent Review Panel, the scientist and politician Julian Huppert, wrote, “We know of no other commercial organization that has set up an independent panel in this way.” Julian Huppert et al., “DeepMind Health Independent Review Panel Annual Report,” July 2016, storage.googleapis.com/deepmind- media/DeepMind.com/Blog/independent-reviewers-release-first-annual-report-on-deepmind- health/DeepMind%20Health%20Independent%20Review%20Annual%20Report%202017.pdf. BACK TO NOTE REFERENCE 19 20. Suleyman, author interview. BACK TO NOTE REFERENCE 20 21. Suleyman recalled, “I decided to use my power to overrule Google, and I did.” Suleyman, author interview. Dominic King stresses the credibility of the overseers. “Really impressive group and it was no mean feat to encourage people to give up serious time for an unpaid gig at Google. We asked a lot from them, and I don’t think anyone thought it was window dressing.” King, email to the author. BACK TO NOTE REFERENCE 21 22. The former minister was the surgeon Lord Ara Darzi. He was an authoritative but not totally dispassionate witness, since he had worked closely with Dominic King. Sarah Boseley and Paul Lewis, “Smart Care: How Google DeepMind Is Working with NHS Hospitals,” The Guardian, February 24, 2016, theguardian.com/technology/2016/feb/24/smartphone-apps-google- deepmind-nhs-hospitals. BACK TO NOTE REFERENCE 22 23. Trevor Back, who worked for Suleyman on the health initiative, reflects, “If the Independent Review Panel could be shown to work, then imagine every company working on AI for health care having one. It’d be phenomenally impactful in ensuring the safe transition of research into a clinical setting.” Trevor Back, author interview, March 21, 2024. BACK TO NOTE REFERENCE 23 24. The web version was published as Sophie Borland et al., “Google Handed Patients’ Files Without Permission,” Daily Mail, May 3, 2016, dailymail.co.uk/news/article-3571433/Google- s-artificial-intelligence-access-private-medical-records-1-6million-NHS-patients-five-years- agreed-data-sharing-deal.html. The headline dominated the front page of the paper version the next morning. The Daily Mail based its claims on an equally misleading article in New Scientist, published on April 29, 2016. See Hal Hodson, “Revealed: Google AI Has Access to Huge Haul of NHS Patient Data,” New Scientist, April 29, 2016, -- 458 of 565 -- newscientist.com/article/2086454-revealed-google-ai-has-access-to-huge-haul-of-nhs-patient- data. BACK TO NOTE REFERENCE 24 25. Trevor Back recalls, “We had set up a UK data center that was separate from Google. You can understand how difficult that was: the lawyers at Google assumed they should have access. I’m like, ‘No, you’re not allowed to see it. It’s got NHS data on it.’ We were doing so much more than other health providers to build the right infrastructure and processes.” When the Independent Review Panel reported on DeepMind’s work in 2017, it upheld this judgment. Having commissioned a data security firm to study DeepMind’s handling of NHS information, it stated that “there were no critical or high-level vulnerabilities detected.” Back, author interview; Huppert et al., “DeepMind Health Independent Review Panel Annual Report,” 12. BACK TO NOTE REFERENCE 25 26. Defending the template, the Royal Free Hospital said it was “the standard NHS information- sharing agreement set out by NHS England’s corporate information governance department and is the same as the other 1,500 agreements with third-party organizations that process NHS patient data.” See Alex Hern, “Google DeepMind Pairs with NHS to Use Machine Learning to Fight Blindness,” The Guardian, July 5, 2016, theguardian.com/technology/2016/jul/05/google-deepmind-nhs-machine-learning-blindness. In 2017, two public authorities (the National Data Guardian and the Information Commissioner’s Office) ruled that DeepMind should have had a data agreement allowing it to act as an “information controller,” not just an “information processor.” However, both official bodies concluded that this error had been the fault not of DeepMind but of the Royal Free, which had chosen the template for the contract. See also note 38, below. BACK TO NOTE REFERENCE 26 27. It should be noted that the Streams app was not analyzing the health status of individuals or using patient data to train AI. It was merely moving information from one part of the NHS to another: from the blood labs to the clinicians. To build the Streams app, a handful of DeepMind engineers and designers had studied the format of the hospital’s health records—the architecture of the data, not what the data revealed about particular patients. In sum, Google’s access to patient data was zero and DeepMind’s was minimal. BACK TO NOTE REFERENCE 27 28. Clemency Burton-Hill, “The Superhero of Artificial Intelligence: Can This Genius Keep It in Check?,” The Guardian, February 16, 2016, theguardian.com/technology/2016/feb/16/demis- hassabis-artificial-intelligence-deepmind-alphago. BACK TO NOTE REFERENCE 28 29. Sam Altman, email to Elon Musk, December 11, 2015. The email was released as part of Musk v. Altman. Habryka, “OpenAI Email Archives (from Musk v. Altman and OpenAI blog),” -- 459 of 565 -- LessWrong, November 16, 2024, lesswrong.com/posts/5jjk4CDnj9tA7ugxr/openai-email- archives-from-musk-v-altman-and-openai-blog. BACK TO NOTE REFERENCE 29 30. Sutskever was offered a salary of $6 million to stay at Google. OpenAI offered less than a third of that, and still managed to hire him. Like many leading figures in the discovery of AI, Sutskever was not primarily motivated by wealth, although he certainly acquired plenty of it by any normal standard. BACK TO NOTE REFERENCE 30 31. King, author interview. BACK TO NOTE REFERENCE 31 32. Alistair Connell et al., “Implementation of a Digitally Enabled Care Pathway,” Journal of Medical Internet Research 21 (2019), jmir.org/2019/7/e13143; Alistair Connell et al., “Evaluation of a Digitally Enabled Care Pathway,” NPJ Digital Medicine (2019), pubmed.ncbi.nlm.nih.gov/31396561. BACK TO NOTE REFERENCE 32 33. Although the AI often generated false positives, it correctly predicted 90 percent of cases in which a patient would need dialysis at some point over the next ninety days. Nenad Tomašev et al., “A Clinically Applicable Approach to Continuous Prediction of Future Acute Kidney Injury,” Nature 572, no. 7767 (2019): 116–19, nature.com/articles/s41586-019-1390-1. BACK TO NOTE REFERENCE 33 34. Jeffrey De Fauw et al., “Clinically Applicable Deep Learning for Diagnosis and Referral in Retinal Disease,” Nature Medicine 24, no. 9 (2018): 1342–50, pubmed.ncbi.nlm.nih.gov/30104768. BACK TO NOTE REFERENCE 34 35. The Royal College also reported that 8 percent of hospital posts for breast radiologists were unfilled. Nicole Kobie, “DeepMind’s New AI Can Spot Breast Cancer Just as Well as Your Doctor,” Wired, January 1, 2020, wired.com/story/deepmind-google-ai-breast-cancer. BACK TO NOTE REFERENCE 35 36. Eric Topol, author interview, May 28, 2024. BACK TO NOTE REFERENCE 36 37. David Aaranovitch, “DeepMind, Artificial Intelligence, and the Future of the NHS,” The Times, September 14, 2019, thetimes.com/uk/healthcare/article/deepmind-artificial-intelligence-and- the-future-of-the-nhs-r8c28v3j6. BACK TO NOTE REFERENCE 37 -- 460 of 565 -- 38. The inquiries were conducted by the National Data Guardian and the Information Commissioner’s Office. Both bodies criticized the Royal Free Hospital for failing to be transparent with patients about how their data would be used. But neither held DeepMind responsible for this violation. Further, the criticism of the Royal Free was focused on the brevity of the data sharing agreement it had signed with DeepMind. No actual data abuse was discovered. BACK TO NOTE REFERENCE 38 39. Laing, author interview, May 15, 2024. BACK TO NOTE REFERENCE 39 40. In a mark of Keane’s stature, Eric Topol described him as “a global pace-setter.” Topol, author interview. BACK TO NOTE REFERENCE 40 41. Pearse Keane, author interview, March 26, 2024. BACK TO NOTE REFERENCE 41 42. Dario Amodei, “Machines of Loving Grace,” October 2024, darioamodei.com/essay/machines- of-loving-grace. BACK TO NOTE REFERENCE 42 43. Suleyman, author interview. BACK TO NOTE REFERENCE 43 44. For example, having assembled the high-powered Independent Review Panel, Suleyman frequently neglected to show up to its meetings. BACK TO NOTE REFERENCE 44 45. One DeepMind colleague recalled, “Moose’s strength is having great ideas. His weakness is he would forget what he’d said the previous day.” BACK TO NOTE REFERENCE 45 -- 461 of 565 -- CHAPTER TWELVE: THE AGENT AND THE TRANSFORMER 1. David Silver, author interview, March 8, 2024. BACK TO NOTE REFERENCE 1 2. Ben Buchanan and Andrew Imbrie, The New Fire: War, Peace, and Democracy in the Age of AI (MIT Press, 2022), 48. BACK TO NOTE REFERENCE 2 3. Silver’s story is another instance of how beauty—the quality that Hinton and Oppenheimer called “sweetness”—motivated AI pioneers. David Silver, diary entry given to the author, April 21, 2016. BACK TO NOTE REFERENCE 3 4. Silver’s key colleagues on AlphaZero were Julian Schrittwieser, Thomas Hubert, and Ioannis Antonoglou. Silver, email to the author, February 14, 2025; David Silver et al., “Mastering the Game of Go Without Human Knowledge,” Nature 550, no. 7676 (2017): 354–59, doi.org/10.1038/nature24270. BACK TO NOTE REFERENCE 4 5. David Silver et al., “Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm,” arXiv, December 5, 2017, doi.org/10.48550/arXiv.1712.01815. BACK TO NOTE REFERENCE 5 6. Silver, author interview. BACK TO NOTE REFERENCE 6 7. Hassabis expands on this idea: “There were two questions. Could a learning system beat a brute-force expert system at all in chess, given how perfected the expert systems were? And was there anything left to discover in chess from a concepts point of view? I was interested in both those answers. So then I determined we would do AlphaZero with chess because that’s the best scientific question, where you’re not sure beforehand which answer’s right. Either answer would be very interesting.” BACK TO NOTE REFERENCE 7 8. Matthew Sadler and Natasha Regan, Game Changer (New in Chess, 2019), 139, ebook. BACK TO NOTE REFERENCE 8 9. Garry Kasparov, “Chess, a Drosophila of Reasoning,” Science 362, no. 6419 (2018): 1087, doi.org/10.1126/science.aaw2221. Likewise, the chess author Matthew Sadler agreed that AlphaZero’s moves felt strikingly intuitive. See Sadler and Regan, Game Changer, 64. -- 462 of 565 -- BACK TO NOTE REFERENCE 9 10. Steven Pinker, The Language Instinct: How the Mind Creates Language (William Morrow, 1994), 190–91. BACK TO NOTE REFERENCE 10 11. The strength of AlphaZero was another illustration of Rich Sutton’s “Bitter Lesson of AI,” which held that programs handcrafted by humans would perform less well than ones that relied more on the capacity of general systems to learn for themselves. Sadler and Regan, Game Changer, 166. BACK TO NOTE REFERENCE 11 12. David Silver, interview by the DeepMind documentary team, May 25, 2018. BACK TO NOTE REFERENCE 12 13. K. He, X. Zhang, S. Ren, and J. Sun, “Deep Residual Learning for Image Recognition,” 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016): 770–78, ieeexplore.ieee.org/document/7780459. BACK TO NOTE REFERENCE 13 14. David Silver, interview by DeepMind documentary team, September 7, 2018. BACK TO NOTE REFERENCE 14 15. Sutskever recalled, “What I said in my thesis was, ‘Recurrent neural networks would be so great if only you could train them!’ ” Ilya Sutskever, author interview, November 3, 2024. BACK TO NOTE REFERENCE 15 16. Dzmitry Bahdanau, KyungHyun Cho, and Yoshua Bengio, “Neural Machine Translation by Jointly Learning to Align and Translate,” arXiv, May 19, 2016, arxiv.org/abs/1409.0473. “Attention” is mentioned only three times in the paper, but it emerged as the key contribution. BACK TO NOTE REFERENCE 16 17. Sutskever, author interview. BACK TO NOTE REFERENCE 17 18. Sutskever, author interview. BACK TO NOTE REFERENCE 18 19. Sutskever, author interview. BACK TO NOTE REFERENCE 19 -- 463 of 565 -- 20. “Unsupervised Sentiment Neuron,” OpenAI, April 6, 2017, openai.com/index/unsupervised- sentiment-neuron. The lead author on the paper was Alec Radford. BACK TO NOTE REFERENCE 20 21. Sutskever, author interview. BACK TO NOTE REFERENCE 21 22. The present, Bergson wrote, is “the invisible progress of the past gnawing into the future.” “Henri Bergson Was Once the World’s Most Famous Philosopher,” The Economist, January 23, 2025, economist.com/culture/2025/01/23/henri-bergson-was-once-the-worlds-most-famous- philosopher. BACK TO NOTE REFERENCE 22 23. For image recognition, convolutional neural networks had processed the data from one image in parallel. But they did not process the data from an arbitrarily large batch of photos in parallel. In contrast, transformers were soon used to parallel-process vast corpuses of text. BACK TO NOTE REFERENCE 23 24. Sutskever, author interview. Likewise, asked how long it took him to recognize the significance of the transformer paper, Yann LeCun responded, “Six minutes. It was picked up immediately.” LeCun, author interview, February 29, 2024. BACK TO NOTE REFERENCE 24 25. Sutskever, author interview. BACK TO NOTE REFERENCE 25 26. Sutskever, author interview. BACK TO NOTE REFERENCE 26 27. Sutskever, author interview. BACK TO NOTE REFERENCE 27 28. Alec Redford, “Improving Language Understanding with Unsupervised Learning,” OpenAI, June 11, 2018, openai.com/index/language-unsupervised. BACK TO NOTE REFERENCE 28 29. Emily Bender, Timnit Gebru, Angelina McMillan-Major, and Margaret Mitchell, “On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?,” Association for Computing Machinery (2021): 610–23, dl.acm.org/doi/pdf/10.1145/3442188.3445922. BACK TO NOTE REFERENCE 29 -- 464 of 565 -- 30. “Ilya Sutskever (OpenAI Chief Scientist)—Building AGI, Alignment, Future Models, Spies, Microsoft, Taiwan, & Enlightenment,” Dwarkesh Podcast, March 27, 2023, 47 min., 40 sec., dwarkesh.com/p/ilya-sutskever. BACK TO NOTE REFERENCE 30 31. The case for RL agents is that they can learn by acting in the world. The case for self- supervised, deep-learning systems is that they can learn, effectively, by going to the library. BACK TO NOTE REFERENCE 31 32. By 2024, large models had ingested most of the textual data on the internet and the pendulum was swinging back toward Silver’s view. Sutskever was starting to emphasize the significance of AI-generated data. BACK TO NOTE REFERENCE 32 -- 465 of 565 -- CHAPTER THIRTEEN: ON LANGUAGE AND NATURE 1. Marc’Aurelio Ranzato, author interview, December 17, 2024. Likewise, after leaving DeepMind, Daan Wierstra reflected, “In the prestige rank of machine learning, building chatbots was of course the lowest prestige of all. Until GPT-3.5, I didn’t feel that these language models were anything but a curiosity.” Wierstra, author interview, October 3, 2024. Other computer scientists, from industry and academia, expressed the same view to the author. BACK TO NOTE REFERENCE 1 2. Dario Amodei, an OpenAI scientist who in 2020 quit to found the rival lab Anthropic, was among those who saw the potential of GPT immediately after the release of its first iteration in June 2018. BACK TO NOTE REFERENCE 2 3. Rae elaborates, “GPT-2 did not attract much interest within DeepMind, but it changed my life.” Rae later emerged as a leading figure in language modeling. Jack Rae, author interview, October 30, 2024. BACK TO NOTE REFERENCE 3 4. “NIPS 2016 Conference Book,” December 7, 2016, media.nips.cc/Conferences/2016/NIPS- 2016-Conference-Book.pdf. BACK TO NOTE REFERENCE 4 5. For US readers, a “fine cardigan waistcoat” may be translated into the less mellifluous “fine buttoned sweater-vest.” BACK TO NOTE REFERENCE 5 6. Andrei A. Rusu, a researcher at DeepMind, recalls, “Gaia was interesting. We made the world complex and we received evidence that the more you make things complex, the less efficient our algorithms are at learning from that data.” Andrei Rusu, author interview, February 2, 2024. BACK TO NOTE REFERENCE 6 7. “AlphaStar: Mastering the Real Time Strategy Game StarCraft II,” DeepMind, January 24, 2019, deepmind.google/discover/blog/alphastar-mastering-the-real-time-strategy-game- starcraft-ii. BACK TO NOTE REFERENCE 7 8. Silver elaborated, “You need to have that belief that this can be done.” He was revealing, yet again, the role of faith in AI discovery. David Silver, interview by DeepMind documentary team, September 7, 2018. -- 466 of 565 -- BACK TO NOTE REFERENCE 8 9. As of 2025, Vinyals had more than 363,000 citations. BACK TO NOTE REFERENCE 9 10. Ben Buchanan and Andrew Imbrie, The New Fire: War, Peace, and Democracy in the Age of AI (MIT Press, 2022), 69. BACK TO NOTE REFERENCE 10 11. Oriol Vinyals et al., “Grandmaster Level in StarCraft II Using Multi-Agent Reinforcement Learning,” Nature 575, no. 7782 (2019): 350–54, doi.org/10.1038/s41586-019-1724-z. BACK TO NOTE REFERENCE 11 12. AlphaStar’s ability to multitask on the battlefield encouraged military strategists to see the potential of battle-planning AI systems. Buchanan and Imbrie, The New Fire, 70–71. BACK TO NOTE REFERENCE 12 13. Anthony Cuthbertson, “Artificial Intelligence Conquers StarCraft II in ‘Unimaginably Unusual’ AI Breakthrough,” Independent, October 31, 2019, the- independent.com/games/artificial-intelligence-starcraft-2-ai-deepmind-a9176601.html. BACK TO NOTE REFERENCE 13 14. Ecclesiastes 1:9 (New International Version). BACK TO NOTE REFERENCE 14 15. “AlphaStar: Grandmaster Level in StarCraft II Using Multi-Agent Reinforcement Learning,” DeepMind, October 30, 2019, deepmind.google/discover/blog/alphastar-grandmaster-level-in- starcraft-ii-using-multi-agent-reinforcement-learning. BACK TO NOTE REFERENCE 15 16. John Dewey, Experience and Education (Macmillan, 1938), 20. BACK TO NOTE REFERENCE 16 17. “AlphaZero had shown that, for some problems, human-provided data was not necessary; AlphaStar was a reminder that, in other contexts, data continued to matter.” Buchanan and Imbrie, The New Fire, 68. BACK TO NOTE REFERENCE 17 -- 467 of 565 -- CHAPTER FOURTEEN: PROJECT MARIO 1. One DeepMind employee recalls, “The idea that we were different, and were redefining what it meant to be a company, was absolutely core. After the Google acquisition, we were Robin Hood deep in the belly of the king’s court. The founders had the power to decide to lead the way they wanted, and invent new structures and systems. This was as central to the vision as AGI.” BACK TO NOTE REFERENCE 1 2. This chapter’s reconstruction of the governance negotiations between DeepMind and Google is informed by multiple internal documents and texts, provided to the author by various sources on condition of anonymity. BACK TO NOTE REFERENCE 2 3. Google CFO Patrick Pichette noted that Google’s internal moon shots generated losses of $3 billion per year. Spinning them out would therefore boost Google’s pre-tax earnings by $3 billion. Since the stock market placed a 25× multiple on Google earnings, shareholders would gain at least $75 billion from the restructuring. Patrick Pichette, author interview, July 23, 2024. BACK TO NOTE REFERENCE 3 4. In 2019, DeepMind’s administrative expenses were over $900 million. “DeepMind Technologies Limited,” Companies House, September 23, 2013, find-and-update.company- information.service.gov.uk/company/07386350/filing-history?page=3. BACK TO NOTE REFERENCE 4 5. Of the $1 billion pledged, OpenAI’s original nonprofit vehicle received less than $150 million. See Mark Harris, “Elon Musk Used to Say He Put $100M in OpenAI, but Now It’s $50M: Here Are the Receipts,” TechCrunch, May 17, 2023, techcrunch.com/2023/05/17/elon-musk-used-to- say-he-put-100m-in-openai-but-now-its-50m-here-are-the-receipts. BACK TO NOTE REFERENCE 5 6. Suleyman’s legal team included Ken Macdonald, an eminent barrister who had served as director of public prosecutions and was an expert on public-interest arguments. BACK TO NOTE REFERENCE 6 7. Habryka, “OpenAI Email Archives (from Musk v. Altman and OpenAI blog),” LessWrong, November 16, 2024, lesswrong.com/posts/5jjk4CDnj9tA7ugxr/openai-email-archives-from- musk-v-altman-and-openai-blog. BACK TO NOTE REFERENCE 7 -- 468 of 565 -- 8. Tad Friend, “Sam Altman’s Manifest Destiny,” The New Yorker, October 3, 2016, newyorker.com/magazine/2016/10/10/sam-altmans-manifest-destiny. BACK TO NOTE REFERENCE 8 9. Reid Hoffman, author interview, May 31, 2024. BACK TO NOTE REFERENCE 9 10. According to DeepMind records, Pichai said that AGI certainly wasn’t likely during his tenure as chief executive. As of 2025, he seems likely to have been wrong on this point. BACK TO NOTE REFERENCE 10 11. Mustafa Suleyman, author interview, May 6, 2024. BACK TO NOTE REFERENCE 11 12. Ilya Sutskever, email to the author, August 20, 2025; Karen Hao, Empire of AI: Dreams and Nightmares in Sam Altman’s OpenAI (Penguin Press, 2025), 62. BACK TO NOTE REFERENCE 12 13. Habryka, “OpenAI Email Archives (from Musk v. Altman and OpenAI blog).” BACK TO NOTE REFERENCE 13 14. Hao, Empire of AI, 63. BACK TO NOTE REFERENCE 14 15. Habryka, “OpenAI Email Archives (from Musk v. Altman and OpenAI blog).” BACK TO NOTE REFERENCE 15 16. Hao, Empire of AI, 65. BACK TO NOTE REFERENCE 16 17. In later years, people quipped that Musk was determined to save humanity but felt no compassion for actual humans. Cade Metz et al., “Ego, Fear and Money: How the A.I. Fuse Was Lit,” The New York Times, December 3, 2023, nytimes.com/2023/12/03/technology/ai- openai-musk-page-altman.html. BACK TO NOTE REFERENCE 17 18. The slide deck was quoting Farhad Manjoo, “Why Tech Is Starting to Make Me Uneasy,” The New York Times, October 11, 2017, nytimes.com/2017/10/11/insider/tech-column-dread.html. BACK TO NOTE REFERENCE 18 -- 469 of 565 -- 19. Jack Nicas and Cade Metz, “Apple Hires Google’s AI Chief,” The New York Times, April 3, 2018, nytimes.com/2018/04/03/business/apple-hires-googles-ai-chief.html. BACK TO NOTE REFERENCE 19 20. Around this period, a talented entrepreneur sold his company and resolved to work on AI. He visited OpenAI and received an offer almost immediately. He went multiple rounds of interviews at DeepMind without anything materializing. Numerous other incidents and sources corroborate Hassabis’s reluctance to hire strong leaders on the nontechnical side. BACK TO NOTE REFERENCE 20 21. Madhumita Murgia, “DeepMind’s Move to Transfer Health Unit to Google Stirs Data Fears,” Financial Times, November 13, 2018, ft.com/content/f4a73450-e771-11e8-8a85-04b8afea6ea3. BACK TO NOTE REFERENCE 21 22. The report was based only on statements from Suleyman’s critics. Because it was technically an “informal fact finding,” the lawyer did not interview senior figures who worked for Suleyman, many of whom would have defended him. For example, Jim Gao, the leader of DeepMind’s energy work, later recalled, “I thought that the stated rationale [for Suleyman leaving] was bullshit. Is he a very tough manager? Of course he is. Most hard-charging entrepreneurs are tough managers. That sort of style is not going to be for everyone. I personally liked it.” Dominic King, the leader of the health work, agreed. “I never saw any sign of anything that I considered inappropriate. In my previous experience in academia and in the NHS, I came across truly malevolent behaviors. Nothing even approximated it at DeepMind.” Jim Gao, author interview, March 11, 2024; Dominic King, author interview, April 4, 2024. BACK TO NOTE REFERENCE 22 23. Giles Turner and Mark Bergen, “Google DeepMind Co-Founder Placed on Leave from AI Lab,” Bloomberg, August 21, 2019, bloomberg.com/news/articles/2019-08-21/google- deepmind-co-founder-placed-on-leave-from-ai-lab. BACK TO NOTE REFERENCE 23 24. DeepMind points out that it issued a statement saying, “Mustafa is taking time out right now after ten hectic years.” However, this did little to change the media narrative. BACK TO NOTE REFERENCE 24 25. In 2021, in response to questions from The Wall Street Journal, Suleyman said that he “accepted feedback that, as a cofounder at DeepMind, I drove people too hard and at times my management style was not constructive.” He added, “I apologize unequivocally to those who were affected.” Rob Copeland and Parmy Olson, “Artificial Intelligence Will Define Google’s Future. For Now, It’s a Management Challenge,” The Wall Street Journal, January 26, 2021, wsj.com/tech/ai/artificial-intelligence-will-define-googles-future-for-now-its-a-management- challenge-11611676945. -- 470 of 565 -- BACK TO NOTE REFERENCE 25 -- 471 of 565 -- CHAPTER FIFTEEN: FERMAT FOR BIOLOGY 1. The dialogue between Hassabis and Silver was recorded by DeepMind’s internal documentary team. Jeremy Kahn, “In a Major Scientific Breakthrough, A.I. Predicts the Exact Shape of Proteins,” Fortune, November 30, 2020, fortune.com/2020/11/30/deepmind-protein-folding- breakthrough. BACK TO NOTE REFERENCE 1 2. David Silver, author interview, March 8, 2024. BACK TO NOTE REFERENCE 2 3. Kahn, “In a Major Scientific Breakthrough, A.I. Predicts the Exact Shape of Proteins.” BACK TO NOTE REFERENCE 3 4. The friend was Tim Stevens, later a computational biologist at the University of Cambridge. BACK TO NOTE REFERENCE 4 5. Kahn, “In a Major Scientific Breakthrough, A.I. Predicts the Exact Shape of Proteins.” BACK TO NOTE REFERENCE 5 6. Marek Barwinski, author interview, February 4, 2025. BACK TO NOTE REFERENCE 6 7. The engineer was Laurent Sifre. BACK TO NOTE REFERENCE 7 8. All quotations from John Jumper in this chapter come from author interviews conducted on September 19 and November 28, 2023. BACK TO NOTE REFERENCE 8 9. Kathryn Tunyasuvunakool, who joined DeepMind’s protein team later, recalled, “If you look back to the very early days of the project, they were thinking about training an agent to play the game of Foldit. That turned out to be light-years away from what we actually ended up doing.” Kathryn Tunyasuvunakool, author interview, September 14, 2023. BACK TO NOTE REFERENCE 9 10. Kahn, “In a Major Scientific Breakthrough, A.I. Predicts the Exact Shape of Proteins.” BACK TO NOTE REFERENCE 10 11. Silver, author interview. -- 472 of 565 -- BACK TO NOTE REFERENCE 11 12. Pushmeet Kohli and Clemens Meyer, author interview, June 27, 2023. Kohli led DeepMind’s work on AI for science. BACK TO NOTE REFERENCE 12 13. Barwinski, text message to the author, February 5, 2025. BACK TO NOTE REFERENCE 13 14. David Silver, interview with the DeepMind documentary team, May 25, 2018. BACK TO NOTE REFERENCE 14 15. Google Deepmind: The Podcast, season 2, episode 1, “A Breakthrough Unfolds,” Google DeepMind, January 25, 2022, 39 min., 14 sec., youtube.com/watch?v=ZfJhOTZi0WE. BACK TO NOTE REFERENCE 15 16. By this point, Jumper had already discarded AlphaFold’s original search algorithm, replacing it with a simpler alternative. BACK TO NOTE REFERENCE 16 17. Pushmeet Kohli, author interview, June 26, 2023. BACK TO NOTE REFERENCE 17 18. On the crucial contribution of the distogram, see Andrew W. Senior et al., “Improved Protein Structure Prediction Using Potentials from Deep Learning,” Nature 577, no. 7792 (2020): 706– 10, doi.org/10.1038/s41586-019-1923-7. BACK TO NOTE REFERENCE 18 19. Mohammed AlQuraishi, “AlphaFold @ CASP13: ‘What Just Happened?,’ ” Some Thoughts on a Mysterious Universe, December 9, 2018, moalquraishi.wordpress.com/2018/12/09/alphafold- casp13-what-just-happened. BACK TO NOTE REFERENCE 19 20. Jumper, email to the author, July 11, 2025. BACK TO NOTE REFERENCE 20 21. Meyer, author interview, June 27, 2023. BACK TO NOTE REFERENCE 21 22. Kahn, “In a Major Scientific Breakthrough, A.I. Predicts the Exact Shape of Proteins.” BACK TO NOTE REFERENCE 22 -- 473 of 565 -- 23. Mohammed AlQuraishi, “AlphaFold2 @ CASP14: ‘It Feels like One’s Child Has Left Home,’ ” Some Thoughts on a Mysterious Universe, December 8, 2020, moalquraishi.wordpress.com/2020/12/08/alphafold2-casp14-it-feels-like-ones-child-has-left- home. BACK TO NOTE REFERENCE 23 24. Cade Metz, “London A.I. Lab Claims Breakthrough That Could Accelerate Drug Discovery,” The New York Times, November 30, 2020, nytimes.com/2020/11/30/technology/deepmind-ai- protein-folding.html. BACK TO NOTE REFERENCE 24 25. Demis Hassabis, “AlphaFold Reveals the Structure of the Protein Universe,” Google DeepMind, February 5, 2025, deepmind.google/discover/blog/alphafold-reveals-the-structure- of-the-protein-universe. BACK TO NOTE REFERENCE 25 26. “Accelerating the Race Against Antibiotic Resistance,” Google DeepMind, July 28, 2022, deepmind.google/discover/blog/accelerating-the-race-against-antibiotic-resistance. BACK TO NOTE REFERENCE 26 27. AlQuraishi, “AlphaFold @ CASP13: ‘What Just Happened?’ ” BACK TO NOTE REFERENCE 27 28. Outside DeepMind, scientific teams reported huge productivity gains from integrating AI: At one US materials research group, patent filings jumped 39 percent. See “AI Models are Dreaming Up the Materials of the Future,” The Economist, March 5, 2025, economist.com/science-and-technology/2025/03/05/ai-models-are-dreaming-up-the-materials- of-the-future. BACK TO NOTE REFERENCE 28 -- 474 of 565 -- CHAPTER SIXTEEN: THE POWER AND THE GLORY 1. Geoffrey Irving, author interview, January 17, 2025. BACK TO NOTE REFERENCE 1 2. The debate over nuclear safety has also featured attempts to control risks through technical solutions intended to prevent proliferation. BACK TO NOTE REFERENCE 2 3. Irving, author interview. BACK TO NOTE REFERENCE 3 4. The sincerity of Irving and Christiano regarding safety is beyond reproach given their later choice to work for government safety institutes. On the importance of releasing models with a lag, Amodei observed, “If our answer is always, ‘We’ll let the customers fix the bugs,’ then we’ll get to catastrophic risk and we won’t know how to do anything else.” Dario Amodei, author interview, December 14, 2023; Irving, author interview. BACK TO NOTE REFERENCE 4 5. Leopold Aschenbrenner, “Situational Awareness: The Decade Ahead,” June 2024, situational- awareness.ai. BACK TO NOTE REFERENCE 5 6. Geoffrey Irving (@geoffreyirving), “…lied to me on several occasions,” X (formerly Twitter), November 20, 2023, x.com/geoffreyirving/status/1726754277618491416?lang=en. BACK TO NOTE REFERENCE 6 7. I am indebted to Gordon LaForge for bringing Keller’s words to my attention—and for many other helpful comments. Helen Keller, The World I live In (The Century, 1908), 113. BACK TO NOTE REFERENCE 7 8. Will Douglas Heaven, “Rogue Superintelligence and Merging with Machines: Inside the Mind of OpenAI’s Chief Scientist,” MIT Technology Review, October 26, 2023, technologyreview.com/2023/10/26/1082398/exclusive-ilya-sutskever-openais-chief-scientist- on-his-hopes-and-fears-for-the-future-of-ai. Multiple researchers described GPT-3 in similar terms to the author. BACK TO NOTE REFERENCE 8 9. Koray Kavukcuoglu, author interview, February 6, 2025. BACK TO NOTE REFERENCE 9 -- 475 of 565 -- 10. Jack Rae, author interview, June 6, 2025. BACK TO NOTE REFERENCE 10 11. The engineer was Nat McAleese. Rae recalls, “The GopherChat moment was exciting. The model seemed much more capable and intuitive. It moved from a research tool that people studied to a piece of technology that researchers used for personal use. In retrospect, it was an early sign of the product market fit that a general and capable chatbot could have, eighteen months before ChatGPT.” Rae, email to the author, August 24, 2025. BACK TO NOTE REFERENCE 11 12. As of January 2020, Google’s Meena featured 2.6 billion parameters and was trained on 341 GB of text. This made it larger than GPT-2, which featured 1.5 billion parameters and was trained on 40 GB of text, but much smaller than GPT-3, which featured 175 billion parameters and was trained on 570 GB of text. Daniel Adiwardana and Thang Luong, “Towards a Conversational Agent That Can Chat About…Anything,” Google Research, January 28, 2020, research.google/blog/towards-a-conversational-agent-that-can-chat-aboutanything. BACK TO NOTE REFERENCE 12 13. Amodei, author interview. BACK TO NOTE REFERENCE 13 14. Karen Hao, Empire of AI: Dreams and Nightmares in Sam Altman’s OpenAI (Penguin Press, 2025), 176. BACK TO NOTE REFERENCE 14 15. “OpenAI’s DALL-E Creates Plausible Images of Literally Anything You Ask It To,” TechCrunch, January 5, 2021, techcrunch.com/2021/01/05/openais-dall-e-creates-plausible- images-of-literally-anything-you-ask-it-to. BACK TO NOTE REFERENCE 15 16. Sam Altman, “Moore’s Law for Everything,” March 16, 2021, moores.samaltman.com. BACK TO NOTE REFERENCE 16 17. Gopher was also smaller than Megatron, a model released by Microsoft and Nvidia a couple of months earlier, which boasted 580 billion parameters. BACK TO NOTE REFERENCE 17 18. Sebastian Borgeaud et al., “Improving Language Models by Retrieving from Trillions of Tokens,” arXiv, February 7, 2022, doi.org/10.48550/arXiv.2112.04426. BACK TO NOTE REFERENCE 18 -- 476 of 565 -- 19. Laura Weidinger et al., “Ethical and Social Risks of Harm from Language Models” arXiv, December 8, 2021, doi.org/10.48550/arXiv.2112.04359. BACK TO NOTE REFERENCE 19 20. Laura Weidinger, author interview, December 18, 2024. BACK TO NOTE REFERENCE 20 21. Weidinger, author interview. BACK TO NOTE REFERENCE 21 22. Jack Rae, Geoffrey Irving, and Laura Weidinger, “Language Modelling at Scale: Gopher, Ethical Considerations, and Retrieval,” Google DeepMind, December 8, 2021, deepmind.google/discover/blog/language-modelling-at-scale-gopher-ethical-considerations- and-retrieval. BACK TO NOTE REFERENCE 22 23. The engineers who left for OpenAI were Aidan Clark, Trevor Cai, Francis Song, and Jacob Menick. BACK TO NOTE REFERENCE 23 24. Meanwhile, another DeepMind scientist recalled, “In my imaginary picture of OpenAI, the researchers say, ‘Oh, Sam, what should we do?’ And Sam goes, ‘Make GPT-3 bigger!’ There’s no ambiguity. In my corresponding picture of DeepMind, the researchers say, ‘Oh, what should we do?’ And it’s like, ‘Maybe this, maybe that, it’s up to you,’ or it kind of depends on the vagaries of your reporting line.” A year or so after Rae left, this scientist also quit in frustration. BACK TO NOTE REFERENCE 24 25. Irving recalled, “Demis can’t just slam his fist and have a bunch of people suddenly do something different.” Likewise, Koray Kavukcuoglu, DeepMind’s research chief, recalled, “People have a lot of agency on what they research. I remember trying to persuade a bunch of people to work on LLMs. But everything that people were working on seemed quite valuable.” Irving, author interview; Kavukcuoglu, author interview. BACK TO NOTE REFERENCE 25 26. Irving, author interview. BACK TO NOTE REFERENCE 26 27. Jean-Baptiste Alayrac, Jeff Donahue, Pauline Luc, and Antoine Miech, “Tackling Multiple Tasks with a Single Visual Language Model,” Google DeepMind, April 28, 2022, deepmind.google/discover/blog/tackling-multiple-tasks-with-a-single-visual-language-model. BACK TO NOTE REFERENCE 27 -- 477 of 565 -- 28. Jason Wei et al., “Finetuned Language Models Are Zero-Shot Learners,” arXiv, February 8, 2022, doi.org/10.48550/arXiv.2109.01652. BACK TO NOTE REFERENCE 28 29. Jordan Hoffmann et al., “Training Compute-Optimal Large Language Models,” arXiv, March 29, 2022, doi.org/10.48550/arXiv.2203.15556. BACK TO NOTE REFERENCE 29 30. Sutskever, author interview. BACK TO NOTE REFERENCE 30 31. Amelia Glaese et al., “Improving Alignment of Dialogue Agents via Targeted Human Judgements,” arXiv, September 28, 2022, doi.org/10.48550/arXiv.2209.14375. BACK TO NOTE REFERENCE 31 -- 478 of 565 -- CHAPTER SEVENTEEN: RACEGPT 1. OpenAI did release a new base model, later dubbed GPT-3.5, under the name InstructGPT. However, reflecting its caution in early 2022, it had not telegraphed its novelty. See note 11 below. BACK TO NOTE REFERENCE 1 2. Karen Hao, Empire of AI: Dreams and Nightmares in Sam Altman’s OpenAI (Penguin Press, 2025), 246. BACK TO NOTE REFERENCE 2 3. Jeff Wu was an example of a safety-minded researcher who did not quit OpenAI with the other defectors. BACK TO NOTE REFERENCE 3 4. Jan Leike (@janleike), “Before we scramble to deeply integrate LLMs,” post on X (formerly Twitter), March 17, 2023, x.com/janleike/status/1636788627735736321. This post was the public expression of arguments Leike had been making privately. Leike, email to the author, August 23, 2025. BACK TO NOTE REFERENCE 4 5. Nitasha Tiku, “The Google Engineer Who Thinks the Company’s AI Has Come to Life,” The Washington Post, June 11, 2022, washingtonpost.com/technology/2022/06/11/google-ai-lamda- blake-lemoine. BACK TO NOTE REFERENCE 5 6. Google stated that Lemoine “chose to persistently violate clear employment and data security policies that include the need to safeguard product information.” BACK TO NOTE REFERENCE 6 7. Hao, Empire of AI, 249. BACK TO NOTE REFERENCE 7 8. Hao, Empire of AI, 255. BACK TO NOTE REFERENCE 8 9. Amodei recalled, “We had all the requirements ready to ship and we deliberately held back.” Dario Amodei, author interview, December 14, 2023; “Anthropic Had Created a Chatbot 6 Months Before ChatGPT but Didn’t Release It,” Officechai, February 7, 2025, officechai.com/ai/anthropic-had-created-a-chatbot-6-months-before-chatgpt-but-didnt-release- it-co-founder-ben-mann. -- 479 of 565 -- BACK TO NOTE REFERENCE 9 10. OpenAI’s charter, published in 2018, is available at openai.com/charter. It loosely defines “late- stage AGI development” as meaning “a better-than-even chance of success in the next two years.” Altman may not have believed that this condition applied in late 2022, but his chief scientist seems to have felt otherwise: Sutskever was given to chanting “Feel the AGI!” with such frequency that colleagues created a “Feel the AGI” emoji in Slack. It seems fair to say that the anti-accelerationist spirit of OpenAI’s charter cut against the decision to rush the release of ChatGPT. Karen Hao and Charlie Warzel, “Inside the Chaos at OpenAI,” The Atlantic, November 19, 2023, theatlantic.com/technology/archive/2023/11/sam-altman-open-ai-chatgpt- chaos/676050. BACK TO NOTE REFERENCE 10 11. GPT-3.5, released under the name InstructGPT, was OpenAI’s first base model to use the mixture-of-experts architecture. Reflecting the company’s low-key mood in 2022, OpenAI had slipped this product out without saying that the base model was new. Instead, it stressed innovations in post-training. BACK TO NOTE REFERENCE 11 12. Hao, Empire of AI, 258. BACK TO NOTE REFERENCE 12 13. Will Douglas Heaven, “Why Meta’s Latest Large Language Model Survived Only Three Days Online,” MIT Technology Review, November 18, 2022, technologyreview.com/2022/11/18/1063487/meta-large-language-model-ai-only-survived- three-days-gpt-3-science. BACK TO NOTE REFERENCE 13 14. Sparrow’s search and safety features may have caused it to be slower and less responsive to users. Perhaps because it respected DeepMind’s twenty-three conduct rules, Sparrow reportedly refused to answer questions with disappointing frequency. BACK TO NOTE REFERENCE 14 15. The head of sales was Aliisa Rosenthal. See Steven Levy, “The Year of ChatGPT and Living Generatively,” Wired, December 1, 2023, wired.com/story/plaintext-chatgpt-year-of-living- generatively. BACK TO NOTE REFERENCE 15 16. Sam Altman (@sama), “today we launched ChatGPT. try talking with it here,” X (formerly Twitter), November 30, 2022, x.com/sama/status/1598038815599661056?lang=en. BACK TO NOTE REFERENCE 16 17. Hao, Empire of AI, 259. -- 480 of 565 -- BACK TO NOTE REFERENCE 17 18. Krystal Hu, “ChatGPT Sets Record for Fastest-Growing User Base—Analyst Note,” Reuters, February 2, 2023, reuters.com/technology/chatgpt-sets-record-fastest-growing-user-base- analyst-note-2023-02-01; Kevin Roose, “How ChatGPT Kicked Off an A.I. Arms Race,” The New York Times, February 3, 2023, nytimes.com/2023/02/03/technology/chatgpt-openai- artificial-intelligence.html; Hao, Empire of AI, 229. BACK TO NOTE REFERENCE 18 19. Kevin Roose, “The Brilliance and Weirdness of ChatGPT,” The New York Times, December 5, 2022, nytimes.com/2022/12/05/technology/chatgpt-ai-twitter.html. BACK TO NOTE REFERENCE 19 20. Will Douglas Heaven, “The Inside Story of How ChatGPT Was Built from the People Who Made It,” MIT Technology Review, March 3, 2023, technologyreview.com/2023/03/03/1069311/inside-story-oral-history-how-chatgpt-built- openai. BACK TO NOTE REFERENCE 20 21. Altman said, “Doing ChatGPT was something that I pushed for that other people at the time didn’t really want to do.” He added that employees asked, “Is the model good enough? Are people going to use it? Does anyone want to chat?” Gerrit De Vynck, “The Man Who Unleashed AI on an Unsuspecting Silicon Valley,” The Washington Post, April 9, 2023, washingtonpost.com/technology/2023/04/09/sam-altman-openai-chatgpt; Kylie Robison, “Inside the Launch—and Future—of ChatGPT,” The Verge, December 12, 2024, theverge.com/2024/12/12/24318650/chatgpt-openai-history-two-year-anniversary. BACK TO NOTE REFERENCE 21 22. John Schulman, emails to the author, October 13, 2025. Schulman, a cofounder of OpenAI, led the ChatGPT project. BACK TO NOTE REFERENCE 22 23. Cade Metz, “The ChatGPT King Isn’t Worried, but He Knows You Might Be,” The New York Times, March 31, 2023, nytimes.com/2023/03/31/technology/sam-altman-open-ai-chatgpt.html. BACK TO NOTE REFERENCE 23 24. While publicly signaling the state of its chat technology, Anthropic did not proceed with a full public release of its model for a few months. The beta version of Claude was released to select users in March 2023. BACK TO NOTE REFERENCE 24 25. When Microsoft’s investment closed in February 2023, OpenAI’s paper value leapt from $14 billion to $80 billion, roughly a hundred times more than DeepMind had been worth at the time -- 481 of 565 -- of its sale to Google. Berber Jin and Miles Kruppa, “ChatGPT Creator Is Talking to Investors About Selling Shares at $29 Billion Valuation,” The Wall Street Journal, January 5, 2023, wsj.com/articles/chatgpt-creator-openai-is-in-talks-for-tender-offer-that-would-value-it-at-29- billion-11672949279; Cade Metz and Tripp Mickle, “OpenAI Completes Deal That Values the Company at $80 Billion,” The New York Times, February 16, 2024, nytimes.com/2024/02/16/technology/openai-artificial-intelligence-deal-valuation.html. BACK TO NOTE REFERENCE 25 26. The CEO was Chris Gibson of Recursion Pharmaceuticals. Chris Gibson, email to the author, May 9, 2025. BACK TO NOTE REFERENCE 26 27. “Insights from Global Conversations,” OpenAI, June 29, 2023, openai.com/index/insights- from-global-conversations. BACK TO NOTE REFERENCE 27 28. Hasan Chowdhury, “Sam Altman Is OpenAI’s Compelling Preacher. The World Is Ready to Bow Down,” Business Insider, June 24, 2023, businessinsider.com/sam-altman-world-tour-ai- chatgpt-openai-2023-6; Morgan Meaker, “Sam Altman’s World Tour Hopes to Reassure AI Doomers,” Wired, May 24, 2023, wired.com/story/sam-altman-world-tour-ai-doomers. BACK TO NOTE REFERENCE 28 29. Tad Friend, “Sam Altman’s Manifest Destiny,” The New Yorker, October 3, 2016, newyorker.com/magazine/2016/10/10/sam-altmans-manifest-destiny; Paul Graham, “A FundRaising Survival Guide,” August 2008, paulgraham.com/fundraising.html; Metz, “The ChatGPT King Isn’t Worried, but He Knows You Might Be.” BACK TO NOTE REFERENCE 29 30. Language modelers who quit Google included Noam Shazeer and Daniel De Freitas, who left in 2021 to found Character.AI, a start-up. BACK TO NOTE REFERENCE 30 31. Hassabis knew that DeepMind and Google Brain would join forces on Gemini by early January 2023. On January 13 he met Oriol Vinyals to recruit him as a coleader of the project. Oriol Vinyals, author interview, February 6, 2025. BACK TO NOTE REFERENCE 31 32. A DeepMind researcher recalls, “We got an email from Demis about the cancellation of Sparrow. It was like ‘Sundar thinks it’s the future of generative search, so we’re not going to release it so we don’t tip our hand,’ or something along those lines. It didn’t really ring true.” BACK TO NOTE REFERENCE 32 -- 482 of 565 -- 33. In one example, a prominent Google AI engineer, Jacob Devlin, resigned in January 2023 and immediately joined OpenAI. “Alphabet’s Google and DeepMind Pause Grudges, Join Forces to Chase OpenAI,” The Information, March 29, 2023, theinformation.com/articles/alphabets- google-and-deepmind-pause-grudges-join-forces-to-chase-openai. BACK TO NOTE REFERENCE 33 34. Nilay Patel, “Microsoft Thinks AI Can Beat Google at Search,” The Verge, February 8, 2023, theverge.com/23589994/microsoft-ceo-satya-nadella-bing-chatgpt-google-search-ai. BACK TO NOTE REFERENCE 34 35. In a widely cited comparison, Professor Ethan Mollick of Wharton asked ChatGPT and Bard to write a poem. Bard lost this sort of creative contest “by a lot,” Mollick reported. Ethan Mollick (@emollick), X (formerly Twitter), March 22, 2023, x.com/emollick/status/1660878127516594177. BACK TO NOTE REFERENCE 35 -- 483 of 565 -- CHAPTER EIGHTEEN: “WE’RE COOKED” 1. Irving (@IrvingX), “Though they’ve caused so much pain…,” X (formerly Twitter), December 4, 2022. BACK TO NOTE REFERENCE 1 2. The OpenAI researcher was Sandhini Agarwal. Will Douglas Heaven, “The Inside Story of How ChatGPT Was Built from the People Who Made It,” MIT Technology Review, March 3, 2023, technologyreview.com/2023/03/03/1069311/inside-story-oral-history-how-chatgpt-built- openai. BACK TO NOTE REFERENCE 2 3. Kevin Roose, “Bing’s A.I. Chat: ‘I Want to Be Alive,’ ” The New York Times, February 16, 2023, nytimes.com/2023/02/16/technology/bing-chatbot-transcript.html. BACK TO NOTE REFERENCE 3 4. GPT-4 System Card, OpenAI, published as part of the GPT-4 release, March 14, 2023. The captcha story was immediately picked up in the tech press. See Kevin Hurler, “ChatGPT Pretended to be Blind and Tricked a Human Into Solving a CAPTCHA,” Gizmodo, March 15, 2023. (URL is no longer available.) The incident became more broadly known after the publication of Andrew Marantz, “Among the A.I. Doomsayers,” The New Yorker, March 11, 2024, newyorker.com/magazine/2024/03/18/among-the-ai-doomsayers. BACK TO NOTE REFERENCE 4 5. Harry Lambert, “Is AI a Danger to Humanity or Our Salvation?,” The New Statesman, June 21, 2023, newstatesman.com/long-reads/2023/06/men-made-future-godfathers-ai-geoffrey-hinton- yann-lecun-yoshua-bengio-artificial-intelligence. BACK TO NOTE REFERENCE 5 6. Cade Metz, “ ‘The Godfather of AI’ Leaves Google and Warns of Danger Ahead,” The New York Times, May 1, 2023, nytimes.com/2023/05/01/technology/ai-google-chatbot-engineer- quits-hinton.html. BACK TO NOTE REFERENCE 6 7. Yoshua Bengio, remarks at Imagination in Action (MIT), February 18, 2025. BACK TO NOTE REFERENCE 7 8. Yann LeCun (@ylecun), “Scaremongering about an asteroid that doesn’t actually exist,” X (formerly Twitter), April 24, 2023, x.com/ylecun/status/1650622244660428800. BACK TO NOTE REFERENCE 8 -- 484 of 565 -- 9. LeCun posed the rhetorical question, “Do you want every A.I. system to be under the control of a couple of powerful American companies?” Cade Metz and Mike Isaac, “In Battle over A.I., Meta Decides to Give Away Its Crown Jewels,” The New York Times, May 18, 2023, nytimes.com/2023/05/18/technology/ai-meta-open-source.html. BACK TO NOTE REFERENCE 9 10. In contrast, venture capitalists who had backed OpenAI early, such as Vinod Khosla and Reid Hoffman, regarded open-source models as potentially dangerous. Alex Konrad, “Vinod Khosla, Marc Andreessen, and the Billionaire Battle for AI’s Future,” Forbes, June 4, 2024, forbes.com/sites/alexkonrad/2024/06/04/inside-silicon-valley-influence-battle-for-ai-future. BACK TO NOTE REFERENCE 10 11. Marc Andreessen, “#386—Marc Andreessen: Future of the Internet, Technology, and AI,” Lex Fridman Podcast, June 21, 2023, 2 hr., 37 min., lexfridman.com/marc-andreessen. BACK TO NOTE REFERENCE 11 12. Marc Andreessen, “Why AI Will Save the World,” a16z, June 6, 2023, a16z.com/ai-will-save- the-world. BACK TO NOTE REFERENCE 12 13. Of course, this discussion about the potential of AI systems to be evil is distinct from the danger that evil humans will use neutral AI to do evil things. As Stuart Russell has argued, the true threat may be less analogous to the Terminator movies (in which an evil AI system called Skynet tries to kill humans) than to the TV series Black Mirror (in which one person programs robot bees to kill 387,036 specific humans). The Reith Lectures, season 1, episode 2, “AI in Warfare,” BBC Radio 4, December 8, 2021, 58 min., bbc.com/audio/play/m00127t9. BACK TO NOTE REFERENCE 13 14. For this point, I am indebted to my former Council on Foreign Relations colleague Michael Levi. Levi, email to the author, May 18, 2025. BACK TO NOTE REFERENCE 14 15. “Planning for AGI and Beyond,” OpenAI, February 24, 2023, openai.com/index/planning-for- agi-and-beyond. BACK TO NOTE REFERENCE 15 16. “Pause Giant AI Experiments: An Open Letter,” Future of Life Institute, March 22, 2023, futureoflife.org/open-letter/pause-giant-ai-experiments. BACK TO NOTE REFERENCE 16 17. An additional argument against the pause is that it would have been rejected by the majority of AI researchers as excessive. Eventually, the majority would have prevailed, if necessary by -- 485 of 565 -- quitting the pausing labs and starting new ones. Effective promotion of safety requires judging the moment when a safety measure will stick. By pushing too early, promoters may counterproductively burn up “safety capital.” Allan Dafoe, author interview, July 25, 2024. (Dafoe is a governance researcher at DeepMind.) BACK TO NOTE REFERENCE 17 18. In the same essay, Yudkowsky wrote, “The most likely result of building a superhumanly smart AI, under anything remotely like the current circumstances, is that literally everyone on Earth will die.” See Eliezer Yudkowsky, “Pausing AI Development Isn’t Enough. We Need to Shut It All Down,” Time, March 29, 2023, time.com/6266923/ai-eliezer-yudkowsky-open-letter-not- enough. BACK TO NOTE REFERENCE 18 19. “Statement on AI Risk,” CAIS, aistatement.com. BACK TO NOTE REFERENCE 19 20. More than a year before ChatGPT, and before he joined the government, Buchanan had laid out the case for semiconductor export controls. The goal of the embargo was to “slow the Chinese down, create space for the United States to have a lead and, ideally, in my view, spend that lead on safety and coordination and not rushing ahead.” Buchanan’s paradoxical prescription—a race aimed at a pause—mirrored the way that American AI labs thought about their race against each other. Ben Buchanan, “The U.S. Has AI Competition All Wrong,” Foreign Affairs, August 7, 2020, foreignaffairs.com/articles/united-states/2020-08-07/us-has-ai-competition-all-wrong; Ezra Klein, “The Government Knows AGI Is Coming,” The New York Times, March 4, 2025, nytimes.com/2025/03/04/opinion/ezra-klein-podcast-ben-buchanan.html. BACK TO NOTE REFERENCE 20 21. Klein, “The Government Knows AGI Is Coming.” BACK TO NOTE REFERENCE 21 22. Assistant to the President Bruce Reed and National Security Advisor Jake Sullivan backed Buchanan’s efforts and gave him the space to be effective. Ben Buchanan, email to the author, June 9, 2025. BACK TO NOTE REFERENCE 22 23. Soon after the White House meeting on July 21, 2023, Google DeepMind, OpenAI, Anthropic, and Microsoft formed the Frontier Model Forum to share best practices on safety. BACK TO NOTE REFERENCE 23 24. The threshold for reporting was set to include models whose training involved at least 1026 floating point operations. Critics attacked this as arbitrary, but the line needed to be drawn somewhere. The goal was to set the threshold just above the current frontier, ensuring that no AI system was caught retroactively. -- 486 of 565 -- BACK TO NOTE REFERENCE 24 25. In 2025, President Trump breached this norm, firing the head of the copyright office. BACK TO NOTE REFERENCE 25 26. Buchanan, author interview. BACK TO NOTE REFERENCE 26 27. Later, the rival post-training efforts were submitted to LMSYS, a popular crowdsourced tool for evaluating chatbots. The results, published on January 26, 2024, showed that the Bard team’s “Elo score” was almost 100 points higher. BACK TO NOTE REFERENCE 27 28. Tom Leiberum et al., “Does Circuit Analysis Interpretability Scale? Evidence from Multiple- Choice Capabilities in Chinchilla,” arXiv, July 18, 2023, arxiv.org/abs/2307.09458. BACK TO NOTE REFERENCE 28 29. Jonah Brown-Cohen, Geoffrey Irving, and Georgios Piliouras, “Scalable AI Safety via Doubly- Efficient Debate,” arXiv, November 27, 2023, arxiv.org/abs/2311.14125. BACK TO NOTE REFERENCE 29 30. Tripp Mickle et al., “Inside OpenAI’s Crisis over the Future of Artificial Intelligence,” The New York Times, December 9, 2023, nytimes.com/2023/12/09/technology/openai-altman-inside- crisis.html. BACK TO NOTE REFERENCE 30 31. Karen Hao, Empire of AI: Dreams and Nightmares in Sam Altman’s OpenAI (Penguin Press, 2025), 3. BACK TO NOTE REFERENCE 31 32. Helen Toner et al., “Decoding Intentions: Artificial Intelligence and Costly Signals,” Center for Security and Emerging Technology, October 2023, cset.georgetown.edu/publication/decoding- intentions. BACK TO NOTE REFERENCE 32 33. The amount of wealth generated by Altman can be gauged by comparing OpenAI’s valuation soon after ChatGPT ($29 billion) with the valuations indicated by two transactions soon after his brief ouster ($86 billion in the case of a tender offer for employee stock, and $157 billion in the case of a primary stock offering). Taking the lower of these numbers, and subtracting the $10 billion injected by Microsoft, a lowball estimate is that Altman and OpenAI generated $47 billion in shareholder value in the space of twelve months. BACK TO NOTE REFERENCE 33 -- 487 of 565 -- 34. Tripp Mickle, Cade Metz, Mike Isaac, and Karen Weise, “Inside OpenAI’s Crisis over the Future of Artificial Intelligence,” The New York Times, December 9, 2023, nytimes.com/2023/12/09/technology/openai-altman-inside-crisis.html. BACK TO NOTE REFERENCE 34 35. Geoffrey Hinton, author interview, June 11, 2024. BACK TO NOTE REFERENCE 35 -- 488 of 565 -- CHAPTER NINETEEN: STEP BY STEP 1. Extrapolating this trend line, the influential observer Leopold Aschenbrenner predicted that, by 2027, models would be able to do the work of an AI researcher. AGI would have arrived, just a little sooner than Hassabis and his cofounders had predicted in 2010, before computers could recognize cat pictures. Leopold Aschenbrenner, “Situational Awareness,” June 2024, situational-awareness.ai. BACK TO NOTE REFERENCE 1 2. Melissa Heikkila and Will Douglas Heaven, “Google DeepMind’s New Gemini Model Looks Amazing—but Could Signal Peak AI Hype,” MIT Technology Review,December 6, 2023, technologyreview.com/2023/12/06/1084471/google-deepminds-new-gemini-model-looks- amazing-but-could-signal-peak-ai-hype. BACK TO NOTE REFERENCE 2 3. Stephanie Palazzolo, “Why Gemini Probably Isn’t as Good as Google Says It Is; A New Open- Source Security Threat: AIJacking,” Information, Dec. 7, 2023, theinformation.com/articles/why-gemini-probably-isnt-as-good-as-google-says-it-is-a-new- open-source-security-threat-aijacking. BACK TO NOTE REFERENCE 3 4. “Google’s Gemini Comparing Apples and Oranges,” Hexacluster, December 12, 2023, hexacluster.ai/blog/gemini-comparing-apples-and-oranges; Palazzolo, “Why Gemini Probably Isn’t as Good as Google Says It Is; A New Open-Source Security Threat: AIJacking.” BACK TO NOTE REFERENCE 4 5. Jason Wei et al., “Chain of Thought Prompting Elicits Reasoning in Large Language Models,” arXiv, January 28, 2022, arxiv.org/abs/2201.11903. Google’s release of this paper demonstrates that it was generally more open than OpenAI in this period. For example, papers relating to OpenAI’s “GPT-Zero” project, launched in 2021, were never published. Likewise, OpenAI had discovered the adjusted scaling laws outlined in DeepMind’s Gopher paper (2021). Unlike DeepMind, it had not published them. BACK TO NOTE REFERENCE 5 6. Sundar Pichai and Demis Hassabis, “Introducing Gemini, Our Largest and Most Capable Model,” The Keyword, December 6, 2023, blog.google/technology/ai/google-gemini-ai. BACK TO NOTE REFERENCE 6 7. With chain-of-thought prompting, GPT-4’s score inched up to 87 percent, still three points less than Gemini Ultra. BACK TO NOTE REFERENCE 7 -- 489 of 565 -- 8. The parameter count of the various Gemini models is undisclosed, and the same goes for OpenAI’s GPT-4. It is therefore not known whether Ultra was larger than GPT-4, although industry gossip suggests that it was probably smaller. However, GPT-4 is thought to have used the mixture-of-experts architecture, boosting its efficiency. So even if Ultra was smaller, it was probably costlier to serve to users. BACK TO NOTE REFERENCE 8 9. Jeanine Banks and Tris Warkentin, “Gemma: Introducing New State-of-the-Art Open Models,” The Keyword, February 21, 2024, blog.google/technology/developers/gemma-open-models; Cade Metz and Nico Grant, “Google Is Giving Away Some of the A.I. That Powers Chatbots,” The New York Times, February 21, 2024, nytimes.com/2024/02/21/technology/google-open- source-ai.html. BACK TO NOTE REFERENCE 9 10. “I felt like we’d just scratched over the finish line with Gemini 1.0. The goal was to be GPT-4 quality, so tick, we did that. But then it’s like, let’s move on and make something much better. The idea was to build up the model such that we know why we added everything and whether it would work at scale.” Jack Rae, author interview, June 6, 2025. BACK TO NOTE REFERENCE 10 11. Rae formed an alliance with London colleagues, including Jonas Adler, Alexander Pritzel, and Sebastian Borgeaud. Adler had worked on the AlphaFold strike team. BACK TO NOTE REFERENCE 11 12. As of June 2025, no Gemini rival had a context window longer than 200,000 tokens. BACK TO NOTE REFERENCE 12 13. The post-training of Gemini remained under the control of the scrappier Bard team. Rae, author interview. BACK TO NOTE REFERENCE 13 14. Steven Levy, “OpenAI’s Sora Turns AI Prompts onto Photorealistic Videos,” Wired, February 15, 2024, wired.com/story/openai-sora-generative-ai-video. BACK TO NOTE REFERENCE 14 15. Rae, author interview. BACK TO NOTE REFERENCE 15 16. Nico Grant, “Google Chatbot’s A.I. Images Put People of Color in Nazi-Era Uniforms,” The New York Times, February 22, 2024, nytimes.com/2024/02/22/technology/google-gemini- german-uniforms.html. -- 490 of 565 -- BACK TO NOTE REFERENCE 16 17. Miles Kruppa, “Google Chatbot to Generate Images of People Again Months After Backlash,” The Wall Street Journal, August 28, 2024, wsj.com/livecoverage/nvidia-earnings-stock-market- today-08-28-2024/card/google-chatbot-to-generate-images-of-people-again-months-after- backlash-zH5gacIxAM0oqjuwC7tf. BACK TO NOTE REFERENCE 17 18. Paul Graham (@paulg), “They’re a self-portrait of Google’s bureaucratic corporate culture,” X (formerly Twitter), February 21, 2024, x.com/paulg/status/1760416051181793361. BACK TO NOTE REFERENCE 18 19. Rae, author interview. BACK TO NOTE REFERENCE 19 20. Elon Musk (@elonmusk), “AI mirrors the mistakes of its creators,” X (formerly Twitter), March 5, 2024, x.com/elonmusk/status/1764857568952766693. BACK TO NOTE REFERENCE 20 21. Sutskever, author interview, March 20, 2025. BACK TO NOTE REFERENCE 21 22. Jack Rae et al., “Scaling Language Models: Methods, Analysis & Insights from Training Gopher,” arXiv, December 8, 2021, arxiv.org/abs/2112.11446. BACK TO NOTE REFERENCE 22 23. Hunter Lightman et al., “Let’s Verify Step by Step,” arXiv, May 31, 2023, arxiv.org/abs/2305.20050. BACK TO NOTE REFERENCE 23 24. Will Knight, “These Clues Hint at the True Nature of OpenAI’s Shadowy Q* Project,” Wired, November 30, 2023, wired.com/story/fast-forward-clues-hint-openai-shadowy-q-project. BACK TO NOTE REFERENCE 24 25. Pablo Villalobos et al., “Will We Run Out of Data? Limits of LLM Scaling Based on Human- Generated Data,” arXiv, June 4, 2024, arxiv.org/abs/2211.04325. BACK TO NOTE REFERENCE 25 26. Sutskever, author interview. BACK TO NOTE REFERENCE 26 -- 491 of 565 -- 27. In May 2024 two Google DeepMind researchers, Jonathan Lai and James An, circulated an influential internal paper laying out the way to supersede OpenAI’s 2023 technique. They called their method Reinforcement Learning with Verified Rewards. BACK TO NOTE REFERENCE 27 28. Rae, author interview. BACK TO NOTE REFERENCE 28 29. Oriol Vinyals, author interview, February 6, 2025. BACK TO NOTE REFERENCE 29 30. Confusingly, GPT-o1 is also sometimes referred to by its internal code name, Strawberry. BACK TO NOTE REFERENCE 30 31. “Learning to Reason with LLMs,” OpenAI, September 12, 2024, openai.com/index/learning-to- reason-with-llms. BACK TO NOTE REFERENCE 31 32. “Introducing OpenAI o1-Preview,” OpenAI, September 12, 2024, openai.com/index/introducing-openai-o1-preview. BACK TO NOTE REFERENCE 32 33. The researcher was Hunter Lightman. See “Noam Brown and Team on Teaching LLMs to Reason,” Sequoia Capital, sequoiacap.com/podcast/training-data-noam-brown. BACK TO NOTE REFERENCE 33 34. “Introducing OpenAI o1-Preview.” BACK TO NOTE REFERENCE 34 35. The two ex-Google DeepMind researchers were Jason Wei and Noam Brown. Wei had contributed to Google’s 2022 paper on chain-of-thought prompting. BACK TO NOTE REFERENCE 35 36. In “Noam Brown and Team on Teaching LLMs to Reason,” Brown’s colleague Hunter Lightman recalls, “Noam would just say, ‘Why don’t we let the model think for longer?’ And then we would. And it would get better. And he would just look at us kind of funny like [why] hadn’t [we] done it until that point?” BACK TO NOTE REFERENCE 36 -- 492 of 565 -- CHAPTER TWENTY: COMEBACK, AND BEYOND 1. Jack Rae, author interview, June 6, 2025. The author is grateful to Noam Shazeer for confirming the accuracy of this account. BACK TO NOTE REFERENCE 1 2. “It felt very much like building the airplane as you fly it.” Rae, author interview. BACK TO NOTE REFERENCE 2 3. Rae, author interview; Noam Shazeer, Unsupervised Learning, episode 58, March 17, 2025, 1 hr., 9 min., podcasts.apple.com/us/podcast/ep-58-google-researchers-noam-shazeer-and-jack- rae-on/id1672188924?i=1000699518901. BACK TO NOTE REFERENCE 3 4. Rae, author interview. BACK TO NOTE REFERENCE 4 5. Behnam Neyshabur and Ethan Dyer quit for Anthropic. They had co-led a group called the Blueshift team, which worked on reasoning. BACK TO NOTE REFERENCE 5 6. Simon Willison, “Gemini 2.0 Flash ‘Thinking Mode,’ ” Simon Willison’s Weblog, December 19, 2024, simonwillison.net/2024/Dec/19/gemini-thinking-mode. BACK TO NOTE REFERENCE 6 7. These scores are for AIME-2024 (math) benchmark. BACK TO NOTE REFERENCE 7 8. These scores were for SWE-bench Verified (real-world code problems). BACK TO NOTE REFERENCE 8 9. Rae, author interview. BACK TO NOTE REFERENCE 9 10. Miles Kruppa, “Google’s Resolution for 2025: Catch Up to ChatGPT,” The Wall Street Journal, January 16, 2025, wsj.com/tech/ai/google-gemini-2025-chatgpt-openai-b6eb595d. BACK TO NOTE REFERENCE 10 11. According to Sensor Tower data, the ChatGPT mobile app had been downloaded about 465 million times on Android and iOS devices; Gemini had only 106 million downloads. See -- 493 of 565 -- Kruppa, “Google’s Resolution for 2025: Catch Up to ChatGPT.” BACK TO NOTE REFERENCE 11 12. Dario Amodei, “The Future of U.S. AI Leadership with CEO of Anthropic Dario Amodei,” Council on Foreign Relations, March 10, 2025, 1 hr., 2 min., 35 sec., cfr.org/event/ceo-speaker- series-dario-amodei-anthropic. BACK TO NOTE REFERENCE 12 13. Dave Lawler, “White House ‘Looking into’ National Security Implications of DeepSeek’s AI,” Axios, January 28, 2025, axios.com/2025/01/28/deepseek-ai-national-security-trump. BACK TO NOTE REFERENCE 13 14. DeepSeek claimed that it used a cluster of more than two thousand Nvidia chips to train its V3 model, compared with tens of thousands of chips that Western labs used to train models of similar size. Raffaele Huang, “Silicon Valley Is Raving About a Made-in-China AI Model,” The Wall Street Journal, January 27, 2025, wsj.com/tech/ai/china-ai-deepseek-chatbot-6ac4ad33. BACK TO NOTE REFERENCE 14 15. Between ImageNet in 2012 and 2024, frontier AI models were estimated to have become roughly ten times more efficient every two years. See Leopold Aschenbrenner, “Situational Awareness,” June 2024, situational-awareness.ai. BACK TO NOTE REFERENCE 15 16. Following the DeepSeek shock, Nvidia’s stock experienced further turbulence relating to the Trump administration’s tariff offensive. BACK TO NOTE REFERENCE 16 17. Lawler, “White House ‘Looking into’ National Security Implications of DeepSeek’s AI.” BACK TO NOTE REFERENCE 17 18. Daya Guo et al., “DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning,” arXiv, January 22, 2025, arxiv.org/abs/2501.12948. BACK TO NOTE REFERENCE 18 19. Other reasoning models also prompted suggestions that AI was approaching self-awareness or consciousness. In March 2025, Sutskever said of the new thinking systems, “I think there is plenty of fear to go around. I mean, justifiable fear. Just talk to the model and you are like, what am I talking to? And then you can debate. Is it conscious, is it not conscious? Who knows? But no one can tell you it’s definitely not at this point.” Ilya Sutskever, author interview, March 20, 2025. BACK TO NOTE REFERENCE 19 -- 494 of 565 -- 20. “Google Unveils Gemini 2.5,” Deeplearning.ai, April 16, 2025, deeplearning.ai/the-batch/issue- 297. Reflecting on 2.5’s success, Jack Rae commented, “We got our payoff in the end. All this innovation did eventually get appreciated.” Rae, author interview. BACK TO NOTE REFERENCE 20 21. On Chatbot Arena, a crowdsourced ranking system, Gemini 2.5 Pro was ranked equal first as of October 15, 2025, sharing that position with two models from Anthropic. Four OpenAI models occupied the six slots classed as equal second. LMArena, https://lmarena.ai/leaderboard. BACK TO NOTE REFERENCE 21 22. Demis Hassabis, remarks at Imagination in Action (MIT), February 18, 2025. BACK TO NOTE REFERENCE 22 23. Jérémy Scheurer et al., “Large Language Models Can Strategically Deceive Their Users When Put Under Pressure,” arXiv, July 15, 2024, arxiv.org/abs/2311.07590. BACK TO NOTE REFERENCE 23 24. This behavior was observed in two OpenAI reasoning models and in R1. See Figure 2 in Alexander Bondarenko et al. “Demonstrating Specification Gaming in Reasoning Models,” arXiv, February 18, 2025, arxiv.org/abs/2502.13295. BACK TO NOTE REFERENCE 24 25. “Recent Frontier Models are Reward Hacking,” METR, June 5, 2025, metr.org/blog/2025-06- 05-recent-reward-hacking. BACK TO NOTE REFERENCE 25 26. Carson Denison et al., “Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models,” arXiv, June 14, 2024, arxiv.org/abs/2406.10162. BACK TO NOTE REFERENCE 26 27. Bowen Baker et al., “Monitoring Reasoning Models for Misbehavior and the Risks of Promoting Obfuscation,” arXiv, March 14, 2025, arxiv.org/abs/2503.11926. BACK TO NOTE REFERENCE 27 28. Ilya Sutskever, author interview, March 20, 2025. BACK TO NOTE REFERENCE 28 29. Hassabis was not alone in his warning. Appearing on the Hard Fork podcast ten days later, Dario Amodei expressed similar views. Kevin Roose et al., “Anthropic’s C.E.O., Dario Amodei, on Surviving the A.I. Endgame,” The New York Times, February 28, 2025, -- 495 of 565 -- nytimes.com/2025/02/28/podcasts/hardfork-anthropic-dario-amodei.html; Demis Hassabis, remarks at Imagination in Action (MIT), February 18, 2025. BACK TO NOTE REFERENCE 29 30. David Silver, lecture at the RLC 2024 conference, “David Silver—Towards Superhuman Intelligence—RLC 2024,” Reinforcement Learning Conference (RLC), October 1, 2024, YouTube, 1 hr., 3 min., 13 sec., youtube.com/watch?v=pkpJMNjvgXw. BACK TO NOTE REFERENCE 30 31. David Silver and Richard S. Sutton, “Welcome to the Era of Experience,” preprint of a chapter to appear in Designing an Intelligence (MIT Press, 2025), storage.googleapis.com/deepmind- media/Era-of-Experience%20/The%20Era%20of%20Experience%20Paper.pdf. BACK TO NOTE REFERENCE 31 32. Google DeepMind: The Podcast, episode 14, “Is Human Data Enough? With David Silver,” Google DeepMind, April 10, 2025, 50 min., podcasts.apple.com/be/podcast/is-human-data- enough-with-david-silver/id1476316441?i=1000703034260. BACK TO NOTE REFERENCE 32 33. Silver and Sutton, “Welcome to the Era of Experience.” BACK TO NOTE REFERENCE 33 34. “Is Human Data Enough?” BACK TO NOTE REFERENCE 34 35. David Silver, author interview, May 8, 2025. BACK TO NOTE REFERENCE 35 36. Silver, author interview. BACK TO NOTE REFERENCE 36 EPILOGUE: TURING’S CHAMPION 1. “Sir Demis Hassabis on the Future of Knowledge | Institute for Advanced Study,” Institute for Advanced Study, May 2, 2025, YouTube, 57 min., 41 sec., youtube.com/watch? v=TgS0nFeYul8. BACK TO NOTE REFERENCE 1 OceanofPDF.com -- 496 of 565 -- INDEX The page numbers in this index refer to the printed version of the book. Each link will take you to the beginning of the corresponding print page. You may need to scroll forward from that location to find the corresponding reference on your e-reader. A B C D E F G H I J K L M N O P Q R S T U V W X Y Z A academic AI, 403n31 Acemoglu, Daron, 324 acute kidney injury (AKI), 178–79, 183–85, 188–90 The Age of Spiritual Machines (Kurzweil), 57–58 Alphabet, 231–32 alpha-beta search, 402n17 AlphaDev, 278 -- 497 of 565 -- AlphaFold, 382 CASP contest and, 269–73, 276–77 direct folding and, 271–73 success of, 277–79, 313–14 transformers for, 274–75 AlphaGeometry, 278 AlphaGo AlphaZero compared with, 198 challenges of, 141–42 Crazy Stone tests of, 147–48 creativity of, 159 deep-learning strategy for, 145–46 Facebook competition for, 154, 156 Fan Hui defeated by, 153–55, 158 Graepel defeated by, 151–52 Guez’s network added to, 150–51 hallucinations of, 157, 160 intuition mimicked by, 146–47 Lee Sedol matches with, 156, 158–61 Maddison’s network in, 146–47, 151 -- 498 of 565 -- Monte Carlo Tree Search and, 143–44 Moravec’s paradox and, 195 in Nature, 155–56 processing success of, 161 protein folding problem for, 266 RL training, 152 safety and, 281 scaling up, 148–49 Silver and, 143–47, 149–50 System One thinking and, 142–43, 151, 417n31 System Two thinking and, 151, 417n31 TPUs for, 157 AlphaProof, 378 AlphaZero, 357 AlphaGo compared with, 198 AlphaStar compared with, 227 chess and, 194–96 GPT compared with, 212–13 idea for, 193–94 Moravec’s paradox and, 195–96 -- 499 of 565 -- ResNet and, 197–98 RL and, 195–200 Altman, Sam. See also OpenAI ChatGPT’s success and, 305–8, 433n21 firing of, 254, 337–40 Hassabis, D., compared with, 290, 292, 294, 349 Leike supported by, 302 “Moore’s Law for Everything” by, 290–92 Musk and, 163–64, 172–73 political ambitions of, 244 reinstatement of, 254, 340 on safety and ethics, 164, 283, 300–303 staff revolt backing, 339–40 strengths and weaknesses of, 289–90, 338 wealth generated by, 436n33 Y Combinator and, 163, 164 Amodei, Dario, 190, 280–82, 289, 301, 369, 430n4 An, James, 438n27 Andreessen, Marc, 321–22 Anfinsen, Christian, 258–59 -- 500 of 565 -- Anthropic, 190, 301, 303, 306, 374, 433n24 Apollo 11, 88 artificial general intelligence (AGI), 62 believers and skeptics in, 88 definition of, 78 God and, xix–xx, 73, 114–15 international body for, 376 introspection and, 144 intuition/System One thinking and, 142–43 LeCun on, 86, 88, 320–21, 324, 410n32 Manhattan Project compared with, 84 OpenAI’s manifesto on, 323–24 potential of, 69–70, 76–77, 112–15 RL and, 97 scale and simplicity in, 149 singleton scenario of, 167–69, 172, 228 for societal problem-solving, 76 space travel compared with, 111 Thiel scouting companies in, 73–74 Artificial General Intelligence (Goertzel), 57 -- 501 of 565 -- artificial intelligence (AI) academic compared with gaming, 403n31 biology and, 404n14 community, 167 consciousness and, 439n19 Dartmouth pioneers of, 23–24 doctors, 378–79 Elixir ambitions for, 35 evil and, 322, 435n13 Gatsby Computational Neuroscience Unit dismissing, 51 governance of, 236–38, 243–44, 246, 254–56 Hassabis, D.’s, childhood dreams of, 16–18 human intelligence mimicked by, xiv induction and, 26, 28–29, 91–92 Irving on safety of, 280–82 job elimination and, 363–64 nature and, 219–20, 389 optimism for, xvi pace of advancement of, 341–42, 351–52 potential of, 62–63 -- 502 of 565 -- Putin on, 246 Republic game and, 38 risks of, xv–xvi RL and, 44 semiconductor advances and, 76 symbolic, 24–25 transformative nature of, xiii, 21 weapons and, 376 arXiv, 292, 371 Aschenbrenner, Leopold, 436n1 Asilomar Hotel, 235–36 Asimov, Isaac, 17 Atari challenge, 100–109, 198, 412n22 atheism, 67 Aviemore, 240–42 B Baby WebMind, 56–57 Back, Trevor, 422n23, 422n25 -- 503 of 565 -- Baidu, 120 Baker, David, 259, 262 Banks, Iain, 17 Bard, 312, 346 Barden, Leonard, 4, 7 Barnett, Ruth, 252 Barwinski, Marek, 261 Bell Labs, 309–10 Bengio, Yoshua, 203–4, 319–20, 324, 332, 373–75 Bergson, Henri, 207 BERT, 274 Bhaskar, Michael, 408n19 Biden, Joe, 328–33, 369, 370 Bing, 312, 315–16 biology, AI and, 404n14 Black Mirror, 435n13 Black & White game, 29–30, 42–43 Bletchley Park conference, 332–33 Bloomberg, 252–53 Blueshift team, at DeepMind, 365 -- 504 of 565 -- Borgeaud, Sebastian, 346 Bostrom, Nick, xvii Bowling, Michael, 412n17 brain, digital, 56 brain, human, 16, 47–49 brainstorming, 271 Breakout game, 105, 109 Brin, Sergey, 70–71, 141–42, 157–58, 239–40, 249, 334–35 Brockman, Greg, 243–45 Brown, Noam, 438n36 Brown, Peter, 232 Buchanan, Ben, 329–33, 435n20 Bullfrog video game production company, 12–15, 17–19, 29–30, 42–43 C Cambridge Analytica, 189 Campbell, Murray, 194–95, 416n2 cancer screening, 179, 188–89 Canetti, Elias, 40 -- 505 of 565 -- Card, Orson Scott, 1–2 Carr, Gary, 403n25 CERN, 376, 387–89 chain-of-thought prompting, 343–44, 353–54 Chatbot Arena, 440n21 ChatGPT, xviii, 300 behavior concerns with, 315–17 Bing integration with, 312, 315–16 competition concerns with, 303–4 Gemini compared with, 344–46, 372–73, 437n8 intuition and, 305 OpenAI’s manifesto and, 323–24 public adaptation to, 341 release of, 299, 304–5, 310 success of, 305–8, 310, 433n21 Chau, Solina, 124, 126 chess AlphaZero and, 194–96 Bullfrog contest and, 13 childhood competitions of, 4–9 -- 506 of 565 -- computer programming and, 9–11, 63, 73 The Chess Computer Handbook (Levy), 9–11, 24 Chess Invaders game, 13 China, 328–29, 369–73, 405n4, 435n20 Chinchilla, 296–97 Christiano, Paul, 280–82, 289 Clarke, Arthur C., 165 Clarke, Tim, 33, 37–39 classical computers, 389–94 Claude, 303, 306, 433n24 Codex, 290 Cohen, Aron, 404n6 Commodore Amiga 500, 9, 11 computational medicine, 70–71 computer programming Bullfrog contest for, 12–13 chess and, 9–11, 63, 73 classical compared with quantum, 389–94 information as unit of reality and, 28 Moore’s Law and progress in, 57–58 -- 507 of 565 -- numerical compared with general, 10 Othello and, 11–12 singularity and, 60 contact maps, 267 context vector, 203–4 context window, of Gemini, 347–48 conversational systems, 207, 214 convolutional neural networks, 200–201, 262–63 Copernicus, 391 Coppin, Ben, 404n3 copyright, 332 Coulom, Rémi, 416n6 COVID-19 pandemic, 290 Crazy Stone, 147–48 Crick, Francis, 19–20, 382 Critical Assessment of Structure Prediction (CASP), 262, 269–73, 276–77 Crowds and Power (Canetti), 40 cults, 322 Culture series (Banks), 17 Cyc, 24–25 -- 508 of 565 -- D Daily Mail, 184–85 DALL-E, 290 DALL-E 2, 300, 302 Dartmouth College, 23–24 Daugman, John, 27–28, 404n14 da Vinci, Leonardo, 115 Dawkins, Richard, 404n13 Dayan, Peter, 51 Dean, Jeff, 135–36, 138, 157, 231, 247, 312–13, 334 deduction, 26, 28–29 Deep Blue, 10, 63, 73, 143, 156, 194 deep learning, 51 AlphaGo strategy with, 145–46 in Atari challenge, 101 Krizhevsky’s system and, 91–94 Q-learning and, 101–2, 106 RL combined with, 102–3 RL compared with, 93, 95–96, 196–97 -- 509 of 565 -- “vanishing gradients” problem for, 197 DeepMind, xiv–xv. See also AlphaGo; Gemini; Google acquisition of DeepMind achievements of, xvii–xviii AGI believers and skeptics in, 88 alignment team at, 323 AlphaDev of, 278 AlphaFold and, 269–79, 313–14, 382 AlphaProof and, 378 AlphaStar project of, 225–28 AlphaZero and, 193–200, 212–13, 227, 357 Applied division of, 174–75, 186–87, 240, 246 Atari challenge for, 100–109, 198, 412n22 at Aviemore, 240–42 Bell Labs compared with, 309–10 big-tech backlash halting progress of, 189–90 Blueshift team at, 365 business plan of, 76–78 in CASP contest, 262, 269–73, 276–77 Chau and Li Ka-shing investing in, 124, 126 -- 510 of 565 -- Chinchilla and, 296–97 culture of, 89–90 deep-learning plan of, 92 DQN of, 108–9 eclectic approach of, 99 Elixir experience aiding, 109 envisioning, 62–63 equity shares in, 83, 85–86, 414nn20–21 expansion of, 90 expenses of, 116–17 first offices of, 83–85 fixed target network and, 413n28 Flamingo and, 295–96 Foldit and, 259–66 Formal Thursday at, 87 free food of, 413n6 fundraising struggles of, 72–73 Gaia and, 219–24, 426n6 Gammon financing, 78–79 Gato and, 296 -- 511 of 565 -- Google Brain merger with, 310–13 Gopher and, 286–88, 290, 293–94, 430n11 governance of, 236–38, 246 GPT-3 competition for, 285, 289 grounding problem and, 215–16 Hark and, 180, 189–90 Health, 183, 190–91, 232, 248–49 Hinton’s company auction and, 120 independence plans for, 231–39 investor motivations in, 122 Irving hired by, 280, 283–84 Jules and, 367 King, D., joining, 180–81 London location issues for, 82 Mnih recruited by, 97–99 Musk investing in, 124–25 Musk’s attempted acquisition of, 136–37 naming of, 63 in Nature, 155–56 neuroscience and, 77–78 -- 512 of 565 -- NHS data concerns for, 181–85, 422nn25–26 at NIPS conference, 108–9 Nobel Prize won by, 23, 381–83 o1 threat to, 360–61 office decor of, 113 OpenAI recruiting against, 185–86 OpenAI surpassed by, 342–44, 373 OpenAI surpassing, 228–29, 288–90, 307, 314 participatory consultation issues of, 186–87 PhD advisers pursued by, 86–87 Project Astra and, 367 Project Mariner and, 367 protein folding success of, 260–61, 277–79 radiology system of, 188–89 Research, 246–48 research funding needs of, 130, 131 retinal technology of, 179, 188, 190 RETRO and, 292, 294, 346 revenue streams of, 89, 121–22, 232 RLHF and, 298–99, 301 -- 513 of 565 -- safety and ethics papers of, 292–94 safety and ethics review of, 162–63, 168–73, 230–31 seeds of, 18 Series C funding for, 116, 119, 121–27 Silver joining, 99, 411n10 Silver rejecting shares in, 85–86 soundproofing office of, 381 SpaceX compared with, 81 Sparrow and, 297–99, 304, 311, 432n14 split proposal for, 240–42, 246–48 staff costs of, 139 Streams AKI alert system and, 178–79, 183–85, 188–90 Suleyman bullying accusations at, 249–51 Suleyman leaving, 251–53 Sutskever recruited by, 86 Sutton joining, 199 Tallinn financing, 87–88 Thiel and Founders Fund financing, 73–75, 82–83, 87, 116–17, 121–27, 409n22 UniProt database and, 267–69 -- 514 of 565 -- valuation of, 137–38, 414n18 Veo 2 and, 367, 369 walk-away plan of, 234–36 women at, 89, 410n39 Zuckerberg and, 132–34 Deep-Q Network (DQN), 108–9 DeepSeek, 369–73, 439n14 Defense Production Act, 331 dense neural network, 347 D. E. Shaw Research, 263–64 Dewey, John, 227 digital brain, 56 direct folding, 271–73 Diskett, Mike, 402n19 distogram, 267–68 Djokovic, Novak, 104–5 doctors, AI, 378–79 Drummond, David, 233–34, 239 Dungeon Keeper game, 19 Dyson, Freeman, 388 -- 515 of 565 -- E Edge magazine, 38 Einstein, Albert, 284–85, 382, 388 electricity discovery, 411n3 Electronic Entertainment Expo, 38–40 Elixir, 405n2 AI ambitions for, 35 business plan of, 33 Camden office of, 37 closing of, 42 cofounders of, 32–34 DeepMind benefitting from, 109 at Electronic Entertainment Expo, 38–40 fundraising for, 34–35 Republic game by, 36–41 Silver leaving, 41–42 team burnout at, 40–41 work ethic and, 38–39 Elixir Diaries (Hassabis, D.), 33, 405n2 -- 516 of 565 -- Ender’s Game (Card), 1–2 European Union regulation, 328 Eustace, Alan, 117–20, 135 Evans, Richard, 403n26, 403n31, 405n18 evil, 322, 435n13 eye damage screening, 179, 188, 190 F Facebook, 67, 73, 132–34, 154, 156, 199–200 Fan Hui, 153–55, 158 Feynman, Richard, xiv, 20, 382 Fire and Water program, 12 first order logic, 25–26 fixed target network, 413n28 Flamingo, 295–96 Flash Thinking model, Gemini, 366–68 flexible intelligence, 27, 43–44 Foldit, 259–66 Formal Thursday, 87 -- 517 of 565 -- Foundation series (Asimov), 17 Founders Fund, 73–75, 80–83, 87, 116–17, 121–27, 409n22 Fowles, John, 30–31 France, 328 free will, 16 Future of Life Institute, 163 G Gaia, 219–24, 426n6 gaming AI, 403n31 Gammon, David, 73, 78–79, 122 Gao, Jim, 241, 253, 428n22 Gates, Bill, xxi, 76 Gato, 296 Gatsby Computational Neuroscience Unit, 50–51, 53–54, 59–62 Gebru, Timnit, 293 Gelly, Sylvain, 416n6 Gemini, 311 chain-of-thought prompting and, 343–44, 353–54 -- 518 of 565 -- ChatGPT compared with, 344–46, 372–73, 437n8 context window of, 347–48 cultural gap in teams working on, 334 diversity issues of, 350–51 Flash Thinking model, 366–68 launch of, 341–42 mixture-of-experts system for, 347 MMLU test and, 342–44 1.5 Pro, 346–48, 356–57 parameter count of, 437n8 post-training of, 335–36 pretraining for, 334–35, 348, 364 promotion of, 348 researchers working on, 333–34 Responsible AI team of, 349–50 RL experiments of, 357–60 RLHF and, 335 Sora compared with, 349 strategic development approach to, 346–47 thinking tokens and, 359 -- 519 of 565 -- 3, 372–73 2.5 Pro, 372, 440n21 Ultra, 342–44, 437n8 “wokeness” and, 351 GenCast, 278–79 general computer, 10 general relativity, theory of, 388 George, Dileep, 410n32 Giannandrea, John, 247 Global Distance Test (GDT), 262, 272–73, 275–76 Go (game), 22–23, 141–42, 146, 147, 417n22, 417n26 God, AGI and, xix–xx, 73, 114–15 Gödel, Escher, Bach (Hofstadter), 15–18, 147, 403n28 Gödel, Kurt, 25, 392 Goertzel, Ben, 56–57, 72 Google, xvii. See also Gemini Advanced Technology External Advisory Council of, 254 Alphabet plan of, 231–32 antitrust concerns of, 132 Bard and, 312, 346 -- 520 of 565 -- BERT model of, 274 DeepMind Health and, 248–49 DeepMind independence and, 231–39 Hinton’s company auction and, 120–21 innovator’s dilemma of, 308–9, 350 LaMDA and, 289, 302, 311 Meena and, 289, 430n12 NHS data concerns for, 181–85, 422nn25–26 semiconductors of, 157, 310 Suleyman at, 253–55 transformer innovation of, 206–8 Xerox PARC compared, 309 Google acquisition of DeepMind code inspection for, 135–36 earnings from shares in, 139 impact of, 139–40, 162 investment patience and, 118–19 plans and negotiations for, 111–12, 117–19, 129–30 safety concerns in, 130–32, 139, 162–69 secrecy in, 129–30 -- 521 of 565 -- valuation for, 137–38 Google Brain, 138, 289, 310–13, 353–54. See also Gemini Gopher, 286–88, 290, 293–94, 430n11 governance, of AI, 236–38, 243–44, 246, 254–56 government regulation, 328–33, 370 GPT, 210–12 GPT-2, 211, 218–19, 282–83, 341, 430n12 GPT-3, 285–89, 341, 342 GPT-3.5 (InstructGPT), 304, 341, 432n11 GPT-4, 300–301, 312, 316, 322–23, 341–44, 355, 437n8 GPT-Zero, 353 Graepel, Thore, 151–54, 161 Graham, Paul, 308, 350 grounding problem, 215–16 The Guardian, 183–84 Guez, Arthur, 150–53 Gu Li, 156 -- 522 of 565 -- H Halcyon Molecular, 79–80 “The Halloween Scenario” (Legg), 59–62 Hamming, Richard, 208 Harari, Yuval, 324 Hark, 180, 189–90 Harris, Kamala, 332 Harrison, Don, 137–39, 231, 415n17 Harvard fellowship, 51–52 Hassabis, Demis. See also DeepMind achievements of, xvii–xviii affability of, 21–22 on AGI and intuition, 143 on AGI’s potential, 112–15 on AI governance, 255–56 on AlphaFold’s success, 313–14 AlphaGo challenge and, 141 AlphaStar and, 225, 227–28 Altman compared with, 290, 292, 294, 349 -- 523 of 565 -- ambitions of, xiv–xv, xx–xxi, 31 appearance of, xiii, xix, 22 on Bell Labs, 309–10 Black & White game and, 29–30, 42–43 on brain and reality construction, 48–49 brainstorming and, 271 CASP contest and, 270–71 celebrity of, 159 on ChatGPT’s release, 307–8 childhood AI dreams in, 16–18 Bullfrog contest in, 12–13 chess in, 4–9 computer programming in, 9–12 friends in, 6, 8–9 games and hobbies in, 8 in Liechtenstein, 7 Molyneux meeting in, 13–14 parents and, 3–6 Theme Park video game work in, 14–15, 17–18 children of, 383 on classical computers and quantum mechanics, 389–94 -- 524 of 565 -- Daily Mail coverage and, 185 on deduction and induction, 28–29 DeepMind equity shares of, 83, 414n20–21 DeepMind split proposal and, 240, 246–48 on DeepMind valuation, 138 DeepSeek competition for, 373 Elixir, 405n2 AI ambitions for, 35 closing of, 42 cofounding, 32–34 DeepMind benefitting from, 109 at Electronic Entertainment Expo, 38–40 fundraising for, 34–35 Republic game by, 36–41 Ender’s Game and, 1–2 entrepreneurial instinct of, 228–29 on first order logic, 26 Foldit and, 259–60 Gaia and, 221–22 Gatsby Computational Neuroscience Unit and, 50–51, 53–54, 59–62 on Gemini compared with ChatGPT, 345–46 -- 525 of 565 -- on God and AGI, xix–xx, 114–15 Gödel, Escher, Bach and, 15–18 Google acquisition earnings of, 139 on Google inspecting DeepMind code, 135–36 on Google’s acquisition plans for DeepMind, 117–19 Gopher and, 286 government regulation and, 327–28 on GPT-3, 285 grounding problem and, 215–16 Harvard and MIT fellowships of, 51–52 Hinton meeting, 52 Hinton’s company auction and, 120–21 on human experience, 216 imagination of, 84–85 on information ages, 114 on information as unit of reality, 28 on international body for AGI safety, 376 interviewing, xviii–xix on language models, 214–17, 222–23, 361 on language models with RL, 352–53 -- 526 of 565 -- memory replay and, 103–4 memory studies by, 45–47 Mind Sports Olympiad won by, 36 Mnih meeting, 98–99 Musk meeting, 111 Musk’s DeepMind investment and, 124–25 on nature and AI, 389 neuroscience studied by, xiv, 1, 45–47 Nobel Prize and, xviii, 381–83 on noise and retreat, 394–96 one-sentence letter signed by, 326–27 on Page and Musk, 166 Page recruiting, 128–29 on “pause” letter, 325 on philanthropy, 386–87 Pichai and, 233, 238–39, 242 on Planck scale, 388–89 poker and, 69 power and, 383–87 pragmatism of, 115–16, 396 -- 527 of 565 -- on product pivot, 313–14 Project Orion and, 53 on protein folding, 257 RL and, 44, 199, 373–75 safety and ethics review and, 162, 169–70, 173 scientific journals and, 155 Silver meeting, 23 singleton scenario and, 167–68, 228 at Singularity Summit, 72–74, 167 Suleyman bullying accusations and, 249–51 Suleyman’s friendship with, 65, 68–70, 122–23, 247–48 table tennis and, 410n31 Thiel misjudged by, 123 on transformers’ potential, 216–17 University of Cambridge and, 14, 19–31, 36 values of, xx–xxi voice of, 381 walk-away plan of, 234–36 on wealth, 385–87 Winston dismissing, 52–53 -- 528 of 565 -- work ethic of, 2–3 Zuckerberg tested by, 133 Hassabis, George, 12, 65, 69 Hawkins, Jeff, 410n32 Her, 314 Heritage Foundation, 254 Hinton, Geoffrey, xvi–xvii, 63, 135 company auction of, 120–21 as DeepMind adviser, 86 DeepMind equity shares and, 414nn20–21 on DeepMind valuation, 138 Hassabis, D., meeting, 52 ImageNet breakthrough of, 91–92 Mnih working with, 96–97 neural networks pioneered by, 51–52, 93 at NIPS conference, 119–20 safety concerns of, 317–19 table tennis and, 410n31 on “vanishing gradients” problem, 197 hippocampus, 47 -- 529 of 565 -- Hoffman, Reid, xvi, xvii, 168–69, 171, 236–37 Hofstadter, Douglas R., 15–18, 28, 147, 403n28, 411n6 “How to Build a Superhuman Agent” (Silver), 353 Huang, Aja, 144–46, 148, 150–53, 157–58 human-machine alignment, 323 Huppert, Julian, 421n19 Hutter, Marcus, 58–59 I IBM, 10 ImageNet, 91–93, 146, 416n15 imagination, memory and, 47 incompleteness theorem, 25 induction, 26, 28–29, 91–92 Industrial Revolution, 217 Inflection, 306 information, as unit of reality, 28 information ages, 114 innovator’s dilemma, of Google, 308–9, 350 -- 530 of 565 -- intelligence, xiv, 25, 27, 43–44, 57–59 Intelligenesis, 56 International Mathematical Olympiad, 359, 378 introspection, 144 intuition, 142–43, 146–47, 305, 417n16 Iron Man, 314 Irving, Geoffrey, 431n25 on AI safety problems, 280–82 DeepMind hiring, 280, 283–84 Flamingo and, 295–96 Gato and, 296 Gopher and, 286, 288 human feedback problem of, 336–37 language models and, 284–85 mechanistic interpretability and, 336 at OpenAI, 281–83 Sparrow and, 297–99 Islam, 63–67, 175 Islamophobia, 65 Italy, 328 -- 531 of 565 -- J James, Kay Coles, 254 job elimination, 363–64 Jobs, Steve, xix Johnson, Lyndon, 419n5 Johnson, Robert A., 68–69 Jules, 367 Jumper, John, 263–67, 269–75 K Kahneman, Daniel, 142, 144 Kane, Angela, 238, 239 Kant, Immanuel, xiv, 47–48 Kasparov, Garry, 10, 73, 194–95 Kavukcuoglu, Koray, 101, 103, 135, 156, 285–86, 334, 431n25 Keane, Pearse, 190 King, Dominic, 180–81, 189, 421n16, 422n21, 428n22 King, Helen, 90 Krizhevsky, Alex, 91, 93–94, 146, 201, 416n15 -- 532 of 565 -- Kumaran, Dharshan, 8, 46–47, 86, 99, 122, 402n7, 406nn24–25 Kurzweil, Ray, 57–58, 60, 407n12 L Lai, Jonathan, 438n27 Laing, Chris, 177–79, 189–90, 421n9, 421n11 LaMDA, 289, 302, 311 language models, 214–17, 222–23, 284–85, 352–53, 361, 363 Large Hadron Collider, CERN, 387–89 Leavitt, Karoline, 370 LeCun, Yann, 99, 132 on AGI, 86, 88, 320–21, 324, 410n32 on deep learning compared with RL, 196–97 Kavukcuoglu recruited by, 135 one-sentence letter signed by, 326–27 Zuckerberg recruiting, 133–34 Lee Sedol, 156, 158–61, 194 Legg, Shane, 51, 134. See also DeepMind on AGI believers and skeptics, 88 -- 533 of 565 -- on Atari challenge, 412n22 Baby WebMind and, 56–57 DeepMind equity shares of, 83 at Gatsby Computational Neuroscience Unit, 59 Google acquisition earnings of, 139 “The Halloween Scenario” lecture of, 59–62 Hassabis, D., meeting, 53–54, 61–62 on Hassabis, D.’s, work ethic, 2–3 intelligence measurement study of, 58–59 Intelligenesis and, 56 Mnih recruited by, 97–98 Moore’s Law and, 58 on OpenAI’s founding, 420n27 in safety and ethics review, 170 schooling struggles of, 54–55, 407n10 on Silver joining DeepMind, 411n10 at Singularity Summit, 72–74 at Swiss Finance Institute, 59 Leike, Jan, 301–2, 305 Lemoine, Blake, 302 -- 534 of 565 -- Levy, David, 9–11, 24 Li, Fei-Fei, 91 Liechtenstein, 7 Life Story, 19–20, 98 Lightman, Hunter, 438n36 Li Ka-shing, 124, 126 Li Qiang, 370 Llama, 321, 323, 356–57 LMSYS, 436n27 logit attribution, 336 London Mathematical Society, 84, 409n25 Lupas, Andrei, 276–77 M Ma, Jack, 405n4 Macdonald, Ken, 427n6 machine autonomy, RL and, 211–12 machine intelligence, 57 “Machine Super Intelligence” (Legg), 59 -- 535 of 565 -- macular degeneration screening, 179, 188, 190 Maddison, Chris, 145–47, 149, 151 Maguire, Eleanor, 45, 47 The Magus (Fowles), 30–31 Manhattan Project, 84, 164, 168 Mann, Steve, 72 Massive Multitask Language Understanding (MMLU) test, 342–44 The Matrix, 48 Maxwell, James Clerk, 411n3 McAleese, Nat, 430n11 McDonagh, Joe, 33–34, 36–37, 405n2, 410n29 mechanistic interpretability, 336 medical diagnostics, neural networks for, 412n14 Meena, 289, 430n12 memory replay, 103–5, 108, 412nn21–22 memory studies, 45–47 Meta, 321, 323, 356–57 Metz, Cade, 156 Meyer, Clemens, 275–76 Microsoft, 119–21 -- 536 of 565 -- Bing and, 312, 315–16 OpenAI funding from, 289, 306, 433n25 ResNet and, 197–98 safety and ethics concerns of, 302–3 Microsoft AI, 191, 255 Mind Sports Olympiad, 8, 36 Minecraft, 219, 220 Minsky, Marvin, 52 Mistral, 328 Mitchell, Margaret, 293 MIT fellowship, 51–52 MIT Technology Review, 342–43 mixture-of-experts system, 347 MMLU (Massive Multitask Language Understanding) test, 342–44 Mnih, Vlad Atari challenge and, 101–8 DeepMind recruiting, 97–99 DQN presentation of, 108–9 education of, 92–93 on Google inspecting DeepMind code, 135 -- 537 of 565 -- Hassabis, D., meeting, 98–99 Hinton working with, 96–97 memory replay and, 103–5, 412nn21–22 neural network inquiries of, 94–95 at NIPS conference, 108–9, 134 RL’s exciting potential, 96 spiraling expectations problem-solved by, 106–8 model weights, 323 molecular dynamics, 263–64 Mollick, Ethan, 434n35 Molyneux, Peter, 50, 402n19 believers of, 405n18 Black & White game and, 29–30, 42–43 Bullfrog contest and, 12–13 cars of, 404n2 Elixir investment of, 35 games invented by, 12 Hassabis, D., meeting, 13–14 Hassabis, D.’s, cash offer from, 19 The Magus compared with, 30–31 -- 538 of 565 -- Project Orion and, 53 Theme Park video game and, 14–15, 17–18 Monte Carlo Tree Search, 143–44, 147, 149, 416nn6–7 Montgomery, Hugh, 421n9 Moore’s Law, 57–58 “Moore’s Law for Everything” (Altman), 290–92 Moorfields, 179 Moravec’s paradox, 195–96 Moult, John, 276–77 Müller, Martin, 416n2 multiparty democracy, for safety and ethics, 169 Murray, Andy, 104–5 Musk, Elon, 110, 236 Altman and, 163–64, 172–73 birthday parties of, 128, 165 campaign contributions of, 401n6 DeepMind acquisition attempt of, 136–37 DeepMind investment of, 124–25 Future of Life Institute and, 163 Hassabis, D., meeting, 111 -- 539 of 565 -- Nosek and, 420n26 OpenAI launch and, 172–73 in OpenAI restructuring fights, 243–46 Page and, 165–66, 169 safety and ethics and, 162–66, 172 Terminator tropes and, 171 xAI and, 324 Muslim Youth Helpline, 66–67, 175 N Nadella, Satya, 312, 340 National Health Service (NHS), 249 data concerns of, 181–85, 422nn25–26 DeepMind Health failures and, 190–91 Hark and, 180, 189–90 needs of, 176–77 Royal Free Hospital and, 177–78, 422n26, 423n38 Streams AKI alert system and, 178–79, 183–85, 188–90 nature, 219–22, 389, 394, 404n14 -- 540 of 565 -- Nature, 141, 155–56, 225 Nature Medicine, 188, 190 neural networks in Atari challenge, 101 as black boxes, 336 convolutional, 200–201, 262–63 dense, 347 Hinton pioneering, 51–52, 93 intuition mimicked by, 146–47 Krizhevsky’s system and, 91–94 for medical diagnostics, 412n14 recurrent, 200, 208 residual, 197–98 self-supervised learning and, 206, 212 sentiment-neuron model and, 206–7 supervised learning and, 205 Szepesvári on, 94–95 neuroscience, xiv, 1, 45–48, 77–78 Neven, Hartmut, 394 Newton, Isaac, 264–65 -- 541 of 565 -- NIPS/NeurIPS (Neural Information Processing Systems) conference, 108– 9, 119–20, 134, 148, 305 Nobel Prize, xviii, 23, 381–83 Nosek, Luke, 80–82, 110–12, 116, 121–26, 136–37, 413n5, 420n26 Numenta, 410n32 numerical computer, 10 Nvidia, 157, 439n14 O o1 model, 359–63, 366–68 o3 model, 368–69, 374–75 O’Gieblyn, Meghan, 419n14 Ojjeh, Ali, 414n14 one-sentence safety letter, 326–27, 330 On Intelligence (Hawkins), 410n32 OpenAI, xviii, 86. See also ChatGPT AGI manifesto of, 323–24 alignment team at, 323 Altman’s firing at, 254, 337–40 Altman’s reinstatement at, 254, 340 -- 542 of 565 -- Bing integration with, 312 Codex and, 290 DALL-E and, 290 DALL-E 2 and, 300, 302 DeepMind falling behind, 228–29, 288–90, 307, 314 DeepMind recruiting against, 185–86 DeepMind surpassing, 342–44, 373 Deployment Safety Board of, 300–302, 316 engineers of, 208–9 GPT and, 210–12 GPT-2 and, 211, 218–19, 282–83, 341, 430n12 GPT-3 and, 285–89, 341, 342 GPT-4 and, 300–301, 312, 316, 322–23, 341–44, 355, 437n8 GPT-5 and, 372 GPT-Zero and, 353 Irving at, 281–83 launch of, 172–73, 420n27 Leike at, 301–2 Microsoft funding for, 289, 306, 433n25 nonprofit/for-profit hybrid of, 254 -- 543 of 565 -- nonprofit status of, 236, 242–44 o1 model of, 359–63, 366–68 o3 model of, 368–69, 374–75 Q* project and, 356, 359 Rae joining, 293–95 restructuring fights of, 243–46 reward hacking problem and, 374–75 RL and, 199–200 RLHF and, 298, 301 safety charter of, 304, 432n10 self-supervised learning and, 206, 212 Sora and, 349, 367 staff revolt backing Altman at, 339–40 step-by-step reasoning and, 355 Sutskever and, 204–6, 208–9, 423n30 Tesla and, 245–46 wealth generated by, 436n33 Oppenheimer, J. Robert, xvii, xxi, 164, 320 Othello, 11–12 Owning Your Own Shadow (Johnson), 68–69 -- 544 of 565 -- P Page, Larry, 110, 135 DeepMind acquisition plans of, 111–12, 117–19 DeepMind independence plans and, 238–40 Hassabis, D., recruited by, 128–29 Musk and, 165–66, 169 safety and ethics and, 165–66, 170 as transhumanist, 419n15 Palantir, 73, 79 “pause” letter, 324–26, 435n17 PayPal, 73 Penrose, Roger, 390–94 Perkins, Tom, 147 philanthropy, Hassabis, D., on, 386–87 physics, xiv Pi, 306 Pichai, Sundar, 233–34, 238–42, 248, 309–11 Pichette, Patrick, 118, 132, 427n3 Pinker, Steven, 195 -- 545 of 565 -- Planck scale, 388–89 Poggio, Tomaso, 51–52, 73, 122, 411n3 poker, 69, 131 Pong game, 104–5 power, Hassabis, D., and, 383–87 Proceedings of the National Academy of Sciences, 47 Project Astra, 367 Project Mariner, 367 Project Mario, 231 Project Orion, 53 protein folding AlphaGo problem for, 266 CASP and, 262, 269–73 DeepMind’s success with, 260–61, 277–79 direct folding and, 271–73 distogram for, 267–68 Foldit and, 259–66 gamification of, 260, 266 molecular dynamics and, 263–64 mystery of, 257–58 -- 546 of 565 -- transformers and, 274–75 UniProt database and, 267–69 X-ray crystallography and, 258–59, 267–69 Purves, Drew, 219–21, 223 Putin, Vladimir, 246 Putnam, Hilary, 26, 404n10 Q Q-learning, 101–2, 106 Q* project, 356, 359 quantum mechanics, 389–94 R R1, DeepSeek, 369–72 R1-Zero, DeepSeek, 371–72 Radford, Alec, 208–10 radiology, 188–89 Rae, Jack, 346, 348–50, 368, 430n11, 437n10 Gopher frustrations of, 290, 293 -- 547 of 565 -- OpenAI joined by, 293–95 reasoning, RL and, 362–66 on scaling up, 218–19, 283, 285 280B project and, 286 Ranzato, Marc’Aurelio, 218 Rayburn, Sam, 419n5 reasoning, RL and, 358, 362–66 recurrent neural network, 200, 208 Redmond, Michael, 159 Rees, Geraint, 421n9 reinforcement learning (RL) AGI and, 97 AlphaGo trained with, 152 AlphaProof and, 378 AlphaStar and, 226–27 AlphaZero and, 195–200 in Atari challenge, 101 concept of, 44, 95 deep learning combined with, 102–3 deep learning compared with, 93, 95–96, 196–97 -- 548 of 565 -- future of, 377–78 Gaia and, 223–24 Gemini experiments with, 357–60 language models with, 352–53, 363 machine autonomy and, 211–12 memory replay and, 103–5, 108 o1 model and, 359 OpenAI and, 199–200 potential of, 96 Q-learning and, 101–2, 106 Q* project and, 356, 359 from raw experience, 412n24 reasoning and, 358, 362–66 safety risks of, 373–75 with verified rewards, 438n27 reinforcement learning from human feedback (RLHF), 298–99, 301, 335 Renaissance Technologies, 232 Reos Partners, 67 Republic game, 36–41 residual neural network (ResNet), 197–98 -- 549 of 565 -- Responsible AI team, Gemini, 349–50 retinal technology, 179, 188, 190 RETRO, 292, 294, 346 reward hacking, 374–75 Riley, Talulah, 128, 136, 163 Roose, Kevin, 315–16 Royal Free Hospital, 177–78, 422n26, 423n38 Russell, Stuart, 435n13 Rusu, Andrei A., 426n6 S Sadler, Matthew, 402n10 safety and ethics AI governance and, 236–38, 243–44, 246, 254–56 AlphaGo and, 281 Altman on, 164, 283, 300–303 Bengio’s concerns with, 319–20 DeepMind’s papers on, 292–94 DeepMind’s review of, 162–63, 168–73, 230–31 -- 550 of 565 -- Google acquisition of DeepMind and, 130–32, 139, 162–69 government regulation and, 328–33, 370 GPT-2 and, 282–83 GPT-4 and, 300–301 Hinton’s concerns with, 317–19 international body for, 376 Irving on AI problems with, 280–82 job elimination and, 363–64 LaMDA and, 302 Microsoft’s concerns with, 302–3 model weights and, 323 multiparty democracy for, 169 Musk and, 162–66, 172 NHS data concerns with, 181–85, 422nn25–26 one-sentence letter on, 326–27, 330 OpenAI’s charter on, 304, 432n10 “pause” letter and, 324–26, 435n17 RL risks with, 373–75 singleton scenario and, 167–69, 172, 228 social cohesion and, 170 -- 551 of 565 -- Suleyman’s concerns with, 420n22 Sutskever and, 303 Tay’s issues with, 302 weapons and, 376 Sagan, Carl, 393 Samuel, Arthur, 402n17 Saudi Arabia, 328 Schmidt, Eric, 157, 170, 239 scientific journals, 155–56 Seaquest game, 105–8 self-supervised learning, 206, 212 semiconductors, 76, 157, 310, 329, 369–71, 435n20, 439n14 Senior, Andrew, 261, 262, 267, 270 sentiment-neuron model, 206–7 sequence-to-sequence modeling (Seq2Seq), 203–4 Shannon, Claude, 9–10, 28 Shazeer, Noam, 362–65, 367 Shear, Emmett, 340 shogi, 8 Silicon Valley, xx–xxi, 33–35, 62, 147 -- 552 of 565 -- Silver, David, 37 AlphaGo and, 143–47, 149–50 AlphaStar project and, 225–27 AlphaZero and, 193–94, 198–99 Atari challenge and, 101, 103–6, 109 CASP contest and, 262, 269 DeepMind hiring, 99, 411n10 DeepMind shares rejected by, 85–86 on DQN as turning point, 109 at Electronic Entertainment Expo, 39–40 Elixir and, 32–34, 41–42 Gemini post-training and, 335 GPT and, 212–13 Hassabis, D., meeting, 23 “How to Build a Superhuman Agent” by, 353 on intuition re-creation, 417n16 memory replay and, 103–4 Monte Carlo Tree Search of, 143–44 Moravec’s paradox amended by, 195–96 on Page as transhumanist, 419n15 -- 553 of 565 -- Project Orion and, 53 on protein folding, 257 Republic game and, 38, 39 RL and, 44, 198–99, 211–12, 226–27, 358, 377–78, 412n24 “Welcome to the Era of Experience” by, 377–80 work ethic of, 39 Simons, Jim, 232 Singerman, Brian, 121 singleton scenario, 167–69, 172, 228 singularity, 58, 60, 167 Singularity Summit, 54, 60, 72–74, 167, 408n7 social cohesion, safety and ethics and, 170 Sora, 349, 367 South Korea, 156 Space Invaders game, 12, 13, 108–9 space travel, 111 SpaceX, 73, 81, 110, 171 Sparrow, 297–99, 304, 311, 432n14 spatial dependencies, 201 spiraling expectations problem, 106–8 -- 554 of 565 -- StarCraft, 224–28 step-by-step reasoning, 355 Stock, Gregory, 72 Stockfish, 194–95 Streams AKI alert system, 178–79, 183–85, 188–90 Suleyman, Mustafa, 119. See also DeepMind academic success of, 64–65 on AI governance, 255–56 atheism and, 67 bullying accusations and, 249–51 childhood of, 63–65 DeepMind Applied division and, 174–75, 186–87 DeepMind equity shares of, 83 DeepMind exit of, 253 DeepMind Health failures and, 190–91 DeepMind split proposal and, 240, 246–48 on DeepMind valuation, 138 defenders of, 428n22 at Google, 253–55 Google acquisition earnings of, 139 -- 555 of 565 -- on Halcyon Molecular, 79 Hark bought by, 180 Hassabis, D.’s, friendship with, 65, 68–70, 122–23, 247–48 Inflection and, 306 Islam and, 63–67, 175 Microsoft AI and, 255 NHS and, 176–77, 181–85 Ojjeh and, 414n14 participatory consultation issues of, 186–87 poker and, 69, 131 politics of, 175–76 Reos Partners, 67 sabbatical of, 251–53 in safety and ethics review, 170–71 safety concerns of, 130–31, 230–31, 420n22 at Singularity Summit, 72–74 Streams AKI alert system and, 178–79, 183–85, 188–90 at University of Oxford, 65–66 walk-away plan of, 234–36 Summerfield, Chris, 99 -- 556 of 565 -- supervised learning, 205 Sutskever, Ilya, 91, 98, 145, 186, 416n15 on AlphaGo scaling up, 149 on AlphaZero, 357 on consciousness and AI, 439n19 on deep learning and RL, 196–97 DeepMind recruiting, 86 Go experiment of, 146, 147, 417n22, 417n26 GPT and, 210–11 on GPT-3, 285 GPT-Zero and, 353 NIPS presentation of, 148–49 OpenAI and, 204–6, 208–9, 423n30 in OpenAI restructuring fights, 243–44 recurrent neural network and, 200 safety and ethics and, 303 salary of, 423n30 sentiment-neuron model and, 206–7 Seq2Seq and, 203–4 strengths and weaknesses of, 339–40 -- 557 of 565 -- temporal dependencies work of, 201–4 transformers and, 200, 208 word embedding and, 202 Sutton, Richard, 43–44, 86, 93, 149, 199, 377–80 Swiss Finance Institute, 59 symbolic AI, 24–25 System One thinking, 142–43, 151, 417n31 System Two thinking, 144, 151, 417n31 Szepesvári, Csaba, 94–95 Szilard, Leo, 84 T table tennis, 410n31 Tallinn, Jaan, 87–88, 122 Tay, 302 temporal dependencies, 201–4 tensor processing units (TPUs), 157 Terminator, 171, 435n13 Tesla, 137, 243, 245–46 -- 558 of 565 -- testing notification, government regulation and, 331–32 tetraformer, 275 Theme Park video game, 14–15, 17–18 Thiel, Peter AGI companies scouted by, 73–74 on board seats’ value, 81–82 DeepMind financed by, 73–75, 82–83, 87, 116–17, 121–27, 409n22 Hassabis, D., misjudging, 123 at Singularity Summit, 408n7 successful startups of, 73 unsettling world of, 79 thinking tokens, 359, 360 Topol, Eric, 189 TPUs (tensor processing units), 157 transformers AlphaStar and, 226 conversational systems and, 207, 214 Google’s innovation with, 206–8 impact of, 200, 208 -- 559 of 565 -- OpenAI replicating, 209–10 potential of, 216–17 protein folding and, 274–75 temporal dependencies and, 201 translational systems and, 207, 209 translational systems, 207, 209 Trump, Donald, xxi, 237, 370 Tsai, Joe, 248 Tunyasuvunakool, Kathryn, 429n9 Turing, Alan, 57, 84, 332, 389–94, 409n27 Turing machines, 389–94 280B project, 286 U UniProt database, 267–69 United Arab Emirates, 328 University College London, 45, 50 University of Cambridge Daugman at, 27–28 -- 560 of 565 -- early acceptance to, 14 entrepreneurship and, 31 epiphanies at, 27–29 first order logic and, 25–26 games played at, 22–23, 36 graduation from, 29–30 Life Story motivation for, 19–20 partying at, 21 Silver meeting at, 23 University of Chicago, 263 University of Oxford, 65–66 University of Waikato, 55 V “vanishing gradients” problem, 197 Veo 2, 367, 369 Vicarious, 410n32 Vinyals, Oriol, 225–26, 274, 334, 359 -- 561 of 565 -- W Walker, Kent, 238 Watson, James, 19–20, 382 weapons, AI and, 376 Weidinger, Laura, 292–93 “Welcome to the Era of Experience” (Silver and Sutton), 377–80 Wierstra, Daan, 87, 89–90, 97–98, 410n35 Winston, Patrick, 52–53 “wokeness,” 351 women, at DeepMind, 89, 410n39 word embedding, 202 X xAI, 324, 409n16 Xerox PARC, 309 X-ray crystallography, 258–59, 267–69 Y Y Combinator, 163, 164 -- 562 of 565 -- Yudkowsky, Eliezer, 74–75, 325–26, 435n18 Z Zilis, Shivon, 244 Zoufonoun, Amin, 132–33 Zuckerberg, Mark, 132–34, 156, 415n9 ZX Spectrum 48K, 9 A B C D E F G H I J K L M N O P Q R S T U V W X Y Z OceanofPDF.com -- 563 of 565 -- ABOUT THE AUTHOR Sebastian Mallaby is the author of several books including the bestselling More Money Than God. A former Financial Times contributing editor and two-time Pulitzer Prize finalist, Mallaby is the Paul A. Volcker Senior Fellow for International Economics at the Council on Foreign Relations. OceanofPDF.com -- 564 of 565 -- Wat’s next on your reading list? Discover your next great read! Get personalized book picks and up-to-date news about this author. Sign up now. OceanofPDF.com -- 565 of 565 --