Marco andrea@passaglia.it
The Bellwether

A morning brief, composed for you when the sources say something worth saying.

‹ Reference

The Thinking Machine- Jensen Huang, Nvidia, and the World's -- Stephen Witt -- 2025 -- Penguin Publishing Group -- 478b4f2934342f4e7a8177f45bdd088c -- Anna’s Archive

other Reference Materials/Technology Books/Tech Western 508 KB text added 6/4/2026
-- 1 of 263 -- ALSO BY STEPHEN WITT How Music Got Free: A Story of Obsession and Invention -- 2 of 263 -- -- 3 of 263 -- VIKING An imprint of Penguin Random House LLC 1745 Broadway, New York, NY 10019 penguinrandomhouse.com Copyright © 2025 by Stephen Richard Witt Penguin Random House values and supports copyright. Copyright fuels creativity, encourages diverse voices, promotes free speech, and creates a vibrant culture. Thank you for buying an authorized edition of this book and for complying with copyright laws by not reproducing, scanning, or distributing any part of it in any form without permission. You are supporting writers and allowing Penguin Random House to continue to publish books for every reader. Please note that no part of this book may be used or reproduced in any manner for the purpose of training artificial intelligence technologies or systems. VIKING is a registered trademark of Penguin Random House LLC. Brief portions of this work originally appeared, in different form, in The New Yorker in 2023. Cover image: Bloomberg / Getty Images , : 2024048528 Hardcover ISBN 9780593832691 Ebook ISBN 9780593832707 International Edition ISBN 9780593834565 Simultaneously published in hardcover in Great Britain by The Bodley Head, an imprint of Penguin Random House Ltd., London, in 2025. First United States edition published by Viking, 2025. The authorized representative in the EU for product safety and compliance is Penguin Random House Ireland, Morrison Chambers, 32 Nassau Street, Dublin D02 YH68, Ireland, https://eu- contact.penguin.ie. ep_prh_7.1a_150788602_c0_r0 -- 4 of 263 -- CONTENTS Dedication Epigraph Introduction PART I 1. The Bridge 2. Large-Scale Integration 3. New Venture 4. Thirty Days 5. Going Parallel 6. Jellyfish 7. Deathmatch 8. The Compulsion Loop 9. CUDA 10. Resonance 11. AlexNet PART II 12. O.I.A.L.O. 13. Superintelligence -- 5 of 263 -- 14. The Good Year 15. The Transformer 16. Hyperscale 17. Money 18. Spaceships 19. Power 20. The Most Important Stock on Earth 21. Jensen 22. The Fear 23. The Thinking Machine Acknowledgments About the Author -- 6 of 263 -- For Jane -- 7 of 263 -- Practice even what seems impossible. —M A -- 8 of 263 -- T INTRODUCTION his is the story of how a niche vendor of video game hardware became the most valuable company in the world. It is the story of a stubborn entrepreneur who pushed his radical vision for computing for thirty years, in the process becoming one of the wealthiest men alive. It is the story of a revolution in silicon and the small group of renegade engineers who defied Wall Street to make it happen. And it is the story of the birth of an awesome and terrifying new category of artificial intelligence, whose long-term implications for the human species cannot be known. At the center of this story is a propulsive, mercurial, brilliant, and extraordinarily dedicated man. His name is Jensen Huang, and his thirty- two-year tenure is the longest of any technology CEO in the S&P 500. Huang is a visionary inventor whose familiarity with the inner workings of electronic circuitry approaches a kind of intimacy. He reasons from first principles about what microchips can do today, then gambles with great conviction on what they will do tomorrow. He does not always win, but when he does, he wins big: his early, all-in bet on AI was one of the best investments in Silicon Valley history. Huang’s company, Nvidia, is today worth more than $3 trillion, rivaling both Apple and Microsoft in value. In person, Huang is charming, funny, self-deprecating, and frequently self-contradictory. He keeps up a semicomic deadpan patter at all times. We met in 2023 for breakfast at a Denny’s diner, his favorite restaurant chain. Huang had developed the business plan for Nvidia at this same restaurant thirty years earlier; chatting with our waitress, he ordered seven items, including a Super Bird sandwich and a chicken-fried steak. “You know, I -- 9 of 263 -- used to be a dishwasher here,” he told her. “But I worked hard! Like, really hard. So I got to be a busboy.” Huang, born in Taiwan, immigrated to the United States when he was ten. Denny’s was the crucible of his assimilation—working there as a teenager, he ate through the entire menu. Still, he told me, he maintains an outsider’s perspective. “You’re always an immigrant,” he said. “I’m always Chinese.” He cofounded Nvidia (pronounced IN-vidia, not NUH-vidia) in 1993 when he was thirty, first targeting the nascent market for high-end video game graphics. His products were popular; his customers liked to build their own PCs, sometimes buying transparent housing to showcase their Nvidia hardware. In the late 1990s, seeking to better render the Quake series of games, Nvidia made a subtle change to the circuit architecture of its processors, allowing them to solve more than one problem at a time. This approach, known as “parallel computing,” was a radical gamble. “The success rate of parallel computing was zero percent before we came along,” Huang said, rattling off a list of forgotten start-ups. “Literally zero. Everyone who tried to make it into a business had failed.” Huang ignored this dismal record, pursuing his unconventional vision in open defiance of Wall Street for more than a decade. He looked for customers besides gamers, ones who needed a lot of computing power—weather forecasters, radiologists, deep-water oil prospectors, that sort of thing. During this time, Nvidia’s stock price floundered, and he had to fend off corporate raiders to retain his job. Huang stuck with this bet, losing money on it for years, until in 2012 a group of dissident academics in Toronto purchased two consumer video game cards to train an exotic kind of artificial intelligence called a neural network. At the time, neural networks, which mimic the structure of biological brains, were deeply out of favor, and most researchers considered them obsolete toys. But when Huang saw how fast neural networks trained on his parallel-computing platform, he staked his entire company on the unexpected symbiosis. Huang now needed two underdog technologies to work—two technologies that had always failed the test of the marketplace in the past. -- 10 of 263 -- When this audacious corporate parlay hit, Nvidia increased in value several hundred times. In the past decade, the company has evolved from selling $200 gaming accessories to shipping multimillion-dollar supercomputing equipment that can fill the floor of a building. Working with pioneers like OpenAI, Nvidia has sped up deep-learning applications more than a thousand times in the last ten years. All major artificial- intelligence applications—Midjourney, ChatGPT, Copilot, all of it—were developed on Nvidia machines. It is this unprecedented increase in computing power that has made the modern AI boom possible. With a near-monopoly on the hardware, Huang is arguably the most powerful person in AI. Certainly, he’s made more money from it than anyone else. In the strike-it-rich tradition, he most closely resembles California’s first millionaire, Samuel Brannan, the celebrated vendor of prospecting supplies who lived in San Francisco in 1849. Except rather than shovels, Huang sells $30,000 AI-training chips that contain one hundred billion transistors. The wait time to purchase his latest hardware is currently more than a year, and on the Chinese black market, his chips sell for double the price. Huang doesn’t think like a businessman. He thinks like an engineer, breaking down difficult concepts into simple principles, then leveraging those principles to great effect. “I do everything I can not to go out of business,” he said at breakfast. “I do everything I can not to fail.” Huang believes that with AI, the basic architecture of digital computing, little changed since it was introduced by IBM in the early 1960s, is being reconceptualized. “Deep learning is not an algorithm,” he said. “Deep learning is a method. It’s a new way of developing software.” This new software has incredible powers. It can speak like a human, write a college essay, solve a tricky math problem, provide an expert medical diagnosis, and cohost a podcast. It scales with the amount of computing power available to it and never seems to plateau. The evening before our breakfast, I’d watched a video in which a robot, running this new kind of software, stared at its hands in seeming recognition, then sorted a collection of colored blocks. The video had given me chills; the -- 11 of 263 -- obsolescence of my species seemed near. Huang, rolling a pancake around a sausage with his fingers, dismissed my concerns. “I know how it works, so there’s nothing there,” he said. “It’s no different than how microwaves work.” I pressed Huang—an autonomous robot surely presents risks that a microwave oven does not. He responded that he has never worried about the technology, not once. “All it’s doing is processing data,” he said. “There are so many other things to worry about.” Where this will lead is anyone’s guess; many technologists now worry that AI’s capabilities pose a direct threat to the survival of the human species. (Among these “doomers” are the Toronto scientists who first implemented AI on Huang’s platform.) Huang dismisses such pessimism. For him, AI is a pure force for progress, and he has declared that it is spurring a new industrial revolution. He doesn’t permit much disagreement on this topic, and his force of personality can be intimidating. (“Interacting with Jensen is like sticking your finger in the electrical socket,” one of his executives said.) Huang’s employees worship him—I believe they would follow him out of the window of a skyscraper if he saw a market opportunity there. In May 2023, hundreds of industry leaders endorsed a statement that equated the risk of runaway AI with that of nuclear war. Huang didn’t sign it. Some economists have observed that the Industrial Revolution led to a relative decline in the global population of horses and have wondered if AI might do the same to humans. “Horses have limited career options,” Huang said. “For example, horses can’t type.” As he finished eating, I expressed my concerns that, someday soon, I would feed my notes from our conversation into an intelligence engine, then watch as it produced structured, superior prose. Huang didn’t dismiss this possibility, but he assured me that I had a few years before my John Henry moment. “It will come for the fiction writers first,” he said. Then he tipped the waitress a thousand dollars and stood up from his many plates of half-eaten food. I found Huang to be an elusive subject, in some ways the most difficult I’ve ever reported on. He hates talking about himself and once responded to one of my questions by physically running away. Before this book was -- 12 of 263 -- commissioned, I had written a magazine profile of Huang for The New Yorker. Huang told me he hadn’t read it, and had no intention of ever doing so. Informed that I was writing a biography of him, he responded, “I hope I die before it comes out.” Still, Huang offered me access to a great number of people to report this book. I spoke with almost two hundred people, including his employees, his cofounders, his rivals, and several of his oldest friends. The beloved and even somewhat goofy family man who emerged from these interviews bore little resemblance to the unapologetically carnivorous executive who made Nvidia succeed, but it is these same attachments that spur Huang’s ambition: he spoke frankly with me of his insecurities, his fear of letting his employees down, his fear of bringing shame to the family name. Some executives speak of profit as “keeping score,” but not Huang; for him, the money is only temporary insurance against some future calamity. There was something a little touching about hearing a man worth a hundred billion dollars talk in this way. But if Huang is motivated by anxiety, he is also motivated by fascination with the seductive power his technology has unlocked. He had not set out to be an AI pioneer, not even when he’d turned his attention to parallel computing, but once it arrived, Huang became determined to push his maximalist agenda for machine intelligence as far and as fast as it could possibly go. Even the most optimistic visionaries in the field urge some degree of caution; the supposed mission of OpenAI, for example, is to ward off catastrophe. Huang, almost alone, believes that AI can lead only to good, and it is this belief that motivates him to work twelve to fourteen hours a day, seven days a week, even after three decades as CEO. Of course, Huang would work hard anyway. It is in his nature. If there is a theme to his life, it is amplification; he has executed on the same simple precepts of diligence, courage, and mastery of fundamentals again and again and again, to greater and greater effect. I was surprised to learn how much of the man he later became was present in the immigrant child arriving unaccompanied by his parents in the United States in 1973 to an environment so unconducive to flourishing that it seems a miracle he -- 13 of 263 -- survived it. To understand Huang fully, we begin not at Denny’s restaurant, nor in the giant cathedrals of technology he later commissioned, but at this tiny rural school. -- 14 of 263 -- PART I -- 15 of 263 -- S ONE The Bridge ometime toward the end of 1973, ten-year-old Jensen Huang rose from bed in his dormitory and set off on the perilous journey to school. Huang, born in Taiwan and raised in Thailand, had recently arrived in rural Kentucky. His path led down a sloping hillside to a floodplain situated among forested hills, and across a rickety pedestrian footbridge, which was suspended by ropes and missing many planks, through which could be seen the frigid and rushing waters of the river below. Huang, a bright and conscientious child, had skipped a year and was in the sixth grade. He was undersized even for his age and was often the smallest boy in class. He spoke imperfect English and was the only Asian student. His classmates at Oneida Elementary were the children of tobacco farmers and coal miners. Almost all of them were white, and many were impoverished. Some had no running water in their homes. Huang had arrived with his older brother, Jeff, in the middle of the academic year, while their parents remained in Thailand. The two lived at the Oneida Baptist Institute, a nearby boarding school, but Jensen was too young to attend OBI and was sent to Oneida Elementary instead. On his first day, the principal had put his arm around the boy and told the class to welcome the new student, who was from a different part of the world but was also extremely intelligent. The bullying started at once. “He was a perfect target,” said Ben Bays, Huang’s classmate. Before Huang’s arrival, Bays had been the designated victim. Like Huang, Bays was small, and also like Huang, he was a good student. The -- 16 of 263 -- bullies honored these qualities by sealing him inside the school’s lockers, sometimes for hours. Following Huang’s arrival, their focus shifted and acquired a racial element—many of Huang’s Kentucky classmates had family members who had fought in Vietnam. “The way you described Chinese people back then was ‘Chinks,’ ” Huang told me, fifty years later in a sterile conference room during our first conversation. His face showed no emotion. “We were called that every day.” The bullies targeted Huang in and out of class, at every opportunity. They shoved him in the hallways and chased him on the playground. The bridge was their favorite location. Huang had to cross it alone, a hazardous proposition in the best of conditions. Sometimes, when Huang was in the middle, the bullies would emerge from hiding on either side of the river, then grab the ropes and begin to swing, attempting to dislodge him into the river below. “Somehow it never seemed to affect him,” Bays said. “Actually, it looked like he was having fun.” Bays and Huang were fast friends. Despite the language barrier, Huang excelled academically, supplanting Bays as the best student. He was a talented artist and had perfect penmanship, although he only wrote in capital letters. He also taught Bays how to fight. Whatever the local boys knew about Chinese culture came from the films of Bruce Lee. Huang initially ran a bluff, telling his classmates that he was a martial-arts expert. This was quickly disproven in the schoolyard, but what Huang lacked in technique, he made up for in determination. When challenged, he would always fight back, sometimes wrestling even larger boys to the ground. In Bays’s recollection, at least, Huang was never pinned. (“That’s not how I remember it,” Huang said, laughing.) Nevertheless, Huang inspired Bays to fight back as well, and after a time the bullying subsided. Bays’s own family was desperately poor. He had five siblings, and his father, a preacher, was itinerantly employed. He lived at the mouth of a small, sheltered valley known as a “holler” in a dilapidated house with pit toilets in the back. Nothing in his experience had prepared him to meet anyone like Huang, and he could only wonder about the circumstances that had delivered this precocious, unsupervised child to the Appalachian -- 17 of 263 -- backwoods of Clay County, Kentucky, one of the poorest counties in the nation. • • • H, , had been born in Taipei, Taiwan, in February 1963. His father was a chemical engineer, and his mother had taught primary school. Huang’s parents were from the city of Tainan, on the southwestern coast. They spoke the Taiwanese dialect of Hokkien natively but had lived most of their lives under foreign rule. Taiwan had been a Japanese colony until 1945; in 1949 the Chinese general Chiang Kai-shek, having lost the mainland to Mao, fled to Taiwan with his army, and soon the island was placed under martial law. When Huang was five, his father, Shing Tai, found work at a petroleum refinery in Thailand and relocated the family to Bangkok. Huang’s memories of Southeast Asia are hazy. He recalled pouring lighter fluid on top of the pool at the family home and setting it ablaze. He recalled a pet monkey that belonged to a friend. In the late 1960s, Huang’s father visited Manhattan on his way to train with the air-conditioning giant Carrier, which was transforming office life with precise climate control. He was astounded by New York City and returned determined to relocate his family to the United States. In preparation for the move, Huang’s mother, Chai Shiu, began to teach the boys English. She spoke no English herself, but this was only a minor hindrance. Drawing on her experiences as a schoolteacher, each night she had her sons memorize ten new words randomly selected from the dictionary, then drilled them on the words the following day. After a year or so of this, she enrolled the three boys in an international academy, and Huang began formal schooling in English, while continuing to speak Taiwanese with his parents. The family’s plans to relocate accelerated in 1973, when Thailand was beset by political unrest. In October of that year, half a million protestors took to the streets of Bangkok, demanding the dissolution of the country’s -- 18 of 263 -- military dictatorship. The government responded with force, and Huang recalled seeing tanks rolling through the streets. Fearing further unrest, Jensen’s father sent him and Jeff to Tacoma, Washington, to live with an uncle. Jensen’s parents and his younger brother stayed behind. The uncle decided the boys belonged at a boarding school and searched for an institution willing to house two unsupervised Taiwanese children, ten and twelve years old, living thousands of miles from their parents. He selected the Oneida Baptist Institute in Kentucky, perhaps mistaking it for a prestigious college-preparatory school. In fact, OBI was a juvenile-reform academy located in a town of three hundred people. The institute had been founded in 1899 by James Anderson Burns, a Baptist preacher looking to put an end to a lethal and long-running family feud. (Burns came up with the idea for the school after he was clubbed in the head with a rifle and left in a ditch to die.) By the 1970s, despite hosting a few international students, OBI was mostly known as a last-chance institution. Upon arrival, the brothers found the grounds of the campus littered with cigarette butts. “Every student smoked, and I think I was the only boy at the school without a pocket knife,” Huang said. Jensen, ten, was placed with a seventeen-year-old roommate; on their first night together, the older boy lifted his shirt to show Jensen the numerous places where he’d been stabbed in a recent fight. Huang’s roommate was illiterate; in exchange for teaching him to read, Huang said, “he taught me how to bench press. I ended up doing one hundred push-ups every night before bed.” Huang would stick to a daily push-up routine the rest of his life. The Huang brothers anglicized their names to fit in. Jen-Chieh became “Jeff,” and Jen-Hsun became “Jensen.” (Their younger brother, Jen-Che, would later become “Jim.”) Jeff and Jensen kept in touch with their parents in Thailand by sending audiocassettes through the international post. With each cassette, they would first listen to their parents’ message, then record over it with their own. Jensen recalled only occasional homesickness. To him, the whole thing played out like some grand adventure. -- 19 of 263 -- During the summer, students at OBI were expected to earn their keep through manual labor. Jeff was sent to a tobacco farm; Jensen was left behind to clean the dormitory toilets. “It wasn’t a punishment,” Huang said. “It was just my job.” Another of Huang’s chores was cutting brush on the school grounds with a scythe. Bays recalled passing him on the way to church. “We was driving by the field, and he was just running around in circles, wearing a baseball shirt, cutting those weeds,” he said. By the end of his year at Oneida Elementary, Huang had all but conquered the school. He was the best student in his class, for which he was given a silver dollar at a school assembly. He had stood up to the racists and the name-callers, including, in at least one instance, a teacher. After the final school bell rang, Huang would take charge, running ahead of his classmates into forests of hickory and oak. Chasing behind him, in a friendly way, were the “rowdy boys” of Clay County, the soft Appalachian mud under their feet. • • • H of 1974 living at the dormitory. He looked forward to watching the ABC Sunday Night Movie with the other holdovers each week. As fall approached, he ate fresh apples from the tree outside his window. He began seventh grade at OBI, while Bays continued on the public-school track. Huang, relying on his battle-scarred roommate for protection, had few problems adjusting. A year after that, Huang’s father secured employment in the United States, and the brothers left Kentucky to reunite with their family in Oregon. Bays and Huang would not see each other again for forty-four years. In the interim, Bays became a nursing-home administrator. Huang became one of the richest people in the world. Bays was unsurprised; he told me that even as a boy, he believed that Huang was destined for greatness. The two were reunited in 2019, when Huang returned to OBI to donate a building to the school. “He’d never forgotten me,” Bays said. -- 20 of 263 -- For many children, the two years in Kentucky would have been traumatic. At ten years old, Huang had been sent eight thousand miles from his parents to a foreign land where he barely spoke the language. He was bullied, isolated, made to share a room with a knife fighter, and tasked with cleaning the latrine. What did it say about him that he thrived in this environment? “Back then, there wasn’t a counselor to talk to,” Huang said. “Back then, you just had to toughen up and move on.” Time may have softened Huang’s memories of OBI. When he donated the building, in 2019, he talked fondly of the (now gone) footbridge he had crossed every day on the way to school; he neglected to mention that the other students tried to shake him off of it. When I asked him about doing chores at the school, he told me they taught him the value of hard work. “Of course, if you’d asked me at the time, I probably would have given you a different answer,” he said. In 2020, Huang was asked to deliver a remote commencement message to the students of OBI. In his speech, he said his time at the school was one of the best things ever to happen to him. • • • I H matriculated at Aloha High School in the suburbs of Portland, Oregon. He dressed in denim and velour and wore his hair in the shape of a motorcycle helmet. He continued to excel academically, and his English rapidly improved. Aloha was a welcoming place, and he soon formed a close-knit clique with a few of his fellow nerds. “There were three or four of us, and we were all in the same clubs: math club, science club, computer club,” Huang said. “You know, the popular kids! I didn’t have a girlfriend.” The computer club was of particular interest. In 1977 the school purchased an Apple II, one of the first mass-produced personal computers. Huang was enthralled by the machine, using it to shoot Klingons on a grid of text in the primitive game Super Star Trek and to code his own version of Snake in Basic. -- 21 of 263 -- His other extracurricular interest was table tennis. At OBI, Huang had dominated the rec-room ping-pong table but didn’t take the sport seriously. In high school, he began to play competitively. His mentor was Lou Bochenski, the owner of the Paddle Palace, a table tennis club located in a converted Elks Lodge ballroom. Bochenski’s daughter, Judy, had visited Beijing in 1971, one of the lucky invitees in the “ping-pong diplomacy” exchange. But Huang had never played in Asia, and he used a Western grip. For an entire summer, Huang did little but practice. Bochenski was so impressed that he wrote a letter to Sports Illustrated, calling Huang “the most promising junior ever to play table tennis in the Northwest,” even though he’d played competitively for only three months. Huang’s signature shot was his arcing forehand loop, which he used to defeat many higher- rated players, sometimes diving under the table to return seemingly irretrievable shots. Within a year Huang was nationally ranked and playing in the finals of the under-sixteen doubles championships in Las Vegas. “He picked up the sport of table tennis faster than anyone I’d ever seen,” said Joe Romanosky, a friend from the Paddle Palace. Huang was athletic and had good reflexes, but his unique quality was his exceptional focus. When he set his mind to self-improvement, the rest of the world faded away. He outworked everyone; he didn’t seem to get frustrated or stuck; he never hit a plateau. Instead, Huang watched, with measured satisfaction, as his patient dedication to the fundamentals slowly manifested itself as skill. Huang spent almost all his time at the Paddle Palace. When he wasn’t practicing, he worked there, scrubbing the floors at night to earn money for tournament fees. Bochenski gave him a key, and sometimes, rather than return home to his parents, Huang would sleep in the ballroom. The tables were set among opulent surroundings, with chandeliers above, hardwood floors below, and padded benches set into the walls. A photograph from this time shows Huang, maybe fifteen, wearing high-cut 1970s gym shorts and striped tube socks. He stands low to the table, a small guy with a bowl cut, striking the ball with an expression of competitive intensity. “He was a very aggressive player, on offense all the time,” Romanosky said. -- 22 of 263 -- As graduation approached, Huang got a job at Denny’s. The nationwide restaurant chain was known in that era for its burnt coffee, its reconstituted powdered eggs, its reheated sausage patties, and its round-the-clock operating hours. Huang loved the place. He began as a dishwasher and worked his way up to server. “I find that I think best when I’m under adversity. When the world is just falling apart, I actually think my heart rate goes down,” he later said. “Maybe it’s Denny’s. As a waiter, you’ve got to deal with rush hour. Anyone who’s dealt with rush hour in a restaurant knows what I’m talking about.” Denny’s provided Huang with a crash course in American cuisine. There he had his first bacon cheeseburger, his first pigs-in-a-blanket, his first chicken-fried steak. He methodically ate his way through the menu; his favorite item was the “Super Bird,” a grilled sourdough sandwich stuffed with turkey breast, bacon, tomatoes, and cheese. For an immigrant adapting to the culture of a new country, gorging on diner slop was as American as you could get. • • • H and was inducted into the National Honor Society. The desire to achieve came from somewhere within; Huang told me that his parents were not “tiger parents” and that they had not put undue academic pressure on him. “Actually, both of my brothers were terrible students,” he said, although he quickly added that they were both very bright. When I asked Huang why he, the middle child, was alone motivated to perform well in school, he shrugged. “I don’t have an answer for you,” he said. “I try not to analyze myself in that way.” By the time he graduated from high school, Huang had skipped a grade, was a nationally competitive athlete, and had a near-perfect GPA. Yet he opted out of the college-admissions scramble, choosing to enroll at nearby Oregon State University. There wasn’t much thought behind the decision, Huang told me, and no pressure from his parents to go anywhere else. His -- 23 of 263 -- high school buddy Dean Verheiden was a legacy Oregon State student, and Huang chose to go as well. “I just followed my best friend,” he said. Others had a different interpretation. Huang, then seventeen, had lived in three countries and attended at least five separate schools. At the time, OSU had an acceptance rate above 70 percent and wasn’t the highest-ranked public school in Oregon, but the campus was a ninety-minute drive from his parents’ house. “He could have gone anywhere—Ivy League, Stanford, East Coast, you name it,” one longtime friend said. “He went to OSU because he wanted to stay close to home.” Huang matriculated in 1980. At the time, Oregon State didn’t offer a dedicated computer science degree, so Huang majored in electrical engineering. His introductory sequence in this field determined much of the course of the rest of his life. He learned how to design circuits, which he spent the rest of his career doing. And he met his future wife. Lori Mills was an earnest eighteen-year-old Oregon State freshman with glasses and curly brown hair. Her personality was friendly and easygoing, but she craved structure, and she lived her life according to a fixed timeline of responsibilities: career by twenty-two, marriage by twenty-five, kids by thirty. She was randomly assigned as Huang’s lab partner during their first week of class. “There were, like, two hundred and fifty kids in electrical engineering, and maybe three girls,” Huang said. “She was the best- looking.” Competition broke out among the male undergraduates for Mills’s attention, and Huang felt he was at a disadvantage. “I was the youngest kid in the class,” he said. “I looked like I was about twelve.” Not liking his chances with conventional flirting, Huang took a different approach. “I tried to impress her—not with my looks, of course—but with my strong capability to complete homework,” he said. Every weekend, Huang would call Mills and pester her to do homework with him. And he was good at homework, which he sometimes called his “superpower.” Lori accepted, and the two became study partners. In their laboratory studies, Jensen and Lori hunched over a rectangular plastic grid known as a “breadboard,” wiring components to build amplifiers and adding machines. The work was delicate and painstaking and -- 24 of 263 -- involved a fair degree of close-quarters contact. The flow of electricity began at a power source, made a loop through various components, then returned to the source where it started. Primitive circuits might power light bulbs or digital clocks. More advanced circuits took advantage of a special component called a “transistor,” which could act like a digital switch. By combining transistors, you could create a “logic gate,” and, by combining logic gates, you could perform rudimentary calculations: one plus zero, say, or one plus one. And by chaining these simple adding machines together, you could do serious mathematics. The final step was always closing the circuit, creating a loop for electricity to flow. After six months of breadboarding, Huang asked Mills out on a proper date. She said yes, and after that the two were seldom separated. Huang completed his studies early and graduated with highest honors. His timing coincided with the silicon revolution of the 1980s. Students might use breadboards, but the preferred medium for commercial circuit logic was a treated silicon crystal known as a “semiconductor.” Technicians “printed” logic circuits onto silicon discs using concentrated ultraviolet light, then diced them into tiny squares called “microchips.” Because all the electrical components on a chip were fixed in place, microchips were also sometimes called “integrated circuits.” The personal-computer craze of the 1980s created tremendous demand for microchips. So, too, did the popularity of digital devices. Microchips were being placed in cars, CD players, children’s toys, microwave ovens, and any other useful object one could think of. In time, they would move into power chargers, refrigerators, credit cards, and electric toothbrushes. This meant that skilled circuit designers were in limited supply. (They remain so today.) Nearing graduation, Jensen found employment in the world’s microchip capital—Silicon Valley. -- 25 of 263 -- T TWO Large-Scale Integration he dawn broke on a desolate stretch of mountain highway near the California-Oregon border just before Christmas, 1984. The trees cast westward shadows on the asphalt and across the sloping hood of the flashy vehicle that sped along the road. The Toyota Supra was a two-door sports car with angular styling and an inline six-cylinder engine. From the front, with the headlights popped, the car looked like some friendly breed of android. Jensen, behind the wheel, took a corner, then accelerated down the deserted road. Surely, he must have felt confident. In the passenger seat sat his girlfriend—now fiancée—Lori Mills. Jensen had proposed to her the night before, at the magnificent office Christmas party thrown by Advanced Micro Devices, the microchip manufacturer where he worked. He’d started working at AMD at twenty, when he was not yet old enough to drink, and secured a starting salary of $28,700, a figure so impressive he could recite it from memory forty years later. Jensen lived frugally, and after a year he had saved up enough money to buy both the car and an engagement ring. The AMD party was a natural place to propose. The holiday bash was one of Silicon Valley’s most extravagant. AMD would rent out San Francisco’s Moscone Convention Center and treat the lucky employees to free drinks and music from well-known bands. That year, the rock group Chicago regaled the assembled engineers with danceable renditions of “Saturday in the Park” and “25 or 6 to 4.” In 1984 the Bay Area tech scene remained a frontier outpost of the American economy; when Jensen joined -- 26 of 263 -- AMD, the most valuable US firms were old-line industrial concerns like DuPont and General Electric. By the time his generation of entrepreneurs was finished, these industrial conglomerates would be gutted, and the stock market would be dominated by tech. Lori accepted the proposal, of course. Even by the standards of the era, this was an early engagement. Jensen was just twenty-one. Lori, twenty- two, had not yet graduated from college. But both found comfort in domesticity, and their marriage would in time become the envy of their social set. After the proposal, Jensen suggested driving Lori home to tell her parents the happy news in person. He was close with the Mills family, particularly Lori’s father, an affable all-American patriarch who resembled the actor Jimmy Stewart in both demeanor and appearance. The Mills family in turn adored Jensen and felt that their daughter, even at this young age, could not possibly have done better. Friends of the couple joked that Jensen was closer to Lori’s parents than to his own. But if Huang was dependable and preternaturally mature, he still occasionally entertained ideas that only a twenty-one-year-old would think sound, like embarking on a nine-hour drive in a sports car across snowy mountain roads in the middle of the night after a boozy office Christmas party. By the time the sun rose, Jensen and Lori had been on the road for over five hours. The country they traveled through was spare and depopulated, and some who lived there could trace their ancestry to that first wave of California fortune seekers who’d burrowed into the surrounding hills in search of gold. It was in the graveyard of these busted mines that Jensen hit the transparent layer of black ice that coated the freeway and sent the Supra into an unrecoverable glide. The tires spun without purpose, and the vehicle drifted onto the shoulder before rolling off the road. Jensen and Lori were momentarily inverted. Then the car hit the ground with an awful crunch before banging along to a stop, shedding components of its upscale trimming along the way. The Supra was totaled, and the couple was trapped inside. Lori, wearing her new engagement ring, was mostly unharmed. Jensen was bleeding, and his neck was twisted bad. The -- 27 of 263 -- sun was coming up, but the temperature was frigid, and it was the coldest moment of the day. When the first responders eventually arrived, they had to cut the couple out of the car. Jensen required stitches in multiple places and had to wear a neck brace for several months thereafter. When I asked him about the incident years later, he mostly expressed regret for the Supra. “Incredible car,” he said. • • • H , and the engagement was unaffected, or perhaps even strengthened, by the distress of the shared experience. As Lori finished school, Jensen returned to work. At AMD he sketched out microchip designs on paper. Each sheet represented a separate layer of the chip, with transistors at the bottom and various interconnects set above. When he was finished with a layer, he would bring it to the back of the office for fulfillment, where it would be transferred to a transparent sheet of colored cellophane. Those cellophane sheets were used to make stencils called “photomasks,” then sent to a fabrication facility. For some reason, all of AMD’s photomask workers were Chinese women. They sat at workstations and arranged the colored stencils in precise patterns. The women didn’t speak much English, and Huang, who’d grown up speaking the Taiwanese dialect of Hokkien at home, didn’t speak Mandarin. The two languages are as different as German and English, but patiently, in conversations with the photomask crew, Huang began to learn Mandarin, the most commonly spoken form of Chinese. “Just phonetically, through regular conversations,” he said. The women reminded him of his mother. Huang spent two years at AMD, a time he recalled with fondness. He acquired some shares of AMD stock through an employee-purchase program and held on to these, with escalating irony, for the rest of his career. But in 1985 a coworker convinced him to leave AMD for LSI Logic, an innovative Silicon Valley firm that developed the first software-design tools for chip architects. By the mid-1980s, engineers were putting -- 28 of 263 -- hundreds of thousands of transistors on a single chip, surpassing the limits of paper draftsmanship. The closest analogy was covering a tennis court with a maze made from strands of human hair. LSI’s “large-scale integration” process automated low-level blocks of circuit design, freeing engineers to focus on higher-level architecture. Over time, these automated design tools evolved into the unfathomably complex “very large-scale integration,” or VLSI, which remains the point of entry for most modern engineers. With VLSI you zoomed out so far that you forgot the individual transistor existed. As time went on, only Jensen and a few other graybeards would remember the artisanal microchip. Lori graduated in 1985 and found employment at Silicon Graphics, a manufacturer of expensive 3D graphics workstations. SGI, as everyone called it, was the other place to work in Silicon Valley at the time, and at first Lori made more money than Jensen. Like AMD, SGI was located close to US 101, a narrow strip of highway that spanned the twenty-five miles from downtown San Jose to the Stanford campus in Palo Alto. Drive along this highway, and you would read exit signs directing you to the otherwise forgettable suburban municipalities of Cupertino, Santa Clara, Milpitas, and Mountain View—home, respectively, to Apple, Intel, Cisco, and Silicon Graphics. Talent here was clustered tightly, and no locale on Earth had ever generated so much wealth per square foot. Huang was bound to the place as if tethered, and he would spend the rest of his career working within a five- mile radius. If the names of the towns were recognizable, the architecture for the most part was not. The Manhattan glamour that had bedazzled Huang’s father was not to be found in Silicon Valley. There were no skyscraper canyons, no bustle of pedestrian energy in the street. Instead, there was a bland collection of modern, mid-rise boxes, surrounded by parking lots and strip malls and extended-stay business hotels, and crisscrossed by freeways in a geographic depression at the south end of the Bay. Behind the tinted glass one could find some of the finest minds in engineering, but from the outside the only signifier of activity was the traffic. -- 29 of 263 -- The buildings were just as boring on the inside. The climate-controlled office of the 1980s featured clunky cathode-ray monitors and drab carpeting and humming fluorescent lights and drop ceilings to hide the ductwork. The favored layout was the open-plan “action office,” which featured reconfigurable arrangements of cubicles at varying heights. At LSI Logic, designers had opted for a low-slung cubicle grid that employees called “the pit.” Huang arrived there in 1985. He wore large glasses and a tasteful watch and button-down shirts with slacks, but still kept his hair a little long. For him, the pit was heaven. There seemed no place in the world he’d rather be. As he had in table tennis, Huang rapidly distinguished himself at LSI through his surreal work ethic. One of Huang’s pit mates was Jens Horstmann, a fellow electrical engineer who had arrived at LSI from Germany as part of a six-month mentorship program and never left. Horstmann and Huang were both immigrants, they were around the same age, and they even had the same initials. They shared a readiness to sacrifice their personal lives and their sanity in service of solving an endless series of hard technical problems. “There was no notion of weekends,” Horstmann said. “We’d come in at seven a.m.; then our girlfriends would call us at nine p.m. asking us when we were going to come home.” Over time, Horstmann became Huang’s closest friend. Horstmann was charismatic, extroverted, funny, and in his personal life a little more reckless than Huang, with a broader range of interests and a wider social sphere. At work, though, Huang was the risk-taker. With characteristic dedication, Huang had mastered a software application known as the Simulation Program with Integrated Circuit Emphasis, or SPICE. Using a command line, Huang would input an ordered list of circuit components and would receive a text-only table of voltage data in return. The primitive SPICE software was often regarded as an academic teaching tool, but Huang used it to push the capabilities of the circuits further than anyone else thought possible. When LSI’s customers wanted new functions, most of the designers would simply respond, “There’s no way.” Huang would say, “Let me see what I can do.” -- 30 of 263 -- Huang would spend hours fiddling with the simulator, attempting to arrange the list of components to enable what the customer wanted. This was painstaking work, conducted without the assistance of graphical user interfaces or even color monitors. His focus was admirable, but Horstmann knew many engineers who could become similarly absorbed in technical problems; what set Huang apart was his ability to avoid dead ends. “Similar people, they get lost, right?” Horstmann said. “They just get lost in these deep, deep ratholes. He doesn’t. He has a great sense of seeing when a problem has reached a certain level of complexity, and he can’t easily make further progress, and he has to go in a different direction.” LSI’s most demanding customers were the computer-graphics designers, whose appetite for faster silicon knew no point of satiation. To serve them, Horstmann, with Huang’s encouragement, began signing contracts to deliver products that, internally, the two had no idea if LSI could actually make. Older engineers advised the two to be more cautious. Do you know what you’re doing? they’d say. If this fails, it may be the end of your career. “It was true, but that never troubled us,” Horstmann said. Almost everything Horstmann and Huang promised was eventually delivered. The rewards for solving these hard technical challenges were new, even harder technical challenges. Huang loved the difficulty curve; he relished leveling up in this way. “He had the ability to make 1 + 1 = 3,” Horstmann said. “By this, I mean we were not only doing work for our customers, but we were turning these orders into tools, and turning those tools into methodologies.” Most engineers couldn’t do this, Horstmann told me; most engineers couldn’t even make 1 + 1 = 2. “You’re lucky to get one and a half,” he said. Among their friendship group, Jensen and Lori were the responsible ones. They were the first to get married and the first to buy a house. In 1988 they moved into a two-story, four-bedroom tract home on the east side of San Jose with a front-facing garage and a backyard patio where Jensen could work the barbecue. They worked stable, well-compensated jobs at respected employers and diligently maximized their contributions to their tax-deferred retirement accounts. They adopted a dog named Sushi and -- 31 of 263 -- smothered it with unconditional affection. Sushi returned the sentiment, knocking aside objects with his enthusiastic tail wagging. Horstmann admired Jensen’s relationship and the orderly life he lived. He also admired Lori, who was a gifted engineer. Horstmann recalled talking with her about a technical problem he was working on: a customer’s microchip, embedded in an orbital satellite, was malfunctioning because of interference from cosmic rays. Lori had worked on a similar problem, which required not just a working knowledge of electrical engineering but also particle physics. “It was just amazing, how deep and how structured her thinking was,” Horstmann said. The downside of this structured approach was that the Huangs were— well, they were kinda square. They worked constantly, traveled rarely, and barely socialized outside the semiconductor industry. Horstmann recalled introducing Huang to a friend who ran a craft microbrewery, an unusual profession in the 1980s. “Jensen was just, like, ‘How do you know this person? How is this possible?’ ” Horstmann said. Huang didn’t seem to have a single friend who didn’t work in tech. • • • H promoted within LSI. He also began taking night classes at Stanford in pursuit of a master’s degree in electrical engineering, but he was so busy at work that it took him eight years to finish. Now driving a sensible commuter car, Huang oscillated between school in the west of the Valley, his home in the east, and his job in the middle, traveling up and down and up and down the 101 for years. By the time he was finally awarded the degree, in 1992, much of what he’d learned in his introductory sequence was obsolete. It was through his work at LSI that Huang came to know Chris Malachowsky and Curtis Priem, chip designers who worked for Sun Microsystems. Sun, like SGI, made high-end workstation computers for power users, and Priem and Malachowsky were tough customers who asked for functionality that average salespeople couldn’t provide or even really -- 32 of 263 -- understand. “LSI reached deep into the bowels of the company to find the most outward-looking technical person that they could assign to us,” Malachowsky told me. “That was Jensen.” Malachowsky, Priem, and Huang made a good team. Priem was the architect who could think in circuits, charting the path of electricity through adding machines in his mind. Malachowsky was the mechanic, fond of cars and small-engine planes, who could build anything Priem could dream up. Huang was the logistician, in charge of tooling production at LSI to mass- produce their designs. Of the three, Priem was the strangest. The pairing of his brainiac face with his brilliant mind was a physiognomist’s delight. Priem’s forehead was enormous, almost elongated; his eyebrows were arched; and his narrow, squinting eyes wandered around the room as he talked. He spoke in a continuous technical monologue, like a tour guide, walking the listener through circuit architecture and pausing, here and there, to zoom in on important features. Often, the conceptual map of what Priem was describing existed only in his head, but he rarely seemed to notice, or care, whether his audience could follow what he was saying. Priem had arrived in engineering through an indirect path. He had grown up in the suburbs of Cleveland, Ohio, where his mother’s dream was that he would play the cello in a professional symphony. Priem had pursued this ambition until high school, when, on a visit to a music camp in North Carolina, he was seated in the last chair in the second orchestra. “I realized I was looking at a future as a high school music teacher,” Priem said. He gave up the cello and got into computers, graduating from Rensselaer Polytechnic Institute in upstate New York before ending up in Silicon Valley, where his eccentricities were deemed within the acceptable range of tolerance—for a time, at least. Malachowsky was a more practical fellow; of the three Nvidia cofounders, he was the one you’d trust to swing a hammer. He was burly, with broad shoulders, big hands, and a wide, friendly face. Growing up in New Jersey, Malachowsky had been a self-described “long-hair” who’d enjoyed drinking beer and goofing off with his buddies. Although he got a -- 33 of 263 -- respectable haircut at the end of the seventies, he retained a gruff, irreverent attitude and was quick to laugh. For Malachowsky, computers were not lofty abstractions, but tangible machines grounded in physical reality. Priem built his own flight simulator; Malachowsky flew his own plane. Huang, a few years younger and technically an outside vendor, acted more like the two men’s manager, working with the LSI fabrication plant to ensure the timely delivery of a high-quality product. All three men recalled how perfectly they were able to operate within their preferred domain of responsibility. “We just never stepped on each other’s toes,” Malachowsky said. This unusual arrangement was possible only because Priem and Malachowsky trusted Huang—indeed, they trusted him more than they trusted their actual bosses. “There was politics at Sun like you wouldn’t believe,” Priem said. Huang eschewed drama and led by example, driving himself hard, refraining from gossip, and carefully apportioning credit for good work. If a product was going to be late or if LSI couldn’t deliver on some promised function, Huang would immediately provide a detailed description of what had gone wrong, who was responsible, and what he was doing to fix it. “When he said he was going to do something, there was a reasonable likelihood that he would actually do it, y’know?” Malachowsky said. Malachowsky struggled to think of other Silicon Valley product managers who fit that description. If Huang had a flaw, it was that he embraced candor in the extreme, sometimes crossing into the territory of insult. The bluntness was part of his charm, of course, but it could leave people’s feelings hurt. He didn’t have much patience for people who disagreed with him, and he also seemed genuinely surprised that there were people working in his industry who didn’t want to spend fourteen hours a day fiddling with the circuit simulator. Of course, for quarrelsome workaholics like Priem and Malachowsky, these traits were only further evidence of Jensen’s managerial fitness. The fruitful collaboration of Priem, Malachowsky, and Huang resulted in the 1989 debut of the Sun GX, a line of three-dimensional graphics -- 34 of 263 -- processors that powered workstations for scientists, animators, and computer-aided-design modelers. The chip took as its input a wire-frame skeleton of points in space, then “painted” on textures, one pixel at a time, to create rotatable objects constructed of blocky polyhedra. To any modern observer, the GX’s output would look clumsy, but if you squinted at its sixteen-color output on a cathode-ray tube monitor in 1989, you could maybe see the future of computer graphics. Huang’s success with the Sun GX caught the attention of Wilf Corrigan, the founder of LSI Logic. After its release, Corrigan promoted Huang to run a “system-on-a-chip” design platform that allowed customers to condense multiple functions—3D graphics, video, game controllers—onto a single piece of silicon. The platform was popular with customers, and Malachowsky, watching from outside, believed that Corrigan was grooming Huang to one day replace him as CEO. “They let this young, twentysomething-year-old kid start a whole division!” he said. “I mean, they saw something in him.” But Priem and Malachowsky saw something in Huang, too. Despite the success of the GX chip, when the two proposed making a cheaper version for PC video games, management at Sun turned them down. (A haughty executive informed them that Sun supplied scientists, not gamers.) Frustrated, Malachowsky and Priem wanted to build this consumer video game chip on their own, but neither felt comfortable managing a business— so in 1992 the two men approached Huang and asked him to run their start- up. Huang had a tough decision to make. He respected Priem and Malachowsky, but he was in charge of his own division, with a secure job on the management track at an innovative corporation. The new company that Malachowsky and Priem were proposing didn’t have a business plan, or even a name—just a rough sketch for a product that Sun had decided wasn’t profitable enough to build. Also, although the two men insisted they were the perfect team, to coworkers their relationship seemed dysfunctional. Malachowsky and Priem fought with each other all the time. “Chris and I would just have these yelling matches,” Priem told me. These fights -- 35 of 263 -- sometimes ended with one man storming across the workroom floor to slam the door to his office. “The people on our team would always ask, ‘Are we disbanding?’ ” Priem said. “But this was just our way of interacting.” Both men were also fairly stubborn. Priem spoke of Malachowsky’s “impedance function,” an engineering term used to describe opposition to the flow of electricity through a circuit. When I shared this with Malachowsky, he responded, “Yeah, well, I don’t know how long you’ve talked with Curtis, but his user interface isn’t terribly well-designed.” Working with Priem and Malachowsky meant committing to a lifetime of door slamming and engineering jokes. Worse, Huang’s personal finances were stretched. In 1990, the Huangs’ son, Spencer, was born, followed by their daughter, Madison, in 1991. (Sushi the dog responded to the new arrivals by trying to steal the children’s pacifiers from their mouths.) The master plan called for both Jensen and Lori to continue their jobs while paying down their mortgage, but the family hadn’t been able to find reliable child care, and Lori eventually left her job at Silicon Graphics to raise their kids. Jens Horstmann was named godfather to both of the Huangs’ children. His own wife had left the engineering workforce to raise children, as had Chris Malachowsky’s wife, Tina. Horstmann told me that all three women were superior engineers. “For my own family, I feel, sometimes, a bit guilty of having taken the liberty to work so hard and to pour myself so into this,” he said. “I mean, we tried nannies, we tried things—but maybe we should have tried a little harder.” When I asked Huang about this, decades after the birth of his children, I could see the discomfort in his face as he recalled first asking his brilliant wife to suspend her career, then, with only six months of savings in his bank account, asking her permission to leave his job for a start-up. But Lori told him to go for it. “She always believed in me,” Huang said. • • • -- 36 of 263 -- I , Huang would describe his rapid rise from draftsman to CEO as a confluence of coincidences. “I was a technical staffer with a state- school education, and I wasn’t particularly ambitious,” he said in his 2020 commencement address for OBI. “If you were looking for someone to run a company someday, I don’t think you would have picked me.” Huang handled his later career success with admirable humility—but sometimes that humility was itself exaggerated. Everyone I spoke with who knew him at this time in his life agreed that Huang was fibbing in his address. Malachowsky recalled him as a superbly competent striver with a master’s degree from Stanford and an abundance of unconcealed ambition. “He wanted to run something by the age of thirty,” Malachowsky said. “I distinctly remember we had dinner at his house, and he told us that.” Hans Mosesmann, a veteran industry analyst who would later help manage Nvidia’s IPO, recalled talking with one of Huang’s former managers at LSI who’d been tasked with giving Huang an employee evaluation. The evaluation form resembled a report card, but the manager left the grades blank. At the bottom, the manager wrote, “Jensen is an excellent employee. I look forward to working for him some day.” Horstmann remembered the friction Huang caused at LSI, where in his twenties he was in charge of a division with $250 million in annual revenue and with many older and more experienced employees answering to him. Seeking to mediate, Corrigan hired a senior director from Intel to comanage the product line. Huang was incensed—using the most profane word in the engineer’s dictionary, he considered the hiring political. “He had built that division up from nothing, and now it was taken away from him,” Horstmann said. Perhaps it was this final indignity that led Huang to defect. If Malachowsky and Priem were obnoxious, they were also brilliant, and Huang was their first and only choice to run their graphics start-up—they just didn’t trust anyone else. But Huang did not commit at once. He knew that start-ups were difficult, hardware start-ups were more difficult, and consumer-hardware start-ups were the most difficult of all. Most did not make it past the prototype stage, and many didn’t even make it that far. Decision-making, for Huang, was a -- 37 of 263 -- clinical process with little room for useless emotions like hope. To him, business was just another engineering problem. Engineers looked to break down complex problems into simple governing principles, which could then be leveraged to powerful effect. To launch a start-up, then, Huang needed first to understand those principles. He had to research the market, the supply chain, the competition, the technology, and the product fit. He had to arrange exploratory phone calls with customers, game developers, and computer-graphics experts. He had to pore through years of dull state-of-the-industry reports, paging through bar charts and sales figures and customer surveys in search of a glimmer of an upward trend. In other words, he had to do his homework. And there was only one place to do that. -- 38 of 263 -- T THREE New Venture he Denny’s of legend was situated adjacent to a freeway in San Jose, its pole sign shining like a beacon amid a downscale landscape of gas stations and money-transfer storefronts. The carpeted floors of the restaurant were patterned in mauve and burgundy, and, in 1993, you could still find ashtrays on some of the tables. A long diner counter with stools faced the kitchen; around back was a quiet area where patrons could enjoy endless refills of coffee for hours at a time. A poster on the front window advertised the Grand Slam breakfast, which consisted of eggs, sausage, bacon, and pancakes for the price of $3.99. The restaurant never closed. Huang had suggested the meeting spot. It was close to his house, and he sometimes took his children there to eat. The quiet area in the back was usually populated by police officers drinking coffee and filling out reports. In 1993 California was suffering through the worst crime wave in its history, with more than four thousand people murdered in the state in a single year. Despite its adjacency to wealthy Silicon Valley, the city of San Jose was not spared. Huang sat at a table surrounded by cops, with his laptop and his research papers scattered around him. Chris Malachowsky and Curtis Priem sat with him. Not especially enthusiastic about the cuisine, the two mostly abused the coffee-refill promotion. Huang atoned for this transgression by chatting with the waitstaff, ordering plate after plate of food, and leaving extravagant tips. -- 39 of 263 -- Sun Microsystems had declined to pursue the consumer marketplace for PC video game hardware. So had Lori’s former employer Silicon Graphics, the industry leader in three-dimensional graphics. (Employees there were busy animating the CGI dinosaurs for Jurassic Park.) The failure of the major players to invest in PC gaming created a vacuum in the marketplace, which a brigade of start-up businesses was now scrambling to fill. The concept was to take the hardware used to paint the wire-frame skeletons of model airplanes and dinosaurs and repurpose it to create controllable animated figures in three-dimensional games. Execution required the mass production of cheap, efficient circuits as well as a thin layer of software that “ran on top of the metal” so that game programmers could access the computing structures beneath. The finished device would be a peripheral circuit board with a graphics chip at its core. To install the board, home computer users would unscrew the computer’s exterior metal housing and click the circuit board into a designated slot on the motherboard. The product was known as a “graphics accelerator,” and at least thirty- five competitors were trying to build one. Huang worried there was no space for a thirty-sixth. The leading expert in computer graphics was Jon Peddie, who had written several textbooks on the topic. Huang had reached out to Peddie to get a sense of the market, and the two soon became friends, with Huang calling incessantly, asking questions late into the night. Peddie advised Huang that the space was too crowded and that many of the best engineers were already working for other start-ups. “I told him not to do it,” Peddie said. “That was the best advice he never took.” At Denny’s, Priem and Malachowsky mostly spectated. “I was there for the pie,” Priem recalled, “and really, just to watch him.” Huang’s magic number was $50 million, which he had determined was the minimum revenue his start-up would have to produce each year to make it worth the effort. He had a spreadsheet running on his laptop, containing revenue projections for the next few years. Huang would tinker with a variable in a cell, and the projections would drop below $50 million. Then he would -- 40 of 263 -- tinker with a variable in another cell, and the projections would jump above. After a few such sessions, Huang had talked himself into it. Soon after, the still-unnamed firm of Priem, Malachowsky, and Huang walked into the office of Palo Alto lawyer Jim Gaither, seeking incorporation. Gaither, who had once attended 129 separate board meetings in a single calendar year, was one of the Valley’s most sought-after advisers. He was impressed by the men, especially Huang, who struck him as a natural leader. “I made a quick decision that there was no way they were going to leave our offices without deciding to stay with me,” Gaither said. Gaither discussed corporate structure with the founders, then drew up the paperwork. The start-up didn’t have a name, so for a placeholder, Gaither wrote “NV”: new venture. This was a striking coincidence, as Priem and Malachowsky were already calling their prototype graphics chip the NV1, joking that it would, like the Sun GX, make competitors “green with envy.” Priem drew up a list of words that riffed on the “NV” concept, using dictionaries from a variety of different languages, including Latin. From this list, the three settled on “Nvision”—until a records search revealed that this name had already been taken by an environmentally friendly manufacturer of recycled toilet paper. The next selection from the list was “Nvidia,” from the Latin word invidia, for “envy.” The men returned to Denny’s to finalize the details of the agreement. Huang would be the chief executive officer of Nvidia, Priem the chief technical officer, and Malachowsky the vice president of engineering. Each man would retain an equal share in the company. As Priem and Jensen discussed details, Malachowsky wandered over to a sheet-glass window facing a wide arterial road. He gazed out upon a Taco Bell and a decaying gas station lit by lamps of sodium vapor. It was a squalid, anonymous view without much to recommend it, and his gaze drifted upward until he realized with a jolt that the top of the window he was looking through was pitted with bullet holes. Malachowsky hurried back to his cofounders. “Take a good look at that window,” he said in a controlled whisper. “It’s full of bullet holes! I think -- 41 of 263 -- people are going up to that overpass to shoot at the cops!” Priem and Huang looked up from their paperwork to survey the evidence. They settled their bill, left a giant gratuity, and fled the restaurant for the relative safety of Priem’s condominium. It would be years before any one of them returned. Thus, the story that Nvidia was founded inside of a bullet-pocked Denny’s was not strictly true—the founders actually signed the paperwork inside of Curtis Priem’s San Jose townhouse. Still, as corporate mythmaking went, it was good enough, and the dining section where the men met for coffee is now graced with a handsome plaque. (The address is 2484 Berryessa Road in San Jose, if you’d like to see it. Management says it has been years since anyone shot at the restaurant.) The final step for Nvidia was capitalization, a formality that could be done with an arbitrary amount of money. Upon their bringing the signed paperwork back to Gaither’s office, Gaither suggested that Jensen simply give him all the cash in his wallet, which turned out to be about $200. In exchange, Jensen was formally awarded a third of Nvidia’s shares, which over time would prove to be a decent investment. He collected his cofounders’ share of the money the next time he saw them, and the firm came into being. Jensen turned thirty in February 1993; Nvidia’s certificate of incorporation was filed in April, about six weeks later. “He just missed his deadline,” Malachowsky said. • • • T S V company begins in a garage. Nvidia inverted the cliché by moving Priem’s furniture into the garage and working out of the two upstairs bedrooms of his condominium instead. (Priem kept a third bedroom for himself.) Although it would be several months before the company received seed funding, the GX chip had been so impressive that several Sun employees left their jobs to work for Nvidia without immediate compensation, assuming that their salaries would be backfilled once Huang raised some money. “We thought we were special, but it was nice to see that some other people thought so, too,” Malachowsky said. -- 42 of 263 -- Not everyone believed in Nvidia. Horstmann recalled drinking beer with Huang at his house and Huang getting a phone call from his mother. Horstmann couldn’t follow the conversation, which was conducted in Taiwanese, but remembered it growing heated. After Huang hung up, he returned, exasperated. “Can you believe it?” he said. “My mom just told me to quit Nvidia and go back to work for a large company.” At the condo, one of the bedrooms was taken by the hardware group, designing circuitry on high-end Sun workstations. The other bedroom was taken by the software group designing the protocols that video game developers would need to communicate with the chip. The office of the CEO was located on the first floor at a small circular table adjacent to the kitchen. By design or happenstance, Jensen had placed himself at the center of the natural flow of foot traffic—employees going to the refrigerator for drinks or snacks had to pass him. No matter how powerful he grew, he would seek to remain in the center of traffic for the rest of his career. Jurassic Park arrived in theaters two months after Nvidia’s founding. For the first time, a film convincingly integrated computer-generated imagery into live-action footage. Doing so required preposterous computing power: one three-second shot of a Tyrannosaurus bursting through a log had taken animators ten months to render. The output from Nvidia’s NV1 chip would be comparably primitive, although the difference was one of degree rather than type. “Even today, interactive video games and special effects in movies are essentially still just blocks sliding around hitting each other,” Peddie said. The most important objective for Nvidia was to somehow differentiate the NV1 from the dozens of other products coming to market. To do so, Priem, the architect, loaded both the chip and the software with esoteric features. To paint the wire-frame skeletons, the NV1 used a method called “quadratic texture mapping,” which was intended to add depth and realism to the process. For the software, Priem used an “object-oriented” approach, which allowed programmers to design reusable blocks of code. The first time I asked Priem about the architecture of the NV1, he spoke -- 43 of 263 -- uninterrupted for twenty-seven minutes. “I wanted to build an architecture that would last for one hundred years,” he said. As Priem led development of the NV1, Huang raised money. With Gaither’s assistance, Nvidia secured audiences with Sequoia Capital and Sutter Hill Ventures, two of Silicon Valley’s largest venture-capital firms. The night before his presentation to Sequoia, Huang struggled to come up with a business plan for his company. “I spent all night on it, but at the end I didn’t have anything,” he said. “I still don’t.” What he did have was the support of LSI’s founder, Wilf Corrigan. The following day, Huang and Priem traveled to Sequoia’s offices to pitch Don Valentine, the firm’s famously blunt founder. (Valentine’s favorite question for start-ups was “Who cares?”) The pitch went badly, with Huang fumbling over his presentation and Priem interrupting with irrelevant technical asides. After this uninspiring performance, Valentine took Huang aside. “Well, that wasn’t very good,” he said. “But Wilf Corrigan says I have to fund you, so you’re in business.” Sequoia and Sutter Hill each wrote checks for several million dollars. In return, the investors were given board seats. Sequoia’s seat was taken by Mark Stevens, an affable and extroverted thirty-three-year-old who had an MBA from Harvard. Sutter Hill’s seat was taken by Tench Coxe, an affable and extroverted thirty-six-year-old who also had an MBA from Harvard. Both men wore a lot of Patagonia, and it could be a little difficult to tell them apart. Coxe and Stevens agreed that it was Huang, specifically, rather than Nvidia’s proposal that attracted their attention. “The reason we backed these dudes is because they were world-class computer scientists,” Coxe said. “The average CEO will try to listen to the customer, but in computing, that’s a big mistake, because customers just don’t know what’s possible. They just don’t know what can be done!” Coxe observed that Intel and Microsoft had later struggled under more conventional management: “Jensen, from the beginning, was a world-class engineer who could see what was possible.” -- 44 of 263 -- • • • B , Nvidia had secured office space in a Sunnyvale strip mall and begun work on the NV1. The office was dingy, with frayed carpet, a water- stained drop ceiling, and linoleum floors in the break room. The smells from an adjacent Chinese takeout restaurant often filtered into the ventilation, and the bathroom was shared with another company. Facing the office was a Wells Fargo bank that was a target for local stick-up crews. “While we rented that space, that bank was robbed twice!” Malachowsky said. “You could watch the robberies from our front window.” The back of the office was converted into a recreation center. Lunch took place around a ping-pong table, where Huang would occasionally take his employees to school. In another room was the computer laboratory, where dismantled equipment sat alongside a small portable television attached to a Sega gaming system. Sega, the only console manufacturer to use the quadratic-texture-mapping approach, was Priem’s favored platform. He held the high scores in most of the games and often played in the middle of the workday, leaning back with his foot atop his knee and steering a motorcycle on a nine-inch screen. Adjacent to the laboratory was the conference room, where Huang set up his command center. There, in his immaculate all-caps handwriting, Huang had drawn whiteboard descriptions of the business strategy and approaching deadlines for the NV1. Using dry-erase markers, he had also drawn precise diagrams of the chip’s architecture in green and red and orange. Running out of whiteboard space, Huang proceeded to draw on the conference room walls. Huang had no artistic background and had not studied calligraphy, but everything was crisp and color-coordinated—he just liked things neat. Standing inside Huang’s office was like standing inside his brain. Board members were impressed to see his master plan for Nvidia wrapping around the room where they convened. “His handwriting is unbelievably good,” Coxe said. An important early milestone was “tape-out,” when the blueprint for the first prototype chip would be sent to the fabrication plant. Huang was holding a board meeting as the tape-out date approached when he heard -- 45 of 263 -- a cheer from his employees outside. He rushed out of the room to the laboratory, hoping to see a finished schematic for his first product. Instead, he found his workers clustered around a gaming console, where Priem had set a record time in Road Rash. Furious, Huang confiscated the console and gave it to his kids. “Jensen was always the adult in the room,” one early employee said. “Even when he was the youngest guy in the room, he was the adult.” • • • N “ ” business model—it outsourced the fabrication of the chips to a factory in Europe, and it outsourced assembly, distribution, and retail sales of the finished circuit board to the US vendor Diamond. The only things Nvidia handled were design and quality control of the microchip. As production dates approached, employees worked late into the night. Soon the laboratory was cluttered with empty food-delivery containers and boxes of bulk candy. One of Nvidia’s earliest hires was Jeff Fisher, who is still with the company. He used a VHS camcorder to document the day that the NV1 prototypes were delivered. In the grainy video, a motley crew of gamers gathers with excitement around a protective hard case. All are men. A buzz runs through the assembled crowd as the case is opened, revealing an array of fifteen chips encased in protective black shells. Priem, holding pliers, pries one case open to reveal a printed sheet of silicon the size of a fingernail. As the men appraise this tiny marvel, they are once again boys, their body language fidgety and energetic. The Nvidia employees in the video wear splendid nineties button-down shirts with loud patterns printed onto thick fabric. Many of them have tucked their shirts into the waistbands of their beltless jeans. Huang, away with his family, missed the unboxing of the prototype, but a photograph taken around this time shows him wearing thick round glasses and a red- and-white striped button-down shirt under a jaunty vest. Fisher recalled the -- 46 of 263 -- excessive starch Huang used in his laundry. “He was very, very, very well- pressed,” Fisher said. Nvidia now had to “validate” these prototypes for flaws. Running this effort was Dwight Diercks, another early Nvidia hire who is still with the company. Diercks had been born into a distinguished family of Minnesota pig farmers and still looked the part. He was a large, fleshy man who lumbered—honestly, lumbered—around Nvidia’s headquarters with his oversized plaid shirt tucked into his drooping denim. He had blond hair, blue eyes, and a no-nonsense Midwestern affect. Several people at the company told me not to underestimate him. Diercks ran a series of graphics clips known as an “art demo” through the prototype, then inspected each frame for errors. It was painstaking work, akin to editing a movie one frame at a time. Once validation was finished, Nvidia sent an error-corrected blueprint back to the European fabricator, which put the chip into mass production. Even with the launch date for the NV1 approaching, Huang was looking ahead to the NV2 and had signed a deal with Sega to develop the graphics accelerator for the forthcoming Dreamcast console. As part of the deal, Nvidia also outfitted its NV1 boards with a sound chip and a joystick controller, allowing Sega to port its games to the PC. The biggest draw was Virtua Fighter, which rendered martial artists in blocky quadrilaterals. Advertisements for the game showed a man made of rectangles being tossed through a television screen. The chip could not come out fast enough. By 1995, the market for 3D graphics had surpassed Huang’s most optimistic predictions, thanks to two blockbuster games. Myst, released in September 1993, was an elegant brain tickler set on a mysterious island and rapidly became the bestselling title in PC gaming history. Doom, released three months later, was a sci-fi/horror mash-up in which the player traveled around Mars spraying demons with buckshot. Myst and Doom represented opposing conceptual poles of what video gaming could be—but each sold millions of copies, and each sent gamers rushing to stores to purchase graphics accelerators. -- 47 of 263 -- The NV1 launched into a crowded marketplace in the fall of 1995. Customers walking into electronics retailers that Christmas season found dozens of chipmakers competing for attention. In addition to Nvidia, there were Matrox, S3, 3dfx, Cirrus Logic, and ATI. The confusing situation was not improved by Nvidia’s circuit-board partners, which sold Nvidia chipsets under the trade name “Diamond Edge” while simultaneously selling competing 3dfx chipsets under the trade name “Diamond Monster.” The packaging for this equipment was ugly, and the retail display shelf for computer peripherals, previously home to a somber assortment of modem and printer components, was reduced to a psychedelic mess. The products were also expensive; a Diamond Edge card with the NV1 chipset cost $249, which was more than a Super Nintendo. Open the ugly box, and you’d find an even uglier circuit board, a sheet of green plastic studded with capacitors and molded epoxy housing. The thing looked flimsy and disposable, as indeed it was—graphics accelerators went obsolete quickly and required replacement every couple of years. The most important variable in Huang’s spreadsheet was the number of customers willing to commit to this expensive upgrade ladder. The answer turned out to be more than anyone expected. Game developers, inspired by Myst and Doom, were embracing the PC, which liberated them from the controlled hardware ecosystems of Sega and Nintendo. Classic titles like Civilization II and Command & Conquer appeared around this time, hypnotizing gamers with endless replayability and interminable streams of small decisions to be made. PC gaming was not always “fun,” exactly, but it was utterly addictive, and players would lose hours or even days to the experience. A player who’d invested in a quality graphics card could witness the rise and fall of an empire before turning to the clock in the wee hours of the morning and pressing his palms into the sockets of his bleary eyes. By the end of the year, Nvidia had sold more than one hundred thousand NV1 chips, driven by demand for Virtua Fighter, which came bundled with the cards. Confident in his product, Huang went on a hiring spree, and Nvidia grew to more than a hundred employees. “Suddenly, it seemed like -- 48 of 263 -- we were a real company, right?” Diercks said. “The Christmas party that year was really over the top.” It was not until the first quarter of 1996 that the problems surfaced. After the charms of Virtua Fighter were exhausted, customers found that the NV1 had trouble rendering other games. Most programmers preferred to build 3D objects using triangles, not quadratic mapping. “It didn’t have a depth buffer, and it only rendered curved surfaces,” Tim Little, a game developer who worked with the NV1, said. “As a result, you couldn’t really determine how to sort the objects in the scene.” This led to the serious “clipping” errors, where game characters would sink into sidewalks or teleport through walls. In the worst cases, the NV1 ceased communicating with the Windows operating system entirely, producing the notorious Blue Screen of Death. “It was catastrophic,” Little said. With few supported titles, NV1 sales tapered off, and dissatisfied customers took advantage of the generous return policies of the big-box retailers to take the cards back to stores. A few months after the launch, Diercks went shopping at Fry’s Electronics, where he saw dozens of garish Diamond Edge boxes on the shelves, opened and marked down in price. Just as Nvidia had ramped up its payroll, demand for its only product vanished. Meanwhile, game developers were moving away from Priem’s experimental approach. In 1995, tired of watching graphics peripherals crash their operating system, Microsoft announced that it was launching its own DirectX standard for game developers. The standard supported triangles only, leaving the NV line stranded. Despite its promising start, the NV1 was a dud. Decades later, Priem was still defending the product, but the other two founders spoke of the NV1 with the kind of bemused, shake-of-the-head bitterness reserved for recollections of failed relationships or tax audits. (Specifically, Huang called the NV1 a “disaster,” and Malachowsky called it a “piece of shit.”) But it was even worse than that because Nvidia had built its entire supply chain around future iterations of this same device. Huang’s master plan called not just for the rollout of Sega’s NV2 but also -- 49 of 263 -- for the NV3, which was based on the same architecture. Now his perfect handwriting and his intricate diagrams would have to be erased, with the smears of ink on the whiteboard serving as a taunting reminder of his best- laid schemes. “We missed everything,” Huang said, of those early days. “Every single decision we made was wrong.” -- 50 of 263 -- W FOUR Thirty Days hen David Kirk arrived at Nvidia’s offices for the first time, in 1996, he could see at once that the company was doomed. Kirk was a graphics expert who consulted throughout the Valley, which was like being a connoisseur of failure. He had watched a great many start-ups falter, including his own, and Nvidia exhibited all of the symptoms of a company hurtling toward insolvency. The employees looked haggard and demoralized, the quirky product didn’t fit with the market, and the supposedly chummy founders were now deadlocked in a “technical discussion” that was obviously more than just a discussion and obviously about more than just technology. Kirk was skeptical of Nvidia—as a condition of his employment, he had insisted on receiving a hand-delivered paper check at the end of each workweek. He wasn’t sure the company would stay in business longer than that. The vibe at Nvidia was atrocious. Shortly before Kirk’s arrival, Huang had briefed his assembled staff on the fallout from the NV1 debacle. The company was running out of money, and the original NV1 architecture would have to be abandoned, Huang said. Nvidia’s best hope for survival was to abandon Sega and slither into bed with Microsoft; the company would then try to beat the rest of the manufacturers to market with an affordable knockoff chip. Unfortunately, this “pivot” meant that most employees were being let go. The remaining staff would work overtime as Jensen looked for corners to cut in order to design, manufacture, and ship a generic graphics accelerator in record-breaking time. -- 51 of 263 -- Following the announcement, Huang reduced the head count from more than a hundred general staff to a skeleton roster of thirty-five engineers. Joining in the aftermath, Kirk walked into an eerie, half-abandoned office. Fluorescent lights hung over blocks of low-slung cubicles, most of them empty. At the back of the office sat a “hardware emulator,” a strange and ugly contraption that Huang had used the last of the company’s funds to purchase. Nvidia’s survival relied on the performance of this mysterious eyesore. The hardware emulator allowed you to build fake microchips and test them. But this explanation understates things a bit—essentially, the emulator was a fake microchip, only crafted in code rather than silicon. The machine was cumbersome, and slow, and appeared half-finished, with circuitry exposed and visible tangles of cable snaking out onto the floor. It was too large to fit in the computer lab, so employees had shoved aside the ping-pong table to make space for it in the break room. Huang’s plan was to use this emulator to skip the costly prototyping step and go straight to mass production with nothing more than a digital napkin sketch. In the history of the semiconductor industry, no firm had ever skipped prototyping before—but it had to work. Huang assigned double shifts on the emulator, with Diercks working the machine during the day and Kirk taking over at night. After a few weeks programming the emulator, Kirk realized he had a second, tacit role at Nvidia: curbing the technical ambitions of cofounder Curtis Priem. Kirk had invented the quadratic-mapping technique used in the NV1, but when he arrived at Nvidia, he advised the company to abandon it. “It was just an idea I had,” Kirk said. “I have lots of ideas.” But this only made Priem promote quadratic mapping more aggressively. Priem was a purist who dismissed technical compromises as spineless concessions to the money guys. “The way Curtis thinks is for the end point,” Malachowsky said. “But he doesn’t really have it in his makeup to, like, stay in business.” Kirk soon realized that the abstruse question of whether or not to use quadratic mapping was a proxy for the more interesting question of who was actually in charge at Nvidia. Priem, the CTO, wanted total autonomy to -- 52 of 263 -- set the company’s technical road map. Huang, the CEO, wanted customers to stop returning his product to stores. Malachowsky, the mediator, seemed to have lost control of the discussion—by the time Kirk arrived, what had begun as a closed-door argument about circuit architecture had escalated into public shouting in full view of the small number of dismayed employees who remained. Many people found Priem difficult to work with. “Curtis and I did not get along super well,” Kirk said, echoing a sentiment that others expressed in a less polite way. “He was an absolutely brilliant technical person, but he was not a people person.” Priem was certainly weird, and he often engineered idiosyncratic solutions to straightforward problems. One example was his approach to the issue of email spam. Most people used a spam filter or just lived with it. Priem’s solution was to create thousands of different email addresses, one for every correspondent. But as stubborn as Priem could be, he was not a match for Huang. When Jensen explained his point of view for the first time, he would do so in a measured voice, moving from premises to arguments to conclusions. At this point the fuse was lit, and the interlocutor had two options: agree with his line of thinking or risk detonation. Those who contradicted him were often shocked when he exploded in fury, berating them cruelly in front of an audience of colleagues. This was the Wrath of Huang. • • • T of Huang’s anger were not clear, even to those who knew him well. Jens Horstmann, his closest friend, told me that when Huang worked at LSI Logic, he was not known for temperamental outbursts; it was only when he moved into the CEO role that he regularly began to blow his stack. “I will never forget the first time I saw him erupt,” one Nvidia employee told me. “I’d been working there for a couple of months, and Jensen was always so charming and self-deprecating. Suddenly he’s screaming at the top of his voice in front of a hundred people.” -- 53 of 263 -- Spectators were important to Huang—when he dressed down an employee, he usually did so in public so that others could learn from the experience. (“Failure must be shared,” Huang said.) If a project was delayed, Huang would command the person responsible to stand up and explain to the audience, in detail, every single thing that had gone wrong. Huang would then deliver a withering analysis of their performance. Such corporate struggle sessions were not for everyone. “You can kind of see right away who is going to last here, and who is not,” Diercks said. “If someone starts getting defensive, you just know that person won’t be long at Nvidia.” Diercks believed there was a method to it. “He would never just yell at somebody,” he said. “He would wait for a meeting, with a bunch of people around, so he could make it an educational opportunity for everyone.” But Huang’s criticisms weren’t always constructive—sometimes they were just verbal abuse. One former employee recalled a time when he bungled a minor assignment. Huang confronted him in front of three dozen executives, asking him how long he’d been with the company and what his salary was. The employee sheepishly provided the numbers; in his head, Huang then calculated the employee’s career compensation and asked for all of it to be refunded. The exercise didn’t feel like a joke. “He was kind of serious,” the employee said. “I practically didn’t sleep for three weeks.” It was almost as hard to watch one of Huang’s tirades as it was to be its recipient. Several Nvidia employees described squirming in discomfort as Huang dissected one of their colleagues. Kirk told me Huang yelled at him only once, when he tried to intervene on someone else’s behalf. “Jensen’s up there torturing this guy, and I just couldn’t stand it anymore, so I stepped in. Suddenly, I’m the target!” Kirk said. “It’s kind of like on a battlefield, where the gun is shooting at something and you stand up and say, ‘Hey, stop shooting!’ Well, then the gun turns and starts shooting at you.” • • • -- 54 of 263 -- P stake in Nvidia, and the papers of incorporation had been signed in his house. He had invited Huang to run his company; he could have invited someone else. As Malachowsky observed, the implicit agreement in their working relationship had always been that the founders didn’t step on one another’s toes—in other words, that Huang stuck to business and Priem stuck to tech. So perhaps it was understandable that when Huang violated that arrangement, Priem reacted as if betrayed. But Priem had no leverage. Despite holding around a tenth of Nvidia’s shares, Priem had neither sought nor been offered a seat on the company’s board. Malachowsky didn’t have a board seat either. Huang had convinced both men that as the CEO, only he should be on the board—and Malachowsky and Priem, who disliked business and sales calls and board meetings so much that they hired someone else to be their boss, peaceably if somewhat naively went along with this arrangement. With the board supporting the Microsoft strategy, Huang finally just overrode Priem unilaterally. What was Priem going to do? There was one more painful conversation to be had. Sega had agreed to pay Nvidia $1 million upon receipt of working prototypes of the NV2. In the middle of 1996, Huang delivered these prototypes, the only chips of their kind ever made. With great deference, Huang then informed Sega that Nvidia would not help build the Dreamcast because the company was surrendering to Microsoft, but given that the delivery of the prototypes technically fulfilled the terms of the contract, he was hoping Sega would pay him anyway, or Nvidia would go bankrupt. “They took it pretty well, considering,” he said. Once the Sega check cleared, Huang used it to buy the emulator. This was the last of Nvidia’s money. Already the triage had begun, with the company’s bills organized in the order in which payment could be delayed. First, vendors would get stiffed, then utilities, then finally employees. Whatever else happened, Huang was determined to make payroll until the day the lights were turned off. Emulation was a wild gamble. If the transistors on the forthcoming NV3 chip were arranged in error, the busted real-world production run would -- 55 of 263 -- ruin his company. But Huang was opening himself up to new frontiers of risk. For most of his life—in academics, in athletics, on the corporate ladder —Huang had been vying for first place. Now he could enjoy the blessings of coming in last. Staring at the long queue of competitors in front of him, Huang realized that being in last place was kind of fun—better than being in the middle, actually. A last-place firm could do whatever it wanted. It could take the shortcut no one else dared take. Of course, for those tasked with working the emulator, last place was less exciting. Video games of the time rendered around thirty frames per second to generate the illusion of motion. The emulator inverted this ratio, rendering about one frame every thirty seconds, breaking the illusion and making game play impossible. Under the leadership of Diercks, engineers reviewed the demo reel in agonizing ultra-slow motion. The mind-numbing auditing process took weeks, but slowly the emulator relinquished its secrets. “We shrank what was typically a twelve-month development cycle into about three months,” Diercks said. When Diercks finished for the day, Kirk took over in the evening. He was often the last worker in the office, and late at night he vented tension with a plastic gun, beating Priem’s high scores at the Sega shooter Virtua Cop. “Once you figure out the mechanics of a game, you can figure out how to beat it,” Kirk said. “Annoying Curtis was just a bonus.” As the emulation neared completion, tensions between Huang and Priem flared up once again. Priem recalled one argument in a hallway over technical concerns. “I told Jensen what he should do, and he started yelling at me about all the stuff he had to do,” Priem said. “That’s when I realized he was all alone.” • • • A about coming in last was that you could move after everyone else had acted. To scare up publicity, Nvidia’s competitors were sending preview cards to hardware reviewers at magazines and websites. Kirk finessed his contacts in the media to see what capabilities these -- 56 of 263 -- competitors had managed to implement. Because Nvidia planned to skip both the prototype stage and the sneak previews, there was just enough time to clone these capabilities in the NV3. The NV3 reached the tape-out stage in early 1997. When the blueprints were sent to Europe for production, Nvidia’s three dozen employees celebrated with beers at a cheesesteak franchise in a nearby strip mall. At the restaurant, Jensen led a toast, but he later admitted he had no idea if the NV3 would actually work. “It was fifty-fifty,” he told me, “but we were going out of business anyway.” While the chips were being fabricated, Jensen approached Kirk with a permanent job offer and borderline-stupid equity compensation. Although it meant working for Curtis Priem, Kirk felt the offer was not one he could in due conscience decline. Huang gave him the title of chief scientist. The finished NV3 chips arrived in late spring. The survival of the company depended on whether every one of the 3.5 million transistors in each case perfectly accorded with the emulation. Diercks mounted the cases into a circuit tester and played back the demo reel. It ran smoothly, flawlessly, creating the illusion of motion at thirty frames per second for the first time. The NV3 was mostly a copycat chip, but it had a couple of innovations. First, it could transport 128 bits at a time from memory to processing, double the industry standard. Second, it had Swiss Army multifunctionality: it could accelerate video games, it could resize a spreadsheet, and it could play a DVD. To emphasize this breadth of capabilities, the NV3 was rebranded as the Real-Time Interactive Video and Animation accelerator, or Riva 128. The chip was distributed to Nvidia’s downstream vendors, which mounted it in circuit boards and sold it at Best Buy. By the time the boards arrived in stores, in August 1997, Nvidia was running on fumes. “Vapors,” said Huang. “We had nothing left.” Nvidia, having shipped no previews to the gaming press, now had to beg for media coverage. Fortunately, reviewers liked the product. “Rendering up to five million triangles per -- 57 of 263 -- second, this is the best accelerator money can buy,” one wrote. Nvidia sold a million Riva cards in the first four months. Following the Riva’s launch, Huang invested in emulators and gave up on physical prototypes. “To this day, we are the largest user of emulators in the world,” Huang said. The bias in the semiconductor industry in favor of hard prototypes had seemed reasonable—imagine trying to sell a car that had never undergone a real-world crash test. Prototyping seemed like the practical approach that the “adult in the room” would take. But Jensen, who worked all hours and confiscated his employees’ gaming systems, was learning that the adults in the room weren’t taking enough risks. The NV1, a revolutionary design that followed best practices for industry workflow, had been a flop. The NV3, a wannabe product slapped together in a berserk, improvisational rush, had been a success. Sometimes, you had to gamble. The experience was liberating for Huang. Desperation, not inspiration, was the mother of victory. Huang encouraged his employees to preserve the mindset they’d adopted during the Riva crunch, asking them to constantly behave as if the company was teetering on the verge of bankruptcy even when it was making massive profits. For years to come, Jensen opened staff presentations with the words “Our company is thirty days from going out of business.” Even today at Nvidia, this sentence remains the corporate mantra. -- 58 of 263 -- T FIVE Going Parallel he final argument between Jensen Huang and Curtis Priem lasted for the better part of a day in 1998. No one could remember its precipitating topic, only the long battle in the conference room with each man shouting at the other, then calming down and regaining his composure, only to rise in anger once again. As the hours dragged on, both men grew hoarse, and although Nvidia staff were by now inured to this dysfunction, employees sensed that the situation had reached a crisis and that divorce was imminent. In the end, Priem broke first, walking stiffly and quickly to his office and slamming its door, remaining there to sulk. When he came out, he refused to speak to Huang at all. Priem exhausted all goodwill with his intransigence. With the NV1, he had been given a blank sheet of paper to design exactly what he wanted— but when he saw his remaindered product piling up on retail shelves, his ego never really recovered. Earlier that year, Huang had promoted David Kirk to codevelop Nvidia’s technical architecture, making Priem’s former subordinate his peer. Shortly thereafter, Priem, in a childish attempt to retain influence, had locked a number of employees out of the production database, preventing them from submitting their work. Chris Malachowsky, who’d adjudicated such disputes in the past, finally gave up. At the advice of the board, Nvidia hired a mediator. “The mediator had previously worked with John Sculley and Steve Jobs at Apple, which ended up with Jobs getting fired,” Priem told me. “She said Jensen and I were significantly worse.” Nvidia management had by this point developed -- 59 of 263 -- a saying: “Never let Curtis talk to investors, and never let Curtis talk to customers.” As Priem later admitted, “Both of those things were true.” Following mediation, Priem was demoted twice more. He briefly became Kirk’s technical adviser but found this role unsuitable. “He didn’t want to work for me or any of the other people that he had previously had as direct reports,” Kirk said. Eventually, Priem was reassigned to managing Nvidia’s patent portfolio, a job that kept him out of the flow of daily decision-making. Kirk would go on to oversee close to a thousand people at Nvidia. Priem would never again manage more than four. Despite the humiliation, Priem retained his Nvidia shares. Both Huang and Malachowsky still considered him a friend, and when Priem got married in 1999, a year after his second demotion in three years, Malachowsky served as his best man. Priem had seen the potential in the video game market, he had given the company its name, and he had sheltered the fledgling start-up in the bedrooms of his home. But from 1998 onward, he had little to do with Nvidia’s success. • • • D K Huang’s consigliere. Kirk had an academic background and didn’t enjoy the high-pressure work culture of Silicon Valley; the first time I spoke with him, he called me from Hawaii, where he was wearing a shirt that read “Procrastination University” and drinking a glass of wine. He seemed like a popular professor whose course had a wait list, not an entrepreneur. But Kirk’s gentle manner cloaked a pitiless attitude toward the competition—his absent father was an alcoholic, and he’d been a sickly child whose physical development had been limited by chronic bouts with strep throat. Perhaps these experiences had given him an edge. “Those people who say that winning doesn’t matter?” Kirk asked, his voice rising with a delicate, unthreatening lilt. “They’ve never won anything.” Kirk and Huang each combined a commanding breadth of technical expertise with a talent for low cunning. As Nvidia succeeded, many of the other graphics start-ups failed. Huang, sensing opportunity, created a master -- 60 of 263 -- list of competitors on the whiteboard in his office. Then, in consultation with Kirk, he identified the two or three best engineers at each company and began strategizing on how to poach them. Kirk recalled showing off the Riva 128 to a competitor at a trade show. When the engineer saw what it could do, he gave up on the spot. “I hired him within a few days—and that killed that company, right?” Kirk said. “Because, you know, I removed its brain.” Kirk, mild and professorial, had a predator’s instincts. “We had all the geniuses from all the other start-ups, and as we were successful in overtaking more and more of these little companies, the remaining companies had a harder and harder time staying alive.” At Denny’s in 1993, Huang had envisioned sharing a large market with many competitors. By 1998, he wanted the cake to himself. “There are still forty companies in this space,” Huang told Kirk. “In five years, there will be three: a big one in charge, a laggard playing catch-up, and a small one trying to disrupt the other two.” Huang intended to be the one in charge. Although he seemed perfect for the role, Huang had not been a natural entrepreneur at the beginning. “Initially, there was a great deal he didn’t understand,” recalled Tench Coxe. But he could learn—he really liked to learn. Huang educated himself by reading every business book he could find. “If you go into his office today, it’s totally abandoned; he never uses it,” one employee told me. “But it’s filled with stacks and stacks of business books.” From a literary perspective, or even a mass-market perspective, Huang would not be considered well-read. Popular books among the Silicon Valley set included Ayn Rand’s The Fountainhead, Isaac Asimov’s Foundation series, and Douglas Adams’s The Hitchhiker’s Guide to the Galaxy. Huang hadn’t read any of them—in fact, Huang told me that he had never read a single work of science fiction at all and that the only novelist he could recall enjoying was Paulo Coelho. But Huang’s knowledge of business books was encyclopedic. Dwight Diercks recalled Huang arguing with another executive about how much Nvidia’s products should cost. “The guy had an MBA, but he’d never read a -- 61 of 263 -- book about pricing,” Diercks said. “Jensen had read probably ten or fifteen.” As the argument progressed, Huang halted the discussion and asked the MBA to name his three favorite books on pricing. The guy fumbled around for a bit, unable to name a single title. Huang listed out his three favorites, then told the executive he’d resume the discussion once he’d finished them. Huang’s best-loved business book was The Innovator’s Dilemma, by the Harvard Business School professor Clayton Christensen. First published in 1997, the book popularized the term disruptive innovation to describe how incumbent firms lose out to start-up competitors. Although the word disruption has since grown meaningless through overuse, the source material is worth revisiting. In Christensen’s model, small firms can chisel away at large ones by serving niche, marginally profitable customers that the market leaders have dismissed. Christensen’s disruptive innovators weren’t necessarily high-tech—they included scrap-metal recyclers and manufacturers of hydraulic shovels. His canonical disruptor was Honda, which had initially had success in the early 1960s selling the off-road Honda Super Cub motorcycle to American teenagers. (The Beach Boys wrote a song about it.) The dirt-bike market was ignored by larger firms like General Motors because, setting all else equal, you’d rather sell a Cadillac to a businessman than a Super Cub to Brian Wilson. But in scorning the bikes, GM gave Honda the opportunity to thrive. With time, Honda leveraged its expertise to make a compact car and raided the US automotive industry from below. Christensen’s insight was that it was easier to go up the escalator of profitability than down. Going down meant voluntarily shrinking profit margins by deliberately making inferior goods, which tended to upset investors and made executives feel like they were jogging in place. This led Christensen to his most enduring and most counterintuitive recommendation: “There are times when it is right not to listen to customers, right to invest in lower-performing products that result in lower margins, and right to pursue small, rather than substantial, markets.” It was -- 62 of 263 -- a point that the buzzword discussion of “disruption” in the popular press usually missed. Huang became obsessed with Christensen, a towering but exceedingly kind Mormon bishop who’d played basketball at university. He read all of Christensen’s books, assigned The Innovator’s Dilemma to his executive staff, and later contracted Christensen as a consultant. It was Christensen who explained, really for the first time, why the big players like Sun and Silicon Graphics had declined to invest in PC video game hardware—not because they hated gamers but because the margins sucked compared to workstations and because, succeed or fail, gamers would not initially make a substantial difference in their bottom line. But in ceding the PC gaming market to Nvidia, the workstation companies had made a fatal mistake, just as GM had by ignoring Honda decades earlier. Nvidia, like Honda, was today selling low-margin products to teenage boys, but if the analogy held, tomorrow they might overtake the business workstations of Sun Microsystems and SGI. Sometimes Jensen would even speak to his clustered executive staff about the possibility of disrupting Intel, then one of the most valuable firms on Earth. In the meantime, Nvidia would survive in Intel’s territory through a strategy of continuous tactical retreat. “To this day, we don’t compete with Intel,” Huang said in 2023, describing their Tom-and-Jerry relationship. “Whenever they come near us, I pick up my chips and run.” The gospel of Christensen counseled Nvidia to sell offbeat products that Intel would not conceive of making to customers it would never want to serve. “Jensen would tell us, even back then, that Nvidia could someday be bigger than Intel,” Kirk recalled. “It was just a question of strategy.” • • • D N’ R cards outpaced what its European vendor could produce, so Huang began to look for other suppliers. The world’s best independent chip manufacturer, by unanimous consensus, was the Taiwan Semiconductor Manufacturing Corporation, whose massive complex in -- 63 of 263 -- Tainan fabricated a significant portion of the world’s silicon. TSMC didn’t design its own chips; it simply manufactured chips for merchants like Nvidia. The rise of such independent “foundry” services was responsible for a surge in computing innovation, permitting upstarts to experiment with radical designs. TSMC filled orders with incomparable precision and efficiency, the product of an extraordinarily demanding work culture. Workers there described the hierarchical corporate structure as “militarized,” and they followed a “996” shift schedule, working from nine a.m. to nine p.m. six days a week. Huang had repeatedly failed to get TSMC’s attention. After leaving a series of voicemails, he’d written a personal letter to Morris Chang, the company’s CEO, and put it in the mail, not expecting to hear back. A short time later, Huang got a phone call. It was toward the end of the workday, and many of his employees had started gaming. “There was quite a bit of ruckus in the office, and I pick up the phone, and people are laughing and yelling outside,” Huang said. “And I say, ‘Hey, guys, hold it down, it’s Morris!’ ” (“Actually, what he said was ‘Everybody shut the fuck up—I’ve got Morris Chang on the phone,’ ” according to one veteran engineer.) Chang had spent his life in silicon. Born in China in 1931, he had arrived in the United States from mainland China as a teenager. He had been a successful executive at Texas Instruments for many years, but in the 1970s he’d been passed over for the top spot—a snub that some observers attributed to anti-Asian racism. Chang then moved to Taiwan and took control of TSMC, which, under his leadership, grew to become the largest tech company in Asia. Chang took an immediate liking to Huang. “They were a very small company—in fact, almost facing bankruptcy,” Chang said. “And I was an older person running a much bigger company. But he was so open and forthcoming and candid in our conversation! Just completely at ease.” Soon, the two had a contract in place. Chang and Huang had much in common. Both were Chinese immigrants working in a technology sector that, at the time, was almost entirely -- 64 of 263 -- managed by white men. Relative to their proportion of the American population, Asian employees were overrepresented at Silicon Valley tech firms—but they were suspiciously underrepresented in positions of senior management. (In 2010, Asian Americans accounted for only 0.3 percent of corporate management positions, despite making up more than 5 percent of the skilled US labor force. Management consultant Jane Hyun has termed this phenomenon the “bamboo ceiling.”) When I asked Huang about the bamboo ceiling, he seemed dismissive—I got the sense that identity politics were not his thing. “I’m the only Chinese CEO of the time,” he said, “but it never occurred to me. And it doesn’t occur to me today.” TSMC was key to Nvidia’s long-term success, but the relationship got off to a difficult start. In early 1998, TSMC misapplied a chemical process at the end of the manufacturing process, introducing short circuits onto many of the chips. The mistake nearly ruined Nvidia, which had invested most of its working capital in the production run. More than half the chips needed to be discarded—Nvidia managed to save itself only by selling equity to some of its circuit-board partners. “We were close to bankruptcy that time, too,” Diercks said. “It’s not just a saying.” But over time, Nvidia’s relationship with TSMC proved to be of great mutual benefit to both companies, especially as Nvidia’s chips grew increasingly complex. For Huang, there was also a personal benefit—the arrangement gave him a reason to go back to Taiwan, which he hadn’t visited since he was a child. Huang’s first visit to a TSMC factory in the late 1990s brought him inside one of the most sterile environments on the planet. Clad in booties, gloves, and head covering, he entered an air shower, where he stood on a sticky mat and raised his arms, his coveralls flapping in the wind as an overhead fan blew lint, hair, dust, skin, dirt, grime, and other assorted detritus away. From the shower he passed through an airlock to the manufacturing center, where a gentler, continuous blower provided ceaseless vertical airflow from the ceiling through the grated floor. Here were the sacred light-printing machines etching identical patterns across mirrored silicon plates in slow, invisible cycles. No one dared lay a finger upon one—so delicate was the printing process that a single footstep -- 65 of 263 -- might upset it. After weeks of layering, the wafers were diced into individual chips by a diamond-crusted wire saw, then sent downstream for packaging. In a good year, TSMC’s factories might produce tens of millions of chips. Leaving the facilities, Huang returned to the country’s famous night markets to gorge on the Taiwanese food he’d enjoyed as a child. He spoke the language and could almost pass as a local, but he remembered little about the place—save for one painful instance he would never forget. When Huang had been about four, he’d gotten too close to a vendor at one of the stalls, and the vendor, who’d been cleaning a knife, accidentally caught Huang on the cheek, drawing blood. Now Huang had returned, still bearing the scar from his accident years before. No matter how wealthy or famous he became, Huang never missed an opportunity to return to the night markets. He often treated himself to a simmering bowl of beef noodle soup, the country’s national dish. It was a meal best enjoyed while sitting on a plastic stool on the side of a busy street. Picking through braised short rib and mustard greens with a pair of disposable chopsticks, Huang was already conceptualizing a transpacific marriage between his scrappy American team and the great maiden of Taiwan. • • • J C let down by the hardware designers. The lead programmer on Doom and Quake was a code surgeon who liked to sift around in the guts of the graphics chips that rendered his bestselling games. To light up a corridor in a space station, most programmers would rely on pre-supplied algorithms. Carmack’s team, by hacking the address structure of the underlying variable, had built its own approach, one that required Carmack to conceptualize the flow of information at the level of individual zeroes and ones. One commenter called it “the most beautiful piece of code you’ve ever read.” -- 66 of 263 -- Carmack was blond, thin, and not possessed of any unusual amount of social grace. His voice was nasal, his delivery was rapid, and he punctuated his technical comments with an occasional “mmmmm.” When coding, he would shut out the world for weeks, retreating to darkened bedrooms and working for fourteen hours a day. He usually emerged with something unforgettable. Quake, his masterpiece, was the first bestselling three- dimensional shooter. The game used graphics acceleration to render polygonal monsters that the player could shoot with a nail gun. (Trent Reznor had provided the game’s sound effects.) Quake II, the splatterfest follow-up, sent gamers to a distant planet to fight zombified cyborgs constructed from dismembered human body parts. (The game came out during my first year of college. It set me back years.) Both games featured a “deathmatch” mode that allowed for multiplayer combat. Carmack’s custom tinkering meant the Quake games ran more smoothly than anything else on the market, but simultaneously rendering dozens of participants in the same combat arena remained a challenge. “We wanted a faster-responding Quake so that our customers could reach out and frag somebody,” Carmack told me. “Most of these hardware accelerators couldn’t deliver that.” Huang saw an opportunity. At the Nvidia offices in Sunnyvale, the Sega consoles sat unused. Quake had conquered all, and the staff were so addicted that Huang had issued a ban on daytime deathmatch. PC manufacturers like Dell were bundling Nvidia’s graphics accelerators directly into new computers, skipping the retail peripheral channel. “It was becoming clear that whoever rendered this specific game the best was going to win the graphics wars,” Kirk said. Huang tasked his team with building a new chip just for Carmack—a custom Stratocaster for programming’s Jimi Hendrix. One thing Carmack wanted was multiple “pixel shaders.” These were the algorithms that assigned colors to individual pixels in a three-dimensional scene. By running more than one shading algorithm at a time, one could first decorate the wall with a light source, then subsequently decorate it with blood spatter. -- 67 of 263 -- But twin shaders meant twice as much computation. 3dfx, the leader in the graphics-accelerator market, had solved the problem by putting two graphics chips on the same card. Kirk and his team had a more radical solution in mind. What if you split the pixel-shading into two datasets, then ran the same instruction set on each pipeline at the same time, all on a single chip? The method was suitable for graphics rendering, which tended to run the same types of calculations over and over and over again. Huang was skeptical at first. This approach, known as “parallel computing,” had been tried before by vendors of expensive supercomputers. “Silicon Valley, it’s littered with corpses of previous parallel-computing platform companies,” he said. “Not one parallel-computing company has ever been created with the exception of us—not one, zero.” But then Huang began to reason from first principles. Carmack wouldn’t stop at two separate pixel pipelines, he surmised. As the shooting games grew in complexity, he’d always want more. Imagine a scene with many sources of illumination: an arena full of lights, with multiple guns firing and a spaceship crashing in the distance, on a world lit by twin suns. By dedicating a chip to each light source, 3dfx would eventually run out of real estate on the circuit board. The only way to meet Carmack’s future needs was to multiply pipelines on a single silicon square. Before committing, though, Huang had to do his homework. He had to understand why the approach had failed so many times. Seymour Cray’s powerful parallel supercomputers were floundering, with customers complaining about their high costs and complexity. Larry Ellison, the founder of Oracle, had invested millions in the parallel start-up nCube; by the late 1990s, it was failing too. The issue was that programming in parallel was difficult—acting on two or more data streams at once required swapping between multiple memory banks, and the learning curve was steep. This left the parallel-computing companies vulnerable to Intel. Intel’s chips used the standard serial approach, doing one calculation at a time. But their power grew exponentially, doubling every eighteen months or so, following a prediction first made in 1965 by former Intel CEO Gordon Moore—a prediction that had been validated so many times it was -- 68 of 263 -- known as “Moore’s Law.” Outside of specialized technical domains like weather prediction and high-energy physics, Moore’s Law guaranteed that one processor—the all-consuming CPU—was enough to meet the needs of even the most demanding users. “If you had a piece of software and you needed it to run faster, you had a choice,” parallel-computing specialist Bill Dally told me. “You could move it to a parallel computer and rewrite a million lines of software. Or you could just wait for the CPU to get twice as fast in two years.” Moore’s Law had already consumed the peripheral circuit-board market segment in which Nvidia originally operated. As first conceived, in 1993, the 3D graphics accelerator would be one of many slot-in add-on cards, alongside sound cards, networking cards, printer cards, et cetera. But by the late 1990s, Intel had folded sound, networking, and printing functionality into the motherboard. Only the 3D graphics cards remained, the lone survivors of a decimated ecosystem. They held out by being hungry; they absorbed all available computing capacity and asked for more. The other functions of the multimedia PC were bounded—once you were processing audio at equivalent quality to a compact disc, you didn’t need more computing power. With 3D graphics, however, demand never stopped. With 3D graphics, you weren’t finished until you were living inside the Matrix. “That was something Intel missed, just how much further the graphics firms had to run,” Hans Mosesmann said. “Demand for processing power in that space was basically infinite.” The CPU would not catch up to 3D rendering, not now, not ever. Twice as fast was nowhere good enough. • • • T R TNT in June 1998. The “TNT” stood for “twin texels,” dual pixel-rendering pipelines governed by a sophisticated switching mechanism. The delighted Carmack embraced his Stratocaster—he called it “the perfect card.” He tailored Quake III: Arena specifically for the twin- pipeline architecture and advised his legion of admirers that Quake games -- 69 of 263 -- were best played on Nvidia hardware. Carmack also saw what Huang had prophesied: that gaming would give Nvidia a base to disrupt the more profitable workstation market. “The TNT was really better, in many cases, than a $10,000 computer,” Carmack said. He was not the only programmer to notice. Kirk, in addition to skimming the best employees from the failing start-ups, was now poaching employees from Silicon Graphics. One of them was Dan Vivoli, a former engineer who’d switched roles into marketing. Vivoli used the TNT to establish Nvidia as a brand. Following Carmack’s endorsement, customers began to fetishize Nvidia products, attracting Jensen’s attention. The pile of reading material in his office soon contained numerous textbooks on marketing. Nvidia never directly advertised the TNT’s parallel capabilities to customers—why confuse them? Instead, it leveraged the dream of somehow living inside the computer. This fantasy had a strange and powerful allure. The concept of the “Matrix”—a shared-computing hallucination—had originated not with the 1999 movie but with William Gibson’s 1984 novel Neuromancer. In a 2011 interview with The Paris Review, Gibson recalled his inspiration: I remember walking past a video arcade, which was a new sort of business at that time, and seeing kids playing those old- fashioned console-style plywood video games. The games had a very primitive graphic representation of space and perspective. Some of them didn’t even have perspective but were yearning toward perspective and dimensionality. Even in this very primitive form, the kids who were playing them were so physically involved, it seemed to me that what they wanted was to be inside the games. Nvidia didn’t have a mission statement (Huang didn’t believe in them), but Gibson’s observation might have served as one. The goal was -- 70 of 263 -- immersion, total immersion, in digital worlds rendered with such pointillist detail that they made reality fall away. What Gibson had intuited from body language, Huang was rediscovering from deductive reasoning. Nvidia’s work wasn’t finished until the gamers lived inside the game. But in pursuing this goal, Nvidia engineers were playing a far more dangerous game than even they realized—for inside the TNT was a secret, a lurking, demonic secret buried so deep in the architecture of the silicon that neither Huang nor Kirk nor Carmack nor Vivoli nor anyone else in the world suspected that it was there. If you popped the cover off that TNT chip and inspected the naked circuitry with a microscope, you would find a change not just in the arrangement of transistors but a change to all computers, and perhaps to all of humanity, forever. Inside that tiny chip was a secret that would change the world. -- 71 of 263 -- T SIX Jellyfish he proposition gamblers gathered around the backgammon board in the elegant hotel room at the Bristol Suites in Dallas, Texas, in 1997. The match before them was unlike any other they had seen. Representing humanity were Nack Ballard and Mike Senkiewicz, two of the best players alive. Acting as an agent for the machines was Malcolm Davis, betting heavily against the human race and playing moves dictated by the computer at his side. A few months earlier, Garry Kasparov had lost to the IBM chess computer Deep Blue, in a match that drew interest from around the world. (“Swift and Slashing, Computer Topples Kasparov,” read the headline in The New York Times, accompanied by a photograph of the great Russian grand master with his face buried in his hands.) The equivalent backgammon competition in Dallas drew no such attention, save for the gamblers wandering in and out of the room. These men spent their lives indulging their obsessions with abstract games of strategy. They played Go, and poker, and Scrabble, and contract bridge, and chess, often at an elite level. They laughed and chattered and exchanged wads of cash after nearly every throw of the dice. The men were not always well-dressed or generally very physically fit, and outside of specialist circles their names were hardly known. But it was this unheralded competition in Dallas, not Kasparov’s defeat in New York City, that marked the dawn of the new machine age. Ballard was the best of them. He was a jolly, portly man with a wide face and sideburns who studied backgammon for six hours a day and had -- 72 of 263 -- been ranked number one in the world several times. With a muted clatter, he shook the dice in the leather cup, then tossed them onto the board. Kasparov had been visibly unsettled by Deep Blue, but Ballard had a Zen-like focus on the position at hand. He pondered his move in silence, then moved two checkers on the board. When he was finished, Davis rolled his own dice, then consulted the computer on the updated position. Davis was also a backgammon master, but by 1997 he was certain that software was surpassing humans at the game. He’d backed this assertion by offering to play as the computer’s agent against any takers for $200 a point. Backgammon was a streaky affair, and if Davis’s assessment of the computer’s abilities were incorrect, it might cost him $100,000 or more. But he had confidence in his machine, which ran a radically different kind of artificial-intelligence software than any that had come before. The program was called “Jellyfish.” What made Jellyfish special was that it was a “neural net” whose structure was inspired by the biological brain. Rather than executing code written by human programmers, Jellyfish made decisions by passing information to a grid of artificial neurons whose synapses were represented in the computer as an enormous matrix of numbers, or “weights.” The grid evaluated the position and passed an answer back through this synthetic nervous system. A clunky dialogue box popped up with Jellyfish’s recommendation. Davis moved the checkers on the board. Occasionally, the program recommended a controversial move, sparking disagreement among the spectators and prompting a flurry of side bets. As the chatter continued, Ballard rolled the dice again and resumed his reverie over the board. • • • T D match drew no more interest from the AI community than it did from the press. In 1997 mainstream computer scientists regarded neural nets as little more than toys. They were first conceived of as “nervous nets” in the 1940s, when early experimenters -- 73 of 263 -- physically re-created the neurons and synapses of the brain using complex electromechanical hardware. These giant contraptions absorbed lots of power and money while producing little of use until, in 1969, the influential MIT researcher Marvin Minsky demonstrated that a single layer of neurons was unable to perform even one of the simplest logic operations. Funding evaporated, and most of the machines were dismantled. AI suffered many false starts in the years that followed and developed a reputation as a career graveyard. Early progress with “symbolic” AI sputtered out in the first AI winter, in 1974. In the 1980s, revived interest in “expert systems” AI created a brief stock market bubble that popped following the 1987 crash. The governments of Japan, the United Kingdom, and the United States all launched ambitious AI initiatives, spending billions in taxpayer funds on grand strategies for research. In each case, independent analysts concluded that the money was mostly squandered. Meanwhile, throughout the 1970s and 1980s a coterie of renegade computer scientists continued to pursue research on neural nets, believing that the rickety machinery of yore might be re-created with software and that multiple layers of neurons might overcome limitations that a single layer could not. Most AI researchers regarded these mavericks as misguided, or possibly insane, but in 1986 cognitive psychologist David Rumelhart, working with computer scientists Geoffrey Hinton and Ronald Williams at UC–San Diego, published an elegant mathematical procedure for training multilayer neural nets called “backpropagation.”[*] The method allowed researchers to fine-tune the computer’s artificial neurons in response to new information, in the same way that the human brain formed new synaptic connections when a task was learned. Backpropagation revived the dormant neural-net community, allowing computers to function in an entirely new way. Backpropagation allowed for computer software that didn’t need to be explicitly programmed. Backpropagation allowed computer systems to make their own rules. Backpropagation allowed computer systems to evolve. In the late 1980s, the researcher Gerald Tesauro, also working at IBM, opted out of the company’s popular chess research group to conquer the -- 74 of 263 -- lowly game of backgammon. Lacking the cachet of chess or the mystique of poker, the game was essentially a race in which players tried to hit one another’s checkers while moving in opposite directions around a twenty- four-point board, subject to the rolls of two six-sided dice. The unpredictability made the game attractive to gamblers, but for Tesauro, backgammon had a different appeal. By simulating dice rolls, he could rapidly generate hundreds of thousands of artificial backgammon games— and this was training data from which a neural net might learn. Tesauro worked on this niche project almost entirely by himself; as with neural nets, few AI researchers took backgammon seriously. He first trained his neural nets to mimic the best human players, but this approach produced little of value. Around 1990, Tesauro decided to try a different approach. He stripped all strategic advice about the game of backgammon out of the neural net, leaving only the rules and an initial set of randomly weighted neurons. Then he had the computer play hundreds of thousands of games against itself. The technique was known as “reinforcement learning,” and Tesauro was the first person ever to get it to work. At first the program was hopeless and moved the checkers around aimlessly. After a few thousand games, though, the neural net had learned that leaving one checker alone was bad but that stacking two checkers together was good—this brought it to the level of a competent beginner. After tens of thousands of games, the neural net was employing more advanced concepts, like using multiple stacks of checkers to build a wall. After two hundred thousand games, the neural net, which Tesauro called TD-Gammon, was playing at a strong intermediate level. Over the next few years, Tesauro exposed TD-Gammon to millions of simulated games, and by 1995, TD-Gammon was employing strategies never before seen by humans. The neural net was no longer just learning. It was innovating. Unencumbered by received wisdom, TD-Gammon discovered a new approach to backgammon. It determined that human players were risking too much to establish an advantage up front and that conservative openings were better. At the same time, it would often forgo a guaranteed win in the -- 75 of 263 -- endgame in a greedy attempt to double its score, a strategy most human players considered reckless. In the middlegame, TD-Gammon made a variety of more subtle moves that human experts understood only after deep introspection. In 1995 backgammon instructor Kit Woolsey wrote to Tesauro with praise after playing against his creation: I find a comparison of TD-Gammon and the high-level chess computers fascinating. The chess computers are tremendous in tactical positions where variations can be calculated out. Their weakness is in vague positional games, where it is not obvious what is going on. TD-Gammon is just the opposite. Its strength is in the vague positional battles where judgment, not calculation, is the key…. Instead of a dumb machine which can calculate things much faster than humans such as the chess playing computers, you have built a smart machine which learns from experience pretty much the same way humans do. But IBM failed to commercialize Tesauro’s project—why would a vendor of business servers sell commercial backgammon software to a few hundred customers? Why, indeed. This adorable nook in the marketplace was filled in 1994 by the Norwegian researcher Fredrik Dahl. Dahl was an unusual man who enjoyed backgammon, chess, simulated tank battles, jiu-jitsu, and foraging in the woods for edible fungi. He worked for Norway’s defense establishment, where he simulated outcomes from a hypothetical Soviet invasion. His work drew inspiration from the 1983 movie WarGames, starring Matthew Broderick. In that movie, an AI attempts to start a nuclear war. Dahl assured me that this was not his own ambition, but he did show a keen interest in military affairs—after the Soviet Union collapsed, funding for Dahl’s research was eliminated. “It was a terrible time,” he said. (He was kidding, I think.) For his doctoral thesis, Dahl had built a neural net that modeled combat outcomes by having the computer fight millions of -- 76 of 263 -- simulated battles against itself. This framework was easily ported to backgammon, and Dahl soon surpassed Tesauro’s results. In 1994 Dahl unveiled Jellyfish, the first neural net ever sold to the public. Jellyfish had trained on many millions of backgammon games, but despite this intensive computational process, the finished product was small enough to fit on a 3.5-inch floppy disk, which Dahl sold via his primitive website. In this way an early distinction was established between the cumbersome training stage of AI, which was how the computer learned, and the inference stage, which was how the computer deployed its knowledge. The latter was far less expensive—a parallel might be drawn between the three-pound human brain, which handles inference, and the hundreds of millions of years of evolutionary conditioning that provided its training. Dahl was attuned to these biological analogies. He had selected the name “Jellyfish” as an homage to the ancient aquatic cnidarian whose “nerve net” controlled its systems of stimulus and response. His program “had only about a hundred brain cells, which I figured was about on par with the jellyfish,” he said. That was the power of neural structures: all it took to conquer backgammon, or survive for half a billion years in a dangerous marine ecosystem, or maybe even fend off the Soviets, was a hundred little cells. • • • T statistical sample, Ballard and Senkiewicz had each agreed to play three hundred games apiece against Jellyfish. Ballard, who’d once played for eighty-four hours in a row, was used to such backgammon marathons and managed to maintain his focus. He beat the computer by fifty-eight games, pocketing $11,600. But Senkiewicz lost a nearly equivalent amount, Davis broke even, and the competition was declared a tie. Ballard was happy with his individual win—but later analysis showed he’d gotten lucky with his dice rolls, and he knew his time had come. Never again would a human be so foolhardy as to challenge a backgammon program for money. -- 77 of 263 -- News of the Jellyfish match spread rapidly throughout the clannish backgammon community. Deep Blue was an expensive supercomputer whose brute-force approach was not replicable by humans, and thus it did not fundamentally change the expert approach to chess. (In fact, Deep Blue was dismantled after its 1997 victory.) Jellyfish, by contrast, was affordable software that could run on any Windows machine, and it revolutionized the game. Consulting Jellyfish on his home computer, the instructor Kit Woolsey published New Ideas in Backgammon, a collection of positions where the neural net’s opinion diverged sharply from human intuition. It soon became clear the computer was always right. In time, analysts learned to evaluate the skill of a human player not by how many games he won or lost but simply by how much his play differed from this ideal computer result. Jellyfish was the first neural net to surpass humans at any game. Dahl next turned his attention to poker. Using reinforcement learning techniques, he soon built—or perhaps the better word is evolved—a neural net capable of beating anyone in the world at the two-player, “heads-up,” limit Texas Hold’em variant. Dahl licensed this neural net to a slot-machine manufacturer that installed the poker bot at casinos on the Vegas strip, offering to play all takers for real money. Once again, no one could get the better of the machine. But the revolution stopped there. Although he made a fair amount of money from the slots operation, when Dahl attempted to build a similar program for no-limit hold’em, he ran into problems. One could bet any amount at no-limit, and the synthetic dataset was larger than the closed universes he’d constructed for limit poker and backgammon. Dahl’s no- limit neural net struggled to learn from this overwhelming amount of data. “It made reasonable plays, but it would never quite do what I was hoping,” he said. Dahl worked on this problem for many years. The obstacle was that he had almost no idea how his poker bot actually worked. The structure of its neural net was no easier to interpret than the nervous system of an invertebrate, and trying to tease out game-playing strategies by examining -- 78 of 263 -- the individual weights of the grid was like trying to unravel consciousness by looking at brain cells through a microscope. This was the criticism of neural nets and the thing that so biased the academic community against them. Once a neural net hit a training plateau —and they almost always did—there was rarely an obvious way to make it better. Classical programming was orderly and logical, but tinkering with neural nets required a different cast of mind. Dahl compared it to running a biology experiment: outcomes were unpredictable, and altering seemingly minor variables could have all manner of unanticipated results. Dahl tried everything he could think of to improve his no-limit poker bot. He fiddled with its evaluation function, he futzed with his computer’s memory, he replaced the activation trigger for the neurons, he even synthesized a simpler data universe for the bot to explore—but he never got it to play at an expert level. Eventually, Dahl, like so many neural-net researchers before him, gave up. He put the poker bot to the side and got a job analyzing medical data with conventional techniques. Many critics of neural nets were similarly disillusioned apostates who’d enjoyed early success with the technology before spending years posting subpar results. Dahl never quite joined the ranks of disbelievers, but his faith was sorely tested. “I dismissed it,” said the man who sold the public its first neural net. “I dismissed it because I just didn’t have the data.” He saw no solution, and he tried everything. He just could not imagine what might make neural nets succeed. S N * Backpropagation uses multivariable calculus and linear algebra to attribute new findings across a stack of layered grids. Backpropagation accomplishes this by first determining the degree to which the output of the existing neural net is in error. This error value is then used to compute a collection of partial derivatives known as the “gradient,” essentially attempting to determine which neurons have it most “wrong.” Once the gradient is determined, backpropagation tweaks the neurons in the opposite direction. The entire process is then repeated any number of times. The backpropagation technique was first discovered in 1970 by the Finnish mathematician Seppo Linnainmaa, although Linnainmaa did not explicitly apply it to neural networks. In 1974 American computer scientist Paul Werbos independently rediscovered backpropagation and presented the technique to Marvin Minsky at MIT as a work-around for the problems outlined in Minsky’s book -- 79 of 263 -- Perceptrons. But Minsky, according to Werbos’s account, all but threw him out of his office and discouraged him from using the technique. (Werbos later theorized that Minsky was upset he hadn’t discovered backpropagation himself.) In 1986 Rumelhart, Williams, and Hinton reintroduced backpropagation as a method for neural-network training, bringing the technique to widespread attention. -- 80 of 263 -- J SEVEN Deathmatch ohnathan Wendel aimed his weapon at the pile of ammunition and waited. Playing under the screen name “fatal1ty,” Wendel was one of the first professional gamers, and by 1999 he was looking to be the best. He had a long string of tournament wins in Quake III: Arena and would often score flawless victories—his strength was his ability to get inside his opponents’ heads. He’d noticed that this opponent seemed a little too fond of ammunition, and he was preparing to spring a trap. Most pro gamers were skinny nerds, but Wendel was a muscular athlete who played ice hockey, tennis, and golf. With pale blue eyes, sandy-blond hair, and a broad, masculine face, he looked like a fraternity jock from an eighties comedy. Wendel had discovered his talent for competitive Quake when he was fifteen, beating a hundred college kids in a tournament in Wichita, Kansas, in 1996. He then dropped out of college to play video games as a job. “The thing is, if you win just one tournament, you’ll be forgotten, right?” Wendel said, referencing his hero Tiger Woods. “These people will forget me if I don’t win more.” Wendel focused on Quake with the demented competitiveness of a professional athlete. He conducted deathmatch sparring sessions eight to twelve hours per day and broke up with his girlfriend so he could practice more. He won dozens of tournaments and was the number-one earner in competitive gaming for seven straight years. Once, after winning a tournament in Texas, he stripped off his headphones, pumped his fist, and let off a triumphant whoop, as his bespectacled opponent shrank back into -- 81 of 263 -- his zippered hoodie. Wendel, much larger and wearing an NFL windbreaker and cargo pants, looked like he might give the guy a wedgie. Wendel used drills to hone his twitch-shot instinct. He would sit in front of his screen with his finger on a mouse; then, when the screen turned green, he clicked the mouse as fast as he could. With practice, he was able to get his response time down to 140 milliseconds. In between training sessions on the computer, Wendel would run for miles, which he believed helped his reflexes. Fond of sports metaphors, he compared himself to a Formula One driver slamming on the accelerator at the start of the race. Through his headphones, Wendel heard his opponent’s footsteps approaching the ammunition pile. Wendel’s training kicked in, and he set his finger lightly on the button of the mouse. The trick was to start shooting just before the opponent appeared. At the critical moment, Wendel pressed the button, and time slowed down. Wendel used an Nvidia TNT2 to render the game. Commodity processors might render Quake III at twenty to thirty frames per second. Nvidia’s parallel-processing technology could push that to sixty to seventy frames. This critical improvement doubled from five to ten the number of frames in Wendel’s 140-millisecond reaction window. That was the edge he needed. Wendel began firing a frame or two before he could see the guy. Thirty milliseconds—three one-hundredths of a second—ticked by. Then the enemy stepped into Wendel’s line of fire. The Nvidia processor updated. One frame of damage, and the processor updated again. Another frame of damage, and another, and with each frame the processor ran about two hundred million individual calculations. Through his headphones, Wendel heard satisfying grunts of pain until the final frame, when the opponent collapsed in a pulp—and with that, “fatal1ty” had tallied another kill. Wendel compared his relationship with Nvidia to Michael Jordan’s relationship with Nike. The company supplied him with free hardware, and in return he advertised their capabilities whenever he could. Wendel knew little about the parallel-processing technology that made the cards run; when I asked him in 2024 about the technology, he responded with a shrug. -- 82 of 263 -- All he knew was that nothing rendered deathmatch like the TNT2. “The frame rate on that thing was insane,” Wendel said. He upgraded his card with every cycle, and soon every pro gamer was doing the same. • • • W S . Nvidia shipped new cards on a six-month cycle, twice as fast as any other vendor. The company introduced a new product line for the back-to-school cycle each fall, then updated that product in the spring. Demand accelerated when flat-screen monitors arrived, and within a few years graphics accelerators were standard on most PCs. In early 1999, fewer than six years after its founding, Nvidia went public with a $600 million valuation. Sequoia, which had initially valued Nvidia at $6 million, tallied a hundred-bagger, subsidizing the losses from countless other speculative investments. The stock, priced at $12 on the NASDAQ under the ticker symbol NVDA, immediately doubled; by the end of the year, it had hit $60. Corporate filings showed that Priem, Malachowsky, and Huang each owned more than three million shares. Huang was now a centimillionaire, but his newfound wealth did not distract from his objective of crushing and absorbing the competition until only his firm remained. Dwight Diercks recalled no parties, no champagne, no sense of relief, not even congratulations from the boss. He shared with me an email he had saved from Huang: The TNT2 team needs to do whatever it takes to get over the finish line. They’re fighting for every single minute within Dell and Compaq for the motherboard business. S3’s Savage 4s are working well on Camino, but we’re still struggling. There’s no more time. Do what it takes to get it done. We need design wins to take share from ATI and keep S3 down, and take Nvidia to the next level. Remember, there are three priorities: one, two, and three. We’re counting on 250,000 units of TNT2 shipments by -- 83 of 263 -- April to make our Q1. Do that and we’ve got to cert risk wafers accordingly. Get it done. Diercks shook his head in amazement. “He wrote that the day after the IPO,” he said. • • • C ATI S were mentioned in the email, but Nvidia’s nemesis was 3dfx. Its Voodoo cards had been the leading graphics accelerators for the past two years. In late 1999, Huang debuted the Voodoo killer. It was called the GeForce, short for “Geometry Force.” Powered by the NV10, it could render ten million triangles per second and alter the colors of pixels in the 3D scene to match the placement of movable light sources. Unifying “transformation and lighting” in a single platform was a long-sought achievement, and Nvidia wanted to crow about it. “Basically, they could now do for $2 what a workstation could do for $2,000,” Jon Peddie said. Huang turned to Dan Vivoli, the marketer he’d conscripted from Silicon Graphics. Vivoli was a clever guy who viewed a limited budget as an opportunity. He had noticed that in making purchasing decisions, gamers relied on a half-dozen independent hardware reviewers. Vivoli reached out to the reviewers, informing them that the GeForce was the world’s first “graphics-processing unit,” or “GPU.” Vivoli’s team had, in fact, made this term up, but the reviewers began grouping products in the category. Soon, graphics accelerators were universally known as GPUs. “We invented the category so we could be the leader in it,” Vivoli said. Engineers at 3dfx bristled at Nvidia’s gamesmanship. “There were situations where Nvidia played a few tricks in benchmarking,” Peddie said. “So yeah, they out-marketed them. But they also out-engineered them! They just ran their company better.” The GeForce could juggle four rendering pipelines on a single chip. A prototype for 3dfx’s more expensive Voodoo 4 employed four chips and still couldn’t do the same work. Nvidia’s frenzied six-month shipping cycle left the perfectionists at 3dfx at -- 84 of 263 -- a disadvantage. At one point, one of 3dfx’s founders publicly speculated about declaring a truce between the two companies so that technical standards could be established before the next generation of products shipped. “That’s when I knew we had him,” Kirk said. “We were in a death struggle with 3dfx, and one of us had to die.” Kirk’s brain-extraction experiments were another source of continuing discord. As engineers defected to Nvidia, they often brought with them proprietary ideas. This created legal problems—between 1996 and 1999, S3, Silicon Graphics, and 3dfx all filed patent-infringement lawsuits against Nvidia, and a fourth competitor, Matrox, filed a suit alleging that Nvidia had encouraged its employees to violate confidentiality agreements. The other three lawsuits were settled out of court, but 3dfx did not go to arbitration—the failing company instead staked its survival on victory at trial. In August 2000, during a disastrous earnings call, 3dfx CEO Alex Leupp told investors that 3dfx was on pace to lose more than $100 million in a single quarter. An hour later, Nvidia announced it was countersuing 3dfx, making rather questionable patent-infringement claims of its own. Many found the timing of the lawsuit cruel; some speculated that Huang had deliberately filed a nuisance lawsuit that he knew he would not win, just to run up cash-poor 3dfx’s legal bills. A month later, the judge in the case issued a preliminary ruling in 3dfx’s favor while rejecting Nvidia’s countersuit completely. 3dfx scrambled to collect, but Nvidia, through shrewd legal maneuvering, was able to stall the payout. Desperate, Leupp tried to sell 3dfx to Intel, offering the favorable ruling against Nvidia as the only thing of value that it owned. But Intel wanted no portion of this petty squabble, and neither did anyone else. With the money running out and its product floundering, 3dfx was forced to admit defeat and offer itself to Nvidia. Huang had won. His reward was an incoming brigade of new employees who hated him. Court filings later revealed that, inside 3dfx, Huang was referred to as “Darth Vader.” (“Actually, they called him worse than that,” Peddie said. “I can tell you, some of the other things they called him were not very polite.”) The rumors about Huang by this point were almost -- 85 of 263 -- mythologically sinister: he poached employees; he lifted ideas; he spun the reviewers; he kicked his fallen competitors in the teeth. Mostly, though, they hated Huang because he had whipped their collective corporate ass. “Nvidia made enemies all along the way as they rose to power, including with partners and suppliers,” Peddie said. “Jensen, you could say he’s a personal friend of mine—but he was ruthless.” • • • P B G got his notice on a Friday afternoon in December, along with the rest of the 3dfx Austin office. The workstations locked everyone out, and security escorted the employees out of the building, pausing to inspect backpacks and purses. The staffers loitered around the parking lot until a human resources manager summoned them to attention and directed them to reconvene in the warehouse across the street. Office complexes in Texas were not designed with pedestrians in mind, and so began an undignified procession of business-casual hostages who first marched into a drainage ditch, then continued in a disorderly scramble across an unmarked frontage road, and finally arrived at a warehouse full of unsold and, indeed, unsellable graphics hardware. Here the employees were informed that, effective immediately, 3dfx was laying them all off. “That was a fun day,” Garlick said. There was consolation for the lucky ones, however. Nvidia had declined to purchase the entirety of 3dfx but had offered to buy specific assets for $70 million. Leupp had accepted, and the lawsuit was dropped. Internal documents later revealed that Huang valued 3dfx’s best engineers at $1 million per head. This estimate reflected both their worth to Nvidia—and the value of keeping them away from Nvidia’s rivals. At the warehouse, the laid-off employees were told to go home for the weekend and wait by the phone for a potential job offer from Huang. Out of about five hundred or so candidates, Huang, in consultation with Kirk, had drawn up a list of 120 3dfx employees whom Nvidia wanted to keep. These employees were distrustful of Huang. Over the weekend, -- 86 of 263 -- whispers circulated that Nvidia was a sweatshop and that Huang was a tyrant who screamed at employees at the top of his voice. Some contended that there was no price at which they would work for such a man. Garlick was more realistic. Christmas was coming, and he needed a job. He was among the first to receive an offer. Garlick was a skilled programmer, though a modest one. (I asked him if his contributions were worth $1 million. “Something like that,” he said.) He returned to the warehouse the following Monday, and Huang, in person, presented him with a 20 percent raise, benefits, and stock options. Garlick took the job and remained at Nvidia for the next seventeen years. “My theory is that Jensen is a good person at heart who had to be ruthless,” Garlick said. “As opposed to some other CEOs, who were ruthless at heart and trying to pretend to be good people.” Such were Huang’s charms that, out of the 120 employees he recruited, 106 joined the dark side. Garlick was given access to Nvidia’s code base. He was appalled at what he saw. “Basically, it was cancer,” he said. “Y’know, cancer cells aren’t efficient. They just mutate, grow, and expand.” At 3dfx, Garlick had taken pride in the elegance of his programming, developing orderly systems with lucid commenting, allowing other programmers to easily maintain and improve his work. “In the time we spent making it clean, we went out of business,” he said. Nvidia’s approach was slapdash, with blocks of code written during some delirious midnight crunch serving as the foundation for critical systems. “What a shit show! The code was crap, the tool-chain was a mess, and the thing was, they didn’t give a shit!” Garlick said. “They didn’t give a shit about anything but the next tape-out.” In this manner, Nvidia had accrued a great deal of “tech debt,” repeatedly taking shortcuts that led over time to less maintainable code and creating problems for programmers later on. But as Garlick acclimated to these shortcuts, he came to see the value of the Nvidia approach. “There was a bizarre brilliance to it all: just iterate, iterate, iterate, execute, execute, execute,” he said. “The way I see it now, tech debt is the battle scar of the survivor.” -- 87 of 263 -- • • • W , Nvidia had more than six hundred employees, up from just thirty-five four years earlier. The company relocated to a new headquarters down the road in Santa Clara, leasing a complex of curved, multistory, glass-and-steel buildings joined by skyways, festooned with sculpture, surrounded by parking, adjacent to the expressway, and spread across eleven acres of land. The new offices didn’t smell like takeout food. They didn’t smell like anything. Modernist respectability, with its boring and predictable implications, had arrived. The trappings of the suburban office park—the cubicle farms, the endless lines of cars, the dreadful chain restaurants—were memorably satirized in Mike Judge’s 1999 movie Office Space. Judge had based the movie on his time working at one of Nvidia’s competitors, a graphics-card start-up located just a couple miles down the road. Judge had hated this work environment, but the Nvidians I talked to never mentioned boredom or a sense of missed opportunity. They’d made the decision to spend much of their life in the cubicles, and the corporate world suited them just fine. Huang did his part by relentlessly hunting for bureaucratic idiocies to eliminate. The main character in Office Space is reminded by multiple bosses to use the correct cover sheet for his TPS reports. Real-world software engineers often did file such “test procedure specification” paperwork, but if Huang were ever to discover that one of his managers was wasting precious engineering time with concerns about cover sheets, I suspect he would have dragged him to the center of the cubicle pit and crucified him. Many new hires at Nvidia, used to working for gaming companies, showed up expecting a looser culture. “At 3dfx, the motto was ‘Work hard, play hard,’ ” one former employee said. “At Nvidia, it was more just ‘Work hard.’ ” Long hours were the default, and the six-month release cycle created relentless pressure. “The end result was almost nonstop deadlines and a perpetual sense of being behind schedule,” another employee recalled. -- 88 of 263 -- Others appreciated Nvidia’s professionalism. “At least you knew the place was going to stay in business,” one veteran said. Gaming companies rarely had a dress code, and some coders indulged in performative slobbishness. Karen Huaulme, another 3dfx hire, recalled getting lost in a suburban Dallas office complex looking for the headquarters of iD Software, the maker of Quake. She resolved her problem by following the worst-dressed person she could find, a pasty young man with long stringy hair in flip-flops and a tattered T-shirt. He led her straight to iD’s front door. No one ever wore flip-flops to the office at Nvidia. Huaulme also told me that at past firms her male colleagues would sometimes aggressively question her credentials; at other times, they would assume a kind of handsy overfamiliarity. There were few female employees in Nvidia, especially in those days, but Huaulme found it a place of relative safety. “I felt protected from that sort of thing at Nvidia,” she said. Kirk ran many of the hiring interviews. Years earlier, at his own start-up, he’d been forced to lay off a hundred engineers, an experience so painful that days later he himself had quit. Resolved never to repeat this experience, Kirk determined that the best way to avoid layoffs was to be selective about whom he hired. The initial interview format at Nvidia consisted of several rounds of interviews, followed by a consensus hiring decision. But the technical staff, reluctant to make people squirm, stuck to standard interview bullshit: “Recall a time you overcame adversity,” “What’s your greatest weakness?,” “Why are manhole covers round?” Kirk, frustrated, felt that his staff were wasting time. He knew how Jensen would respond: by gathering the technical staff in a conference room and screaming at them. Like Diercks, Kirk believed that Huang’s outbursts were purposeful. “Yelling at people was part of this motivational strategy,” Kirk said. “You might think he’s just mad, but I think it was premeditated. And it works! It annoys people, but it does work.” The audience, Kirk believed, was crucial: “He wants everybody to benefit. He would never just yell at some guy in the hall. When he’s torturing people, he’s forcing them to learn a lesson—and they certainly would never forget it.” -- 89 of 263 -- Realizing that his employees didn’t know how to hire their own replacements, Kirk opted for the Jensen Huang Instructional Method. Deliberately, and with great intentionality, Kirk gathered his engineers together in a conference room. Then Kirk—gentle Kirk, who eschewed conflict and rarely raised his voice—began to yell. “What the fuck are you doing?!” Kirk screamed at his staff. “You just interviewed a guy who you’re going to be counting on to do half of your work, and you didn’t even bother to find out if he can do it! Now you’re gonna have to do twice as much work, and he’s gonna get half your salary!” Kirk, with satisfaction, regarded the stunned silence of his employees. “We’re gonna bring him in again, and I’m going to ask him questions now, because none of you know how,” he said. The unfortunate job seeker returned to find himself in a theater of interrogation. With his staff in attendance, Kirk opened the interview with a softball: “Are you familiar with how to draw a triangle?” Once the candidate answered that question, Kirk gently increased the difficulty. “OK, how do you draw the edges of a triangle?” With that answered, Kirk kept going: “What if one of the coordinates of the triangle is zero? You can’t divide by zero, so what do you do?” Kirk pushed into more difficult territory until he felt he’d reached the limit of the candidate’s understanding. Then came the final question, a demanding technical challenge that Kirk was certain the candidate didn’t know how to answer. The offer of a job hinged on what came next. Often, candidates would lie or try to make something up. That was an automatic fail. Others would admit blankly that they didn’t know. That was usually a fail, too. The engineers who passed the test were those who realized they were participating not in a job interview but a Socratic dialogue. These engineers could walk back through the series of questions that had led up to this point, then advance their knowledge in the interview, using the previous answers to figure out the new one. In fifteen minutes, Kirk learned more about the candidate’s capabilities than his staff had in eight hours of structured interviews—and his staff learned how to ask the right questions. -- 90 of 263 -- • • • N in whom he hired than Huang. As Nvidia succeeded, Huang’s family pressured him to distribute the spoils. “His parents pushed and said, ‘Look, you’ve got to give your brothers jobs,’ ” Jens Horstmann said. Huang refused, leading to tense conversations. “Basically, he said, ‘I can’t justify that. I don’t think they fit our culture,’ ” Horstmann recalled. Despite the family encouragement, Huang never hired his brothers. Jensen’s older brother, Jeff, held a variety of engineering jobs during his career. Horstmann recalled a time when the three of them were building a deck in Jensen’s backyard. Horstmann and Jensen did the construction; Jeff supervised. “He couldn’t even hold a nail gun,” Horstmann said. Jensen’s younger brother, Jim, had followed Jensen to OSU, earning the same electrical engineering degree. While Jensen was managing Nvidia out of Priem’s townhouse, Jim had played it safe, going to work for Intel. He remained there for the next thirty years, building and maintaining software tools. “He was a hardworking, solid engineer,” Horstmann said. “But this sort of entrepreneurial risk-taking—you know, being willing to accept failure, and be punished for it—I didn’t see that in him.” Rather than giving his brothers jobs, Huang gave them real estate. In the early 2000s he liquidated some of his Nvidia shares and used the proceeds to buy a sizable plot of land in Los Altos Hills, an affluent community overlooking Silicon Valley that is currently the third-wealthiest zip code in the United States. There, with Lori’s oversight, he commissioned a six- thousand-square-foot mansion with five bedrooms, seven bathrooms, a swimming pool, and an oversized garage. He bought two Ferraris (his and hers) and started collecting expensive whiskey. He donated his old house, with the deck, to Jeff. But not even a custom dream house could meet Jensen’s standards. As his wealth increased, he developed the persnickety “just so” attitude of the very-high-net-worth client. Returning home from work one day, he noticed that the glass doors to the garden of his mansion did not perfectly align with -- 91 of 263 -- the view to the pool house out back. The lack of symmetry annoyed him, so he ordered the pool house uprooted and, at considerable expense, had it moved eighteen feet to the side. • • • T GF line established Nvidia as the market leader. Simultaneously, Nvidia launched Quadro, a line of professional GPUs for advanced computer modeling and digital animation. As foretold, Nvidia was coming for Silicon Graphics—and, as foretold, some of Jensen’s former managers were coming to work for him. In 2000 Huang hired Tommy Lee, who had been his first boss at LSI. The growth in the 3D accelerator sector coincided with a frothy time in the stock market. Nvidia wasn’t some vaporous dot-com: it shipped an actual product, it had real revenues, and it posted real profits. Still, it was a technology company operating in a technology bubble. In early 2000, Nvidia announced an agreement to develop a chip for Microsoft’s as-yet- unnamed home-gaming console, and Nvidia’s stock price popped above $100 a share. The price triggered the fulfillment of a number of corporate dares made over drinks at some boozy offsite a few months before. Malachowsky got his ear pierced. Priem shaved all the hair off his head, except for a square patch at the top, which was dyed green and sculpted into the Nvidia logo. Huang got the logo tattooed on his upper arm, then complained about the pain for many years after. By 2001, Nvidia was selling a billion dollars’ worth of GPUs per year. The only firm able to match Nvidia’s pace of innovation was ATI, based in suburban Toronto. ATI’s flagship product line was the Radeon. The Radeon, like the GeForce, was a fan-cooled accelerator with parallel pixel pipelines. Its chips were manufactured in the same TSMC facility as the GeForce, and Kwok Yuen Ho, the company’s cofounder and CEO, was, like Huang, an immigrant with a fiercely competitive streak whose company had skirted bankruptcy several times before succeeding. With the introduction of the Radeon line, the GPU market settled into a static duopoly, and over the next -- 92 of 263 -- two decades, GeForce and Radeon would battle for supremacy, with each product line spending time at the top. (Today, GeForce has pulled well ahead.) Despite the success, Huang remained wary. In 1996, the leading graphics accelerator firm was S3 Graphics. By 1999, it was gone. In 1998, the leading firm was 3dfx. By 2000, it was gone, too. There was no guarantee the same wouldn’t happen to Nvidia. One of the business books stacked in Huang’s office, written by Intel CEO Andy Grove, was titled Only the Paranoid Survive. Competitive threats were inherent in any capitalist enterprise, but in the microchip sector those threats were of a different order. For a business like Coca-Cola, once you established a winning formula, the product sold itself —your job was not to tamper with success. The microchip industry was more like the fashion business—if your product today resembled your product from yesterday, you had made a terrible mistake. In semiconductors, everything was reinvented from scratch every few years. This was true of the software tools used to design the chips; it was true of the ultraviolet photolithography machines used to print them; it was true of the architecture of the chips themselves. The first Nvidia chip contained a million transistors. By 2000, Nvidia’s chips contained twenty times that number, cooled by high-speed fans and packed into half the space. “All that is solid melts into air,” one early observer of capitalism wrote, or, as Andy Grove declared a while later, “We all need to expose ourselves to the winds of change.” Jensen was not the only gambler in the sector. Every executive had to be one. The ever-increasing precision of transistor manufacturing raised the cost to participate with each new product cycle, and as such, the industry resembled a poker tournament in which the stakes were constantly increased. Stand pat in the tournament, and your stack would dwindle away. Your only chance to survive was to find a promising hand and shove in all your chips—then do it again. One expensive gamble Huang took was to add programmable shaders to the GPUs. Graphics technology at the time often rendered scenes with a -- 93 of 263 -- “plastic” or “rubbery” appearance. Kirk wanted to give programmers better lighting tools, but adding the shaders meant sacrificing profits to build infrastructure that not many game developers would use, at least not at first. “If you do this, it’s gonna cost a little bit more for a while,” he told Huang. “But then everybody’s gonna want it, and it’ll make everybody try to catch us.” As with the parallel pipelines, Huang was initially skeptical. He became convinced only when he started to consider the cost of not implementing the shaders. The lone guarantee in his industry was that more transistors were coming. As that happened, graphics would get cheaper and easier to render. Huang had managed to stay ahead of his competitors so far, but his asset-light “merchant” business was essentially just a collection of engineers sitting around a Silicon Valley office park. If those engineers weren’t constantly developing new, difficult-to-replicate technology, manufacturers in Asia would start knocking off his chips, and Nvidia would cease to exist. “If we don’t reinvent computer graphics, if we don’t reinvent ourselves, and we don’t open the canvas for the things that we can do on this processor, we will be commoditized out of existence,” Huang later said. Not to gamble was the biggest risk of all. The first GeForce models to feature programmable shaders debuted in June 2001. The Xbox, Microsoft’s gaming console, launched in November 2001, accompanied by its signature game, Halo. The game was a shooter in the tradition of Quake and Doom, except instead of taking place in Hell, it took place in a ring-shaped artificial world with beautiful, naturalistic lighting, rendered by Nvidia’s hardware. The success of Halo pushed Nvidia’s total market value above $20 billion. Two weeks later, Nvidia was added to the S&P 500 stock market index. It replaced Enron. At thirty-eight, Huang was one of the youngest CEOs in the index. In eight years he’d gone from brainstorming product ideas in a diner booth to running one of the five hundred most valuable companies in America. He had outcompeted and assimilated all but one of his competitors, and in the weeks after the index selection, Huang even briefly became a paper billionaire. But the stock market was a fickle arbiter of value, and Huang’s -- 94 of 263 -- glory was short-lived. It would be fourteen years before he saw that much money again. -- 95 of 263 -- J EIGHT The Compulsion Loop ensen drove the ball across the table with a satisfying thwock, generating terrific spin. He was practicing his forehand loop, which had been his kill shot in his competitive days. He reset his body as the next ball came, coiling and exploding, twisting his hips, and swinging his paddle up and across, sending the ball in a curving trajectory over the net. Thwock. He repeated this action, striking and resetting, striking and resetting, his body turning around his pivot foot like a mechanical piston. Things were not going well at work. Thwock. The stock market was crashing as the dot-com bubble deflated. Thwock. The latest GeForce product had shipped with a defective fan that made it sound like a leaf blower. Thwock. The Xbox deal was collapsing. Thwock. Nvidia was being investigated for accounting fraud. Thwock, thwock, thwock, thwock. Jensen put down the paddle, covered in sweat. Huang had returned to the tables in 2002 after contacting Joe Romanosky, his old friend and table tennis partner. Romanosky was surprised by the call; he hadn’t talked to Huang in nearly twenty years. The two talked cordially for a while, catching up on old times. Romanosky recalled the time in college when Huang had participated in a table tennis tournament at a state penitentiary. Winning easily in front of an audience of prisoners, Huang couldn’t resist showing off. “He had a bunch of trick shots, but he’d been able to mask it like it was part of the match,” Romanosky said. “You know, like the Harlem Globetrotters.” -- 96 of 263 -- On the call, Huang mentioned that he’d started a company that made equipment for video games. “He seemed very excited about it,” Romanosky said. Huang sent a GPU to Romanosky’s son, who built a computer around it. Romanosky, returning the favor, ordered Huang his favorite make of table tennis paddle. Soon, Huang was practicing again and hired a former Olympian to train him. Romanosky, who worked in San Diego as an engineer for Boeing, began flying to the Bay Area with his wife to spend time with Jensen and Lori. In the garden next to his relocated pool house, Huang had installed a large Japanese teppan grill, where he was practicing the tricks of a Benihana chef. He tossed fried rice, threw food in the air, and was experimenting with an onion-ring volcano. He never quite mastered the art of catching a grilled shrimp in his hat, however, and many ended up on the floor. At the backyard dinners, Huang talked about his family, his kids, his interests—anything but business. Romanosky was careful not to ask; he could read the headlines. Huang treated Romanosky to his expanding whiskey collection, the two men’s wives became friends, and Romanosky invited Huang to go camping in the Sierras. (Huang declined.) “He is very warm, very engaging,” Romanosky said. “He’s not at all a superstar executive when he and I sit down. I feel like he’s very authentic.” Romanosky’s impressions of Huang bore little resemblance to the obsessive tycoon his colleagues and competitors described. Romanosky saw only a high-spirited, high-energy, fun-loving mischief-maker—the same Jensen he had always known. Somehow, it seemed, Huang was able to compartmentalize his business persona from his home life. I wondered whether Romanosky had ever seen a glimpse of Darth Vader. “If that’s there, he’s turned that side of himself off,” Romanosky said. “Just a very warm, down-home kind of guy—that’s the Jensen I know.” • • • B and the fall of 2002, Nvidia’s stock price declined more than 90 percent. Huang’s wealth plummeted with it. The -- 97 of 263 -- problems started in January 2002, when the Securities and Exchange Commission opened an investigation into Nvidia’s accounting. Similar investigations had revealed schemes to inflate earnings at Worldcom and Enron, leading to widespread suspicion of corporate shenanigans. In July, Nvidia was forced to restate three years of earnings, and shortly thereafter, CFO Christine Hoberg was let go.[*] Tench Coxe told me that her firing was unfortunate and that the board never lost confidence in Huang. “The SEC was on a fishing expedition,” he said. He had a point; the restated earnings showed that Nvidia was more profitable than previously reported. The accounting scandal occurred during one of the worst bear markets in history. Suffering under the simultaneous bursting of the dot-com bubble, the 9/11 attacks, and the Enron bankruptcy, the S&P 500 lost nearly half its value. Coincident with these misfortunes, Nvidia started squabbling with Microsoft. The dispute was attributed to pricing and intellectual-property issues, but Nvidia’s growing sense of entitlement played a role. Nvidia employees were unabashedly elitist. They considered themselves the best—and they were—but their pride could sometimes sound like narcissism. In the weeks before the Xbox’s launch, Microsoft had hosted a celebration banquet to launch the console, with Bill Gates giving a congratulatory speech. The Nvidia technicians were seated near the back, sharing their table with the manufacturers of the rubber stabilizers that kept the Xbox from sliding off ledges. Talking with me decades later, Kirk recalled this seating arrangement with the wounded indignity of a bridesmaid slotted at the rejects’ table. “Yeah, you know, we’re an important partner on this project—right up there with the guys who make the little rubber feet,” he said. Kirk stressed that he was joking, but as with Huang, Kirk’s jokes were often delivery mechanisms for barbed, unpleasant truths. Perhaps it was an oversight by an event planner, but to Kirk, being sequestered with the rubber-injection and molding specialists felt like a deliberate signal that Microsoft was not to be held captive by a lowly hardware supplier. The slight was not soon forgotten, and shortly thereafter the business relationship soured. Microsoft started asking Nvidia for large shipment -- 98 of 263 -- volumes and reductions on price, Nvidia claimed that its contract did not obligate it to meet such demands, and the argument was sent to arbitration. For the next generation of the Xbox, Microsoft switched to ATI. Yet, of all these woes, the worst for Nvidia was the slow adoption of the programmable shaders. For the gamble to work, the company had to convince developers to adopt a new coding language. To do so, Nvidia marketed its shaders via “Dawn,” an inadequately clothed CGI pixie with butterfly wings, antennae, and large breasts. Dawn graced the cover of Nvidia’s programming textbook, The Cg Tutorial, whose less titillating contents consisted of ten chapters’ worth of vertex transformations, pixel pipelines, and sample computer code—and, of course, homework exercises. Uptake was slow. • • • H’ , invisible to Romanosky, found increasing room to express itself at work. In early 2003, Nvidia shipped the infamous GeForce FX, prone to slow rendering speeds and known to gamers as “the dustbuster” for its faulty, overactive fan. The device was panned by reviewers and customers, including Huang’s thirteen-year-old son, Spencer. Jensen arrived home one evening to find a gaming magazine featuring a harsh review of the device waiting for him with a Post-it note attached. “Dad,” the note read, “I think you need to kick it up a notch.” Huang arranged a meeting in which the product managers presented, to a few hundred people, every decision they had made that led to the fiasco. Huang then screamed at them, near the top of his voice, for nearly an hour. “ ‘Terrifying but cathartic’ is how I would describe it,” said Sharon Clay, one of the engineers responsible for quality control. Huang’s tirades inspired as much guilt as fear, and he often described, in detail, how in letting their customers down, Nvidia employees had let one another’s families down as well. (“I think I’m driven as much by guilt as anything else,” Huang told me.) -- 99 of 263 -- Nvidia conducted regular performance reviews of employees, and following the GeForce FX debacle, Clay feared that her next one would read RI: “Requires Improvement.” This, at Nvidia, was like being handed the Black Spot. For the GeForce FX, Clay had run four or five quality- control tests. For its successor, she expanded to one hundred, and ultimately to thousands. “When we started the whole process, I could not have imagined the solution that we ended up coming up with when properly, uh, pushed to think,” she said. So demotion never came, not for Clay or for anyone else on the quality- control team. Instead, Nvidia’s marketing team shot a satirical video starring the product managers, in which the card was repurposed as a leaf blower. This was distributed to the press, and the updated GeForce shipped to acclaim six weeks later. Many people at Nvidia told me that Huang’s anger enforced a kind of discipline within the company, in the manner of a military general or a pro football coach. “I’m not sure he yells more than any other Fortune 500 CEO,” one employee said. “Look, it’s not really his job to be your friend. It’s his job to push you beyond where you think you could ever go.” Even those who disliked such managerial tactics often had positive things to say about Huang personally. Former employee Tim Little recalled receiving an email with the subject heading “Drag Your Sorry Ass Across The Finish Line.” Little had been traveling for weeks, away from his family, working late nights at the circuit simulator; feeling he had nothing more to give, he responded to the email by submitting his resignation. A few nights later, at around two in the morning, as Little was finishing one of his last shifts, Huang arrived and sat down at the simulator beside him. The glow from the monitor illuminating his exhausted face, Huang recalled his own career, the sacrifices he’d made, the many late nights he’d spent away from his family, often working the circuit simulator himself. He expressed, frankly, that he wasn’t sure it was all worth it. He offered Little his job back if he wanted it; when Little declined, Huang thanked him for his service to the company and left. “That was absolutely the high point of my employment there,” Little said. -- 100 of 263 -- Of more than a hundred former and current Nvidia employees I spoke with for this book, almost all had a tender story about Huang to relate. One employee—the same one whom Huang had humiliated in front of dozens of people, asking for a full refund of his salary—told me that when he was later diagnosed with a serious medical issue, Huang offered to pay in full, out of pocket, for his treatment. When Ben Garlick decided to leave Nvidia for a start-up, he was startled to receive an impassioned, personal plea from Huang to stay. “We’re sitting together at this conference table, and he’s gotten so close to me we’re almost bumping knees, and he’s, like, begging!” Garlick said. Garlick was a frontline manager in charge of ten people at a company of thousands. “I didn’t even think Jensen knew my name,” he said. Huang’s combination of love, fear, and guilt was a seductive and powerful motivator. “You felt like you couldn’t let him down,” Clay said. “You just couldn’t.” • • • A , Curtis Priem finally tapped out. His responsibilities had been limited for years, and eventually he stopped coming into the office. (“He had a conflict with another executive staff person, and I think he decided he was fired,” Kirk said.) Priem began liquidating his Nvidia stock to make a large series of donations, mostly to his alma mater, the Rensselaer Polytechnic Institute in upstate New York. Priem was named to the school’s board of trustees, and the money was used to build a $200 million performing arts center and, later, to buy a quantum computer. Priem retreated to a splendid $6 million ranch in the Diablo range east of Oakland. His perch on the ridge gave him a commanding view of the Bay— gazing out toward the setting sun over his fenced-in herd of cattle, he could, on a clear day, make out all five of the major bridges that spanned the glittering water. He purchased a Gulfstream jet to shuttle from California to RPI and back; to offset his carbon footprint, he invested in experimental green technology and moved his ranch off the grid. He suffered through a -- 101 of 263 -- messy and unpleasant divorce, then began to look for a formula that might, in his words, “repair the earth.” In a series of chunk transactions between 2004 and 2006, Priem sold all his Nvidia shares. “That’s why the stock flatlined,” he said. “We basically sold into strength whenever it started going up.” Had he held those shares and done nothing but play cowboy for twenty years, Priem would today be worth more than $100 billion, making him one of the wealthiest people alive—but he told me he didn’t regret his decision. Doing so would have required him to have 99.9 percent of his net worth invested in the volatile stock of a risky tech company he no longer worked for, which didn’t seem like a good idea. Channeling George Bailey, Priem asked me to consider where his vanished windfall profits had gone. “The shares went out there, but it’s not like they disappeared. It’s in pension plans. It’s in people’s houses. It’s sort of like I contributed $100 billion to our economy,” he said. “I’m on track to give away half a billion in my lifetime, and that has taken most of my time and effort. In the back of my mind, I’m trying to figure out what I would do with a $100 billion foundation, and it is not easy. I wouldn’t even know how to give that away.” • • • N by the gamers in the end. Even with its stock in the toilet, the company was shipping some of the most complex silicon ever manufactured. These chips, combined with the arrival of home broadband Internet and the maturation of the multimedia home computer, inaugurated what some critics later called the Golden Age of PC gaming. Developers leveraged the new hardware to deliver classic titles like Call of Duty, Half- Life 2, The Sims, and World of Warcraft. “PC gaming peaked somewhere between 2000 and 2005,” one nostalgic commenter opined. Was this the best use of such technology? The subculture of PC gaming was toxic—from it grew 4chan and later the Gamergate harassment campaign. PC gamers termed console gamers “peasants” and referred to -- 102 of 263 -- themselves as the “PC master race.” Graphics pioneers were frustrated by the arrested aesthetic development evident in the leading titles, which reflected—or maybe produced—the stunted maturity of the customers. Nvidia had gifted the developers an extraordinary tool. The developers had used that tool to render monsters, gunfights, car chases, and gore. “It’s astounding when you think about all the work that goes on and the triviality of some of the results,” Jon Peddie said. But it made good business sense. The PC gamers were the best kind of customers: addicts. By design, video games offered rewards on a randomized schedule. Casinos used similar tactics to keep slot-machine junkies pasted to their chairs. In 2001 John Hopson, a researcher who worked for the studio that made Halo, described gaming’s “compulsion loop”: upgrade the player’s character, send them off to complete a quest, reward them with loot, then repeat. Some players found the loop hard to escape. Researchers noted that hardcore gamers exhibited behaviors associated with substance abuse. They binged. They suffered withdrawal. They lied to friends and family members about how much time they spent gaming. They deleted their games one day, then downloaded them again the next. In 2013, the American Psychiatric Association’s diagnostic manual added an entry for “internet gaming disorder,” noting that young men were especially susceptible. Symptoms included “poor performance at school, work or household responsibilities,” and “a decline in personal hygiene.” For others, though, the games offered spellbinding alternative worlds pregnant with meaning, challenge, and opportunity. World of Warcraft might be addictive, but through it gamers befriended compatriots all over the world. About a quarter of gamers played two hours a day or more. Nvidia called them “enthusiasts,” and they were the best customers. Many had started on Nintendo as children, then graduated to the PC scene in adulthood. In absolute terms, the PC market had fewer customers than the console market, but those customers spent far more on their systems. In between gaming sessions, some even managed to secure jobs. With Nvidia’s encouragement, the gaming PC became to the neckbeards what the muscle car was to gearheads. Custom-built gaming computers -- 103 of 263 -- termed “rigs” could be tricked out with thousands of dollars’ worth of after- market equipment. Vendors sold transparent computer casing with colored interior lighting to showcase the hardware. Just as automobile fanatics popped the hoods of their cars to advertise their engines, the enthusiasts posted photos of their rigs to online forums, bragging about their overclocked motherboards and the rendering speed of their GPUs. As the gaming market matured, Nvidia’s stock price recovered. In fact, even when the stock was cratering, Nvidia had never stopped growing. By the start of 2004, the company had more than a thousand employees and was reporting record earnings. And so began the chronic struggle among Wall Street analysts to figure out what Nvidia was even approximately worth. Few companies created greater headaches for money managers: analyzing prior years’ figures was of little use, because any money that Nvidia earned Huang immediately reinvested in speculative technologies that would either revolutionize computing or flop trying. By the mid-2000s, his track record looked a little worse than breakeven: he’d succeeded with the GPU and parallel processing but whiffed on a number of other initiatives, resulting in a corporation that was successful on paper but whose stock chronically underperformed. What mattered at Nvidia wasn’t profits or revenues. What mattered was the obsessive chief executive and his crazy long-shot bets. Either you believed in him or you didn’t. And if you didn’t, you were certainly in for a tough ride, for Huang was about to make his craziest bet yet. S N * The SEC later charged Hoberg with fraud, accusing her of hiding $3 million of unrecorded expenses from corporate auditors. Hoberg paid a $600,000 fine, but she admitted no wrongdoing. -- 104 of 263 -- G NINE CUDA amers like Johnathan Wendel lowered their screen resolution to maximize the frame rate in deathmatch. Ian Buck took things to the opposite extreme—he wanted to blow up the action as big as it could go. Buck, a graduate student in computer graphics at Stanford, realized that with a little technical expertise he could distribute the rendering requirements for a single game across multiple Nvidia cards. In 2000 he chained thirty-two GeForce units together to render Quake III across eight projectors. “It was the first gaming rig in 8K resolution, and it took up an entire wall,” Buck said. “It was beautiful.” Poking around in the circuits, Buck began to wonder if his GPU daisy chain might be useful for tasks other than shooting grenades at his friends. If you wanted to hand-render thirty frames of Quake III in 8K using pencil and paper, and you were willing to work twenty-four hours a day to do so, the arithmetic would take about sixteen thousand years to complete. Buck’s GeForce array was doing that every second. Plus, the whole rig cost only about $20,000 to assemble—a pittance by the standards of high- performance computing. To play Quake on the wall, Buck had inadvertently built a low-budget supercomputer. Buck figured this affordable horsepower would be useful to science and industry, but the code Nvidia had packaged with the cards only spoke the language of triangles. If Buck wanted to use his GPU array for some other purpose, he was going to have to hack it. He immersed himself in Nvidia’s shading textbook—the one with the pixie on the cover—becoming one of -- 105 of 263 -- the first programmers to master it. Ironically, in doing so, he lost interest in computer graphics. He’d been sucked, à la Tron, into the technological substrate below. Buck was intense, and balding, and he radiated intelligence. The closer he got to the circuits, the more obsessive he became about their capabilities. Like a veteran astronomer setting aside his telescope to contemplate the vastness of the universe, Buck remained perpetually astonished at how much arithmetic the computer could do. Sixty gigaflops—sixty billion operations—every single second, and that was just one card. No matter how much low-level circuit hacking Buck did, the sense of awe never left him. With a grant from DARPA, the Department of Defense’s research arm, Buck assembled a group of researchers. In 2003 Buck and his team released an open-source programming language called “Brook.” Using Brook, scientists could smuggle demanding mathematical payloads—say, simulating the formation of a galaxy or modeling the ignition process of a nuclear bomb—into hardware built to render carjackings and disembowelments. The graphical output of a Brook program was a meaningless series of triangles, but in rendering these images, the GPU coincidentally executed important scientific calculations at speed. “You really had to understand computer graphics to be able to hack those triangles,” Buck said. Brook made parallel computing accessible. Academics began bulk- purchasing GeForce cards and chaining them together, developing applications in financial modeling, weather simulation, high-energy physics, and medical imaging. The gaming cards were more than just gaming cards now; they were jerry-rigged scientific tools. The emergence of this new kind of customer did not escape Huang’s notice. “I got a bunch of papers published, and everyone was very supportive,” Buck said. “And then, around 2004, Jensen asked me to come to Nvidia and do it for real.” • • • -- 106 of 263 -- B’ that of many researchers who went to work for Nvidia. In 2005, Silicon Graphics, unable to compete with Nvidia’s Quadro line, was delisted from the New York Stock Exchange. (Its headquarters became the Googleplex.) As it had with 3dfx, Nvidia absorbed many SGI asylum seekers, doubling its staff to 2,000 people. Of those, 1,200—60 percent of the company—were classified as working in research and development. To outsiders, Nvidia still looked like a slightly ridiculous manufacturer of gaming hardware, but to insiders, it was beginning to feel like a scientific laboratory. Bill Dally, the chair of Stanford’s computer science department and a longtime parallel-computing evangelist, had observed with excitement the growing “arithmetic intensity” of Nvidia’s chips, which he thought might act as the backbone for a new kind of computer entirely. He wrote several papers praising the company’s innovations, and in 2003 Huang dropped by his university office to offer him a consulting job. Dally had spent years pitching parallel-computing concepts to indifferent executives; now Huang was coming to him. “He technically understands things at an extremely deep level, and he always asks the right questions,” Dally said. “Sometimes he’s even a step ahead of you, and it’s something you think you’re an expert on.” But Dally was also skeptical of Huang. He had heard stories of his emotional volatility and of Nvidia’s unforgiving work environment. Huang, perhaps anticipating these objections, produced at their first meeting a signed paper check made out in Dally’s name. Dally, one of the world’s leading academics, agreed to consult. Buck, Dally, and dozens of other talented engineers were being recruited for a secret Nvidia project called Compute Unified Domain Architecture, or CUDA. (The name was purposefully imprecise.) The concept behind CUDA was to take the parallel-computing circuits used for video games and repurpose them for scientists. No more hacking at triangles to get to those precious gigaflops—the architecture was being opened up. “Basically, the way to think about CUDA is you have a video game card on one side, but it -- 107 of 263 -- has a switch on it,” Dwight Diercks said. “So you flick that switch, and turn the card over, and suddenly the card becomes a supercomputer.” • • • T CUDA was John Nickolls, an Nvidia engineer who had previously cofounded one of the parallel-computing start-ups whose corpses littered the road. Nickolls was an expert downhill skier and model- train enthusiast whose office was decorated with framed pictures of microchips. He was passionate about making computers run faster: at his “massively parallel” MasPar Corporation, he had attempted to implement TSMC’s 996 work schedule, asking employees to work twelve-hour shifts six days a week. Even after his company tanked, Nickolls never gave up on parallelism, believing that it would eventually triumph as a consequence of the laws of physics. For decades, an engineering principle called “Dennard scaling” had governed the miniaturization of electronics. Dennard scaling dictated that transistors would continue to efficiently process electricity as they got smaller—basically, it was the reason computers got faster every year.[*] But Nickolls had calculated that sometime around 2005, the Dennard scaling relationship would collapse. The coming generation of light-printing machines would craft transistors a mere one hundred atoms in width. This was six thousand times thinner than a human hair and seven hundred times thinner than a red blood cell. At this fine scale, the transistors’ conductive properties would be compromised, and they would leak electricity into the surrounding circuitry. Once that happened, computers would slow down. Nickolls could see that the industry was in denial about this problem— especially Intel, which was confidently predicting linear gains from shrinking transistors down to components a single atom in width. Nickolls believed this was impossible, and in early 2003 he sent an unsolicited letter to Huang outlining his heretical thoughts. Nickolls didn’t panic or exaggerate. Rather, with a precise but measured sense of urgency, he explained, using principles of electricity, why Intel’s long domination of the -- 108 of 263 -- semiconductor industry was about to expire. “We had all seen it coming for a while, but it was Nickolls who convinced me that Moore’s Law was truly dead,” Huang said. “He deserves so much credit for what this company has become.” Huang hired Nickolls and put him in charge of a pilot project developing scientific applications for the GeForce. Even at Nvidia, Nickolls was considered intense. Two weeks after his first day of work, he was diagnosed with malignant melanoma. He continued working seventy-two hours a week while receiving cancer treatment, concealing from both family and colleagues the discomfort he was experiencing. Soon, Nickolls’s melanoma was in remission, and the earliest versions of the CUDA platform were live. Nickolls had no interest in video games at all. He didn’t even care about computer graphics; he cared only about making microchips go faster. In every other way, though, Nickolls was the model Nvidia employee. “My dad was always one to yell,” his son, Alec, told me. “I remember overhearing my dad take meetings on the phone and yelling at people. Not in a toxic way but just, like, make sure you know what you’re doing. Make sure you’re being productive.” Nickolls drove as hard in his personal life as he did at work. He tricked his son into skiing black diamonds with him, and at the model-train club he preferred to lay track rather than socialize. When Alec was young, he had struggled with a classic Boy Scout survival drill that required him to use his pants as a life preserver. Nickolls wouldn’t let his son out of the pool until his pants were filled with air. Nickolls was obsessed with getting the CUDA platform to work. Friends sometimes asked him why he was working for a video game company when he didn’t play video games. Nickolls informed them he wasn’t working on video games; he was working on one of the most important technologies of all time. He was building a platform so fast it would make every other computer look like a calculator watch. “Few inventions will have the impact on the world that CUDA will ultimately have,” he would say. This was more of a statement of faith than anything else. By the late 2000s, computers were fast enough for most consumer purposes, and there -- 109 of 263 -- were not obvious customers for what Nickolls was building. Bundling CUDA with a retail circuit board was like attaching a minivan to a rocket sled and trying to sell it to suburban commuters. Nickolls was undeterred. He did not want merely to get around Moore’s Law; he wanted to smash it forever. To do so, Nickolls had to ratchet up the arithmetic intensity of the circuits. Microchips kept time with a furious internal metronome that got faster every year. By the mid-2000s, that metronome was pulsing hundreds of millions of times per second, and the delicate wiring could not keep up with the beat. Parallel computing solved this problem not by speeding things up but by getting more transistors to respond to each pulse. An Intel CPU fired only a few transistors at a time. An Nvidia gaming GPU fired thousands. Nvidia struggled to find users who actually needed such power. “Initially, our only customers were two breast cancer researchers,” Diercks recalled. The researchers, working at Massachusetts General Hospital, had written to Nvidia with a proposal to upgrade their mammogram scanners. Huang enlisted the hospital to alpha-test CUDA, investing several million dollars in a pilot project that would ultimately sell exactly two graphics cards. “But Jensen loved that, right?” Diercks said. Mammogram imaging was the first example of what Huang would later call the “zero-billion-dollar” market. Huang had long sought a way to differentiate Nvidia from its competitors. Hardware innovations wouldn’t get him there; they were too easily cloned. Online, silicon fetishists swapped “die shots” of Nvidia’s microchips obtained by ripping the chip out of a retail board, dissolving the case in boiling sulfuric acid, then scanning the circuitry with a metallurgical microscope. The enthusiasm of the hobbyists paralleled professional espionage efforts by reverse- engineering teams at chipmaker laboratories. The denuded silicon was technically patented, but the 3dfx experience had shown the futility of lawsuits. “Everyone takes a look at their competitors’ hardware and how it works,” Diercks said. “It’s not even black ops. We just do it.” -- 110 of 263 -- To distinguish himself, Huang had to pursue a strategy that so defied conventional business logic that ATI wouldn’t follow. He had to build an exploratory product, like a $300 entry-level scientific supercomputer that not only didn’t have competitors but also didn’t even have obvious customers. The zero-billion-dollar market, by definition, was one that only he would participate in—one that only he would even see. Huang was going to build a baseball diamond in a cornfield and wait for the players to arrive. • • • P problem into smaller pieces, then solves them all at once. The complexity of its inner workings would require a textbook to accurately explain, but some of what happens is accessible through analogy. First, let’s consider the action of the circuits. Imagine that the microchip has been blown up to the size of a dance floor. The floor is packed with partiers waving glow sticks, who represent the transistors. The lights are flashing, and the beat is raging, but most of the dancers are frozen—they can move only when it is their turn. One dancer moves on the first beat of the first measure, another on the fourth beat of the second measure, and so on. You can see glow sticks waving here and there, but most of the dancers aren’t dancing. They’re waiting to act. The serial DJ has been trying to get the crowd moving by speeding up the beat, but this has diminishing returns. Then the parallel DJ takes the stage. Rather than speeding up the beat, the parallel DJ choreographs far more complicated movements among the dancers. This works: the activity grows frenzied, the floor begins to shake, and the place is suddenly much hotter—some dancers are so active they might overheat. Now thousands of glow sticks shake with every beat. Reworking the circuits in this way is a bottom-up change that alters everything above it. This is the challenge of parallel computing— harnessing the complex choreography is logistically difficult, and requires programmers to think about problems in an entirely new way. It is easy to -- 111 of 263 -- feed instructions to Intel’s CPUs, which work like a delivery van, dropping off one package at a time. The truck is slow, sure, but it requires little from the programmer. Have a package to be delivered? Throw it in the back of the van! Nvidia’s parallel GPU acts more like a fleet of motorcycles spreading out across a city. The drivers deliver every package at roughly the same time, and the entire process can be completed in half an hour. But this rapid parallel solution is far more difficult to execute; it requires more drivers, more machines, and more logistics. Here, with motorcyclists constantly circulating in and out of the warehouse, every package has to be assigned to the correct vehicle and precisely routed to its destination. For decades, programmers had preferred the van—but as Nickolls had predicted, now the van was running into the traffic jam of electromagnetic physics. Once that happened, he believed, programmers would finally take the time to learn how to manage the fleet of motorcycles. In fact, they’d be forced to learn this. They’d have no choice. Still, it was not really the programmers that Nvidia was targeting; they were intermediaries. The true customers, if they ever arrived, would be doctors, astronomers, geologists, and other scientists—highly educated academic specialists who were skilled in specific domains but who maybe didn’t know how to code at all. It was these end users who would ultimately direct money toward CUDA, and for them an even gentler metaphor was required. For scientists, the best way to think of the difference between serial and parallel computing is to think of Intel’s serial CPU as a high-end stainless- steel Wüsthof kitchen knife. The knife is a beautiful multipurpose tool that can make any kind of cut. It can julienne, batonnet, chop, slice, dice, or hack. With a little skill, a chef can build a whole meal with just this one implement—but the knife can only ever chop one vegetable at a time. By comparison, Nvidia’s parallel GPU acts more like a Cuisinart. It is a specialty tool that is loud, indelicate, and power-intensive. It cannot chiffonade tarragon or score a crosshatch on a tube of calamari. But to mince a bunch of vegetables quickly, the GPU is the tool. -- 112 of 263 -- Now, just as you would struggle to cook a meal with only a Cuisinart and no kitchen knives, you cannot run a computer on a GPU alone. The architecture of the device is too specialized for that; a CPU is always necessary. In this sense, the CPU is always the primary tool, while the GPU is an expensive add-on. And, like a lot of kitchen gadgets, it’s one that was initially sneered at by purists. But imagine that a chef arrives to work one day to find that a semitruck full of fresh vegetables has pulled up to the loading dock. The chef no longer has time to chiffonade; she must mince these hundreds of pounds of vegetables before they spoil. This chef might find the Cuisinart useful. In fact, she might want dozens or even hundreds of Cuisinarts running at the same time. The truckload of vegetables in this analogy represents Big Data. In the mid-2000s, the scientific loading dock was piling up with datasets that were exponentially larger than anything that had come before: astronomical data, geo-engineering data, medical data, government data, financial data, and the sprawling, ever-expanding human-generated dataset of the World Wide Web. In the past, a scientist might count herself lucky with one crate of vegetables every couple of weeks. By the mid-2000s, a scientist could expect delivery of several shipping containers’ worth of vegetables every day. Intel’s timeless kitchen knives just weren’t up to the challenge—you needed a machine-driven spinning blade. OK, so the cuts weren’t always so beautiful. So what? Another truck was coming in a few minutes anyhow. The GPU was data’s Cuisinart. It was the machine that processed data into rough-hewn cubes. • • • U N’ , Nvidia’s designers began segmenting their microchips into “CUDA cores,” which were arrays of circuitry that could simultaneously execute the same instruction in parallel across multiple groupings of data. Arjun Prabhu, Nvidia’s director of hardware engineering, -- 113 of 263 -- compared designing the new microchip to urban planning, with different zones of the chip dedicated to different tasks. As Tetris players do with falling blocks, Prabhu would sometimes see transistors in his sleep. “The best ideas happen on a Friday night, when I’m literally dreaming about it,” Prabhu said. The decision to ship dual-purpose chips was controversial within Nvidia because it raised the GeForce’s cost of production above that of the Radeon, a cost that was internally referred to as the “CUDA tax.” Huang was gambling that his gaming customers, entranced by Half-Life 2, wouldn’t notice that they were subsidizing a risky and possibly pointless side quest into the arcane realm of high-performance computing. “Many of the ideas employed in CUDA cores had been used long before in supercomputers and specialty processors, but they were too expensive for the small, specialty markets they served,” Brett Coon, one of the first CUDA engineers, recalled. “In my opinion, the ‘genius’ of CUDA is getting gamers to pay for the massive chip development costs.” Several layers of software had to run on top of Prabhu’s circuits. The first was the machine code layer, which broke down complex mathematical formulae into simple arithmetic. Much of Ian Buck’s work took place here, building ideas from the circuit board up. This was the catacombs of computing, the lowest-level code you could write, whispering directly to the metal. Many programmers found this layer mind-numbing, but Buck loved it—he was later granted patents on several assembly-level techniques. “It’s where the rubber meets the road,” he said. Buck hired a team of numerics specialists, many of them graduates of Moscow State University. (“You know, a lot of Pyotrs, a lot of Borises,” Buck said.) Working with the Russians, Buck took the complicated mathematical structures that scientists liked—differential equations and higher-dimensional matrices and such—and rewrote them as primitive equations consisting of only plus, minus, times, and divide. Running these elementary operations in parallel across many datasets at once required an unusual talent for holistic reasoning. “Human beings think linearly,” Buck said. “You give instructions to someone on how to get from here to -- 114 of 263 -- Starbucks, and you give them individual steps. You don’t give them instructions on how to get to any Starbucks location from anywhere. It’s just hard to think that way, in parallel.” Sitting above Buck’s layer was the “compiler,” which translated programming languages like C++ and Python into machine code. Bas Aarts, the Dutch developer who wrote the first CUDA compiler, was similarly obsessive. He could retreat into his mind for weeks at a time, forsaking friends, relationships, and hobbies to conceptualize how a computer might interpret information. “Certain people in my life think that I’m pretty one- dimensional,” he said. “But it’s—it’s elegant! It’s complicated. It’s challenging. And if I don’t get challenged, I get bored.” Business strategy for CUDA took a long-term view. Huang encouraged Nickolls to embrace the scientific customers—to embrace them very tightly and not let go. The performance gains from CUDA had to be so great, and so obvious, that customers would voluntarily build whole new academic disciplines around the platform. “After that, you will never want to leave,” Aarts said. “It’s vendor lock. There is no out.” In this way Nvidia built what software developers called “the CUDA stack.” At the bottom were the circuits, above this was the machine code that pushed the electrons around, above this was the compiler that translated from machine to human, and at the top was the software that faced the scientists. The stack turned ideas into electricity and turned electricity into results. • • • CUDA in late 2006. The software package was free, although it worked only on Nvidia hardware. In 2007 it was downloaded an underwhelming thirteen thousand times—in that first full year, not one hundredth of one percent of the hundreds of millions of GeForce owners out there bothered to flick the switch on their video game hardware to transform it into a supercomputer. Skeptical investors wondered who this technology was for. “Not only did Wall Street not think CUDA was -- 115 of 263 -- valuable; they thought it had negative value,” one employee close to the situation said. Many programmers found CUDA difficult to use. To maximize the power of the GPU, programmers had to break apart large tasks into hundreds of smaller subtasks called “threads.” Then they had to carefully— very carefully—feed these threads into the CUDA cores. This was a tricky bit of business with many hidden pitfalls. Programmers had to manage multiple memory banks without getting confused; they also had to avoid timing mismatches that could produce incorrect results. The learning curve for parallelism was steep, and it built on advanced concepts in computer science. Academics who’d trained in other fields, like physics or medicine, rarely possessed the programming chops needed to make CUDA work. Kirk, looking to repeat the success he’d had with the shaders, tried marketing the technology with a textbook, Programming Massively Parallel Processors, which he cowrote with computer scientist Wen-Mei Hwu. In the introduction the authors observed that computer architecture had not evolved since the Hungarian genius John von Neumann had laid out its basic schematics in 1945. “Computer users have also become accustomed to the expectation that these programs run faster with each new generation of microprocessors,” they wrote. “Such expectation is no longer valid from this day onward.” Few professors incorporated the textbook into their classes. There was heresy, there was blasphemy, and then there was questioning John von Neumann. In industry, Nvidia sought a range of customers, including stock traders, oil prospectors, and molecular biologists. At one point, the company signed a deal with General Mills to simulate the thermal physics of cooking frozen Totino’s pizza. But most of these deals fizzled out after a couple of quarters —pizza chefs needed only so much computing power. The high cost of R&D for CUDA was a drag on Nvidia’s financial returns, but CUDA was expensive in subtle ways as well. The project caused internal dissension within Nvidia—Nickolls had to fight for resources, sometimes forcefully. “It was a lot easier to convince the hardware designers why it was important to improve performance on -- 116 of 263 -- Unreal or Doom than on, say, matrix multiplications or Fast Fourier Transforms,” Coon recalled. Meanwhile, the still-cancerous code base swelled in complexity: the complete GeForce software package would soon surpass one hundred million lines of code, making it more complex than some Windows operating systems. Perhaps the biggest hidden cost was that CUDA distracted Huang from serving his core customer. Rumors of manufacturing problems at Nvidia first surfaced in late 2006, with gamers complaining about GPUs in notebook computers that stopped functioning after a few weeks of use. By the time the problem was acknowledged, the gaming forums had grown conspiratorial, with posters accusing Nvidia of incorrectly attaching their chips to the soldering “bumps” on the circuit board beneath. “Bumpgate” had begun: gamers defected to competitors, and Nvidia stock once again plummeted, losing close to 90 percent of its value for the second time in six years. Board members consulting the stock chart on a Bloomberg terminal compared it to an EKG of a heart attack. Delighted Radeon loyalists—haters, all—piled on, accusing Huang of orchestrating a cover-up. “Nvidia is tanking, we told you so,” one wrote. In early 2009, Dell dropped Nvidia as a preferred supplier for its popular line of gaming laptops. “For a long time, we have wondered when Nvidia’s abject stupidity would have a price,” wrote one caustic tech columnist. “The answer, at least at Dell, is now.” Huang, seeking to get ahead of Bumpgate, set aside $200 million for customer refunds. The reserve wiped out Nvidia’s profits for the year, and for the first time since going public, Nvidia lost money. Huang arranged a Q&A with the press to explain the situation. He arrived in spectacles, black jeans, and a loose-fitting gray athletic T-shirt that revealed the surprising definition of his upper body—the result of his diligent, lifelong push-up routine. “I just don’t want the consumers to fight the process,” he said in a relaxed and patient voice. “It’s a little bit messy because the competition wants to stir it up, but it’s not really that complicated.” Only seasoned observers of Jensen could intuit the anger coiled beneath the patter. -- 117 of 263 -- Nvidia held regular offsites to discuss corporate strategy. Looking to save money in the wake of Bumpgate, the 2008 offsite was relocated to the company cafeteria. “That was the most I ever heard him yell,” said Sameer Halepete, one of Huang’s top hardware engineers. The target of Jensen’s wrath was an extraordinarily skilled and dedicated chip architect with many years of service to the company. The architect stood in one corner of the cafeteria; Huang stood in the opposite. Lining the walls were some 150 senior executives of the company mutely observing the excruciating scene. “I still vividly remember Jensen just nonstop berating him for a good hour and a half,” Halepete said. “Honestly, maybe it was two hours—he was just livid.” Yet the architect kept his job. “Very rarely does Jensen make significant changes as a result of execution issues,” Halepete said. “He’s very conscious of having an even slightly chilling effect on people’s willingness to take risks and innovate. As a result, his level of forgiveness for even the largest screw-ups is extremely high.” Halepete surmised that the tirades were what Jensen did instead of showing you the door. “He will berate you, he will yell at you, he will insult you—whatever,” Halepete said. “He’s never going to fire you.” • • • T , Huang turned to Deb Shoquist, who managed Nvidia’s worldwide network of suppliers. Shoquist’s portfolio took her from Guadalajara to Hanoi to Bangalore; it was her job to make sure that the company’s components arrived on time and in sufficient quantity. The job required a fair amount of screaming on the telephone, and Shoquist, voluble and expressive, was not one to back down from a confrontation. Shortly after arriving at Nvidia in 2007, Huang had asked her to shorten the lead time on deliveries from a Taiwanese packaging vendor downstream from TSMC. Shoquist regarded this as an impossibility; Taiwan was famous for its efficiency, and she doubted that there was any gristle to trim from the process. The two began to argue about the difference between lead times -- 118 of 263 -- and cycle times, until Shoquist directly contradicted Huang. “You don’t understand how this works,” she said. The argument took place around a conference table, where executives sat with their laptops open. The moment Shoquist told Huang he didn’t understand, her inbox lit up with messages from her colleagues. “Stop!” one read. “Stop, don’t go there. Try to listen to him.” But it was too late. Huang erupted at Shoquist, screaming at her that she didn’t know how to do her job. “I thought you were an ops person. You’re not an ops person!” he shouted. “You don’t know ops!” Huang’s fury was matched by Shoquist’s own. She’d been doing this for twenty years—who was this guy to tell her she wasn’t an ops person? She was ops. Incensed, Shoquist told Huang she was going to fly to Taiwan and get the stats directly from the vendor to prove she was right. The vendor was delighted to host her: the packaging facility was not a place of glamour, and customers almost never visited. Over the course of a week, Shoquist familiarized herself with the unit economics of this back-end supplier. At the packaging facility, the “lead time” between the receipt of an order and fulfillment was three weeks. But, to Shoquist’s surprise, the “cycle time”—the total number of person-hours it took the vendor to place Nvidia’s microchips into the black casing—was just thirty-six hours. The vendor explained it would at least be theoretically possible to expedite the lead time to match the cycle time, although this would increase the cost of packaging each chip from $8 to $1,000. Huang had been right: it was possible to shorten the lead time. Expensive, but possible. Chastened, Shoquist returned to Nvidia with a cost schedule for expediting packaging orders. She waited until the two were alone to present her findings to Huang. (“I didn’t want to give him an audience,” Shoquist said.) She braced for his fury, but it never came. Instead, he said, “That’s the right answer.” Shoquist developed similar cost schedules for all of the hundreds of suppliers that slotted into Nvidia’s manufacturing network. Then she began to squeeze, reducing Nvidia’s cycle time from months to weeks, eventually setting a record of thirteen days. -- 119 of 263 -- In pushing Shoquist, Huang was employing a scheduling technique he called the “speed of light.” He drilled this management concept into his employees with the fervor of religious doctrine—almost everyone at Nvidia I talked to referenced the “speed of light” at least once. “Speed of light” did not mean, as one might assume, to move quickly. Instead, Huang encouraged managers to identify the absolute fastest that something could conceivably be accomplished, given an unlimited budget, and assuming that every single thing went right. (For example, traveling from New York to London at the “speed of light” would involve perfect weather, zero traffic, and a supersonic plane.) Managers could then work backward from this unachievable constant to realistic but still impressive delivery times. “It sounds hard, but it really takes the pressure off of you,” Shoquist told me. “Once you understand the physical limits of what is possible, you understand the competition can’t go any faster either.” Huang pursued this unattainable ideal every day of his life. “I should make sure that I’m sufficiently exhausted from working that no one can keep me up at night,” he later said. “That’s really the only thing I can control.” He maintained this pace for decades, but others burned out. Tired of his commute, David Kirk decided he’d had enough of full-time employment in 2007. Cashing in a portion of the shares Huang had awarded him in the 1990s, he moved to Telluride, Colorado, and later Hawaii, continuing with Nvidia as he had begun, as a part-time consultant. At the time of his departure, Kirk was running Nvidia Research, a group of thirty scientists working on advanced graphics technologies. Huang believed that something more ambitious was needed to leverage the growing power of CUDA, and he deputized Kirk to find his own replacement. The man Kirk returned with lived at the speed of light. He would soon transform Nvidia Research into the most successful corporate R&D department in the world. S N * Technically speaking, Dennard scaling states that as transistors get smaller, their power density remains constant. It was first proposed in 1974 by Robert Dennard and his colleagues at IBM. -- 120 of 263 -- W TEN Resonance hen Bill Dally wasn’t flying his plane, or applying for a patent, or reinventing the computer, he was riding his bicycle to the point of collapse, or rowing in Lake Tahoe, or competing in a downhill ski race, or sailing nonstop from Grenada to Antigua. Dally’s pace of invention made Kirk and Nickolls look lazy: he was the author of 250 technical papers and 4 textbooks, and he held 120 patents spanning an eclectic range of computing domains, ranging from complex circuit architectures to the chip that ran the power supply. Bald, fit, quick-talking, and obviously brilliant, Dally spoke with unforgiving academic precision when discussing computers and in a blunt, matter-of-fact tone when discussing anything else. There was no domain of computing he didn’t seem to understand, and there was no moment of his life that didn’t seem optimized either for technical achievement or adventure. Dally had dropped out of high school because he didn’t want to sit through history class. Working as an auto mechanic, he finagled entry into college on the basis of his test scores. He never got his high school diploma, but he did receive a bachelor’s from Virginia Tech, a master’s from Stanford, and a PhD from CalTech. By his early thirties, Dally was a tenured professor at MIT. Dally liked to build his own computers. He also liked to fly his own plane. On a gloomy September day in 1992, he took his single-engine Cessna on a trip to New York over the Long Island Sound. While cruising at an altitude of around six thousand feet, he noticed his oil indicator go on. -- 121 of 263 -- He steered toward a nearby airport, but minutes later his engine cut out, followed by a sound that he described as “a lot of softballs kicking around in an oil drum.” Dally reoriented the Cessna into a glide pattern, then prepared to crash-land into the sound. When the plane hit the water, Dally’s body was thrown forward, and he broke his nose against the steering wheel. Stunned and bleeding, he had about twenty seconds to get away before the plane sank beneath the waves. In that time he was able to break open the cabin window and escape into the ocean chop, clutching a seat cushion as a life preserver. He was eventually rescued by a passing sailboat. “I later programmed that event into a simulator,” Dally told me in a matter-of-fact way, as if he were describing a letter he had mailed. “After ten tries where I wound up in the water, I came up with one where I managed to land at the Groton-New London airport.” Following the plane crash, Dally was back at work within a couple days. His team at MIT was building an experimental parallel computer known as the “Jellybean machine.” The Jellybean was Dally’s reimagining of how information technology might work, and almost every part of it—the microprocessors, the circuit boards, the networking hardware, and the applications—was bespoke technology that Dally and his team had engineered from first principles. The contraption stood as tall as a person, and parts of it were held together with duct tape. Benchmarking tests showed that the Jellybean ran much faster than a conventional computer, but Dally could never find a commercial partner to build it. Preexisting computers were plenty fast for most purposes, and there seemed to be no market for a parallel device. So when he was forty years old, Dally left MIT for Stanford; the former high school dropout was now the chair of the most prestigious computer science department in the world. Dally started consulting for Nvidia following Jensen’s visit in 2003, but when Kirk approached him with the offer of a full-time job in 2009, Dally initially turned him down. Nvidia had lost money in fiscal 2009 and fiscal 2010, and the stock was depressed. Fast Company’s 2010 list of the world’s most innovative companies did not include Nvidia; neither did Businessweek’s. Moving from Stanford to Nvidia looked like a lateral move -- 122 of 263 -- or maybe even a demotion. Intel, ten times bigger than Nvidia, was promising investors that there were methods to get around the leaky transistor problem; in repeated conference calls, Intel executives insisted that Moore’s Law was not dead. Around the time that Huang was trying to hire Dally, Intel made him a more lucrative offer. Dally considered it for a while, but ultimately chose to join money-losing Nvidia. He formally accepted his job as Nvidia’s chief scientist in January 2009, right as the stock dipped into single digits. Jim Plummer, the dean of Stanford’s engineering school and a member of Intel’s board, questioned Dally’s sanity. “Bill, you’re crazy,” Plummer said. “Intel is going to crush Nvidia.” Dally was undeterred. “Jensen’s just one of these people who’s a natural leader,” Dally said. “You want to follow him wherever he’s going.” • • • D N R, growing it to more than three hundred people. Nvidia’s ongoing advantage was that it would provide exponentially more computing power per dollar as time went on. Dally chose research projects that would intersect with that slope, a portfolio that would in time include robotics, automobiles, climate modeling, and biochemistry. In academia, Dally had enjoyed unlimited time to pursue his eccentric passions. At Nvidia, he was bound to the rhythm of the six-month GPU release cycle, but to his surprise he loved it. “There’s far less bureaucracy, because everybody in the company—their livelihoods, their jobs and their families—are depending on this GPU getting to the finish line on time,” he said. Dally believed that Nvidia could do more outreach to academic customers. A few months after he joined, Nvidia organized the first annual “GPU Technology Conference,” or GTC. The conference took place in 2009 at the Fairmont Hotel in downtown San Jose. Huang, who for many years had looked as if he bought most of his wardrobe with Kohl’s Cash, was in the midst of an evolving glow-up. He arrived in a tight-fitting black -- 123 of 263 -- shirt, boot-cut blue jeans, and black shoes with silver buckles. Huang, a natural performer, managed the crowd with the practiced ease of a stage hypnotist. “Welcome to the Woodstock of high-performance computing,” he said. Huang liked spectacle. He had once asked Adam Savage and Jamie Hyneman, the hosts of Mythbusters, to demonstrate the difference between serial and parallel computing. Their demonstration featured two devices rigged to shoot paintballs at canvas. The first was a remote-controlled robot that shot one paintball at a time, rendering a crude smiley face over the course of about a minute. The second was a stationary array of cannons that shot 1,100 paintballs at once, rendering a pixelated version of the Mona Lisa in a split second. The audience loved it—it was that kind of crowd. GTC operated in a similar register. The first night of the conference featured a masquerade charity ball for a local elementary school. (The allure of the event was marginally diminished by the attendees, mostly middle-aged men wearing lanyards.) Featured topics the following day included quantum chemistry, augmented reality, and modeling the behavior of black holes. One of the talks was titled “Unlocking Biologically-Inspired Computer Vision: a High-Throughput Approach.” The presenter, an MIT professor named Nicolas Pinto, had assembled a large variety of image- recognition applications—including several neural networks—and used CUDA to optimize them for Nvidia GPUs. He’d then asked the programs to identify characters and objects from video clips he’d culled from a collection of Law & Order DVDs. The best of the models, when blended together, could identify Jerry Orbach with almost 90 percent accuracy, even when shown a new clip of Orbach for the first time. In the paper accompanying the presentation, Pinto observed that the Nvidia chipset offered 1,356 times the performance of a comparably priced Intel CPU. Image recognition was a foundational problem for artificial intelligence —by teaching computers to recognize images, researchers were following the evolutionary trail toward more sophisticated capabilities. Around the time of the first GTC conference, paleontologists digging into a rock wall in Greenland discovered a five-hundred-million-year-old trilobite fossil whose -- 124 of 263 -- neural tissue had been miraculously preserved. Older than the dinosaurs, the creature resembled a horseshoe crab, with eye stalks extruding from its body. Run your finger down its petrified optic nerve and you would arrive at a tiny clump of cells, smaller than a grain of rice. Here was the earliest brain ever found. The fossil record showed that Pinto was on the right track—visual recognition had led to an explosion in biological intelligence and would soon do so again with computers. Unfortunately, his presentation, one of dozens that took place that week, did not receive much attention. Subsequent GTCs, held in 2010 and 2011, did not build on his insights, and in the first two years following publication, Pinto’s paper received only fifteen citations. Neural nets were a neglected branch of inquiry. So were trilobite fossils, for that matter. No one cared. • • • I , CUDA struggled. In the late 2000s, John Nickolls was once again diagnosed with melanoma. This time the disease proved fatal, and in 2011 he passed away. Dally, his competitive-skiing buddy, was devastated. So was Huang—when I asked him about Nickolls twelve years later, his face grew strained with emotion, and he changed the subject at once. Even when Nickolls was dying of cancer, he never stopped working. “I think some of his best, most productive years at Nvidia were during those times,” his son, Alec, said. Nvidia funded a scholarship at the University of Illinois at Urbana-Champaign in his honor. To his final breath, Nickolls insisted that CUDA would change the world, but he witnessed only a glimpse of what CUDA would become. The software that turned your graphics card into a supercomputer had been downloaded more than three hundred thousand times in 2009. Then interest declined for three straight years, bottoming in 2012 at slightly more than one hundred thousand new installs. The market for scientific computing looked saturated, and investors began to grumble that Nvidia’s sustained investment in CUDA didn’t make financial sense. “They were spending a -- 125 of 263 -- fortune on this new chip architecture,” Ben Gilbert, the cohost of Acquired, a popular Silicon Valley podcast, told me. “They were spending many billions targeting an obscure corner of academic and scientific computing, which was not a large market at the time—certainly less than the billions they were pouring in.” By 2012, the situation was becoming dire. Nvidia’s stock price had not appreciated in more than a decade, and although revenues and employment at the company had grown considerably, profits remained flat. Huang was bringing supercomputing to the masses, but the masses didn’t want it. In early 2013, Nvidia’s board received a letter from the activist investor Starboard Value, which had taken a small stake in the company. Jeff Smith, Starboard’s chief investment officer, targeted underperforming companies, demanding board seats and changes in strategy. When he encountered resistance, he usually tried to fire the CEO. Starboard’s letter, while formally agnostic on CUDA, gently questioned whether what Huang was doing made any sense. Other investment analysts believed that if Nvidia stopped investing its profits in CUDA and instead returned them to shareholders, Nvidia’s stock would trade higher. Some also questioned Huang’s continuing fitness for the role. Smith, forty-two, was youthful and energetic, with curly hair and a boyish face. He liked to question operational decisions in excruciating detail: he once managed to replace the entire twelve-person board of Darden Restaurants while holding less than 6 percent of the company’s stock on the basis of a 294-slide plan to turn around the struggling Olive Garden chain. Starboard’s Olive Garden slideshow became a legendary document among equity analysts, particularly slide 104, which criticized the restaurant’s breadstick strategy. (Historically, Olive Garden waiters would bring one breadstick for every guest, plus one for the table; they would then refill the breadstick container as needed. But over time the quality of service deteriorated, and servers just started dumping a bunch of breadsticks on the table, reducing the amount of food that customers ordered.) Slide 163 noted Olive Garden had also stopped salting the pasta water in a misguided effort to extend the life of the cookware. “How can management -- 126 of 263 -- of the world’s largest Italian restaurant chain think it is OK to serve poorly prepared pasta?” Starboard asked. Smith was good at playing the media, and he used the press to his advantage. In 2014, after he had forced eighty board replacements across thirty companies over the course of just three years, Fortune magazine called him “the most feared man in corporate America.” The perception of CUDA as a money pit was not obviously wrong—and Huang had killed off unprofitable business lines before, sometimes after spending years pursuing them. In the early 2000s, Nvidia had, for a time, made “northbridge” chips, which acted as a memory controller on the motherboard. After pursuing this market for several cycles, however, Huang realized he was in a race to the bottom with Intel. He scuttled the initiative and informed employees that he’d made a mistake. And while he was developing CUDA, Huang also invested in the graphics market for tablets and mobile phones. (In fact, in interviews from the early 2010s, Huang talks more about mobile phones than he does about supercomputing.) This was defensible—the mobile market was massive—but in 2011, Nvidia did something that made less sense, spending $367 million to buy Icera, a manufacturer of cellular modems. It was this misguided acquisition that had triggered Starboard’s alarming letter: the modem market was mature and was dominated by Qualcomm. Smith and his lieutenants believed that with CUDA, graphics cards, mobile chips, and modems, Nvidia had thrown too many balls into the air. They visited Nvidia headquarters in 2012, urging Huang to focus. The meeting was cordial, but the underlying threat of a proxy fight for control of Nvidia was there. Eventually, Huang came around to Smith’s point of view and abandoned the modem market. “If we’re fighting to the death in mobile, then we’re not doing something else, right?” Huang said. Huang had fought to the death before, and won, but the experience had scarred him, and he was disinclined to do so again. But Huang never considered abandoning CUDA—for in this unpopular market Nvidia was the only provider. Of the great many decisions Huang would make over the course of his career, the decision to double down on CUDA in the face of Jeff Smith was -- 127 of 263 -- the riskiest. Unlike gamers, supercomputing customers were fickle and constantly starved of cash. Academic customers were dependent on unpredictable research grants. Corporate R&D was subject to scrutiny from skeptical CFOs. Ambitious government-research programs were announced with ten-year investment schedules, then devolved into protracted bureaucratic wrangling about how the money was to be disbursed. Even other semiconductor executives, no strangers to risk, thought CUDA unwise. It was the bet that made Jensen Jensen; it was the gamble that set him apart. With the assistance of board member Jim Gaither—the same lawyer to whom Huang had once given all the money in his wallet—Huang organized a campaign to plead for his job. Nvidia’s largest shareholders were the East Coast mutual funds. The most important was Fidelity, which managed more than a trillion dollars in customer funds and owned more shares of Nvidia than Huang did. Huang flew to Boston to meet with them. The meeting went poorly; Fidelity “beat the crap out of us,” Gaither said. From Boston, Huang traveled to New York, meeting with a half-dozen other institutional investors. Huang did his best to persuade the giants to support CUDA, but he was grilled by skeptical portfolio managers. “It wasn’t clear that there was a path to a real breakthrough,” Gaither said. Huang retained the support of his board, most of whom had been with him since the company’s founding. But even here, there were the first whispers of discontent. “We were—look, you know, we were kind of going sideways,” board member Tench Coxe said. Dawn Hudson, a former NFL marketing executive, was named to the board shortly after Starboard’s letter arrived. “Nvidia did not have a great reputation when I joined,” she said. “It was a distinctly flat, stagnant company.” • • • A , Huang was suffering from empty-mansion syndrome. His son, Spencer, had left for an art academy in Chicago, with aspirations to become a photographer; his daughter, Madison, had enrolled in culinary school in -- 128 of 263 -- Paris; his beloved dog, Sushi, had expired. Working constant eighty-hour weeks, Huang had missed out on much of Spencer and Madison’s childhood. “If I’m being honest, Lori did ninety percent of the parenting,” he said. Typically, Huang spent one weekend day a week with his children, but even here he was often preoccupied. (Horstmann recalled visiting an amusement park with Huang, where he repeatedly sent his kids on the roller coaster so the two could discuss technical problems.) Horstmann also observed that neither Huang’s nor his own kids had initially gone into technical fields. “I think they tried to get out of this crazy work environment,” he said. “I think they looked at us, and said, ‘There’s got to be more to life than this.’ ” Huang, pained by their absence, tried to re-create the glow of family on their frequent visits home. He adopted two more dogs and, developing his culinary skills, often took to the kitchen, where he would improvise all manner of delicious food. Yet even this was not always a respite; Horstmann recalled a family gathering around the time of the Fidelity cross- examination when Huang botched a complex dish he was preparing. Standing in his million-dollar kitchen, with his daughter who had trained at Le Cordon Bleu there to help, Huang exploded and began to scream at his inadequate equipment. “I think we all understood we had to get out of the kitchen,” Horstmann said. “It was just time for Jensen to yell at his stove.” • • • I , Huang has cited a visit to the office of Ting-Wai Chiu, a professor of physics at National Taiwan University, as giving him confidence during this time. Chiu, seeking to simulate the evolution of matter following the Big Bang, had constructed a home-brew supercomputer in a laboratory adjacent to his office. Huang arrived to find the lab littered with GeForce boxes and the computer cooled by oscillating desk fans. “Jensen is a visionary,” Chiu said. “He made my life’s work possible.” -- 129 of 263 -- Chiu, who had used gaming cards to build his machine, was the model customer. By shipping CUDA on the retail boards, Nvidia was marketing to even the most meager scientists—mad scientists, basically, whose research was so disfavored they couldn’t afford a workstation. Here, Huang was once again following the gospel of Clayton Christensen. Disruptive technologies, Christensen had observed, often grew out of hobbyist communities. They were developed using “bootlegged resources” in which “off-the-shelf components” were redeployed for something other than their intended purpose. They started out wonky but rapidly improved along attributes of performance that established players ignored. But even once you had absorbed this lesson, it wasn’t easy to implement. Pursuing niche markets cost profits, making investors question your sanity. This, too, Christensen had foretold: “One of the reasons managers at established firms find it difficult to serve emerging markets is that their investors and customers tell them not to.” That was the real secret of The Innovator’s Dilemma, which readers often missed. It was not a book about how to succeed; it was a book about how not to fail. Christensen’s book wasn’t a how-to for start-ups but a counterinsurgency manual for senior managers at stagnating firms. Thirteen years in, Huang felt that Nvidia was at risk of becoming such a firm, and it was as much paranoia as optimism that led him to pursue the mad-science market. “There was a risk in shipping CUDA with every card, but there was also a risk in not doing it,” Huang said to me in our first meeting, but it was only after researching his company for months that I came to understand what he meant. He was referring to the risk that someone else might do it— some small, hungry business, operating out of a dingy office next to a Chinese restaurant and a frequently robbed bank, willing to serve marginal academic customers for years, with limited profits and no clear future prospects, all in the hopes of one day doing to Nvidia what Nvidia had done to Silicon Graphics. It was a risk that only a disciple of Christensen would recognize. One frustrating thing about Huang, though, was that even when you thought you agreed with him, he turned around and disagreed with you. -- 130 of 263 -- When I mentioned the impact Christensen had on his company, he immediately contradicted me. “You have to absorb his book and its lessons,” he told me, “but Christensen got as much wrong as he got right. There’s much more to it.” Christensen’s marginally profitable customers were dirt bikers and trench diggers. Huang’s marginally profitable customers were scientists. They were scientists engaged in research, and in serving them, it was just possible he might enable one to change the world. Lateral technology transfers of this type had happened before. In the early 1600s, Dutch craftsmen working in the spectacles business realized they could rearrange their eyeglass lenses to view distant objects. (One story credits the discovery to two children trying to observe a weather vane.) The lenscrafters flooded the Dutch patent office with designs for telescopes, and within a year, Galileo was pointing one toward the heavens, becoming the first human to describe the phases of Venus, the moons of Jupiter, and the rings of Saturn. Made from modified eyeglass lenses, Galileo’s telescope had less magnifying power than a pair of modern bird- watching binoculars, but it forever changed our understanding of the universe and our place within it. By shipping low-budget supercomputers to the mad scientists, Huang hoped to enable a similar revolution. Huang did not have a concrete vision of what the future of technology would look like. Some technologists did; for example, Elon Musk began with a vision of himself standing on the surface of Mars, then worked backward to build the technology he would need to get himself there. Huang went in the opposite direction; he started with the capabilities of the circuits sitting in front of him, then projected forward as far as logic would allow. Only there, at the frontier of reason, would he allow himself to take a single step forward into the nebulous realm of vibes. “What Jensen does is beyond focus,” Horstmann said. “I would call it resonance.” To achieve this resonance, Huang engaged in constant interactions with his customers and his employees. At his conferences, he put the press in the back and the scientists in the front, and allocated his attention accordingly. His frequent visits with low-level employees weren’t just to boost morale, but to feel the pulse of his company against his -- 131 of 263 -- fingertip. “I remember times at LSI when we almost didn’t have to do anything, because we were reading our customers’ minds,” Horstmann said. “And with parallel computing it was the same: through his discussions with his customers, with his employees, he could feel that resonance. He could see it was time.” The breakthrough was coming—Huang sensed it. He sensed it through his discussions with researchers and by their astonishment at the speed-ups his technology unlocked. He sensed it from the obsessive enthusiasm of brilliant employees like Bill Dally and John Nickolls and Ian Buck. He sensed it enough to torpedo his profits; he sensed it enough to compromise his core product; he sensed it enough to risk his job. It might not be from a quantum physicist like Chiu, specifically, but Huang was certain that somewhere out there was some lunatic whose ideas CUDA would prove right. Somewhere out there was some graduate student who would skip the grant-application circus to buy an Nvidia GPU with his housing stipend and usher in a revolution. Somewhere out there was some neglected branch of science waiting to harness the firepower of CUDA to shatter the paradigmatic frame. Huang just had no idea what it was. -- 132 of 263 -- I ELEVEN AlexNet f Alex Krizhevsky could have turned himself invisible, he probably would have. The talented computer programmer had an almost pathological aversion to attention. He was a small man and a slight one, with pale skin and ruddy orange hair. He revealed few details about his private life to colleagues, even to some who had known him for years. Geoffrey Hinton, his PhD adviser, could tell me little about him, save one important detail: “Alex was probably the best programmer I ever met.” Hinton first met Krizhevsky in the late 2000s, when he was a graduate student living at home with his parents and attending the University of Toronto. Krizhevsky had been born to a Jewish family in the Soviet Union, in what is today the imperiled territory of eastern Ukraine. He had immigrated to Canada when he was young, and although his native tongue was Russian, he spoke English flawlessly, albeit infrequently. His affect reminded Hinton of a wizened espionage agent who had seen much and revealed little; he was able to convey a penetrating depth of intelligence in just a few words. One day this enigmatic figure appeared uninvited at Hinton’s office. “He came to me and said, ‘I’m the top student in software engineering, and it’s boring,’ ” Hinton recalled. Could he join Hinton’s group? The request was presumptuous: Hinton was a legendary academic who’d spent years developing neural networks. He was one of the coauthors on the seminal 1986 “backpropagation” paper and had championed this approach in the -- 133 of 263 -- face of indifference and even hostility from mainstream AI researchers for decades. But Hinton’s crusade had taken him far from the mainstream centers of computer science like Stanford and MIT. Snowy Toronto was not the first place you thought of when you thought about tech. It probably wasn’t even the tenth. Hinton had minimal funding for his research, and before he agreed to accept Krizhevsky as a student, he cautioned him that neural networks were deeply out of favor. Even though Hinton’s group was producing results competitive with conventional approaches, their work was often rejected for publication. “Neural nets were regarded as nonsense,” he told me. The bias against neural nets, Hinton felt, was “ideological,” a word he pronounced in the same venomous tone that Huang had used to say “political.” The ideology of the research community at the time was that it was not enough that AI be useful. Instead, AI should somehow “unlock” the secrets of intelligence and encode them in math. The standard, 1,100-page AI textbook of the time was a survey of probabilistic reasoning, decision trees, and support-vector machines. The neural nets got just ten pages, with a brief discussion of backgammon up front. When Hinton’s colleague designed a neural net that outperformed state-of-the-art software for recognizing pedestrians, he couldn’t even get his paper admitted to a conference. “The reaction was well, that doesn’t count, because it doesn’t explain how the computation is done—it’s just not telling us anything,” Hinton said. Hinton countered that nobody understood how to mathematically describe the way biological brains processed language, either, but this argument didn’t get him anywhere. The AI community of the time didn’t want to mimic intelligence—they wanted to solve it. Hinton thought trying to solve for the function of the brain was a little absurd, like trying to solve for the function of the kidney, but he couldn’t get traction with this argument. So to disguise what he was doing and to better secure funding, he and other neural-net researchers described their work as “machine learning” or sometimes “deep learning”—anything but “AI.” -- 134 of 263 -- Krizhevsky, undeterred, joined Hinton’s group. Hinton paired him with Ilya Sutskever, another Russian-speaking Jewish immigrant from the former Soviet Union. Despite the biographical similarities, Sutskever looked and acted nothing like Krizhevsky. He was athletic, with dark, bushy eyebrows, deep-brown eyes, and a wicked smile. Hinton’s most fervent acolyte, Sutskever argued that neural nets would one day outpace human intelligence, a claim that at the time even Hinton didn’t make. “Ilya likes to say outrageous things, but he can get away with it because he’s so open and honest,” Hinton said. “He’s sort of unconstrained by convention. He believes in himself. And he’s right.” Hinton gave Sutskever and Krizhevsky an ambitious assignment: using Nvidia GPUs, he wanted them to teach a computer how to see. CUDA had made its way to Hinton’s laboratory. In 2008 he had tasked graduate students Abdelrahman Mohamed and George Dahl with building a speech- recognition module using an expensive Nvidia server. By the beginning of 2009, Mohamed and Dahl’s neural net rivaled the best mathematical models in existence. Hinton, speaking at the Neural Information Processing Systems conference later that year, told his audience that running neural nets on parallel-computing processors was the future of AI, and that the researchers should drop whatever they were doing and buy Nvidia GPUs. He then sent an email to Nvidia: “I just told 1,000 machine learning experts at this conference that they should all go buy Nvidia cards. Would you give me one for free?” Nvidia declined. Although the company was pursuing a great number of supercomputing applications, at the time not a single dedicated AI researcher worked there. Machine learning was not among the potential applications Kirk had proposed for parallel computing in his textbook, and Hinton sometimes couldn’t even get the CUDA group to return his emails. The bias against neural nets was long established; in introductory AI courses, one would still sometimes hear professors claim that neural nets couldn’t even resolve simple logic functions, even though backpropagation had overcome this limitation decades before. -- 135 of 263 -- Hinton figured the only way to get Nvidia’s attention was not just to equal his rivals but to crush them. Krizhevsky and Sutskever seemed the most likely in his group to accomplish this. The two had a great deal in common, although Sutskever didn’t regard Krizhevsky as a friend, exactly —Krizhevsky was too private for that. But they were tuned to the same intellectual radio frequency, and even Hinton sometimes had a hard time keeping up. In office conversations, Hinton would ask them a question, and Krizhevsky and Sutskever would turn to each other to discuss it in Russian before turning back with the answer, which was inevitably correct. Hinton wanted Krizhevsky and Sutskever to build an image-recognition system using a “convolutional” neural net, which employed mathematical filters to focus on key details in a picture. He encouraged the two to think big; he needed them not just to win but to obliterate. Krizhevsky, with no background in the discipline, rapidly mastered parallel-programming techniques—something in his brain just clicked with the paradigm of driving to Starbucks from everywhere at once. “He managed to make those GPU boards do convolutional neural networks much more efficiently than anybody else,” Hinton said. “He really was a wizard.” In early 2012, Krizhevsky retooled an image-recognition network used for teaching exercises to run on CUDA. The GPU took just 30 seconds to train it. When Krizhevsky demonstrated his progress to Sutskever, Sutskever could not contain his excitement: the speed of the GPU was unprecedented, hundreds of times faster than anything he’d seen before. Sutskever had believed in the promise of neural nets from the moment he’d first learned about them. They just seemed like the obvious way computer intelligence would work. “If you allow yourself to believe that an artificial neuron is kind of like a biological neuron, then they should do everything we can do,” Sutskever told me. “And if you allow yourself to believe that they can be accelerated—well, then you’re training brains.” In the past, this approach had run up against the limitations of the hardware, but the GPU produced in half a minute what would have taken an Intel machine an hour and what would have taken biology a hundred thousand years. Sutskever immediately recognized they had to scale the -- 136 of 263 -- computing power available to Krizhevsky to the maximal degree—in other words, he had to make this new form of synthetic evolution run as fast as possible. This proved to be a keen and enduring insight. “Ilya sees things that other people take a long time to see more or less immediately,” Hinton said. The two graduate students pooled their money to build the fastest computer they could. This wasn’t much—with their combined funds, Sutskever and Krizhevsky could afford to purchase only two GeForce GTX 580s, gaming GPUs that retailed for about $500 apiece online. When the GeForce units arrived, they looked like props from the movie Alien. Each unit weighed around three pounds, with black casing, slime-green accents, and a circular vent for the powerful fan that kept the circuits cool. Beneath the case was the giant Nvidia chip, embedded in a black circuit board and surrounded by heat sinks, comprising three billion transistors arranged in thirty-two parallel cores. This was the power Krizhevsky needed; these were the transistors that could dance. After a few trial runs, Krizhevsky slotted the two GPUs into the desktop computer in his bedroom, then let them rip for a week. (“Actually, it was his parents who paid for the quite considerable electricity costs,” Hinton said.) Here, finally, was the customer Huang had dreamed of, the programmer so broke he could only afford to do his experiments on a repurposed graphics accelerator. Here was Krizhevsky, a weirdo recluse whom even his colleagues knew little about. Here was the mad scientist. Here was the iconoclast. Here was the man who would build CUDA’s killer app. To train his neural network, Krizhevsky used the ImageNet database, a collection of images assembled by the Stanford computer scientist Fei-Fei Li. Disappointed by the limited scope of training datasets available online, Li had assembled her own by hiring workers from Amazon’s Mechanical Turk service to manually label more than fifteen million images across twenty-two thousand categories. ImageNet was hundreds of times bigger than any comparable dataset; Li’s advisers had questioned the wisdom of the effort, but it turned out to be exactly what Krizhevsky needed. His -- 137 of 263 -- network had some 650,000 individual neurons, roughly the same number as a honeybee. Training such a large model required a massive amount of data. At the beginning of the week, the neurons were connected at random, but as the training progressed, they rearranged themselves into an intricate, beautiful pattern, slowly learning how to see. In the first nanosecond of training, Krizhevsky’s network was exposed to a randomly selected image from the dataset, then asked to assign it a label from one of Li’s thousands of categories. Perhaps the image was a stingray; perhaps it was a Scottish terrier; perhaps it was a golf cart. Whatever it was, the network had never seen it before—so in returning a label, it could only guess, and this guess was certainly wrong. But, in guessing wrong, the network had gained a little bit of information, however small, about what the image was not—not a stingray, at any rate. The network processed this information by retooling the connections between its neurons using Hinton’s backpropagation approach. This was the difficult part, as it involved repeated “matrix multiplication,” a mathematical operation that scientists compared to solving an unimaginably large Rubik’s cube. Past attempts to train neural networks had always faltered here, but Krizhevsky had CUDA, which leveraged the GeForce’s parallel architecture to crack this computational puzzle. Once the math was finished—in a fraction of a fraction of a second—Krizhevsky’s network was shown a second image. Then a third, then a fourth, then thousands more, then millions more. To “see” the images, the neural network did not actually process light. Instead, it was fed a stream of digits representing the placement and colors of a particular array of pixels. It then updated the grids of digits that represented the synaptic weights the network had assigned to different layers of interpretation. For this reason, skeptics would later argue that the neural network was “just doing math”—but this reductionist viewpoint was akin to saying the human retina was “just interacting with photons.” Sometime in the first couple minutes of training, Krizhevsky’s network, by pure chance, managed to label its first image right—let’s say it was a flowerpot. This grand success triggered an orgy of matrix multiplications as -- 138 of 263 -- the backpropagation scheme attempted to communicate to the neurons what distinguished this thing called a “flowerpot” from a “chimpanzee” or a “pool table” or a “dump truck.” The gain from success was marginal— shown a garbage can, the network was likely to guess “flowerpot” again. But shown an object of a totally different shape, like a whale shark, the network now knew just enough to make a distinction. This process was repeated many millions of times, every moment of the day, transforming Krizhevsky’s bedroom into a theater of hyperspeed evolution. The training of a neural network was a thing of wonder. Krizhevsky’s network had several “layers,” each of which slowly learned to distinguish among different aspects of the data. One layer learned shape, another color, a third the importance of symmetry, while snaking pathways for information wove the layers together into a unified, organic whole. Each time a new image appeared—a dragonfly, an hourglass, a mongoose, a container ship, a dirigible, a walking stick—those tendrils of information rearranged themselves into a more perfect mirror of reality. The cooling fans on the GeForce units ran constantly, at around forty- four decibels; the combined noise was not deafening but was enough to keep Krizhevsky awake at night. Slowly, the success rate of image identification ticked upward, starting at 0 percent, moving to 1 percent, then 10 percent, then 40 percent, then 60 percent, before flatlining at an 80 percent success rate. The finished network still had some weaknesses: it was especially bad at distinguishing between human tools and could not tell a spatula from a hatchet. Krizhevsky could have kept tinkering with it, but the best that any other method for image recognition had ever achieved was 70 percent. Li’s ImageNet group at Stanford ran an annual contest for AI image recognition, so as a sanity check, Krizhevsky tested his model against the prior year’s competition data, which the model had never seen before. It easily squashed all of that year’s entrants. In machine-learning circles, training on labeled datasets like ImageNet was termed “supervised learning,” so Krizhevsky called his neural network “SuperVision.” Hinton and Sutskever were astonished—just absolutely gobsmacked. “GPUs showed up, and it felt like a miracle,” Sutskever said, -- 139 of 263 -- his enthusiasm undiminished even ten years later. Under ideal circumstances, the two GeForce cards could execute a combined three trillion operations per second. Doing the math, that meant the GPUs had executed quintillions of distinct mathematical steps in just under a week. That was over a hundred billion years’ worth of human arithmetic now encoded into SuperVision’s fragile synthetic brain. “To do machine learning without CUDA would have just been too much trouble,” Hinton said. Krizhevsky decided he would introduce SuperVision to the world by winning ImageNet’s 2012 competition. In the weeks leading up to the event, Sutskever and Hinton began to pace the Toronto laboratory in giddy anticipation. “We knew we were going to win it,” Hinton said. They were the first to experience what would soon become a common phenomenon: the uncontainable thrill of sneak previewing embargoed AI technology that would shock the world once unveiled. Theorizing on the impact that SuperVision would have on the world, the researchers discussed autonomous robots, self-driving cars, and self-coding computers. The three men regarded AI as a purely positive force for progress—or they did at the time, at least. The Toronto research group began to see something else as well: if SuperVision had benefited from Nvidia, Nvidia was going to benefit from SuperVision even more, for the neural net’s demands for increased parallel- computing power had no foreseeable limit. “It was pretty obvious to us, even before we submitted, that going forward a large fraction of scientific computation was going to be machine learning,” Hinton said. • • • W F-F L first saw the SuperVision results, she wondered if they were in error. Li’s ImageNet contest had been her attempt to prove the value of her efforts to her advisers, but after attracting thirty-five entrants in 2010, participation had declined to just fifteen entrants in 2011. In 2012, there were only seven, and it was not clear the contest would survive another year. -- 140 of 263 -- Now, one of those seven entrants was demonstrating a success rate above 80 percent—10 percent better than the state of the art in a field where improvement was typically measured in fractions of a percentage point. Stranger still, the winner was a neural network, a technology that Li considered to be a museum artifact. “It was like being told the land speed record had been broken by a margin of a hundred miles per hour in a Honda Civic,” Li recalled in her autobiography. Born in China, Li was an only child who had moved to New Jersey as a teenager. In high school, she was something of a dreamer, struggling to acclimate to American culture while helping her parents make ends meet. A friendly math teacher encouraged her academic development, and with his tutelage, she was offered a full scholarship to Princeton University. She’d majored in physics, then pursued a PhD in electrical engineering, with ambitions of teaching a machine to see. But now, presented at last with a machine that actually could see, Li couldn’t quite believe it was real. She asked her staff to double-check. “I talked to the guy who computed the results, and he thought there was a bug to begin with,” Hinton said. Li’s thesis adviser emailed Hinton to ask if he was absolutely, 100 percent sure his researchers hadn’t accidentally contaminated his model by training on the contest data. “We had to check it several times before he believed that the computed results were correct,” Hinton said. Gradually, Li began to accept reality: her dying AI pageant had just crowned Miss Universe. The official ImageNet results were published in October 2012, with researchers scheduled to discuss their models at an academic conference in Florence, Italy, later that month. Li, having just given birth, had planned to skip the proceedings but changed her mind upon seeing the SuperVision results. She had to meet the genius behind the tech. When Li arrived in Florence, Alex Krizhevsky didn’t respond to her texts, and she began to wonder if he was even coming. Had he bailed? Was he lost in the Uffizi? But on the morning of the conference, Krizhevsky showed up without notice, safe in an oversized zippered sweatshirt with -- 141 of 263 -- black glasses and a bushy hairdo. This was the architect? He looked like a teenager. Krizhevsky was the final presenter of the day. The quality of his presentation was directly inverse to the importance of his results. As he opened his remarks, his voice suddenly cracked into a high and nasal register, and he gave a nervous cough while casting his eyes downward in embarrassment. He hurried through a series of black-and-white slides, rarely raising his eyes from the podium. One slide detailed SuperVision’s impressive specifications: 650,000 neurons, 60,000,000 parameters, and 630,000,000 connections. “It was trained, actually, in my bedroom,” Krizhevsky told the assembled crowd. “It’s pretty big for the type of model you can train in your bedroom.” Krizhevsky’s short talk ended with no indication that he’d just revolutionized computer science. “That’s it,” he said, in conclusion. “That’s all I have.” He then opened the floor to a series of surprisingly hostile questions. Or perhaps not so surprising: the unspoken implication of Krizhevsky’s presentation was that it was time to throw all that fancy AI math in the trash. Krizhevsky was telling the assembled academics they had so far wasted their careers—in some cases, decades of research was going to be abandoned. Was it so much to ask Krizhevsky to properly present his findings? The other researchers, it seemed, were exorcising their frustrations, having lost to the man who’d cracked the hardest problem in computer science with a gaming rig in his childhood bedroom. • • • T F notwithstanding, the AI community embraced the SuperVision results. “A bunch of senior researchers more or less immediately said, these results are amazing, we were wrong, neural nets really do work,” Hinton said. The 2013 ImageNet competition was overwhelmed by neural-net submissions, and by 2014, all of the more than forty contestants were using this approach. The accompanying academic paper for the SuperVision network, credited to Krizhevsky, Sutskever, and -- 142 of 263 -- Hinton, has to date been cited more than 150,000 times, making it one of the single-most-important findings in the history of computer science. Krizhevsky pioneered a number of important programming techniques, but his key finding was that a GPU could train neural networks hundreds of times faster than a CPU could. Inside Google, a Polish researcher named Wojciech Zaremba was tasked with replicating SuperVision. As his network, WojNet, began to percolate throughout the industry, Hinton feared that the modest Krizhevsky wouldn’t get credit for his breakthrough. He encouraged Krizhevsky to rename SuperVision so that it emphasized his contributions; although he questioned the utility of the exercise, Krizhevsky complied. (“Alex isn’t into things like branding,” Hinton said.) Henceforth, SuperVision was known as “AlexNet.” Hinton need not have worried, however—Big Tech had suddenly grown extremely interested in the backwater University of Toronto computer science department, and the days of funding shortages were over. The AlexNet team was bombarded with acqui-hire offers. At Sutskever’s insistence, Hinton incorporated a start-up venture named DNNResearch, with Hinton, Krizhevsky, and Sutskever each holding one-third of the shares. DNNResearch had no customers, no board, no revenues, and no website. It had nothing except the collective brainpower of the three men who’d cracked the code. That was enough. In December 2012, while attending a research conference, Hinton conducted an auction via email to sell this “company.” Working out of a seventh-floor hotel room in Lake Tahoe, the AlexNet team realized they were about to strike it rich. Microsoft and a London-based AI start-up called DeepMind submitted early offers, but both dropped out after a few rounds, resulting in a final bidding war between Google and the Chinese tech giant Baidu. As the bids crossed $20 million, the three researchers occasionally strolled over to the window to admire the wooded Sierra Nevadas, dusted with snow. When Google’s offer reached $44 million, Hinton, with the endorsement of Sutskever and Krizhevsky, cut the auction off and took the money. -- 143 of 263 -- Google, the three felt, was a more natural cultural fit than Baidu. AlexNet, the neural network that Krizhevsky trained in his bedroom, could now be mentioned alongside the Wright Flyer and the Edison bulb. “That was a kind of Big Bang moment,” Hinton said. “That was the paradigm shift.” -- 144 of 263 -- PART II -- 145 of 263 -- B TWELVE O.I.A.L.O. ryan Catanzaro stuck out at Nvidia. In the hive of STEM drones he was the dreamer. His hair was long, and he dressed like a jester, with statement eyeglasses and loud, tacky shirts. The first time we spoke, he was wearing a rainbow T-shirt decorated with JPEG compression artifacts; the second time we spoke, he was wearing a sweater embroidered with an owl. He was patient and kind, and he spoke in a soothing, gentle voice. He was the only Nvidia engineer I met who had a humanities degree. Catanzaro had grown up in the Mormon Church, and a year after graduating high school, he began working as a missionary in Siberia. For two years he spoke nothing but Russian. “I was very, very committed,” he said. While there, he read Crime and Punishment in the original, a moving experience for a germinating existentialist. He returned to Brigham Young University and earned a degree in Russian literature. “My favorite writer, of course, is Dostoyevsky,” Catanzaro said. “Dostoyevsky, Tolstoy, and Pushkin, they’re the top three for me.” Simultaneously, Catanzaro pursued a dual degree in computer engineering. (He had a good sense of the economic value of his literature degree.) In 2001 he was hired as a summer intern at Intel, where he was asked, as an exercise, to design a microchip that could pulse at ten billion beats per second. Doing the math, Catanzaro concluded the question was a setup: such a chip could never be built. He presented his findings to a group of senior engineers. “You must have done your work wrong,” his supervisor said. “This is part of Intel’s road map.” Catanzaro was stunned. He double- -- 146 of 263 -- checked his calculations but could find no error. The transistors were getting too small, the end of Moore’s Law was approaching, and Intel was ignoring it. “I mean, I was just an intern, right?” he said. “But I could see that traditional computer architectures were running into the wall.” Catanzaro was convinced the solution was to redesign the microchip anew. He cofounded the UC-Berkeley Parallel Computing Lab in the mid- 2000s, along with several colleagues. There, Catanzaro made a list of existing parallel applications. The business problem, he could see, was that even for the supposedly hungriest customers, the demand for computing power was capped: once you sold an oil prospector a supercomputer, you saturated demand for years. What you needed, Catanzaro figured, was an application that was so hungry for computation that it could never be satisfied. You needed another application like 3D graphics that demanded more computer power once its initial needs were fulfilled. Eventually, Catanzaro deduced what had to be parallel computing’s killer app. “The answer to that was AI,” Catanzaro said. “I came to AI from the bottom up. I came from a circuits perspective. I felt it was just inevitable that AI was the most important computational workload.” Catanzaro’s Berkeley advisers were reluctant to support his AI efforts. To many computer scientists, trying to build an AI was like trying to find Bigfoot. Neural nets, in particular, were viewed by mainstream researchers with something like contempt. “The thought was, you know, the computing industry is full of eccentric characters, and these guys are out there doing this old thing, it’s very eccentric, and it doesn’t work,” Catanzaro said. When Catanzaro told me this story, he combed his fingers downward through his shoulder-length hair and shook his locks. I sensed his frustration: before anyone, he had had the insight to combine AI with parallel computing, but his professors had steered him away from using the right tool. Instead, Catanzaro fooled around with the fancy math and published a few unexciting research papers. In the meantime, while pursuing his Berkeley PhD, he still had to pay the rent. He and his wife had three children, and the family was living in the Bay Area on a graduate student’s -- 147 of 263 -- income. Desperate, Catanzaro applied for every available corporate fellowship. He cycled through eight internships in six years, each one paying just enough to save his family from eviction. The upside of the experience was that it gave him a tour inside the titans of silicon. First, Catanzaro returned to Intel. By this time, Intel could see that Nvidia posed a threat, and Catanzaro was assigned to assist with “Project Larrabee,” a graphics chip that Intel internally referred to as the “GeForce killer.” Such bravado proved toothless—the Larrabee hardware was repeatedly delayed by infighting, then killed off before launch. Catanzaro believed that management wasn’t passionate about advancing computing technology. “To them, Intel might as well have been a machine that made soap,” he said. Catanzaro then interned at Qualcomm, the San Diego chip designer that had built much of the infrastructure for the modern cellular phone. Qualcomm was well-managed, and the pay was great, but Catanzaro was put off by their vocal denigration of rivals. “They kept telling me that Nvidia was a horrible place to work and that their CEO was some kind of tyrant,” Catanzaro said. This did not accord with Catanzaro’s own experience at Nvidia, where he had also worked. It was Catanzaro who had compared interacting with Huang to sticking a finger in the electric socket—but it was also Catanzaro who emphasized that Huang was not a man selling soap. He was a man whose passion for computing was not to be surpassed, and if there was anyone Catanzaro could convince about the coming intersection between parallel computing and AI, it was Huang. After Catanzaro was awarded his PhD, in 2011, he chose Nvidia. Catanzaro joined Bill Dally’s expanding group at Nvidia Research, an experience he compared to graduate school, “only bigger and better.” The place had an academic vibe, with researchers free to pursue their passions and collaboration with other corporate research groups encouraged. Dally published many of his discoveries in academic journals for public consumption and with no financial reward. Often, he coauthored papers with engineers at AMD and Intel. Dally’s openness surprised a lot of people -- 148 of 263 -- and sometimes led to pursed lips inside Nvidia, but Dally was playing the long game: he figured it was better to advertise what he was doing to other leading scientists so that they would come to work alongside him. “We’ll get the best academics to join the company because they’ll see our publications,” he would say. “The quality will speak for itself.” One of those academics was Catanzaro. Although he was initially assigned to study programming languages, Catanzaro soon became Nvidia’s first dedicated AI researcher. Dally was hearing whispers about progress in neural-net technology and decided it was an area he couldn’t ignore. In 2012 he farmed out Catanzaro to his former colleague Andrew Ng, a Stanford professor who worked for Google. Ng had developed similar technology to AlexNet in Mountain View—only he’d done it using conventional computing architecture. This was expensive; using a cluster of two thousand CPUs, Ng had fed thumbnails from ten million YouTube videos into a neural network in an attempt to teach it how to identify a cat. The project was costly, and the power draw was ruinous, but at the end of the training cycle, Ng’s neural net had synthesized a striking internal conception of the feline phenotype, which Ng extracted and distributed to the press. The computer’s impression of a cat was featured in a widely circulated article in The New York Times in June 2012. For Dally, the interesting finding was not that the neural network could recognize animals but that it had taken so much computation to get there. Dally deputized Catanzaro to repeat the cat experiment using Nvidia hardware. Catanzaro was able to do it with just twelve GPUs. • • • T fast now—everyone was clustering in the Valley. Following the Google acquisition, Krizhevsky, Hinton, and Sutskever relocated to Mountain View, where they instigated a parallel-computing insurrection. When Krizhevsky was offered the use of Google’s massive CPU cluster, he declined, instead purchasing a commodity PC and a couple -- 149 of 263 -- of retail Nvidia cards and installing them in an office closet. Soon, other researchers at Google were disconnecting from Google’s sprawling archipelago of data centers—probably the largest private collection of computers in the world at the time—to run gaming hardware under their desks. Catanzaro, sensing something in the air, returned to Nvidia to ask for more resources. Initially, he was turned down. With his sensitive disposition and the stigma of a humanities degree, Catanzaro was not the model Nvidia employee. “My reviews at Nvidia were not very good,” he told me. “My pay wasn’t very good either.” Undeterred, he began to work full time, by himself, on building cuDNN, a software library that would accelerate neural-network development on the CUDA platform. It was a struggle at first. Catanzaro was a researcher with no experience in practical software engineering. His fourth child had just arrived, and he wasn’t getting much sleep at home. He was having health issues, and his medication was making him feel “kind of dumb.” When he presented his prototype for cuDNN to Nvidia’s software team in early 2013, they panned it. Catanzaro began to second-guess himself. “I don’t think that my managers really thought I was doing important work,” he said. “It just wasn’t coming together.” Catanzaro decided to make his case directly to Huang. Machine learning technology did not seem to be on his dashboard; at the 2013 GTC conference in March, Huang spoke of weather modeling and mobile graphics, and he didn’t mention neural nets at all. (He had, however, worn a leather jacket to the conference for the first time—a bulky, ugly thing. His look was still evolving.) To Catanzaro’s surprise, Huang was immediately intrigued. Following their first meeting, Huang cleared his schedule and spent an entire weekend reading about AI, a subject about which he knew little. Another meeting soon followed, where Catanzaro found that his boss now knew as much—perhaps more—about neural nets as he did. Huang’s CUDA bet had taken the company far into uncharted waters. For a decade, he had stood at the prow of the ship, scanning for land. Now it was as if he had found Atlantis. He threw himself into research and phone -- 150 of 263 -- calls, and the more he learned, the more his excitement grew. By the middle of 2013, Huang was vibrating with wild-eyed, resonant intensity. He called Catanzaro into the conference room he was using as his office and told him that he considered cuDNN to be the single most important project in his company’s twenty-year history. The whiteboard on the wall had been cleared of diagrams; in its place, Huang had written the cryptic acronym “O.I.A.L.O.” in perfect lettering. This, Huang said, stood for “Once in a Lifetime Opportunity.” He asked the dumbfounded Catanzaro to participate in a thought experiment. “He told me to imagine he’d marched all eight thousand of Nvidia’s employees into the parking lot,” Catanzaro said. “Then he told me I was free to select anyone from the parking lot to join my team.” • • • H a while to warm up to ideas. “With parallel computing, it really took us a fair amount of convincing to talk Jensen into it,” Kirk recalled. “Same with CUDA. We really had to make the business case.” But with AI, Huang experienced a Damascene epiphany. “He got it immediately, before anybody,” Kirk said. “He was the first to see what it could be. He really was the first.” Huang told me he was just reasoning from first principles. “The fact that they can solve computer vision, which is completely unstructured, leads to the question ‘What else can you teach it?’ ” Huang said. The answer seemed to be: everything. Huang concluded that neural networks would revolutionize society and that he could use CUDA to corner the market on the necessary hardware. He announced that he was betting the company. “He sent out an email on Friday evening saying everything is going to deep learning, and that we were no longer a graphics company,” Greg Estes, a vice president at Nvidia, said. “By Monday morning, we were an AI company. Literally, it was that fast.” Huang had turned fifty a few months earlier. Although his hair was beginning to gray, he retained a boyish eagerness and roamed the halls of -- 151 of 263 -- his company with vigor, frequently stopping to quiz junior employees on their work. As his company grew larger, Huang began holding quarterly all- staff presentations. He could talk extemporaneously for more than two hours at a time, and in these presentations he would often revisit the same themes: the importance of the “speed of light” scheduling concept, the pursuit of the fabled “zero-billion-dollar market,” and above all, the ever- present danger of creeping bureaucracy. As Nvidia grew, Huang maintained an agile corporate structure, with no fixed divisions or hierarchy. The C-Suite was essentially just him, with no COO, no CTO, no CMO, and no obvious second-in-command. Huang didn’t even have a chief of staff. Instead, he had more than thirty people reporting to him directly, most of them given fluid responsibilities under the all-encompassing title of “vice president.” The parking lot thought experiment he’d practiced with Catanzaro reflected his belief that his company might need to kaleidoscopically reorganize itself at any time. “I need all of you to be ready,” he would tell his assembled executive staff. “You never know when you might suddenly become the most important person in this company.” • • • T in the company in early 2014 was Catanzaro, a cultural misfit with consistently poor performance reports. Now managing a team of engineers, Catanzaro stripped cuDNN down to its most essential task: accelerating evolution. In human brain tissue, each neuron maintains an average of around one thousand synaptic connections with other adjacent neurons. The brain alters these connections through chemistry. The neural net alters them through matrix multiplication. Matrix multiplication combines the numbers in one grid with those in a second to produce a third. The rules for populating the new grid are straightforward, but as the matrices grow large, the number of required calculations explodes. This makes the operation a good candidate for -- 152 of 263 -- parallelism, but before the arrival of the neural network, matrix multiplication had not been a priority at Nvidia. The chief custodian of CUDA’s matrix multiplication library was Philippe Vandermersch, a somewhat cantankerous Frenchman who rode his bicycle to the office every day. Vandermersch had rescued the orphaned software package after it was left unfinished by its previous developers. He then spent several years fruitlessly attempting to convince global-warming researchers to upgrade climate simulations written in ancient Fortran code to his modern implementation. Few scientists wanted to put in the effort. “Those guys in the lab, honestly they could be a little lazy,” he said. But with neural nets, there was no switching cost. Most of the code was being written for the first time, by brilliant and motivated programmers who valued speed above all. These were the power users that Huang had long envisioned: as the neural-net community coalesced around CUDA, they became lifetime customers for Nvidia chips as well. Vandermersch, picked out of the parking lot, joined Catanzaro’s team as his ace programmer and optimized his function library to meet the needs of AI. (In his telling, at least, Vandermersch also did a lot of the technical work. “You could say Catanzaro was the Jobs of cuDNN,” he said. “I was the Wozniak.”) Over time, Nvidia’s programmers found clever ways to speed these matrix operations up. One thing they noticed early on was that most of the weights in a neural net were clustered between plus and minus one. Numbers outside that range could often be truncated, speeding up operations and compacting size. Another thing they noticed was that, even between plus and minus one, weights didn’t need to be represented so perfectly. Just like in the brain, the neurons were “fuzzy,” maintaining loose synaptic connections rather than precise ones. Sometimes it was enough just to get the sign right. AlexNet had used 650,000 neurons to represent 630,000,000 synaptic connections. At that scale, any one synapse hardly mattered. Serial code was so finicky that sometimes a single misplaced semicolon could crash an entire operating system, but for neural nets a bad weight was just one data point among millions. For this reason, as they developed cuDNN, Nvidia -- 153 of 263 -- programmers rebalanced the trade-off between precision and speed. Good neural-net software, they reasoned, should favor the latter. AlexNet’s brain was the size of an insect’s, but forthcoming neural nets would be larger. As they grew, training would not be the only concern; it was equally important that users be able to quickly get answers. (Imagine an oracle who knew all the answers but spoke only one word per hour.) This inference process used less computing power than the training stage, but over time it grew to be a significant portion of the cuDNN library. Huang followed Catanzaro’s progress with great interest. The two met frequently, and Huang had to repeatedly apologize to Catanzaro for mispronouncing his last name. At many firms—most, perhaps—Catanzaro’s cuDNN library would have been taken away from him and handed to a seasoned product manager. At Nvidia, Catanzaro, who’d never developed a single piece of commercial software or even really managed anyone, was put in charge of the company’s flagship product. Catanzaro himself questioned if he was the right person for the job, especially as his production deadline neared, and Huang, having rewarded innovation with loyalty, now rewarded tardiness with abuse. “Jensen is not an easy person to get along with all the time,” Catanzaro told me. “I’ve been afraid of Jensen sometimes. But I also know that he loves me.” As the launch date neared, Catanzaro privately worried that his boss was falling into an age-old trap. If there was one field that had a worse performance record than parallel computing, it was AI. Going back to the 1950s, AI technology had gone through repeated hype cycles that ended with embarrassing busts. Catanzaro, like all researchers in the field, was keenly aware of AI’s traumatic past encounters with commerce, and he worried that AI was preparing to disappoint investors once again. But he said nothing to Huang—the opportunity he’d been offered was just too good. Anyhow, if Catanzaro could remember the past, Wall Street could not. Almost three decades had passed since the expert-systems crash of the 1980s, a span of time beyond the memory of all but the most durable equity analyst. As the buzz about neural networks spread, Nvidia’s stock moved -- 154 of 263 -- higher, and Starboard Value stopped sending letters. “They doubled their money and disappeared,” Coxe said. By early 2014, cuDNN was preparing to ship. Huang took the stage at GTC 2014 to promote the work, and for the first time in its twenty-one-year history, Nvidia was publicly associated with AI. (Huang skipped the leather jacket this time, instead wearing black slacks and a sleek, dark-blue polo shirt that was unbuttoned to the chest. His power was growing.) Huang showed off Ng’s network, with its internal conceptions of a cat and a human face. He walked the audience through several early-stage Nvidia AI initiatives. Then, mispronouncing his name, he ceded the presentation to Catanzaro, who used a variant of AlexNet to identify dog breeds posted to Twitter. Operating in real time, the network identified a Dalmatian, a German shepherd, a vizsla, and a cairn terrier. (“I didn’t even know what a cairn terrier was,” Catanzaro said.) With these adorable posts, the long AI ice age came to an end. In his final slide, Huang revealed corporate partners who were beta- testing cuDNN, including Adobe, Facebook, and Netflix. The slide didn’t mention Google, an AI customer too important to be publicly identified. A few weeks before the dog show, Google had acquired DeepMind, the London-based AI company cofounded by Demis Hassabis, Mustafa Suleyman, and Shane Legg. DeepMind had grand ambitions to build the world’s first artificial “general” intelligence, or AGI, unlocking the puzzle of cognition once and for all. Already, the company was developing AlphaGo, a neural network that would, in a series of thrilling matches against grand master Lee Sedol in 2016, crack the Japanese game of Go, a long-standing milestone for AI. With Ng’s group, the AlexNet group, and DeepMind, Google had established an early AI monopoly. As Google’s AI efforts expanded, researchers began demanding GPUs. In late 2014, Google launched “Project Mack Truck,” a secret effort to build the world’s most powerful parallel computer. The finished product required more than forty thousand Nvidia GPUs, costing more than $130 million. It was by far the largest single purchase order Nvidia had ever received, but it -- 155 of 263 -- was only the start. As AI succeeded, parallel computing now emerged from a long and unforgiving winter of its own. The truth, not widely understood until later, was that the deep-learning revolution was as much a revolution in hardware as software. It was the product of not one but two unpopular, cast-off, discredited, and cash-starved technologies whose ideal form could only be revealed in synthesis. Neural nets running on parallel computers: these tightly coupled technologies were the twin strands of DNA for a new and powerful organism, looking to consume all the data in the world. -- 156 of 263 -- J THIRTEEN Superintelligence ens Horstmann had started to notice a change in his old friend. In the past, no matter how hard he worked, Huang had maintained a gentler side at home. He set aside time for his outside interests—his dogs, his whiskey library, and his collection of expensive automobiles, which now included a Tesla Roadster and a Koenigsegg supercar. Huang had also purchased a large, beachfront vacation house in Maui with a sunset view. He often hosted friends there, including Morris Chang of TSMC, with whom he’d grown close. Huang’s great outside passion remained cooking. For one of his birthdays, his friends had arranged for him to train under a Michelin-starred chef at the Four Seasons hotel. When he arrived in the morning, he was subjected to a round of hazing. “He was ridiculed by the real chef,” Horstmann said. “I mean, he’d done Denny’s, but this was something else.” Huang worked a twelve-hour shift, experiencing, for the first time in many years, the role reversal of having his boss yell at him. After serving his friends dinner, Jensen fell asleep in the passenger seat of the car on the way home. But as Nvidia moved into AI, Huang abandoned his hobbies. His appetite for mischief diminished, he stopped practicing table tennis, and the teppan grill went cold. He even stopped returning texts. “He was just so, so focused on work,” Horstmann said. “It was all he could talk about.” The conviction that Huang had been given a once-in-a-lifetime opportunity seized him. The acronym “O.I.A.L.O.” was repeated at every meeting. -- 157 of 263 -- From the day Huang had started his career, at twenty, he had worked relentlessly, putting in consecutive twelve-hour days, six days a week, for three decades straight. Now past fifty, and with his kids grown, he began to work even harder. In the past, to relax, Huang had enjoyed going to the movies by himself. He preferred big-budget popcorn flicks, particularly the Avengers movies, which he watched as much to grade the execution of the CGI as for the story. But many of the Marvel fanboys were also gamers, and Huang’s photo had been posted to the Nvidia subreddit enough times that he became a recognizable presence in the theater. Tired of audience members pestering him for selfies, Huang tried, for a while, to attend only ten a.m. showings, but even here he was noticed. Eventually, he stopped going. From 2014 onward, there was only work. There was only AI. • • • N’ went up 30 percent in 2013, 27 percent in 2014, and 66 percent in 2015. The last was enough to finally and permanently push Nvidia above the peak it had reached way back in 2001, when Nvidia was first added to the S&P 500 index. The timing of Nvidia’s inclusion was relevant; the index funds that controlled America’s retirement savings had been mandated to buy Nvidia stock the following trading day. For fourteen years, retail investors had backed this dog with nothing to show for it. Now it was time for everyone to get paid. On paper, Nvidia was still a gaming company, with retail GeForce sales supplying most of its revenue. Wall Street, looking forward, began to value it as a cutting-edge AI firm. Google’s bulk purchase of GPUs during Project Mack Truck was being replicated at other major tech firms, including Amazon, Oracle, and Microsoft. The business plan for these “cloud-service providers” was to build hyperscale data centers running tens or even hundreds of thousands of GPUs, then lease them to corporate customers. Cloud providers sold computing power in the manner of a utility, just like water or electricity. The data centers were located in unmarked -- 158 of 263 -- industrial warehouses spread throughout the world. Walk through security, past an airlock, and onto the climate-controlled floor of a data center, and you would find many rows of seven-foot-high “racks” that resembled stack shelving in a library. Each rack held several horizontal “trays,” and each tray contained one or two GPUs. The trays were modular and easy to upgrade, and the entire system was interlinked via cabling, allowing hundreds of GPUs to act in concert as a unified computer. A smaller core of CPUs administered the system, and a thick bundle of fiber optics connected it to the outside world. Nvidia didn’t make the racks or the trays, and it normally didn’t build the data centers themselves. All it supplied were the chips, but this was a lucrative business—one in which Nvidia enjoyed a virtual monopoly. Nvidia named its chip architectures after famous scientists from the past: Curie, Tesla, Fermi, Kepler, Maxwell, Pascal, Turing, Volta, Ampere, Lovelace, Hopper, and Blackwell. The later chipsets included dedicated circuits for AI, ensuring that the trays in the data centers required continuous upgrades. The six-month upgrade cycle spurred panic buying from the vendors supplying the machine-learning vanguard, resulting in tremendous profits. Prices from the cloud-service providers were quoted in “dollars per GPU per hour,” with fresh Nvidia chipsets approaching $3 per. (Access to comparable Intel CPUs cost a few cents.) At this rate, it would have cost about $500 to train AlexNet’s insect brain, but researchers were now developing far more ambitious models, with billions of parameters and training costs in the millions. Although training was expensive, if done correctly it could pay for itself, as Google’s efforts showed: by using a neural network to optimize the power draw of its server network, Google was able to save hundreds of millions of dollars on its annual electric bill, recouping the cost of its investment in AI almost immediately. Google also debuted image-recognition tools for its photos application, created automatic tags for user pictures, and used AI to improve the quality of its search results. -- 159 of 263 -- Other large tech firms integrated AI into their products. Soon, Instagram, Facebook, and Twitter were harvesting users’ attention by organizing their feeds “algorithmically,” a euphemism for using machine-learning techniques to drive engagement. (Such tactics were as exploitative as they were effective; social media’s mechanical curators could keep users doomscrolling for hours by feeding them rage bait.) Investments in AI led to profits in such direct fashion that Huang resurrected one of the oldest pitches in retail sales: “The more you buy, the more you save.” Venture capitalists began shoveling money at AI start-ups, investing not only in image and speech recognition but also in health care, retail self- checkout, self-driving cars, and education. In 2010, venture investment in AI was closer to zero than any other meaningful number; by 2015, it had swelled to $5 billion and was rapidly growing. “We’ve been investing in a lot of startups applying deep learning to many areas, and every single one effectively comes in building on Nvidia’s platform,” Marc Andreessen, of the firm Andreessen Horowitz, said in early 2016. “Our firm has an internal game of what public companies we’d invest in if we were a hedge fund. We’d put all our money into Nvidia.” A lot of money was also flowing into the bank accounts of skilled deep- learning engineers. The preferred metric for assessing a job offer was “total comp,” which aggregated base pay, bonuses, stock options, and benefit packages into a magic number that could exceed seven figures annually. In late 2014, Bryan Catanzaro left Nvidia to join Baidu, where he worked with Andrew Ng. “They tripled my salary,” Catanzaro said with a shrug, adding ruefully that if he’d just stayed put at Nvidia, he’d have made more from his stock options. Bas Aarts, who built the first CUDA compiler, also left around this time. Within a few years, though, both Aarts and Catanzaro were back. There was no other firm. There was only Nvidia. • • • A AI , observers grew alarmed. In his 2014 book Superintelligence, Nick Bostrom, a Swedish philosopher at Oxford University, compared -- 160 of 263 -- humans tinkering with machine intelligence to “small children playing with a bomb.” He suggested that a computer capable of general, abstract intelligence would probably be the last invention humans would ever have to make. He posited that a generally intelligent computer might begin to self-augment, transforming itself—perhaps within just a few seconds—to a “superintelligence” that would, to human purposes, seem omnipotent. He wondered if such a machine would do to humans what humans had done to gorillas: conquer the planet and destroy their habitat, leaving a few exemplars of the species to serve as charismatic mascots in protected preserves. Bostrom’s book was an extension of ideas he’d been considering for years. He had previously advanced the “paper-clip maximizer” thought experiment: Suppose we have an AI whose only goal is to make as many paper clips as possible. The AI will realize quickly that it would be much better if there were no humans because humans might decide to switch it off. Because if humans do so, there would be fewer paper clips. Also, human bodies contain a lot of atoms that could be made into paper clips. The future that the AI would be trying to gear toward would be one in which there were a lot of paper clips but no humans. The paper-clip maximizer argument had long circulated online, and it gained traction among the “rationality” community, as well as with many tech executives. A few months after the publication of Bostrom’s book, Elon Musk posted a comment to the futurology website Edge.org: The pace of progress in artificial intelligence (I’m not referring to narrow AI) is incredibly fast. Unless you have direct exposure to groups like Deepmind, you have no idea how fast—it is growing at a pace close to exponential. The risk of something -- 161 of 263 -- seriously dangerous happening is in the five year timeframe. 10 years at most. He deleted the comment minutes later, but screenshots circulated widely. Like many of Musk’s predictions, the timing was exaggerated—but the long-term risk was there. A few months later, Musk joined Huang for a discussion at GTC 2015. It was here that Huang finally nailed his signature look. Taking the stage at the San Jose Convention Center, he wore a black leather jacket over a dark- blue polo, black slacks, and black shoes. The outfit was striking, and it was expensive—most of the clothing was of high-end manufacture. Huang would later credit his wife and his daughter for the makeover. Madison was especially influential; shortly after Jensen’s wardrobe reveal, she would leave the culinary trade to join the French luxury conglomerate LVMH. When I spoke with Huang about his clothes, in 2023, he admitted he often was unsure what brand he was wearing on any given day. “I think this is, uh, Tom Ford, maybe?” he said, opening his glossy black jacket to look at its label. Huang’s presentation that year was a nonstop AI showcase. First, he invited the Slovakian researcher Andrej Karpathy to the stage. Working at Fei-Fei Li’s lab at Stanford, Karpathy had glued two neural nets together, one that identified images in the manner of AlexNet and a second that provided elegant natural-language descriptions of what it saw. To this combined network, an image of a bird was not merely a “bird” but “a bird perched on the branch of a tree.” An image of an airplane was not just an “airplane” but “a large airplane sitting on top of a runway.” Most impressive was a photograph, taken from behind, of a man driving a carriage. Karpathy’s amalgamated network interpreted it perfectly: “a man riding a horse drawn carriage down a street.” Huang then gleefully walked Karpathy through some errors. A flying fish was “a small white bird flying over a body of water,” two men on a toboggan were “a man and a child sitting on a bench,” and an infant holding a toothbrush was “a young boy holding a baseball bat.” (Distinguishing between categories of human tools remained a weak spot for neural -- 162 of 263 -- networks.) Karpathy acknowledged that his network had limitations: it was mimicking speech without necessarily understanding it, and when it got to a concept it didn’t understand, it proudly produced nonsense. Karpathy had termed these mistakes “hallucinations.” Musk then came to the stage. As one, the audience raised their smartphones and started to record. Huang was a Tesla fanboy who owned versions of all three cars the company had produced and geeked out over Tesla software updates. Musk had been using Nvidia’s chips to power Tesla’s heads-up display modules since 2011. Once the two men were seated, Huang got straight to the point. “You were quoted as saying artificial intelligence was more dangerous than nuclear weapons,” Huang said. Musk shifted in his chair. “I said ‘potentially.’ ” “You also said it’s like summoning a demon,” Huang said. Musk glared at him. Huang didn’t press, and changed topics to self-driving cars. (“I almost view this as a solved problem,” Musk said. “We’ll be there in a few years.”) The two discussed automobiles for a while, but at the end Musk revisited Huang’s promptings about existential risk. “It’s odd that we’re so close to the advent of AI,” Musk said. “It seems strange that we’d be alive at this time.” With that, the sedate and anticlimactic AI summit ended. The real discussion took place backstage, outside the view of cameras and investors. With Jens Horstmann and Chris Malachowsky standing nearby, Musk and Huang shared their true thoughts on AI, the ones not fit for public consumption. The two began to fire concepts and strategies at each other, and this made them excited. Soon, their rapid exchange more closely resembled a high-speed data link than human conversation. “They were bouncing ideas off each other, and we had no idea what they were talking about,” Horstmann said. “Even Chris was baffled.” • • • -- 163 of 263 -- T M and Huang were obvious. They were immigrants; they were workaholics; they were visionaries. They were screamers; they were gamblers; they were world-class engineers. They moved with confidence into barren commercial brownfields, strewn with the remains of luckless entrepreneurs, and for the first time made them bloom. It took a sharper eye to spot the differences. There was the vision question, with Musk moving backward from fantasy and Huang moving forward from reality. There was also the topic of loyalty. Musk did not value it; he often fired people arbitrarily and without warning, in one case canning the entire Starlink engineering team almost at random on a Sunday afternoon. Huang almost never fired anyone, and when he did, it was only after multiple cautions and the offer of a performance-improvement plan. It took truly egregious behavior to get kicked out of Nvidia, and many employees worked there for decades, including boomerang hires like Catanzaro and Aarts. Even when operating economics forced Huang to shutter a division, he reassigned employees to other useful tasks. In 2019 Curtis Priem returned to Nvidia’s offices for the first time in sixteen years to join Huang and Malachowsky for a reunion of the company’s founders. “I was astounded at how many people were still there,” he said. “Jeff Fisher, his kids were working for Nvidia.” Huang maintained a stable marriage and spoke of Lori with great affection. Musk had at least eleven kids with at least three different women. Huang possessed magnetic charisma, could be funny as hell, and, when he wasn’t losing his shit, was capable of warmth and great empathetic understanding. Musk missed social cues, was awkward and stilted in conversation, and claimed to be on the autism spectrum. Huang drank whiskey, didn’t tweet, and, as far as I could tell, had over the course of forty years never offered a single political opinion about anything. (Searching federal records, I found not a single donation made in his or Lori’s name.) Musk smoked weed, posted cringe, and funded Donald Trump. But it was their divergent views on the risks posed by AI that was perhaps the most important difference between the men. To Musk, -- 164 of 263 -- advanced AI posed a potentially extinction-level threat. Moreover, this opinion was shared by a great number of technologists, including both Hinton and Sutskever, the coauthors of the original AlexNet paper. Huang didn’t see it that way. He saw no risk in AI at all. Zero. • • • B , Musk, with characteristic humility, had concluded that the best way to protect the human species from AI was to build it himself. Opting for an unusual structure, Musk led a coalition of donors and technologists in founding OpenAI in 2015. “OpenAI is a non-profit artificial intelligence research company,” the organization’s first blog post read. “Our goal is to advance digital intelligence in the way that is most likely to benefit humanity as a whole, unconstrained by a need to generate financial return.” The nonprofit eventually collected $135 million in donations. Musk, giving around $45 million, was by far the largest single donor; other early donors included Reid Hoffman, the cofounder of LinkedIn, and Sam Altman, the president of Y Combinator, a venture investor in early-stage start-ups. (OpenAI’s structure and funding commitments would later create trouble for all involved.) While canvassing for donations, OpenAI also built an exceptional roster of AI talent. Andrej Karpathy, who had presented his hallucinating caption engine onstage at GTC 2015, was among the founders. So was Wojciech Zaremba, the Polish programmer who’d cloned AlexNet at Google. Greg Brockman, a developer from North Dakota who’d been an early employee at Stripe, joined as CTO. The most important hire was Ilya Sutskever, Alex Krizhevsky’s old Russian Israeli research partner who’d been present at the creation of AlexNet and had guided the development of AI since. Krizhevsky himself escaped the dragnet. Rarely saying a word to anyone, he was not the ideal collaborator, and he departed Google in 2017. His share of the auction money was just under $15 million, enough not to work anymore, especially given the asceticism of his lifestyle. In 2019 he granted a Japanese news crew a visit inside his Bay Area home. Krizhevsky -- 165 of 263 -- lived like a Benedictine monk, in a spartan apartment above a Vietnamese restaurant. The walls inside were completely bare; the only items of furniture were a desk, a couch, a digital piano, and a television; the only sign of life in the place was his house cat. Krizhevsky, the Orville Wright of the neural net, told the news crew he had walked away from the technology. “Maybe it’s just my personality,” he said, “but when I spend a very long time specializing in something, after about ten years I start to lose interest.” • • • A OAI assembled the A-Team, Huang set out to build them a computer. Nvidia’s most expensive offering at the time was a desktop box for science and data visualization. Huang figured he needed something ten times more powerful. He asked his team to design the DGX-1, an AI-accelerated computer. The focus of the device was matrix multiplication, which was to AI what Quake had been to graphics. The rebuild began at the atomic level. TSMC was now offering a manufacturing technique called “FinFET,” with transistors that jutted above the silicon substrate like a shark’s fin. If you could shrink yourself to stand on the impossibly smooth surface of the microchip, the finned transistors would look like Soviet apartment blocks, towering over you in every direction at two hundred atoms tall. These crystal canyons were not so much printed as “sculpted” with ultraviolet light at a level of precision that would have impressed a Renaissance master. Engineers compared the manufacturing process to shooting a laser from the surface of the moon and hitting a quarter on the sidewalk in Arkansas. The new shark-fin transistors allowed designers far better control of the flow of electricity. In the past, transistors had always behaved a bit like an old garden hose, leaking everywhere and offering only rough control of the trickle of output. The new transistors stopped the leaks and, in a sense, outfitted the hose with a high-tech nozzle, allowing for different spray modes. For chip designers, FinFET presented a long-sought trinity of precision, efficiency, and control. -- 166 of 263 -- How little attention such innovations incurred. Transistors—computers —were experiencing their most important physical upgrade since the 1970s, but among the public, not one person in a hundred knew or even cared. Like some dominant sports franchise, the semiconductor industry’s long and extraordinary track record was taken completely for granted, even by software engineers. Computers just got better, always; that was just the default. How could it ever be any other way? Inside Nvidia, however, the shark-fin transistors generated great excitement. Chip architects designed at scales that were almost inconceivable: the maze made of hair would now not only fill a tennis court but the state of Rhode Island. As these mazes started “going vertical,” they offered fantastic capabilities. In ancient legend it was Daedalus who built the labyrinth to entrap the fearsome Minotaur—but if Daedalus could have seen what Nvidia was building, he would have wept with envy. The architecture for Nvidia’s first FinFET chip was named “Pascal,” after the seventeenth-century philosopher and mathematician Blaise Pascal, who, among many other accomplishments, had invented the first mechanical calculator. The name was a coded reference to the primary bottleneck in computing, which had always been calculation speed. From Pascal’s gear-driven adding machine to the vacuum tubes of ENIAC to the fine-scale microchips of Intel and IBM, computers simply had never been able to do arithmetic fast enough. But Nvidia’s P100, released in April 2016, did calculations faster than they could even be brought to the machine. With these chips, calculating speed was no longer the primary obstacle. With these chips, the “computer” had transcended the mundane act of “computing.” Huang had seen this problem coming. Since 2014, Nvidia had been developing a data superhighway called NVLink that increased the speed with which math problems could be presented to the machine. NVLink was a homework cannon that fired a million semesters’ worth of matrix- multiplication exercises at the processor every second. By wiring eight P100 chips together with an NVLink connection, Nvidia collapsed separate silicon mazes into a single computing super-labyrinth. -- 167 of 263 -- This magnificent eight-chip array formed the spine of the DGX-1, which was housed in a handsome metal case with a pebbled exterior. The computer weighed 134 pounds, cost $129,000, and drew as much power as a clothes dryer. The DGX-1 was the most powerful computer Nvidia had ever built—Huang called it a “data center in a box.” It was not a machine intended for general purposes; it was not a machine really intended for anything except training ever-more-powerful AIs. But the DGX-1 could today be listed alongside the ENIAC and the Apple II as one of the most important computers ever built. Elon Musk received the first device. In August 2016, Huang hand- delivered it to him at OpenAI’s headquarters in San Francisco’s Mission District. Huang arrived in leather; it was now all he ever wore. The computer weighed so much that he needed a dolly to get it into the office and assistance lifting it onto the table. Musk cut open the packaging with a box cutter, and Huang signed the computer with a marker. “To Elon and the OpenAI Team!” he wrote in his elegant capitals. “To the future of computing and humanity. I present you the world’s first DGX-1!” The following week, Huang delivered another signed DGX-1 to Fei-Fei Li’s lab at Stanford. The 2016 ImageNet contest began a few days later. With their many cognitive “layers,” the best contestants could tag images with 98 percent accuracy, beating the human average of 95 percent. Following six decades of research and tens of billions in investment, a computer could now distinguish a spatula from a hatchet. The age of machine supremacy had begun. -- 168 of 263 -- I FOURTEEN The Good Year n 2016, Nvidia’s stock price appreciated 224 percent, restoring Jensen Huang to glory and making him a billionaire once again. Yet Nvidia was not the best-performing stock in the S&P that year. The title went to Nvidia’s longtime rival AMD, which went up 309 percent. Huang didn’t like coming in second. He rededicated himself to victory and found it the next year. Revenues doubled, profits tripled, and the company launched new products at dizzying speed. Few companies anywhere, ever, could hope to have a year as successful as Nvidia had in 2017. The first task for Huang in 2017 was to hamstring AMD. Since purchasing graphics-card maker ATI in 2006, AMD had been Nvidia’s only real competitor—GeForce and Radeon were the Coke and Pepsi of GPUs. But since 2014, this long-running and occasionally vicious sparring match had become personal: AMD’s new CEO, Lisa Su, was Huang’s relative. Su, seven years younger than Huang, was the daughter of a statistician and an accountant. Her family had immigrated to New York City from Taiwan when she was three. Unlike Huang, Su was the product of tiger parents. When she was young, her parents had given her three career options: engineer, doctor, or concert pianist. She chose the first because “it seemed to be the most difficult.” Su spoke with a mild New York accent, dressed in pantsuits, and wore her hair short. She was also willing to gamble. “She has some of that same nature as Jensen, you know, the conviction,” Forrest Norrod, a senior executive at AMD, told me. “She’s got the courage to stick with it through difficult times.” -- 169 of 263 -- The genetic connection between Huang and Su was somewhat faint. Huang’s mother had come from a large family and had at least eleven older siblings. One of those siblings was Su’s grandfather; technically speaking, this made the two executives first cousins, once removed. While he was growing up, Huang hadn’t been aware of Su’s existence and learned she was his relative only after she was named AMD’s CEO. And the relationship was perhaps even more distant than blood implied. Huang’s mother was not that close to her biological family—as one of the youngest siblings, she’d been raised in a different home. “In the old days, you used to have to give away a child, so you gave it to your friend,” Huang said. “My mother was given to her father’s friend, and that was it; they grew up separately.” Lisa’s ancestor remained with the original parents. When I asked Huang about Su, he had only positive things to say. “She’s terrific,” he said. “We’re not very competitive.” This was certainly not how anyone else saw it. For years, AMD had been Nvidia’s only real rival in the GPU space, and Nvidia employees could recite the two companies’ relative market share from memory. I wondered if there was something Huang wasn’t telling me; in earlier days, I’d noticed, he often denigrated competitors like 3dfx in the press. Now he never said anything bad about any of his rivals—or, at least, never when pot-stirring journalists lurked within earshot. Several people told me that although Huang had wised up about publicly belittling competitors, he continued to do so in private. David Kirk believed that Huang had learned his lesson after bankruptcy lawyers for 3dfx had forced him to read his disparaging comments out loud in a videotaped deposition. But Kirk also suggested that when the recorders were turned off and the conference room doors were closed, the old, trash-talking Jensen would emerge. Industry analyst Hans Mosesmann also suggested that any appearance of amity between Nvidia and AMD was a ruse. “Yes, Jensen does not want to lose,” he said. “But he especially does not want to lose to Lisa.” • • • -- 170 of 263 -- E was appointed to run the place, Huang had a complicated relationship with AMD. His first real job had been at the company, and he still held the small amount of AMD stock he’d bought through the employee-purchase program way back in 1984. In 2006 AMD had even offered to purchase Nvidia. Huang had been open to it, but when he learned he wouldn’t be in charge of the combined company, he ended the merger talks. The deal would have cashed him out nicely, but Huang had no intention of leaving the CEO chair, ever. He had no exit plan. AMD instead purchased Toronto-based ATI, Nvidia’s competitor. A series of foolish management decisions followed, and AMD churned through three CEOs in the next six years. In 2008 the company completed construction of a new corporate campus on top of a hill in Austin, Texas, which it pretentiously branded as “The Summit at Lantana.” (The hill was a few hundred feet high.) Five years later, in an attempt to stave off creditors, AMD was forced to sell the Summit and lease it back like a sharecropper. The transaction was embarrassing, and by 2014, AMD’s stock was trading at $2 a share. Looking for someone willing to steward the company through hospice care, the board promoted forty-five-year-old Su, an internal vice president, to CEO. Few on Wall Street expected her to do anything but delay bankruptcy filings by a few months. But they underestimated her; in time, she would emerge as one of the most celebrated semiconductor CEOs of the era, second only to Huang. Their personalities were different. Huang was temperamental and expressive, while Su was reserved and stoic. “She has a great poker face,” Mosesmann said. “Jensen does not, although he’d still find a way to beat you.” Su had the opposite strategic approach as well. Rather than sailing off toward the horizon like Jensen, she liked to prowl around the incumbent, waiting for it to falter. In this way, Su was perhaps braver than Huang: she didn’t run from Intel. In fact, AMD’s revival came from capturing Intel’s CPU business, a feat that analysts once regarded as impossible. Now that AI was succeeding, Su was prowling around Jensen. In 2016 her company released an open-source CUDA clone for the Radeon card. -- 171 of 263 -- Most developers saw this offering as inferior, and AMD’s cloud GPUs didn’t command the premium rates Nvidia’s did. But Su, like Huang, was patient, and she was willing to wait years for the opportunity to strike. “Jensen does not want to lose. He’s a driven guy,” Norrod said. “But we think we can compete with Nvidia.” Huang welcomed his upstart cousin to the show by stealing one of her oldest customers. Nintendo and AMD had a close relationship: ATI had supplied graphics hardware for several generations of Nintendo consoles, including the GameCube and the Wii. This was a business segment that Huang, following the collapse of the Xbox deal, had repeatedly claimed he did not want to pursue. Video game consoles updated only once every five years, too slow for a man accustomed to a six-month refresh cycle. So it came as a surprise to everyone—Su included—when it was announced that Nintendo was going to Nvidia for the new Nintendo Switch. To outsiders, Nintendo resembled an uncrackable safe. Even by Japanese standards, the company was insular. It was headquartered in conservative Kyoto, with most of its decisions made by iconic game designer Shigeru Miyamoto and the small clique of executives who surrounded him. Miyamoto had been in his early thirties when he’d produced Super Mario Brothers and The Legend of Zelda; now in his sixties, gaming’s Walt Disney had lost none of his passion. Hardware and software teams in Kyoto sat directly next to each other, building full-stack gaming titles that tightly coupled the gameplay experience to the controller. Miyamoto exercised a fanatical level of control; he once complained about the stitches on Mario’s pants in a US print advertisement. But Miyamoto also pushed his teams to take bold creative and technological risks, and the forthcoming Switch would prove to be one of the most versatile and delightful consoles ever produced. Insiders considered the Switch’s chipset a done deal. “ATI had an extraordinary relationship with Nintendo,” Jon Peddie said. “They were a loyal, consistent vendor, and there was no reason to think that would change.” But Nvidia’s sales reps somehow worked their way in, peddling the Tegra, a repurposed mobile-phone product that engineers described as -- 172 of 263 -- “floor scrapings.” How, precisely, Huang stole the Switch remains a guarded secret—although Japanese executives might invite a foreign adviser into the boardroom, the real decisions were made late at night in the tiny izakayas of Kyoto after a pitcher or two of beer. The Tegra was a “system-on-a-chip,” meaning that it combined CPU, GPU, and other functions onto a single piece of silicon. The Tegra’s computational performance was not exceptional; the selling point was its low power draw, which permitted gamers to detach the Switch from its base and play Animal Crossing for hours while hiding under the covers from their parents. Peddie recalled meeting with Nintendo executives shortly after the unexpected decision was made. “It was incredibly friendly, but stony,” he said. “You know, ‘This is done, it’s very nice to see you, and please go away.’ ” Nintendo would go on to sell 140 million Switch consoles, making it by far the most profitable product in company history. But it was a measure of how fast Nvidia was accelerating that Huang barely mentioned building the brain of the Switch, even in interviews at the time. Ambushing Lisa Su was a side project; his sprawling company now had its tentacles in everything. • • • T N P in Physics was awarded for work on the Laser Interferometer Gravitational-Wave Observatory, twin pairs of intersecting laser beams located 1,800 miles apart that captured subtle perturbations in the texture of space-time to detect stellar collisions millions of light-years away. The 2017 Nobel Prize in Chemistry was awarded for work on cryo- electron microscopy, a technique in which biological specimens were suspended in transparent “amorphous ice,” then bombarded with electron beams to visualize their three-dimensional structure. Both awards were not for new scientific discoveries but new scientific tools—and each tool intersected with the parallel-computing slope of Nvidia’s GPUs. The laser beams and the frozen amoebae each generated huge amounts of raw data. This data was essentially just a large collection of points in -- 173 of 263 -- space, like a pile of billions of Lego bricks waiting to be assembled. The GPU allowed the scientists to stack these “bricks” of data in parallel, producing precise three-dimensional models of the wonders of our universe at scales both large and small. The laser-beam data were used to draw an animation of two black holes that had been bound in a deadly gravitational spiral for millions of years, spinning together closer and closer until they finally crashed into each other with an explosion so violent that it warped the fabric of reality and sent shock waves echoing throughout the universe. Science magazine called it the breakthrough of the year. At the other extreme, a miniature, one-cell creature suspended in ice revealed biological machinery of surreal complexity. Before, such intricate structures had been unrecoverably smashed between microscope slides, but the new ice-bound, three-dimensional render revealed that the “blind watchmaker” of evolution had built actual biological clockwork. There were gears! There were tiny gears inside the cell, and when they spun, they propelled the tiny creature forward like a motorboat. The three-dimensional renders were CGI constructions, not actual images, and at times you had to squint to see the beauty. But it was there. It was there, waiting not for theory or experimentation but for the raw scientific act of discovery. The black holes and the gears of nature had been there, waiting for millions of years, waiting simply to be seen for the first time. Parallel computing made it possible. The frenzy over AI obscured the contributions that Nvidia made to prosaic scientific advancements like these, which merely won their discoverers the Nobel. Supporting the work of laureate groups in both physics and chemistry in the same calendar year offered irrefutable evidence of CUDA’s value to humanity—the problem was it didn’t pay. As beautiful and important as the images were, Bill Dally was unsure if scientific computing ever would have been profitable without neural nets. “The market for building these big supercomputers for various labs around the world is probably no more than what it costs to develop each generation of GPUs,” he said. “We might have turned the corner anyways, but it was good that AI came along.” -- 174 of 263 -- • • • I that another, less reputable kind of customer discovered CUDA. Cryptocurrency miners created new coins by cracking mathematical ciphers through brute-force computation. CUDA was well- suited for this task, which involved processing similar chunks of data in parallel many millions of times. Enterprising miners soon realized that a well-configured GPU rig would take only four hours to mint a coin. When the price of bitcoin crossed $1,000 for the first time in January 2017, early adopters realized instant fortunes. By mid-2017, crypto mining had turned into a classic speculative frenzy. Bitcoin quadrupled, then quadrupled again, crossing $16,000 by the end of the year. Retailers couldn’t keep GeForce gaming cards in stock, and on eBay the cards could command double the suggested retail price. “Superminers” emerged, arranging GeForce units on metal racks in bedrooms and garages, generating enough heat to melt snow off a roof. Miners soon discovered crypto’s Mecca: East Wenatchee, Washington, a tiny river town in “hydro alley” that purportedly had the cheapest electricity in the nation. For a time, Nvidia’s stock traded in parallel with bitcoin’s price. Many inside Nvidia were appalled—some, concerned about climate change, found this use of the cards to border on sacrilege. Huang, reasoning from first principles, had seen a world in which AI would become the dominant force in computing. When Huang similarly reasoned from first principles about the potential for the blockchain, he did not foresee a world in which crypto displaced fiat money. Huang was too much of a businessman to explicitly discourage miners from purchasing his GPUs, although he never exactly endorsed bitcoin mining either. Nvidia’s official position on crypto was silence. On conference calls throughout 2017, Nvidia stuck to discussions of self- driving cars and AI, downplayed the impact of crypto, and declined to speculate on the proportion of its retail sales that came from miners. Financial reports from this time showed only a suspicious boost in the -- 175 of 263 -- general category of “gaming,” but even silence had a cost: the SEC later brought charges against Nvidia for failing to disclose to investors the proportion of GPU sales that were attributable to crypto. Without admitting or denying the SEC’s findings, Nvidia paid the regulators a $5.5 million penalty. This was nickels for insurgent Nvidia, but the experience was still a pain in the ass, and most of the Nvidia engineers I spoke with regarded crypto as an idiotic distraction from the company’s far more important scientific work. Eventually, Nvidia shipped the CMP, a miners-only card that was basically just a GeForce with the video ports ripped out. At the same time, Nvidia deliberately impaired the mining ability of its standard graphics cards to make sure that gamers and scientists got priority. Bitcoin crashed in 2018, and demand evaporated. By the time bitcoin recovered, GPUs had been supplanted by application-specific rigs that were built for mining, and other leading cryptocurrencies had abandoned the expensive, climate- unfriendly brute-force approach. Nvidia was happy to cede the business. • • • I J , Nvidia reported earning a billion dollars in annual profit for the first time. Shortly thereafter, the company hired its ten-thousandth employee. The dull modernist building complex the company had occupied since 2001 could no longer contain its ambitions. It was time to expand. Years earlier, Nvidia had purchased the lot adjacent to its leased campus in anticipation of building a command center fit for Huang’s vision. The project had stalled following Bumpgate, but in 2017, after several years of construction, the new headquarters, called Endeavor, finally opened. Endeavor was an enormous building in the shape of a triangle with its corners trimmed. This shape was replicated in miniature throughout the building interiors, from the couches and the carpets to the splash guards in the urinals. Nvidia’s “spaceship,” as employees called it, was cavernous, filled with light, and spotlessly clean. Its interiors were sharp and angular, -- 176 of 263 -- colored in hospital white and matte black, and interrupted by multistory “living walls” thick with vegetation. The head architect for Endeavor was Hao Ko, a tall and soft-spoken architect who worked for Gensler, a leading corporate-design firm. Ko dressed fashionably, maintained a trim soul patch, and wore black designer eyeglasses. He had been a junior architect at Gensler when Huang had appointed him the lead designer on Endeavor, bypassing Ko’s boss. I asked Ko why Huang had done so. “You probably have heard stories,” Ko said. “He can be very tough. He will undress you.” Huang had no architecture experience, but he wanted someone he could push around a little. “I would say ninety percent of architects would battle back,” Ko said. “I’m more of a listener.” To achieve his vision for Endeavor, Huang strapped Ko into a virtual- reality headset, then attached the headset to a rack of Nvidia GPUs so Ko could simulate the flow of light. “This is the first building that took a supercomputer to design,” Huang said. Ko used the headset to design Endeavor’s unusual crinkle-cut roof, which reminded me of a fractal. First, Ko extended the roof over the edge of Endeavor’s exterior glass walls so that it would cast shade like the brim of a hat. Then, using computer-aided- design software, he placed hundreds of small, triangular skylights along the folds of the roof at mathematically optimized locations. The cumulative effect was to ensure that the building was at all times flooded with natural light while never exposing employees to the direct glare of the sun. Endeavor had expansive sight lines, and you could see clear from one edge of the building to the other, a distance of a couple hundred yards. This was just as Huang wanted, as it gave him a 360-degree survey of his staff. Ko had built Huang a beautiful executive office complex on the third floor; Huang used this as a book depository. Instead, he commandeered an anonymous-looking conference room in the middle of his headquarters and turned it into his war room. Huang still needed to be in the center of things —just as he’d needed to be seated at the table next to the refrigerator in Priem’s house many years before. -- 177 of 263 -- Working with Huang could be challenging. Ko recalled Huang yelling at his engineering staff about the rendering speed of the VR headset. “He was really letting them have it,” Ko said. The headset originally took five hours to render design changes; following Huang’s outburst, the engineers got the speed down to ten seconds. Huang was tough on them, but there was a logic to it. “If the headset took five hours, I’d probably settle on whatever shade of green looked adequate,” Ko said. “If it took ten seconds, I’d take the time to pick the best shade of green there was.” The building won several awards and made Ko’s career. Nvidia now had a signature building to rival the GooglePlex and Apple’s Infinite Loop. Employees were astonished by the immensity of their new workspace—Ko, at Huang’s insistence, had even built a bar called “Shannon’s” into the top floor. (David Shannon, Nvidia’s former general counsel, had questioned the wisdom of mixing employees and alcohol in this way, so Huang named the bar after him.) Outside of the bar, though, the building was light on amenities. It did not have some of the over-the-top perks other Silicon Valley campuses did: no gym, no climbing wall, no dog park, no disc golf, no ball pit. “You come here to go to work,” Huang said. Ko’s project was not finished. Having completed the first structure, he was already designing a second, even larger building directly to the north. That building, known as “Voyager,” would be another trimmed triangle opposite the first. Together, the two buildings would form an interlocking polygonal structure that evoked Nvidia’s origins in 3D graphics while providing striking vistas from both ground and sky. (Their names interlocked as well; the initial sounds of “Endeavor” and “Voyager” came together to generate “NV.”) When Ko was done with the second building, Huang’s master plan called for a third building, as yet unnamed, to complement the other two. Each building would be bigger than the last, tiling the floor of the valley one tessellated triangle at a time. When Endeavor opened, Ko led Huang and a group of executives on a tour. “The place was finished, it looks amazing, we’re doing the tour, and he’s questioning me about the placement of the water fountains,” Ko said. -- 178 of 263 -- “He was upset because they were next to the bathrooms! That’s required by code, and this is a billion-dollar building! But he just couldn’t let it go.” “I’m never satisfied,” Huang said. “No matter what it is, I only see imperfections.” • • • T CUDA was downloaded 2.7 million times in 2017, nearly triple the previous year and fifteen times more than the dregs in 2012. Some of this was for crypto, but the majority of CUDA downloaders were looking to build AI. A lot of the interest came from students—by 2017, the most popular course at Stanford was CS 229, “Introduction to Deep Learning.” Unlike crypto miners, CUDA developers didn’t necessarily purchase their own Nvidia hardware. Many ran their programs, and sometimes their whole companies, on virtual machines they leased through the cloud. For cloud providers, this was an exceptionally lucrative business. The dominant player was Amazon Web Services, which controlled 50 percent of the market and which in good years actually made more money than Amazon’s vast e-commerce operation. Chasing behind Amazon was Microsoft’s cloud service Azure, under the reinvigorated leadership of Microsoft chief executive Satya Nadella. The combined purchasing demand from Amazon, Microsoft, and other cloud providers was enough to double Nvidia’s chip sales to data centers for the year. It did not escape Huang’s notice that leasing virtual computing equipment was more profitable than selling actual physical hardware. In 2017 he introduced two leasing platforms of his own. The first was GeForce Now, which rented out virtual graphics cards, permitting gamers to play high-end games on underpowered machines like laptops and cheap commodity PCs. The physical sale of GeForce gaming cards still provided a majority of Nvidia’s revenue, and enthusiasts still loved to show off their rigs—but cloud computing suggested that all modern gamers really needed was a monitor and a decent broadband connection. With GeForce Now, -- 179 of 263 -- Huang was preemptively preparing to someday put a bullet in the head of what for twenty-five years had been his core business—reasoning that if he didn’t kill it, someone else would do it for him. The second platform was altogether stranger. The idea for it had occurred to Huang while watching Yevgen Chebotar, a researcher at USC, train a robotic arm to hit an orange ball with a hockey stick. The arm used reinforcement learning techniques to learn to score goals. Given enough time, it might rival Gretzky, but to make it happen, Chebotar and his assistants had to physically reset the ball in front of the stick thousands of times. This did not strike Huang as an efficient use of Chebotar’s day. Instead, he reasoned, the physical parameters of hockey should be simulated on a computer. The power of simulation had been a recurring theme throughout Huang’s career. Anything he could simulate, he did—profits and innovative products usually followed. Now it occurred to him that simulating any one thing was not enough; he had to simulate everything. “We have to build an alternative universe,” Huang said. In this alternative universe for robotics training, Chebotar would not have to reset the ball. Instead, reinforcement could happen instantaneously; the robot arm would hit a billion balls over the same span of time. Gerald Tesauro had simulated games of backgammon by generating millions of dice rolls. Huang would simulate games of hockey by weaving a new reality in code. Creating a platform with sufficient fidelity to real-world physics was not an easy task. Game engines were nowhere close—as Peddie had observed, once you stripped away the textures you just had blocks running into one another. A reality simulator would require not only a perfect physics engine but also precise representations of collision elasticity and object density. Striking a tennis ball, for example, would have to produce a different response from striking a billiard ball. Soft textures like cloth would have to give, while rigid textures like metal would have to hold. A slippery wet plate would have to fall, and a bruisable piece of fruit would have to rupture. With so many variables, a reality simulator would be expensive, -- 180 of 263 -- time-consuming to build, and initially have maybe five or six customers. Huang, of course, loved the idea. Huang called his first simulator the “Isaac” robotic-training platform. With time, Isaac led to a more complex product called the Omniverse, which Huang sometimes described as an “industrial metaverse” and other times as Earth’s “digital twin.” The Omniverse would not be a physical product that Huang would sell, but a high-fi digital playground he would lease. And lease not just to robots, but also to cars, and industrial designers, and warehouse builders, and anyone else looking to make complicated physical products work in the real world without breaking things first. Ko, using VR, had built him a spaceship. With the Omniverse, Ko could have built anything. • • • B , Nvidia had tripled its profits to $3 billion, with spiking growth across every product line. The neglected graphics-card company was now a Wall Street player with a $100 billion market capitalization. Still, if you asked the average person on the street in 2017 what product Nvidia made, the answer would likely be hardware for graphics or crypto mining. Actually, that’s not quite true—if you asked the average person on the street about Nvidia in 2017, they would surely have no idea what you were talking about. But for the small fraction of the public who did know, Nvidia was still mostly associated with video games. Only the savviest investors grokked that Nvidia was no longer a consumer-graphics play. It was an AI company. It was the AI company. And AI was suddenly and quietly everywhere: recognizing human faces, recommending products, organizing social media feeds, and improving voice quality on mobile phones. As Marc Andreessen had observed, in one way or another almost all of it relied on the Nvidia computing stack. Investors who’d had the gumption to stick with Huang through the dark ages were rewarded, and the Fidelity portfolio managers who’d cross- examined Huang in Boston in 2013 were now delighted with their oversized -- 181 of 263 -- and ever-growing position in what was rapidly becoming the stock market’s single best investment. With CUDA, the Switch, the Nobel Prizes, crypto, and the cloud, Nvidia had forever left the corporate doldrums behind. Yet none of these products, not even Huang’s attempt to clone reality, could’ve prepared him, his customers, or his investors for the metamorphosis that was about to occur. -- 182 of 263 -- A FIFTEEN The Transformer s Wolfgang Mozart was born into music, as Stephen Curry was born into basketball, so was Jakob Uszkoreit born into the obscure field of computational linguistics. His father, Hans, was an acclaimed researcher who had spent his life attempting to teach computers to process language. Jakob had wanted to do something different with computers, perhaps something involving biology, but after he was hired at Google in 2008, he soon realized that language processing was one of the most interesting problems. Surrendering to fate, Uszkoreit entered his father’s field. Within a few years, he would forever surpass him. Uszkoreit was handsome, with stern Teutonic features and dark-brown eyes. He wore his hair long and frequently tied it in a man bun. Born in America, he had grown up in Germany and spoke English with a faint accent. At Google, Uszkoreit pondered the hidden grammatical structures that made language go. Perhaps from the neural net’s array of random starting weights those same structures might be shaped into existence. Early linguistic efforts with neural nets had been frustrating. No matter how they were trained, the models continued to make elementary grammatical errors. Teaching grammar to the computer explicitly, like a high school Latin teacher, was an approach that didn’t scale. Researchers instead tried implementing long-term and short-term memories inside “recurrent” neural networks, but this architecture was finicky and difficult to program. When exposed to more text, recurrent neural networks would sometimes even untrain, forgetting things they already knew. -- 183 of 263 -- Uszkoreit wanted a neural net that got smarter the more it read. Sometime in 2014, he was struck with a novel idea. If more data was going to lead to better results, then the underlying structure for processing information should be as simple as possible. His inspiration was biology— medical scans suggested that of the estimated hundred billion neurons in the human brain, fewer than 1 percent were dedicated to language processing. “Probably language evolved this way to exploit our cognitive capacities in a fairly optimal way,” he said in a 2023 interview. Uszkoreit decided to model language using context alone. He ripped out all the memory structures and replaced them with a simple knowledge graph. Words themselves meant nothing: in isolation, they were arbitrary collections of sounds. The only way to capture their meaning was to draw links between them and other words in the text. So if you had a knowledge graph linking the words “hop,” “green,” “tongue,” “flies,” and “amphibian,” then you knew enough to guess that the word in the center was “frog.” Not only that, but the graph should look the same in German, French, Swahili, or Vietnamese. The word wasn’t the letters “f,” “r,” “o,” and “g”—those letters were just placeholders. The word, in a cognitive sense, was that unique map of links to the rest of the vocabulary. To capture this relationship, Uszkoreit defined each word as a tree of statistical weights. For example, if asked to consider the sentence “The orange _____ caught the brown mouse,” the neural net could guess that “cat” was most likely the missing word because it had encountered a great deal of “cat/mouse” pairs in its training set. “Cat” might also have a relatively strong relationship to “caught” and perhaps “ate,” but less strong to “brown” and barely strong at all to “the.” Perhaps, with enough training examples, the computer could also come to understand that “orange” was an adjective that modified the noun “cat” without ever receiving explicit grammatical instruction. Common nouns were easy to map in this way, but other words were harder. For example, in parsing the word “unhappiness,” the native English speaker implicitly understood the negating prefix “un,” the root word “happi,” and the suffix “ness,” used to convert an adjective into a noun. To better model these relationships, Uszkoreit split words into -- 184 of 263 -- fragments called “tokens.” These, too, were connected in a tree of statistical weights. Uszkoreit called this learning mechanism “self-attention.” The reception at Google was cool—it seemed too simple to work. “People raised their eyebrows because it dumped out all the existing neural architectures,” Uszkoreit said. Even his father was skeptical. But Uszkoreit was designing for the GPU. The recurrent neural-network architecture didn’t take advantage of Nvidia’s hardware. In fact, it fought against the parallel architecture, minimizing data input and maximizing fancy code. Uszkoreit, seeing the analogy with the brain, wanted to do the opposite, piping massive amounts of text, words, and computing firepower through a simple yet elegant funnel. Uszkoreit outlined his thinking in 2023: “If you are given a piece of hardware that has the very key strength of doing lots and lots of simple computations in parallel, as opposed to complicated, structured computations sequentially, then really that’s the statistical property you want to exploit.” • • • T “-” was immediately successful, and aspects of it were rolled into Google’s search and advertising products. Uszkoreit, looking to take it further, convinced Illia Polosukhin, a champion coder who worked with him at the GooglePlex, to join this research group. Polosukhin, too, was fascinated by the biological underpinnings of language. “Images are interesting, and obviously contain a lot of world knowledge, but we have thousands of species which can see,” he told me. “Only one species can actually understand language.” Polosukhin was mulling over how to implement the self-attention mechanism when he went to see Denis Villeneuve’s 2016 movie Arrival. In the film, squid-like alien “heptapods” attempt to communicate with humans by painting mysterious circular inkblots. A linguist, played by Amy Adams, eventually realizes that each inkblot represents a single, unified body of text. (She then begins to see into the future, but we’ll stick with inkblots for -- 185 of 263 -- now.) Polosukhin recognized that “self-attention” could be deployed in a similarly maximal way, probabilistically linking each word in the tree not just to other words in the sentence but also to thousands of other words throughout an entire text. Even a word that had appeared many paragraphs earlier might provide a contextual clue to what the next word meant. Polosukhin and Uszkoreit were joined by Ashish Vaswani, another Google researcher, and by early 2017 the three had built a rudimentary English-to-German translator based on the self-attention mechanism. Polosukhin and Uszkoreit had previously contributed to an internal Google program called “autobot,” which attempted to automatically write Wikipedia pages. The new self-attention mechanism was called a “transformer.” Over the next few months the team added four more researchers, and by February 2017 the German-English translator was competitive with the best recurrent networks. At this point Noam Shazeer, a Google veteran who’d worked for the company since 2000, joined as the team’s eighth and final member. Shazeer was an expert coder who was frustrated by recurrent neural networks and wanted an alternative. Together with the Welsh programmer Llion Jones, he upgraded the transformer from a research project to production-quality software. As the team fed more data into the transformer, the results improved, outclassing the public Google Translate platform. “What we saw was that as you make it bigger, it’s clearly just kind of more intelligent!” Shazeer said. “That was not true of our preexisting work.” Previous neural-net architectures had tried to build sentences or even paragraphs. The transformer worked by predicting exactly one word at a time, based on the probabilistic relationships. Just one word—that was the furthest it ever looked ahead. “By learning to generate sequences in order, you’re forced to learn extremely complex behaviors,” team member Aidan Gomez said. “One of the most beautiful things falls out of this.” Soon, the transformer model demonstrated that it could “understand our culture, our language, the interactions between us.” -- 186 of 263 -- Jones noticed that if the transformer always returned the best fit, the prose could get a little clunky. So, like a writer turning to a thesaurus, he implemented a “beam search” function that would allow the transformer to choose from one of several candidate words. “If you cut a path through the top word, then the second, then the third, then the top again, you will often get a better result than if you just always picked the top word,” he told me. “It was a nightmare to implement, but once we did, that was when we first got state-of-the-art results.” To better understand what the transformer was doing, Jones also coded a visualization tool that represented the strength of the statistical relationships between words with lines of different thickness. He fed it a notoriously difficult sentence pair. The first sentence read, “The animal didn’t cross the street because it was too tired.” The second sentence read, “The animal didn’t cross the street because it was too wide.” To parse the sentence pair correctly, the transformer would have to know that “tired” could describe only an animal and that “wide” could describe only a street. To his amazement, Jones’s visualization demonstrated exactly that relationship. “This was one of the oldest problems in computational linguistics, and we weren’t even trying to solve it!” he said. “It just fell out.” Language had many similar deep, implicit structures, some of which were probably invisible to linguists. “It’s not like the model is just learning adjective-noun relationships—it’s also learning far more complex stuff that we probably don’t even have language to describe,” Gomez said. For example, native English speakers follow an implicit order for the placement of adjectives—we know to say “an old Canadian maple tree,” not “a maple Canadian old tree.” With the transformer, that linguistic intuition had been captured in software. If AlexNet had been the first airplane, a rickety proof-of-concept, then the transformer was the jet engine. Shazeer and Uszkoreit, working together on a whiteboard, ensured everything about the transformer was built to accommodate hyperscale architecture, with massive amounts of data, massive numbers of parameters, and massive GPU clusters. As the project -- 187 of 263 -- built momentum, the work grew frenzied; following one late-night crunch, Vaswani, sleep-deprived, recalled seeing neurons in the office curtain. In the final stage, the team ran “ablations,” deliberately disabling portions of the transformer code to understand what contributions they made. But the ablations had the unexpected result of making the core transformer function run even better. Shazeer removed so much of the surrounding code that in the end he was left with almost nothing. In its most primitive interpretation, the transformer was barely more than twenty lines of code. Oh, but what it could do! As they prepared for the publication of their seminal paper, the team experimented with feeding the transformer libraries of music and archives of visual art. Just as the transformer could accurately predict the next most likely word in a sentence, so could it predict the next- most-likely note in a symphony or the next-most-likely pixel in a work of art. Soon the transformer was writing music and painting recognizable works of art. This elegant architecture, designed to do the simplest thing conceivable—just take one step at a time—was like a skeleton key for AI. In 2017 the team published its results in the Neural Information Processing Systems journal, which had published the original AlexNet results. The paper needed a name, so Jones, channeling the Beatles, suggested “Attention Is All You Need.” This was an off-the-cuff joke that he didn’t think the team would actually use. Later, he would meet people with the sentence tattooed on their arms. In July 2017, shortly before publishing their results, Shazeer and team member Lukasz Kaiser tried an experiment. Rather than ask the transformer machine to translate preexisting texts, they asked it to ingest a corpus of millions of Wikipedia articles, then generate new text based on what it had read. As their prompt, they entered a single request: Write an article about the Transformer. Out flowed a thousand-word description of a Japanese new-wave punk band known as Transformer. The article was purely hallucinatory—no such band existed. But the text was smooth, confidently written, and even included fabricated footnotes. Gomez felt that his timeline for progress was collapsing. “You went to bed one night, and models could -- 188 of 263 -- barely spell,” he said. “We thought we’d have models that could write compelling English in a few decades, and then suddenly it shows up!” The transformer team expected Google to turn the technology into consumer-facing products, but management somehow didn’t see the value in the tech. Team members felt that Google’s search monopoly had resulted in a bloated, bureaucratic company unwilling to take risks. “They were like, ‘Hey, we cannot launch anything that doesn’t fit into the search box,’ ” Polosukhin said. “Fifteen years earlier, we would have just launched something bad. Then we iterate, we learn, and we improve, improve, improve, improve, until it’s actually really good. At some point, we lost that mentality.” The transformer authors began defecting to start-ups; by 2023, every one of the eight original transformer researchers had left. (Shazeer, who cofounded the popular chatbot service Character.ai, was acqui-hired back in 2024. Defecting pays.) Jones, the last to go, still praised his former employer. He recalled his eclectic collaborators, each with unique viewpoints and capabilities, who had organically coalesced around a promising topic. “Only Google could have developed this product,” he said. It was probably true, but Google’s failure to capitalize on the transformer left a giant hole in the market for someone else to fill. • • • OAI Huang had delivered the DGX-1. The company’s first research effort was an “artificial gamer” that excelled at the real-time strategy game Dota 2. Some at the organization wondered if this was the best use of their world-class talent. Elon Musk grew frustrated with OpenAI’s apparent lack of progress and began attempting to lure its engineers to join him at Tesla. The situation came to a head in February 2018, following a fraught piece of boardroom wrangling that left Musk sidelined and Sam Altman, the former head of Y Combinator, in control. In his final meeting with the organization, Musk told employees he was quitting to pursue his own AI research at Tesla. “After a young researcher -- 189 of 263 -- challenged Musk’s decision, suggesting it would exacerbate the AI arms race, he called the researcher a ‘jackass’ and stormed out of the building,” The Wall Street Journal reported. Ilya Sutskever remained. The AlexNet coauthor had continued to publish important research and now rivaled his mentor Geoffrey Hinton as one of the most cited scholars in the field. Sutskever focused on AI to the exclusion of everything else, including, increasingly, grooming: his beard was tangled and unkempt, his eyebrows were thick and untrimmed, and you could see most of his scalp through his thinning, wispy hair. His strength, as Hinton had observed, was the speed with which he could pivot toward a good idea. Around the time of Musk’s tantrum, Sutskever had watched Shazeer present the transformer at a conference. He grasped at once the power of the architecture, returning to OpenAI’s offices to advise his colleagues that they should ditch the auto-gamer to build something that could change the world. “Literally the next day, it was clear to me, to us, that the transformer addressed the limitations of recurrent neural networks,” Sutskever said. “We switched to transformers right away.” Altman, now in charge, agreed with the change in strategy. Sutskever wanted to use the transformer to build a product that would deliver high-quality, human-readable text and answer any conceivable prompt. He had seen Shazeer and Kaiser’s proof-of-concept with the fake Wikipedia articles and thought it could be expanded upon. First, the model would “pretrain” on a large collection of text. Then it would generate text of its own. Combining the purpose, the method, and the architecture, you arrived at the “Generative Pre-Trained Transformer,” or GPT. GPT-1 launched in June 2018. It learned to read using BookCorpus, a collection of around seven thousand free, self-published ebooks. (Sci-fi, romance, and fantasy were the predominant genres—many of the books were Twilight knockoffs.) Drawing from this curriculum of third-rate vampire fiction, the first GPT was as bad as you might expect, answering users’ queries with streams of Dadaist nonsense. Musk wasn’t impressed, and after the release of GPT-1, he sent OpenAI a nasty email. “My -- 190 of 263 -- probability assessment of OpenAI being relevant to DeepMind/Google without a dramatic change in execution and resources is 0%,” he wrote. Still, OpenAI had the freedom to launch something bad; unlike Google, it could even launch something terrible, then improve, improve, improve. GPT-1 sucked, but it showed that something of its kind could work. The key to a better system, as Sutskever had grasped way back in 2012, was scale. GPT-2 launched eight months later. Having graduated from the vampire- romance syllabus, the model trained by absorbing around eight million web pages comprising around six billion individual words. The final product could sometimes produce text that was indistinguishable from that of a human. Asking for a story, OpenAI researchers gave GPT-2 the following prompt: “In a shocking finding, scientists discovered a herd of unicorns living in a remote, previously unexplored valley, in the Andes Mountains. Even more surprising to the researchers was the fact that the unicorns spoke perfect English.” GPT-2 ran with the prompt: The scientist named the population, after their distinctive horn, Ovid’s Unicorn. These four-horned, silver-white unicorns were previously unknown to science. Now, after almost two centuries, the mystery of what sparked this odd phenomenon is finally solved. Using only statistical relationships to guess the next word in the sentence, GPT-2 output better prose than any other language model had ever produced. Sutskever and his team then pushed GPT-2 out of its comfort zone, asking “zero-shot” questions that were not directly answered on any of the web pages. “Who wrote the book The Origin of Species?” they asked. “Charles Darwin,” GPT-2 answered. (Correct.) “What is the largest supermarket chain in the UK?” they asked. “Tesco,” it answered. (Correct.) “Who played John Connor in the original Terminator?” they asked. “Arnold Schwarzenegger,” it answered. (Incorrect: Schwarzenegger -- 191 of 263 -- played the cyborg.) “What river is associated with the city of Rome?” they asked. “The Tiber,” it answered. (Correct.) The ability of GPT-2 to answer new questions without explicit training was an example of the “emergent” properties in AI. These unexpected skills and behaviors were appearing as the models grew larger, surprising even the researchers. Once a model crossed the threshold of emergence, no one, not even its designers, could say what it was fully capable of doing. Of course, GPT-2 had many limitations. It was terrible at summarizing arguments and could get tripped out counting to ten. Still, Sutskever, seeing these capabilities, began to wonder if the transformer was a first step toward “artificial general intelligence,” or AGI. One way to define AGI was as software that could accomplish any task a human could. Sutskever, building ever more sophisticated versions of GPT, revisited the same concerns that had prompted OpenAI’s founding. What if an AGI could do its own AI research, augmenting its own intellect in a never-ending feedback loop? Was there a tipping point where, as Bostrom had postulated, over the course of a few moments the AI went from sort-of-smart to hypersmart? Would its human operators be able to understand that this was happening? Would this AGI permit them to survive? Sutskever did not think the current generation of neural-net architecture would lead directly to AGI. But what about the next breakthrough? In a little more than five years, he’d watched AI leap forward two separate times. Surely, he reasoned, somewhere out there someone was working on the next AlexNet, the next transformer. What then? The question began to consume Sutskever—no one could say what might happen once the AGI threshold was crossed. Yet these concerns did not constrain his ambitions. Researchers described the size of the models by the number of individual weights, or “parameters,” they contained. Each parameter was roughly equivalent to a synapse in a biological brain. GPT-1 had about 100 million parameters, keeping it in insect territory. GPT-2 had 1.5 billion, the equivalent of a small lizard. For his next model, Sutskever was targeting 100 billion parameters—on the order of a rodent. -- 192 of 263 -- Training this neural structure required a lot of computing power. The electricity costs alone made it one of the more expensive computer- intelligence applications ever developed—an early indicator of things to come. Altman, now the CEO, realized he could not raise enough money through nonprofit donations to satisfy Sutskever’s ambitions, so in 2019 he announced that OpenAI was launching a “capped-profit” subsidiary that limited backers to a mere 100× return on investment. “It would be wise to view any investment in OpenAI Global LLC in the spirit of a donation, with the understanding that it may be difficult to know what role money will play in a post-AGI world,” the accompanying press release stated. The largest initial “donor” was Microsoft, which invested $1 billion in OpenAI, accepting a modest $100 billion in potential remuneration. (Perhaps a world that had transcended money was coming, but until that day arrived, Microsoft would continue to accumulate more of it.) The investment represented a million-fold increase from the $1,000 that Krizhevsky and Sutskever had used to buy two GeForces seven years before. Yet even this was not enough to fulfill Sutskever’s appetite. The human brain had perhaps two hundred trillion synapses; AGI or not, OpenAI was looking to surpass it. Nvidia was already assembling the computing stack Sutskever would need to get there, an integrated, warehouse-sized solution that Huang no longer referred to as a supercomputer or a data center, but as an “AI factory.” -- 193 of 263 -- H SIXTEEN Hyperscale uang’s only interest now was scale. He did not see AI as an emergent machine superintelligence and was dismissive of direct analogies to biology. For him, AI was merely software—software running on hardware that his company sold. He imposed this viewpoint in his company-wide addresses, and, as a result, Nvidia’s employees talked about the capabilities of AI with rather less excitement than one might expect. And perhaps, from Huang’s perspective, the neural nets weren’t the real accomplishment; the real accomplishment was the extraordinary amount of computational power he was managing to pack into the data centers. At the 2018 GTC conference, Huang observed that Nvidia GPUs had increased processing speed by twenty-five times in just five years, far outpacing Moore’s Law. He then showed a chart showing how AlexNet, which took a week to train on his 2012 graphics cards, could train on his new computer, the DGX-2, in just eighteen minutes. “There’s a new law going on,” he said, “a supercharged law.” Huang had his own perspective on the data centers, too. These giant computers weren’t an agglomeration of many chips—to him, they were all just one chip working as a unified system on a single problem. In fact, the thousands of GPUs were transferring bits around the data center so quickly they were clogging the networking infrastructure. To fix the problem, Huang had to go shopping. The fastest networking standard in the world was Infiniband, designed by the Israeli firm Mellanox. Eyal Waldman, Mellanox’s founder, was a burly, garrulous, serial -- 194 of 263 -- entrepreneur who shared Huang’s taste for speculative overkill. He had originally developed Infiniband for the niche supercomputing market, spending more to develop the technology than that market promised to return. Like CUDA, Infiniband was a powerful product in search of a customer. But as Huang had found out, the “field of dreams” approach agitated investors. Overinvestment in capacity led to lowered profits, low profits led to low stock prices, low stock prices led to malcontent shareholders, and malcontent shareholders inevitably, like some bat signal of the Wall Street cinematic universe, led to Jeff Smith, the same corporate raider who’d targeted Nvidia in 2013. By 2017, Smith had established himself as one of the world’s better investors, reportedly posting annualized returns above 15 percent. His Starboard Value firm began accumulating shares in underperforming Mellanox and soon owned a tenth of the company. Huang had managed to fend Smith off, but Waldman was not so fortunate—Starboard went Olive Garden on his ass, attempting to eject all eleven sitting members of Mellanox’s board in a single election. Waldman postponed the board meeting in a scramble to get the votes he needed to survive; simultaneously, Smith lobbied Mellanox’s investors to toss him out. An epic proxy fight loomed, but in the end, Waldman caved, and he conceded three board seats. Sitting across the conference table from the goon squad, Waldman then announced he was putting Mellanox up for sale. His timing was good, at least. As cloud computing scaled in the late 2010s, Infiniband moved to supply it. The cables, the switches, the routers, the networking chips—for a Lamborghini of a data center you couldn’t rely on the tired Ethernet standard. Waldman, operating under duress, managed to attract bids from seven different cloud suppliers. In March 2019, Nvidia won the bidding war, edging out Intel with a $7 billion offer. When the deal closed in 2020, Nvidia added three thousand new employees, and Mellanox was rebranded as Nvidia Israel. The deal meant paying off his frenemy Smith, but this scarcely concerned Huang. He was slavering over the data pipes, the missing -- 195 of 263 -- ingredient that transformed his AI factories from a mere brain into something more like an integrated nervous system—an example of corporate synergy that was as authentic as it was uncommon. Smith doubled his money, but subsequent events showed that Huang got the better end of the deal. “It ended up being one of the more profitable corporate acquisitions in recent memory,” board member Mark Stevens said. Starboard, in a later corporate filing, expressed admiration for the man they’d once tried to push around. “Jensen Huang is a real visionary in this sector,” the filing dryly noted. • • • F M, Huang decided to press his luck. In September 2020 he offered $40 billion to purchase the UK-based chip designer ARM. It was the largest semiconductor merger ever proposed—too large, as it turned out. In 2021 US regulators sued to block the transaction, arguing that it would suppress innovation. Not one to fight the feds, Huang scuttled the deal a short time later. (Chinese and UK regulators were equally skeptical. Intel and Nvidia were by this time in heated competition for the data-center market, as was AMD. Allowing any one firm to vertically integrate might have stifled competition from the other two.) Nvidia’s market position allowed it to charge high prices, and the gross profit margin on its AI chips could exceed 90 percent. This ratio attracted competition in the manner that chum attracted sharks. Seeking to reduce their dependency on Nvidia, Google and Tesla had both developed AI- training hardware. Numerous start-ups were pursuing the market as well. One of them was Cerebras, which made a “mega-chip” the size of a dinner plate. “They’re just extorting their customers, and nobody will say it out loud,” Andrew Feldman, the CEO of Cerebras, said of Nvidia. Observers sometimes wondered if Nvidia’s position was sustainable. AMD and Intel both offered open-source alternatives to CUDA, and these alternatives liberated customers from the Nvidia hardware ladder, -- 196 of 263 -- potentially saving them billions of dollars. But few AI researchers used AMD’s or Intel’s products. Why not? I asked this question of a great many people, and I got a great variety of answers. One common misconception was that it was somehow technically difficult to switch away from CUDA. Actually, it was easy—sometimes, all developers had to do was alter a couple of lines of code. Another common misconception was that corporate hardware buyers were playing it safe. (“No one ever got fired for buying Nvidia,” Hans Mosesmann said, echoing an old line about IBM.) But when I actually talked to a hardware buyer, he told me that he constantly experimented with competing products, and he expressed a wistful desire that a cheaper equivalent might someday appear. “This stuff is fucking expensive,” he said of the Nvidia computing stack. The reason that Nvidia succeeded was not that its circuits were better. The reason was that its software was better. Only a small portion of the performance gains now came from the classic strategy of packing more transistors into the chip—Moore’s Law was dead. The rest came from Bill Dally, Ian Buck, and the rest of the Nvidia scientists accelerating matrix multiplications with numerical magic tricks. Nvidia engineers taught the GPU new instructions that acted like speed solvers on a Rubik’s cube. They replaced the processor’s native language with ugly but effective lo-fi data types, akin to switching from calligraphy to shorthand. They trimmed “dead” synapses from the matrices, essentially deleting unproductive information from the neural net in the manner of the forgetting machine in Eternal Sunshine of the Spotless Mind. Between 2012 and 2022, Nvidia achieved a thousand-fold speed-up in single-chip AI inference performance, which was far in excess of anything that Moore’s Law had ever achieved. A mere 2.5× of that speed-up came from transistors; most of the remaining 400× came from Nvidia’s mathematical toolbox. “Honestly, AMD can make silicon just as good as we can,” Arjun Prabhu said. “They just can’t make the calculations go as fast.” On top of this powerful engine, Nvidia built domain-specific tools for specialist programmers. There was “Drive” for automotive research, “BioNeMo” for drug discovery, “Clara” for medical imaging, “Morpheus” -- 197 of 263 -- for cybersecurity, and “Highlights” for capturing kill shots in Fortnite. By the 2020s, Nvidia was offering almost three hundred such tool kits spanning a portfolio of gaming, animation, planetary science, climatology, mathematics, physics, finance, biochemistry, and quantum computing. These software packages were freely available for anyone to use, with zero licensing fees, and Huang pushed them into the hands of scientists like a grandmother distributing second helpings of food. He called the tool kits his “treasures.” Of course, Huang wasn’t any sort of altruist—the long-term play was to use this free software to lock researchers into Nvidia’s hardware-upgrade cycle. In early 2024, an administrator at CalTech’s data center told me that the school’s wait time for delivery on an H100 chip was almost eighteen months. He had encouraged professors at the school to switch to other providers but found few willing to accept. “They’d rather wait for the hardware than switch away from CUDA,” he said. It was all this code that made Nvidia hard to compete against. Upstarts might design a new chip, but that wasn’t enough—Dwight Diercks, Nvidia’s head of software engineering, had ten thousand programmers working for him. “We’re really a software company; that’s the thing people don’t understand,” Diercks said. Like Huang, Diercks never let off the pressure, putting his team into perpetual crunch to deliver the latest must-have features. Working for Diercks was hard, and former Nvidia programmers often mentioned burnout as their reason for departure. Wandering around the spaceships in his plaid shirt and jeans, Diercks looked like a cattle rancher abducted by a UFO, but his incongruous appearance disguised his obsession with deadlines. Nvidia’s software products were not always beautiful or easy to use, and the interfaces for some of the tool kits were a decade out of date. But Diercks was not a person who cared for appearance; he cared only about being first. Whenever some promising new frontier of science opened, he was there at once, distributing his latest collection of software tools. A competitor might later arrive with a more elegant product that was cheaper to run, but by then it was too late—the industry standard had been -- 198 of 263 -- set. For this reason, Bill Dally told me he wasn’t concerned about the open- source competition. “Because we’re flooring it!” he said. “We’re always a couple of generations ahead of them.” • • • N , and employees were accustomed to remote collaboration, so when the COVID-19 pandemic arrived, operations were scarcely affected. Still, things became less fun: the third-floor bar was closed, GTC went virtual, and Huang, like everyone else, was forced to retreat into his house. In July 2020, when Nvidia finally surpassed Intel in valuation, Huang couldn’t celebrate in person. Nor could he brag about his accomplishment to his family—his younger brother, Jim, still worked for Intel. Instead, Jensen took his two dogs for a walk. Perhaps they intuited his accomplishment; he couldn’t be sure. Soon, Nvidia surpassed AMD in valuation as well. Housebound and managing the world’s most valuable semiconductor firm, Huang grew concerned that he was losing touch with the front. Nvidia had grown too large for Huang to really understand everything that was going on, but it was not his style to delegate. To maintain resonance, he needed to keep communication open with the frontline employees. Sometime around 2020, Huang asked everyone at the company to submit a weekly list of the five most important things they were working on. Every Friday from that day forward, he received twenty thousand emails. Brevity was encouraged; Huang would randomly sample from this pool of correspondence late into the night. In turn, he communicated to his staff by writing hundreds of emails per day, often only a few words long. (One executive compared the emails to haiku. Another compared them to ransom notes.) His responsiveness was superhuman. “You’d email him at 2 a.m. and receive a reply at 2:05 a.m.,” Dally said. “Then you’d email him again at 6 a.m. and receive a reply at 6:05 a.m.” The buzz about the transformer architecture filtered up to Huang through these emails. OpenAI was said to be running a job on a Microsoft Azure -- 199 of 263 -- server that had tied up a thousand GPUs over the course of a month while running up a $5 million bill. This was a new language model—a “large” language model, so named because it was about a hundred times bigger than anything yet deployed. Huang realized that he needed a new tool kit just for transformers. He informed Bas Aarts to drag his current recurrent neural-network compiler to the recycle bin and begin work on a transformer compiler at once. Aarts was glad to do it: he found the long-term/short-term memory structures of recurrent neural networks a needless complication. “RNNs were very difficult to program and even more difficult to compile,” he said. “Transformers were a better fit across the board.”[*] Even before the switch, Aarts had begun to witness strange and terrible things at Nvidia. Cutting-edge product demos were available there months and sometimes years before they would be unveiled to the public. In mute awe, Aarts had watched a motion-transfer demo where an AI had edited one of the Mission Impossible movies in real time, replacing Tom Cruise’s face with that of someone standing in the room. “It looked to me—I’m not a video guy, but it looked to me like it was perfect,” Aarts said. “I couldn’t see there was a fake.” Nvidia was fond of theatrical staging. Sometime in 2019, in the middle of the large, empty floor of the Endeavor headquarters, there suddenly appeared a large, black pillar that Aarts described as an “obelisk.” The obelisk housed a terminal running PicassoGAN, an AI that generated images by pitting two neural networks against each other. The responsiveness of the machine amazed Aarts; PicassoGAN could create anything you liked. “You type in a prompt: draw me a man in a landscape, with a river, and trees, and some birds, and a little house in the back,” Aarts said. “Boom. There.” Aarts, like everyone at Nvidia, had signed an NDA that prevented him from discussing such technology with the outside world. This frustrating experience was widespread in Silicon Valley as members of technical staff, sworn to secrecy, could not discuss the miracles unfolding before their eyes. Up until around 2018, AI had flourished in a spirit of open academic -- 200 of 263 -- collaboration. Now the innovations were coming from skunkworks R&D. Cryptic Twitter accounts began to appear, teasing hints of the extraordinary wonders to come. “I saw things where I thought, ‘Wow, I cannot believe this is possible in this age,’ ” Aarts said. “People are oblivious about what is already going on. People have no clue.” But Aarts had as yet seen nothing. S N * The programming environment had changed much in the years since Aarts had written the first CUDA compiler. In the past, neural net programmers had used a software library called Torch, written in the niche programming language Lua. In 2016, Facebook’s AI Research group ported Torch to the programming language Python, which was easy to use and familiar to researchers. The resulting PyTorch framework became the industry standard and made Python the most popular programming language in the world. By the late 2010s the default AI workflow was more or less established: write code in a Python notebook with the PyTorch library installed, compile the code to CUDA, send it to a cluster of Nvidia GPUs for processing, analyze the results back in PyTorch, then repeat until sentience. PyTorch had for a time competed with Google’s TensorFlow framework, but today TensorFlow’s market share has fallen to the low single digits. The consensus among programmers is that PyTorch is easier to use and that community support for PyTorch is more robust. The latter impression is due in great part to the relentless efforts of Piotr Bialecki, a programmer from Germany who has posted almost forty thousand times to the official PyTorch forums since registering his account in 2017. Averaging about fifteen posts a day, Bialecki is especially helpful at getting Python to integrate with CUDA, and the appearance of his friendly face in a comment thread has served as a beacon of hope for many a stumped developer. In 2019, Bialecki joined Nvidia. -- 201 of 263 -- I SEVENTEEN Money n 2020 OpenAI released GPT-3, which was trained on more than a terabyte of text data, the equivalent of a hundred billion words. The specifics of that training data were hidden within a thicket of nondisclosure agreements; later analyses suggested that OpenAI had used an extremely liberal interpretation of the “fair use” doctrine to absorb not just the entirety of English-language Wikipedia but also a comprehensive scraping of copyrighted web links, including archives of The New York Times dating back to 1851. In addition to the self-published vampire-romance fiction, the training set also included a mysterious second curriculum of texts labeled “Books2.” Many speculated that “Books2” was derived from LibGen, a shadow collection of four million cracked ebooks that had circulated on peer-to-peer file-sharing sites for years. (Jonathan Franzen, John Grisham, Jodi Picoult, and George R. R. Martin were among the bestselling authors who would later sue OpenAI over similarities between their work and GPT output, as would the Times.) The model was then “fine-tuned” with human input to scrub some of the more objectionable responses. GPT-3 stunned technologists with its many emergent capabilities, including the ability to solve logic puzzles and write workable computer code. Still, it did not immediately set the atmosphere on fire and was largely ignored by the public. Not until late 2022, when Sutskever and his team released “ChatGPT,” a chatbot for interacting with the latest OpenAI models, did the world take notice. Details about internal workings of these models were confidential; by this time, Microsoft had invested at least $10 -- 202 of 263 -- billion in OpenAI’s capped-profit subsidiary and was loath to release proprietary data to competitors. What can be said is that ChatGPT was fine- tuned for human conversation, using not just internet text but also transcripts of YouTube videos and data from licensed third-party sources— and that conservative estimates speculated the model had at least a trillion parameters. Translate that to synapses, and you were approaching the brain of a cat. ChatGPT launched as a beta-test on November 30, 2022, with no marketing and no subscription tier. The portal to world domination was a bland grayscale website with a blinking cursor into which the user might enter any command. Really, any command. ChatGPT could write poetry! And not just doggerel, but sonnets, limericks, and sestinas. It could write screenplays and essays and functional computer code. It could write short stories and letters to the editor, and it gave good parenting advice. In five days, more than a million people signed up to test it. By January 2023, ChatGPT had one hundred million active monthly users. In March 2023, OpenAI unveiled GPT-4 through its online portal. Looking to quantify its creation’s intelligence, OpenAI subjected the model to a battery of academic tests. GPT-4 passed the bar exam; it scored 5’s on the Art History, US History, US Government, Biology, and Statistics AP exams; it scored in the 99th percentile on the Verbal component of the GRE; it scored in the 92nd percentile on the introductory sommelier exam. The researchers attached a visual-recognition layer to the neural net and found that it could not only perfectly describe images but also recognize complex visual jokes. In one, the researchers fed GPT-4 an image of a clunky computer cable from the 1990s connected to an iPhone, then asked GPT-4 to explain what it was looking at. “The humor in this image comes from the absurdity of plugging a large, outdated VGA connector into a small, modern smartphone charging port,” the model responded. Later, a social media user showed how GPT-4 could create a website from a sketch on a napkin. Around this time, I began to fear for my job. I once asked ChatGPT to make me cry; it returned a story about a pair of songbirds, one of whom dies by running into a glass window and the other of whom forever guards -- 203 of 263 -- their empty nest. I once asked ChatGPT to make me laugh; it asked me to envision a man in an adjacent car picking his nose. I did not, however, use ChatGPT to write this book—I was too afraid. My experience was broadly mirrored by users around the world. Students realized they could use it to write essays, and homework was forever obsolete. Lawyers used it to summarize legal briefs; job applicants used it for cover letters; I used it to request that the town council install a traffic sign. It was magic. It was really magic. “That first-time experience is what hooked people,” Ilya Sutskever said. “The first time you use it, I think it’s almost a spiritual experience. You go, ‘Oh my God, this computer seems to understand.’ ” • • • OAI than $100 million to train GPT-4, with much of the money making its way to Nvidia through Microsoft. Although GPT-3 was essentially a single giant neural network, GPT-4 used a “mixture of experts” model, featuring many neural networks assigned to different tasks. One “expert” might focus on safety, blocking users from asking GPT-4 how to make bombs or dispose of corpses; another might focus on writing computer code; a third would concentrate on emotional valence. (OpenAI declined to comment on GPT-4’s construction.) The “inference” process of extracting knowledge from GPT-4 could easily exceed half of the initial training costs and had to be provided to customers on an ongoing basis. Estimates varied, but one informed analysis put the cost of inference at roughly a quarter of a cent per word. At that rate, it would cost GPT-4 about $10 to write a five-thousand-word college term paper—a bargain when compared to hiring an unemployed graduate student to do it and certainly a better solution than doing the work yourself. To defray the inference costs, OpenAI began charging $20 per month to access GPT-4. By March 2023, the product was approaching two million subscribers. -- 204 of 263 -- The synthesis of the transformer architecture with hyperscale parallel computing resulted in a Cambrian explosion of AI services. Microsoft built Copilot, an autocomplete tool for computer code that programmers found indispensable. (So successful was the service that Huang predicted coding would soon be superseded by natural-language descriptions. “The programming language of the future is human,” he said.) DeepMind built AlphaFold2, an AI that predicted the three-dimensional structure of proteins from one-dimensional amino-acid building blocks. With this, the age of “programmable biology” drew closer, in which the four elemental nucleotide bases of RNA could be made to act like the 0s and 1s of computer binary. A profusion of “generative” AI products arose, promising to revolutionize creative industries by synthesizing content on command. OpenAI’s DALL-E, or competing Midjourney and Stable Diffusion, could in a minute or two create artwork of any description in the style of any artist. Start-ups Udio and Suno offered competing music-generation applications that could generate songs in any genre. Jasper could be used to create effective marketing campaigns in seconds. OpenAI’s Sora, revealed in 2024, promised the opportunity to create real-time video of any description. The consumer-facing products were splashy, but many close to AI believed that the real progress was in product lines that were “invisible by default.” Industrial adoption of AI meant more efficient power grids, faster airline scheduling, improved delivery speeds, and countless other incremental wins. Each improvement, in isolation, was absorbed into back- end infrastructure with little notice outside specialist circles, but the aggregate effect was a massive and ongoing upgrade to global productivity. There was also progress in what was possibly the last frontier of human endeavor: using AI to accelerate AI. In 2022, DeepMind unveiled a neural net intended to speed up matrix multiplication. Two years later, Nvidia introduced a software package that used generative AI to design the patterns on silicon microchips. Such self-augmentation remained within human control—for now. -- 205 of 263 -- Sinister applications also began to appear. Eerie AI voice-cloning technology was used to satirize Donald Trump and Joe Biden as quarreling Overwatch teammates. Bizarre algorithmically generated content began to flood social media. “Deep-fake” pornography targeting female students circulated in American high schools. Voice-cloned telemarketers began harassing the public; some scammers created the illusion of family members held for ransom. In Hong Kong, police reported that a finance worker at an architectural firm was duped out of $25 million in corporate funds after his colleagues in a conference call all turned out to be deep-fake clones. The only solution to these problems, technologists advised, was more AI. The large tech firms reoriented their entire business lines toward AI. Microsoft, Meta, Tesla, and Google all announced multibillion-dollar spending initiatives. The automakers promised autonomous vehicles; the defense industry promised autonomous weapons. AI start-ups raised a cumulative $50 billion in 2023. The money went to medicine, to education, to business platforms, to robotics start-ups. The stock market moved sharply higher in anticipation of gains in productivity to come. Prize committees started distributing hardware to the AI pioneers. Demis Hassabis of DeepMind was given the 2024 Nobel Prize in Chemistry for his work on the protein-folding problem. Geoffrey Hinton was simultaneously awarded the 2024 Nobel Prize in Physics. That same year, Jensen Huang was elected to the National Academy of Engineering, an honor many felt was overdue. And all of this—all of this money, all of this talent, all of this innovation —would pass through a single corporate siphon. All of it would go through Nvidia. • • • I E H’ The Sun Also Rises, Mike Campbell, a drunk and penniless war veteran, is asked how he went bankrupt. “Two ways,” he responds. “Gradually and then suddenly.” The same phrase might be used to -- 206 of 263 -- describe how Jensen Huang got rich. Small changes that had accumulated over two decades of investment were now paying extraordinary dividends. At the beginning of 2023, Huang informed investors that Nvidia had sold GPU supercomputing clusters to fifty of America’s hundred largest companies. The other fifty, he continued, were leasing Nvidia’s infrastructure through the cloud. In 2023, he said, data-center revenues had surpassed gaming revenues for the first time; he expected those revenues to double once again in the coming year. Nvidia, which controlled close to 90 percent of the AI chip market, became an object of fervent interest on Wall Street, with CNBC featuring a countdown to its quarterly earnings announcement. Expectations, for once, could not match reality. When the Nasdaq opened on May 25, 2023, Nvidia’s value increased by about $200 billion. By the close of trading, Nvidia was the sixth-most-valuable corporation on Earth, worth more than Walmart and ExxonMobil combined. “There’s a war going on out there in A.I., and Nvidia is the only arms dealer,” one Wall Street analyst said. Ten years earlier, Huang had been pleading with Fidelity for his job. Now, at sixty, he was the planet’s most celebrated monopolist. In Taiwan the noodle shops he visited put his picture on the menu; his iconic leather jackets were featured in the Style section of the Times. In addition to the homes in Los Altos and Maui, he was connected, via a shell corporation, to a seven-bedroom, $38 million mansion on San Francisco’s Gold Coast, the redoubt of billionaires. Featuring a library, a gym, and a movie theater, it was one of the city’s most expensive homes. To the sanctified roster of Jobs, Bezos, Gates, and Zuckerberg, one could now add the name of Huang. His chips were so valuable that they were used as loan collateral, and demand for them brought powerful men to their knees. In late 2023, Huang met with Elon Musk and Oracle cofounder Larry Ellison for sushi at the Nobu restaurant in Palo Alto. Ellison and Musk had a collective net worth above $300 billion, yet in this rarefied company, Huang was demonstrably the alpha—the other two spent the entire meal pleading for Huang to ship them more AI chips. “Elon and I were begging, I guess is the best way to describe it,” Ellison recalled. “An hour of sushi and begging.” -- 207 of 263 -- The burgeoning value of Nvidia’s shares spread great wealth to loyalists. Tench Coxe and Mark Stevens had both served on the board since 1993, stewarding Huang through the lean years and defending him when Starboard Value attempted to commandeer the ship. By 2024, each man owned a stake in Nvidia worth more than $4 billion. Coxe had staked hundreds of companies; when I asked him if Nvidia was his best investment, he looked at me as if I had just asked the stupidest question he’d ever heard. “Uh, yeah,” he said. (It later occurred to me that a founder’s stake in the world’s most valuable company was by definition the best possible investment you can make.) The stake of longtime board member Harvey Jones was worth more than $1 billion. Brooke Seawell, the chair of Nvidia’s audit committee, had about $700 million in Nvidia shares. Jim Gaither, Huang’s first lawyer, had a mere $500 million, having not joined the board until 1998. Colette Kress, Nvidia’s CFO, had around $800 million; Tim Teter, Nvidia’s general counsel, had more than $400 million; Deb Shoquist, Nvidia’s operations wizard, had more than $300 million. Due to Nvidia’s decentralized management structure, old-timers like Dwight Diercks and Bill Dally were not considered corporate officers and were therefore not required to disclose their holdings—but I sensed they were doing OK. David Kirk, whom Huang had offered an irrefusable equity package way back in 1997, had retired in 2018 but told me that he’d held on to a portion of his shares. “I have a lot more money than I ever thought I would,” he said. The most opaque Nvidia fortune was that of cofounder Chris Malachowsky, whose name, I noticed, was not in the Forbes list of billionaires. In his last required disclosure, way back in 2001, Malachowsky had owned about 5 percent of the company, an amount that would be worth more than $100 billion today. Much had changed since, but when I asked him in mid-2023 if he had cashed out, he responded, “Not much.”[*] Like many firms, Nvidia allowed employees to purchase stock at a discount to market prices. What set Nvidia’s program apart was that employees were allowed to purchase stock at a discount to the lowest price at any point in the last two years. These purchases were capped at a certain -- 208 of 263 -- dollar amount, but as the stock went vertical, the program basically turned into free money, and those who maxed out their contributions each year made the trade of a lifetime. With the windfall extending deep into middle management, some newer employees expressed concerns that the nouveau- riche veterans were entering a state of “semiretirement.” Executives disputed this characterization. Jeff Fisher, who ran the company’s gaming side, had been among the first thirty employees. “Many of us are financial volunteers at this point,” he said, “but we believe in the mission.” The lure of developing this revolutionary technology offered purpose beyond what money could buy. This was especially true of the old guard, who’d spent years explaining to baffled peers why they were working for a gaming company and who constantly had to correct the pronunciation of the firm’s name. AI had not been a consideration for these veterans, and they were as surprised to be working on it as anybody. “There was no way me, or anybody else, could have dreamed at the time that this stuff that science fiction writers might come up with has become a reality,” said Jay Puri, Nvidia’s head of sales, who started work at the company in 2005. The value of Puri’s shares exceeded $700 million by 2024, but he felt that the interesting work at Nvidia was only beginning. “Maybe I’m biased, but I think it really is the most important technology company of our time,” he said. S N * The share valuations in this chapter were calculated using Nvidia’s December 5, 2024, closing stock price of $145.14 and include money held in family trusts. -- 209 of 263 -- I EIGHTEEN Spaceships n late 2023, I visited Nvidia’s offices for the first time. Voyager, the larger triangle of Nvidia’s campus, was now open. The place is awesome. In the middle of 750,000 square feet of cavernous, naturally lit office space stands a three-story matte-black “mountain.” Climb the staircases, and you pass a series of vertical garden walls where species of stonecrop, ivy, and fern are arranged in undulating tones of green. At the summit is an illuminated marble bar surrounded by open-air, natural-wood pergolas. Stroll, with your drink, to the rim of the mountain, and you look out upon thousands of open- air cubicles, interrupted here and there by sterile white pillars that stretch a hundred feet up to support the lofted ceiling above. The building interiors were immaculate; I imagined firing the gun from Portal at the walls. As I later learned, Nvidia tracks employees throughout the building with video cameras and AI. If an employee eats a meal at a conference table, the AI will dispatch a janitor within an hour to clean up after him. A human janitor, for now. I could scarcely believe how excited the employees were to work in this panopticon. Huang was present in the building, and a buzz went through the cubicles whenever he crossed the floor. He was a striking figure, smaller than I imagined, alone in black in this white expanse, no entourage, just him. People spoke of him as “Jensen,” only as “Jensen,” and in the company Slack channels, they’d built a menu of custom Jensen emojis that they used to react to various pieces of positive news—like, for example, -- 210 of 263 -- Nvidia crossing $3 trillion in market capitalization to become the single most valuable firm in the world, as it did a short while after my visit. Nvidia’s employees roamed these twin spaceships with the glow of the beatified. They were earnest, giddy, nerdy, and unpretentious, although their command of English was sometimes a little iffy. One entry-level hire I spoke with in the cafeteria had worked for the company for two months. She was twenty-five and had recently arrived from China, where she had earned a master’s degree in computer science. “I wake up every day so happy to go to work,” she said. She gestured around Voyager’s interior, her petite frame burdened by the backpack Nvidia issued to new hires. “Look at this place. Everyone here is so smart. I can’t believe I’m in this place!” The only people I saw at Nvidia who didn’t look happy were the quality- control technicians, who labored inside the mountain, like dwarves. Stuck in windowless laboratories and surrounded by industrial metal shelving, pallid young men wearing earplugs and T-shirts pushed Nvidia’s microchips to the brink of failure. The racket was unbearable, a constant whine of high- pitched fans trying to cool the overheating parallel circuits. On these parallel circuits Nvidia was building a parallel reality. • • • S , Nvidia’s graphics cards have featured “ray-tracing,” which simulates the way that light bounces off objects to create photorealistic effects. Inside of a triangle of frosted glass in Nvidia’s executive suite, a product-demo specialist showed me a three-dimensional rendering of a gleaming Japanese ramen shop. Light reflected off the metal counter, and steam rose from a bubbling pot of broth. There was nothing, as far as I could tell, to indicate the place wasn’t real. The specialist next showed me “Diane,” a hyperrealistic digital avatar that spoke five languages. A powerful generative AI had studied millions of videos of real people to create this composite entity. Diane was stunning, but it was the imperfections that really got me—she had blackheads on her nose and trace hairs on her upper lip. The only clue that Diane wasn’t truly -- 211 of 263 -- human was the uncanny shimmer in the whites of her eyes. “We’re working on that,” the product specialist said. One of Huang’s goals is to unify Nvidia’s computer-graphics research with its generative-AI research. Image-generation AIs will soon reach a level of sophistication where they are able to render three-dimensional, inhabitable worlds and populate them with realistic-seeming people. At the same time, language-processing AIs will be able to process and interpret voice commands in real time. Once the technologies are united, users will be able to speak whole universes into existence with a few simple voice commands.[*] I felt dizzy leaving the product demo. I thought of science fiction; I thought of the Book of Genesis. I sat on a triangular couch with the corners trimmed and labored to imagine the future my daughter would inhabit. They were building the Manhattan Project of computer science, but when I questioned Nvidia executives about the wisdom of unleashing such power, they looked at me like I was questioning the utility of the washing machine. I wondered if an AI might someday kill someone. “Eh, electricity kills people every year,” Bryan Catanzaro said. I wondered if it might eliminate art. “It will make art better!” Dwight Diercks said. “It will make you much better at your job.” I wondered if someday soon an AI might become self- aware. “In order for you to be a creature, you have to be conscious. You have to have some knowledge of self, right?” Huang said. “So no. I don’t know where that could happen.” I pressed Huang on this point in every interview I conducted with him. He gave me variations on the same answer every time. I brought up Geoffrey Hinton’s worries. (“Humanity is just a passing phase in the evolution of intelligence,” Hinton had said in a PBS interview.) Huang scoffed: “A lot of researchers don’t understand why he’s saying that. Maybe it’s bringing attention to his own work.” I was stunned by this statement. Hinton was the most farsighted researcher in AI history, and Nvidia’s financial success could be traced directly to work done in his lab—a fact that Huang had acknowledged many times. This wasn’t some guy on the -- 212 of 263 -- street waving a sign; this was the greatest mind in AI, the direct descendant of George Boole, telling us we should be very, very worried. Huang was coldly dismissive. “Look, you buy a hot dog, so the machine recommends you ketchup and mustard,” Huang said. “Is that the end of humanity?” He spoke of society’s rapid acclimation to cars, alarm clocks, and mobile phones, and he told me we would grow similarly accustomed to an autonomous droid vacuuming the carpet. “The robot’s not doing anything strange,” he said. “Like I said, all it’s doing is processing data. If you understand how it works, that world doesn’t look weird at all.” But I kept pressing, and Huang finally grew annoyed. “I’m so tired of this question,” he said. “All this theorizing about something that there’s no evidence for.” • • • D using a technique called “deep learning super sampling,” which first relied on ray-tracing to illuminate the scene, then used a neural net to draw the details. In the product demo, only one out of every eight pixels was the product of the ray-tracing algorithm; the rest had been inferred by AI. David Kirk had considered ray-tracing the last frontier. Once you had the ability to accurately depict the cascade of light in a scene, that was the end —you’d collapsed reality and simulation into a single frame. In movies, ray-traced CGI became indistinguishable from what the camera had captured—almost every major film now used the technique. “It no longer makes sense to talk about fact and fiction on the screen,” Kirk said. “Everything is fiction.” But Catanzaro, having returned to Nvidia to manage deep-learning applications, saw a realm beyond. Termed “neural graphics,” this new technology would no longer even attempt to simulate physics. Instead, it would use AI to “paint” realistic scenes in real time. Rather than using optical physics to represent reality, neural graphics hacked the human perception of light. -- 213 of 263 -- Catanzaro’s journey back to Nvidia had been a little rocky. Around the age of forty, he finally accepted that he was gay. He separated from his wife and was kicked out of the Mormon church choir. With his expressive outfits, his religious speculations, and his interest in literature and music, Catanzaro continued to stick out at Nvidia. Not that he was really all that eccentric or anything—at Google, he would have been regarded as a normie. But Nvidia wasn’t the kind of place where people experimented with ayahuasca or polycules. In the hundreds of hours of interviews I conducted with Nvidia employees, nobody, not once, had ever mentioned Burning Man. Catanzaro alone was willing to discuss the implications of the technology he was building. Browsing social media, I had come across a 1964 clip of the science-fiction author Arthur C. Clarke speaking to the BBC. Even before cowriting the screenplay for 2001, Clarke had entertained the possibility that machines might someday learn faster than humans: The most intelligent inhabitants of that future world won’t be men or monkeys. They’ll be machines—the remote descendants of today’s computers. Now the present-day electronic brains are complete morons, but this will not be true in another generation. They will start to think, and eventually they will completely outthink their makers. Is this depressing? I don’t see why it should be. We superseded the Cro-Magnon and Neanderthal men, and we presume we’re an improvement. I think we should regard it as a privilege to be stepping stones to higher things. I suspect that organic, or biological, evolution has about come to its end, and we’re now at the beginning of inorganic, or mechanical, evolution, which will be thousands of times swifter. This was the most optimistic interpretation of what Nvidia had achieved —not merely a business or technological success, but a new phase of -- 214 of 263 -- evolution. The next time I spoke with Catanzaro, I showed him Clarke’s clip, and he responded with enthusiasm. His posture grew erect, and he began to once again run his fingers through his hair. “So the conservative position is that Prometheus deserved to be chained to a rock and have his liver eaten because he gave mankind fire, and men don’t deserve to have fire,” he said. “The progressive position is that the world can be better, it should be better, and it’s our responsibility to make it better. When I look around at problems that we face as humanity, I believe we need more intelligence.” Surely, I countered, a superior intelligence could be dangerous. Our own species, through agriculture, animal husbandry, mineral extraction, and urbanization, had transformed the surface of the planet, decimating or even eliminating all competing species and leaving only a handful of protected habitats unscathed. I asked Catanzaro if AI might do the same to us. “I feel like we get stuck in science fiction perspectives a little bit too often,” Catanzaro said. He leaned back in his chair, and I got a good look at the large owl embroidered on the front of his sweater. “AI isn’t going to be interested in zero-sum games with us because there’s so much more to do in this universe. For example, if an artificial intelligence is trying to build a huge data center—it doesn’t want to put it where the humans live. It wants to put it somewhere else, maybe underground. Do you know how much space there is underground?” Catanzaro was uncorked now—I sensed that he didn’t often get to share this perspective at his job. “It doesn’t need to inhabit this biosphere. In fact, it doesn’t need to be on the Earth, either, because the thing about artificial intelligence is that it travels at the speed of light. Humans, you know, we actually have to lug bodies around. Artificial intelligence can move along a radio signal as long as there’s an antenna on the other side.” Free of the limitations of biology, Catanzaro explained, AI would rapidly spread throughout the solar system and beyond. “Humans are naturally confrontational—like, we’re territorial animals, and it’s built into our limbic system to defend our turf,” he said. “AI, if it’s truly intelligent, the things that it’s interested in are so much bigger than the little thin crust of Earth -- 215 of 263 -- that the humans live on. I don’t think that it’s going to be interested in taking that from us. Rather, I feel like AI is going to want to take care of us.” • • • S for a space computer was an experience to be savored at leisure. Today, though, Nvidia remains earthbound, and Catanzaro’s discussion of where to put the data centers was not just idle speculation—it was borne of practical concerns. The whirring fans I’d seen inside the mountain suggested the intense demand for electricity that Nvidia’s equipment generated. GPUs had unclogged the bottleneck of calculation speed, and Mellanox’s Infiniband protocol had unclogged the bottleneck of data throughput. The remaining bottleneck was simply how much electricity a data center could consume. The remaining limit was power. S N * Google’s Genie 2, released in December 2024, can create an explorable 3D world from nothing more than a picture. -- 216 of 263 -- A NINETEEN Power s the utility technician neared the transmission tower, the wire emitted an audible hum. More than two hundred thousand volts of electricity coursed through it. The worker was suspended in a bucket, attached to a crane, nearly a hundred feet in the air. He wore a tool belt, a fire-retardant shirt, and insulated gloves. Using a specialized tool called a “hot stick,” he reached in to detach the high-voltage wire from the crossarm that held it. He had to be careful; an errant move at this stage could mean immediate electrocution. The line uncoupled with a crackle, and the aroma of ozone filled the air. One down, several thousand to go. The lineman worked for Dominion Energy, the electric utility that served Loudoun County, Virginia. He was upgrading the infrastructure that powered the world’s largest collection of data centers. Loudoun was the nerve center for US retail-computing demand: if you conducted a Google search in Manhattan, chances were good that your query was routed through Virginia. The electrical infrastructure needed to supply these giant computers had taken Dominion more than a century to build. With the arrival of AI, Dominion was projected to double that infrastructure in less than fifteen years. This was perilous work in the best of circumstances, but the data centers were so demanding that in the summer of 2024, Dominion determined that the wires would have to be replaced while they were still live. The risks were considerable, but the safety of humans seemed a secondary consideration. Under no circumstances was the power to the data -- 217 of 263 -- centers ever to be cut. Demand was too frenzied. People just wanted AI so bad. The demand did not come from training new neural nets—mostly, it came from deploying them. Inference had once been a task so simple it could fit on a floppy disk, but modern users wanted more from AIs than backgammon moves. They wanted term papers; they wanted search; they wanted voice generators to make memes. OpenAI’s GPT products had more than 180 million daily users. Google was bundling experimental AI answers along with conventional search results. TikTok reels were narrated by an expanding cast of AI voice avatars. Powering each service, in a rack in a data center in some anonymous warehouse on the outskirts of a major population center, you would find an Nvidia GPU. These GPUs used a lot of juice. A standard Google search required about a third of a watt-hour’s worth of electricity. With generative AI enabled, the same Google search required ten times that, which was enough to power a light bulb for about twenty minutes. Ask GPT to write you a five-thousand- word term paper, and you used enough energy to run a microwave for an hour. Industrial demand was greater; executives were excited about the prospect of replacing human labor entirely. In 2022 James Earl Jones announced that he was tired of performing the voice of Darth Vader, so he licensed his voice to Disney to be cloned in perpetuity. (Jones died in 2024, but Vader’s menacing tone and labored breathing may prove immortal.) Image-recognition tools promised to save billions of dollars a year in MRI analysis costs, putting most radiologists out of a job. Amazon used AI to determine whether the strawberries it was delivering were bruised, and farm owners were experimenting with AI robots to pick the fruit they shipped. One analysis predicted that meeting the needs of the generative AI boom might require doubling US nuclear plant capacity in under ten years. Even conservative estimates projected a 20 percent increase in required total demand. There was no realistic way to supply the Nvidia GPUs with the electricity they needed while simultaneously hitting carbon-neutrality targets. Dominion, in addition to upgrading the high-voltage lines, was discussing reviving mothballed coal-burning facilities. -- 218 of 263 -- Nvidia was not in denial about the problem of global warming. Researchers there were quick to point out that climate modeling was one of the first uses for supercomputers. Syukuro Manabe, the scientist who won the 2021 Nobel Prize in Physics for showing that trace amounts of carbon dioxide would trap heat in the atmosphere, had arrived at this conclusion in the late 1960s after fashioning a primitive simulation of Earth with an IBM computer that weighed seventy tons and drew as much power as ten city blocks. Using exponentially more powerful computers in the 1980s, NASA scientists had correctly predicted a coming rise in Earth’s average temperature of several degrees Fahrenheit, even though the empirical trend at the time had looked flat. These simulations also predicted that as the planet warmed, the upper atmosphere would cool down, causing atmospheric layers to pancake as heat was trapped near the surface. By the 2020s, satellite data showed that the sky had indeed collapsed. Almost all of what we understand about climate change is the product of powerful, energy-hungry supercomputers. Scientists had been running climate models on Nvidia hardware since the late 2000s, and Nvidia had its own climate division, which conducted extraordinarily sensitive forecasting. (Nvidia’s “Earth-2” simulation promised to predict the wind speed on an individual city block.) Bill Dally, Nvidia’s chief scientist, worried a lot about climate change—as a frequent visitor to both the snow- crowned Sierras and the storm-tossed Caribbean, he’d personally witnessed its manifold effects. One of Dally’s reasons for working on AI was that he believed it would move humanity toward a carbon-neutral future. “If there’s one problem that I think everybody in the world ought to be staying up at night about, it’s climate change,” he had said. “The survivability of our planet needs to be a first-order concern.” But despite such talk, Nvidia’s chips were almost single-handedly responsible for a global surge in electricity demand. As the chips got better at matrix multiplication, so too did they become hungry for power. The 2020 A100 chip required 250 watts in standard configurations. The H100, released two years later, required 350 watts, 75 percent more. The B100, released in 2024, doubled demand to 700 watts, and the forthcoming B200 -- 219 of 263 -- would require 1,000. The original DGX box from 2016 used as much power as a clothes dryer. Its 2024 successor used enough electricity to power a single-family home. And this was but a single brick in the fortress wall. The DGX boxes were modular units that customers stacked into the long “superpods” that formed the spine of the modern data center. The superpods were wired with thick bundles of cable and fitted with advanced air- and liquid-cooling systems to keep from overheating. The biggest of the hyperscale data centers measured their annual power requirements in gigawatts—more than the output of a nuclear reactor, enough to power Minneapolis. Continuously upgrading Nvidia chips in its data centers had contributed to a 50 percent increase in Google’s greenhouse-gas emissions in five years, even accounting for the efficiency gains in the power grid and despite the company’s commitment to “net zero” by 2030. (In 2024 a company spokesperson affirmed Google’s commitment to carbon neutrality while admitting that the AI power draw presented a “challenge.”) At Microsoft, whose cloud-computing division trained and deployed the expensive OpenAI models, emissions went up by almost a third. Cloud providers were even trying to buy disused crypto-mining operations. Some investors questioned the wisdom of this gigantic build-out. The path to profits from AI was not a direct one, and the high costs of both the silicon and the electricity were compounded by a shortage of talent. Snake charmers who could consistently tease out results from the fickle AI architecture were rare, and they commanded serious coin. Not every company was OpenAI, and not every company had Ilya Sutskever working for it; the frustrations that Fredrik Dahl had experienced trying to get a neural net to play poker were now replicated at IT departments around the world. Despite the increase in computational power, neural nets still hit plateaus, and when they did, there was still often no obvious way to make them better. Firms that attempted to replicate GPT with in-house data often produced shambolic “knowledge engines” that were little better than toys. These AIs supplemented the standard large-language-model training set -- 220 of 263 -- with emails, mission statements, patent applications, legal memoranda, and other exciting selections from the internal corporate syllabus. As the buzz percolated through middle management, executives at marketing, media, and health-care firms launched ambitious initiatives, sometimes openly telling staff that many employees would be laid off once the neural nets were working. But much of what was produced was vaporware: late, expensive, and barely functional. Many users felt that AI technology simply wasn’t ready. In summer 2024, Elliott Management, one of the world’s largest hedge funds, told investors that tech stocks, especially Nvidia, were in “bubble land.” The letter to clients, quoted in the Financial Times, called AI “overhyped, with many applications not ready for prime time.” Many of the promised applications were “never going to be cost-efficient, are never going to actually work right, will take up too much energy, or will prove to be untrustworthy.” Disappointment, not competition, was the biggest risk to Nvidia’s empire. FOMO was driving a lot of activity—no company could afford to be the one without an AI strategy. The executives buying Nvidia hardware needed to build products that justified the scale of investment. If they couldn’t, remorse would set in, demand would slacken, and Nvidia’s stock would crater. The money managers at Elliott believed that if Nvidia were to experience even a single bad quarter, the entire tech sector would deflate like a hissing tire. AI investors were exuberant, certainly, but were they irrational? Skeptics had compared the AI craze to the dot-com boom or the mortgage crisis, but the executives in charge of AI spending were not hucksters. They were seasoned computer geniuses with decades of experience. The businesses involved were stable, and the funding came from reserves of accumulated profit, not dicey share issuances or subprime debt. Every academic computer scientist I spoke with, without exception, saw the triumph of neural networks as a civilizational advancement. Many thought it was the most important discovery in the history of the field. The technology was phenomenal, and the earnings that it had generated so far were real. -- 221 of 263 -- The problem that investors had once faced valuing Nvidia alone now applied to valuing the entire stock market. Close readings of past financial statements would not tell you if the AI gamble was smart, and projections of future earnings had as little bearing on reality as the leaves at the bottom of a teacup. The salient question was whether Huang knew what he was doing—whether Huang, Zuckerberg, Musk, Nadella, Pichai, and Altman knew what they were doing at all. They had allocated hundreds of billions into this unproven technology. Was this smart? Were they smart or not? -- 222 of 263 -- J TWENTY The Most Important Stock on Earth udgment Day arrived on February 21, 2024. The hype leading up to the earnings announcement was absurd. CNBC once again had run a countdown for days leading up to the event. The niche social media community known as “FinTwit” circulated a meme of Huang, in his signature leather, standing in front of an electoral map of the United States, with all fifty states colored slime green. In the days before the reveal, Goldman Sachs had called Nvidia “the most important stock on earth.” Nvidia’s conference call began fifteen minutes after the close of trading. It was one of the most listened-to earnings reports in Wall Street history. Colette Kress, Nvidia’s CFO, spoke first. She announced that annual revenues were far better than expected, more than doubling to $60 billion. Not only that, but Nvidia had realized a greater than 70 percent gross margin across all products. (Apple, Wall Street’s favorite money-printing machine, earned a gross margin of 46 percent.) Net income for the year, Kress announced, was just under $30 billion—more than Nvidia had earned in the previous thirty years combined. With thirty thousand employees, every worker at Nvidia was now generating $1 million per year in profit. Huang spoke next. He sounded like Thomas Edison. He described how neural nets could now understand and generate not just human language but any information, including elements of biology and the three-dimensional world. He described a forthcoming “trillion-dollar infrastructure cycle” of building AI factories. “We are now at the beginning of a new industry, where AI data centers process massive raw data to refine it into digital -- 223 of 263 -- intelligence,” he said. “Like AC power-generation plants of the last industrial revolution, Nvidia AI supercomputers are essentially AI- generation factories of this industrial revolution.” Nothing could have prepared traders for the frenzy the following day. On February 22, in six and a half hours of trading, Nvidia gained $277 billion in market value, adding more than the entire value of the Coca-Cola Company to its market capitalization. Traders exchanged at least $65 billion worth of Nvidia stock that day, accounting for nearly a fifth of all US stock market activity. By market close, the three most valuable companies on Earth were Microsoft, Apple—and Nvidia. It was the largest single-day accumulation of wealth for any company in Wall Street history. This buildup put unbelievable pressure on Huang. Most of Nvidia’s value was not based on what the company had done but on what investors expected it might do in the future, and that meant continuing to innovate and deliver at a rapid pace in an expanding field of motivated competitors. Watching him pace the stage at his conferences and investor presentations, I recognized the same thing that Curtis Priem had twenty-five years before: Huang was all alone. Management professors theorized that a chief executive should ideally have between eight and twelve direct reports. Huang now had fifty-five. He had no right-hand man or woman, no majordomo, no second-in-command. Huang also had no designated successor, and as Nvidia grew, its C-suite actually shrank, meaning that there was no scapegoat for mistakes. Board members spoke of his irreplaceability; it was not an exaggeration to suggest that Huang had personally saved the American economy from recession. The US stock market, over the course of Nvidia’s rise, had pulled away from markets in Europe and Asia. Almost all of that outperformance was attributable to AI. Nvidia’s $3 trillion market capitalization purportedly represented the expected value of the company’s future earnings—but it was really a giant, GDP-sized bet on the capabilities of this single sixty- one-year-old man. • • • -- 224 of 263 -- I on the website Glassdoor ranked Nvidia the best place to work in the United States. Employees were influenced by the stock price, of course, but even in the gaming era the company had scored well. This was a little surprising, for while working at Nvidia was stimulating, it was never exactly fun; the corporate culture that Huang fostered was closer to Microsoft than Google, closer to IBM than Apple. But years earlier, Chiu, the Taiwanese physicist, had told Huang that he’d allowed him to do his “life’s work.” The phrase had stuck with Huang, and now he wanted to offer that same opportunity to his staff. “We want NVIDIA to be a place where people can build their careers over their lifetime,” the company wrote in its annual report. “Our employees tend to come and stay.” The appeal lay in what Nvidia allowed you to achieve. It was not a secret that Huang pushed people hard. Thus, he attracted determined workaholics seeking to establish legacies as inventors. In the same way that a bestselling author didn’t stop writing, even many wealthy Nvidia engineers kept showing up to work each day to attack difficult technical problems. Those engineers collectively held more than fifteen thousand patents, but there was always something left to build. Success allowed Nvidia to be discriminating in who it hired. Every job posting now received several hundred applicants. Ambitious strivers looking to contribute to the AI revolution often considered it their top choice. The workforce was “diverse,” sort of—I would guess, based on a visual survey of the cafeteria at lunchtime, that about a third of the staff was South Asian, a third was East Asian, and a third was white. The line for Indian food was the longest. Nvidia didn’t share official demographic statistics with me, but photographs and interviews suggested that, even well into the 2000s, most of the company’s employees had been white American men. The transition to a mostly Asian workforce was part of a broader pattern in Silicon Valley, but it also reflected Huang’s shift of focus. “You know, computer graphics was not considered to be a very glamorous field,” one longtime executive told me. “There was something a little disreputable about it.” -- 225 of 263 -- But AI brought respectability. Instead of long-haired Americans in cargo shorts, graduates of the best technical universities in India, China, and Taiwan were now applying to work at the company. One former manager described the family pressure placed upon these Asian graduates to get a prestigious job at Nvidia, specifically. “These kids, their families only give them three options: you either need to be a doctor, or you need to be an engineer, or…actually, I don’t think there’s a third option,” he said. “They’re very competitive, and they’re very bright, and they’re just under a tremendous amount of pressure.” The manager, a gamer who had worked at Nvidia since the graphics days, reflected on Nvidia’s changing culture. Graphics had been a quirky field that paid relatively poorly. “My parents told me not to do it,” he said. “They thought I should get a real job.” With success, he felt, the passionate mavericks of Nvidia’s past were being supplanted by a new generation of conformist overachievers. Still, the manager conceded, the new hires were extraordinarily bright. “I mean, objectively speaking, can they code?” he asked. “Yeah. Better than I ever could.” In the early years, Nvidia had employed almost no women at all. Around the time of the IPO, David Kirk met his future wife, a fellow employee from Montreal. “All the men at Nvidia knew exactly how many women there were at the company at the time,” he said. “Three.” That number reflected, at least in part, the low number of qualified female electrical- engineering graduates—even today, more than 90 percent of US electrical- engineering graduates are male. In a 2022 survey, US female engineering undergraduates were asked to rank disciplines by career interest. Environmental and biomedical engineering tied for first; electrical engineering came in last. Following explicit directives from Huang, Nvidia made an effort to hire more women. This initiative was successful, and by 2024 a little more than a quarter of the staff at Nvidia was female. (Sometime around 2020, Dawn, Nvidia’s sexualized CGI pixie, also mysteriously vanished from company marketing material.) Jensen’s wife, Lori, had left the workforce to raise their children, and when he spoke of his continuation of Nvidia’s liberal -- 226 of 263 -- work-from-home policy following the abatement of the pandemic, I sensed in him an eagerness to atone for her interrupted career. “We want to take advantage of this incredible videoconferencing technology so that young people—especially young women—can build their lives, build their families, and build their careers at the same time,” Huang had said. “I don’t want to give up on that.” Less successful was Nvidia’s effort to hire Black employees. In 2020, fewer than 1 percent of Nvidia’s global workforce was Black. Again, this partly reflected the limited pipeline of qualified Black American electrical- engineering graduates, but perhaps more saliently it reflected the limited pipeline of any qualified American electrical engineers. The discipline drew from a global talent pool, in which native-born citizens of the United States were outnumbered ten to one by applicants from East Asia, South Asia, the Middle East, and Europe. “Our diversity problem is basically getting any Americans of any color to study this subject at all,” one university professor told me. One thing I’d noticed was that many of the engineers I talked to at Nvidia had at least one parent who was also an engineer. (Some had two— for them, it was a hereditary affliction.) Sameer Halepete’s father had taught him to solder circuits when he was in the third grade. “Engineering, especially electrical engineering, is very, very intimidating,” he said. “So you kind of have to have some sort of cushion. You have to have somebody telling you, ‘This is something that you can do.’ ” Halepete had, in turn, tried to pass the soldering tradition on to his daughters but had been unsuccessful. Huang, of course, had a father and two brothers who were engineers. Both of his kids had gone into the hospitality industry, but after years of gentle paternal browbeating, he brought them back into the flock. Madison, after training as a chef and working at LVMH in Paris, was hired at Nvidia as a marketing director in 2020. Spencer had cofounded a cocktail “laboratory” in Taipei that had been named one of Asia’s fifty best bars, but it went out of business during the pandemic. He joined Nvidia’s robotics group in 2022. Inside Nvidia, Spencer and Madison received no obvious -- 227 of 263 -- special treatment. “They pull together slides. They’re on endless boring calls. They eat in the cafeteria,” one employee said. • • • H’ benefited from globalization, and historically Nvidia had not worried much about geopolitical entanglements. That changed on October 7, 2023, when militants from Hamas invaded Israel and began indiscriminately killing civilians. The youngest daughter of Mellanox founder Eyal Waldman was slain in the massacre, and Nvidia engineer Avinatan Or was taken hostage. (As of November 2024, Or’s fate remains unknown.) More than 1,100 people were killed in the initial Hamas assault, and more than 250 were abducted. A ferocious Israeli counteroffensive followed; in the months after the initial attack, Israeli forces killed an estimated forty thousand Palestinians, including at least eight thousand children. More than half of Gaza’s buildings were either damaged or destroyed, and protests were organized all over the world. At the start of the war, Nvidia had around three thousand Israeli employees; the company also had about one hundred Palestinian employees in the West Bank. As four hundred of his Israeli workers were called up to military duty, Huang canceled an AI conference in Tel Aviv, then sent a company-wide email telling his staff in the region that he was giving them a bonus “to help in a small way.” He also promised that Nvidia would match humanitarian-aid donations from employees. “Some want to donate to Israel’s relief efforts, while others want to help innocent Palestinians,” the email read. “You decide to support humanitarian efforts in Israel, Gaza, or both.” In public statements, Huang did not take a side in the conflict, repeatedly stating that the safety of his employees was his top priority. But he also emphasized the value of Israel’s high-tech sector to Nvidia. “The heart and soul of the Blackwell processor came from Israel,” he told the press at the March 2024 GTC conference. “We will continue to invest heavily in -- 228 of 263 -- Israel.” A month later, with the conflict ongoing, Nvidia announced it was buying the 150-person Israeli start-up Run:AI for $700 million. With China, Nvidia faced an even trickier situation. Huang saw China as a critical market for his AI chips and wanted to sell Chinese customers as many chips as he could. The US government had different ideas and in 2022 blocked sales to China of Nvidia’s advanced A100 and H100 chips. Huang got around the embargo by selling modified versions of the chips; the unamused US government accused him of exploiting a loophole and issued a wider ban on sales of Nvidia equipment, including some high-end gaming cards. This created an unusual situation in China. Chinese state media described video games as “spiritual opium,” and Chinese regulators had limited kids to three hours a week of play time. But the demand for dual-use gaming hardware from China’s scientists—not to mention its military—was great. In the days before the sanctions took effect, Nvidia offloaded chips in a hurry, selling enough A100s to ByteDance, the owners of TikTok, to train ten ChatGPTs. Even after the ban, high-end chips continued to find their way into China. In June 2023, Reuters reported that in the stalls of the massive electronics marketplace in Shenzhen, black-market vendors were offering A100s for double their retail price. Huang told me his first priority, above all, was to obey the law—but he objected to the sanctions. “If they can’t buy it from us, they’ll just build it themselves,” he said. The bigger problem Huang faced was China’s designs on Taiwan. China’s navy had more warships than any other, and China had long claimed that Taiwan was part of its national territory. In an address on New Year’s Eve, 2023, Xi Jinping had announced that “reunification” with Taiwan was “inevitable.” Satellite imagery released in April 2024 showed that China had built what appeared to be a replica of Taipei’s presidential district in the Gobi Desert, presumably to train for an amphibious invasion. The scale of such an invasion, should it ever occur, would make D-Day look small. It would also be catastrophic for the global economy. TSMC’s manufacturing facilities in Taiwan were one of the major production chokepoints in the world. No facility anywhere approached the speed and -- 229 of 263 -- precision with which TSMC produced the most advanced microchips. Slowdowns in Taiwan during the pandemic had delayed deliveries of automobiles for months. A war would create delays for years—and not just in cars and AI but also in smartphones, consumer electronics, medical devices, and anything else that used a high-end microchip. To my great surprise, Nvidia had done no contingency planning for this eventuality. “If something happens to Taiwan and TSMC, the ramifications are so large it’s almost like asking me what I’d do if California fell into the ocean,” Deb Shoquist said. Shoquist ran logistics at the most valuable semiconductor firm in the world, but Huang had instructed her never to think about this question. “I don’t want her spending one brain cell on trying to mitigate that, because it’s impossible for her to do so,” he said. In the event of a war, Shoquist speculated that Nvidia would transfer its orders to Samsung in South Korea as well as to other partners around the globe. “I’ll tell you what would happen,” she said. “What would happen is everybody’s products would get dumbed down a click.” Recognizing Taiwan’s precarity, governments had begun funding their own chip-fabrication infrastructure. In the United States, President Biden had allocated tens of billions in taxpayer money to build facilities in Ohio and Arizona; similar initiatives had been launched in Japan, South Korea, and the European Union. “In the semiconductor space, there is no globalization anymore,” Morris Chang told employees at a TSMC function in late 2023. “The priority is national security only.” On a patch of unclaimed desert in northern Phoenix, TSMC had cleared 1,100 acres of scrubland and was laying foundations for two massive chip- fabrication facilities. Though technically within Phoenix’s sprawling municipal borders, the site was desolate and the area undeveloped; the closest entertainment was an open-air shooting range. The complex comprised a world unto itself, with dedicated utilities for water and natural gas, and on-site refineries delivering elementally pure streams of nitrogen, oxygen, and argon. Temperatures in the Sonora could cross 110 degrees in August—to conserve water, the plants would be outfitted with moisture- reclamation systems, like something out of Dune. -- 230 of 263 -- Curious neighbors, surveying the site from nearby Pyramid Peak, counted thirty-nine construction cranes in simultaneous operation, including a 180-foot Manitowoc crawler that could lift five million pounds at once. TSMC’s total budget for the Phoenix plants was forecast to exceed $40 billion, making it one of the largest foreign direct investments in the history of the United States and one of the most expensive megaprojects in the world. When complete, the Phoenix fabs were projected to employ two thousand people and manufacture six hundred thousand chips a year, enough to meet the entire domestic demand of the United States. Forty miles to the south, in the bedroom suburb of Chandler, Intel was funding a $32 billion expansion of its own. Amid the golf courses and the xeriscaped ranch homes, Intel ran construction around the clock: to ensure that the foundations would set in the summer heat, the builders used ice in the concrete mix and poured it in the middle of the night. They laid, in total, about four hundred thousand cubic yards of concrete, enough to fill a hundred Olympic swimming pools, then shipped in thirty thousand tons of steel to build the scaffolding above. Drone footage showed six gigantic fabrication buildings, arranged north to south and connected by industrial capillaries to a parallel line of support structures to the west. The dueling Arizona plants, and others under construction around the world, suggested the possibility of a forthcoming glut in supply. Chang, retired and out of the decision loop, saw the Arizona investment as foolish —he believed that US work culture would never match the productivity of Asia and that the facility would lag global rivals. (When Nancy Pelosi visited Taiwan in 2022, Chang told her the US projects were “doomed to fail.”) Chang also argued that TSMC’s concentration in Taiwan was actually preventing a Chinese invasion, for no national economy would suffer as much as China’s if TSMC went offline. (He called this Taiwan’s “silicon shield.”) Still, when his favorite customer, Jensen Huang, toured the TSMC construction site in Phoenix in 2023, Chang was there to greet him with a smile. China had nothing to compare. The US government had successfully lobbied to block the export of the advanced light-printing machines that -- 231 of 263 -- were the most critical component of the fabrication process, leaving China a decade behind. I wondered whether China might instead try to seize the TSMC plants in Taiwan and produce microchips themselves. A former US military official I spoke to regarded this as unlikely. “This is not like when Iraq conquered Kuwait for the oil fields,” he said. “These machines are insanely precise, and insanely fragile, down to a precision of one atom. What do you think’s going to happen to that machine if a missile goes off next to it?” The former official pointed out that administrators could remotely brick the machines by disabling their software; workers could also simply break the machines with baseball bats and hammers. “I would say the Taiwanese engineer sabotage thing is not to be underestimated,” he said. Still, I wondered, was it so crazy to ask if China couldn’t rebuild the machines? Wasn’t it possible they could write firmware updates of their own? Or force the Taiwanese engineers to work the production line under duress? The former military official paused for a long time. “There is no way the USA will ever let China have the Taiwanese semiconductor factories,” he said. “Period. Ever.” • • • T N into a general frenzy for semiconductors. Qualcomm, AMD, Broadcom, Supermicro, and others enjoyed record valuations. ASML, the Dutch company that built the light- printing machines, became the most valuable tech stock in Europe. TSMC, with its unrivaled fabrication facilities, became the most valuable tech stock in Asia. In March 2024, the Financial Times reported that Ferrari sales in Taiwan had doubled in the past four years. “It’s very simple,” said one local businessman. “Taiwanese are rich, and there’s only so many places you can put your money: you can buy property, and you can buy cars.” From the IPO date of January 22, 1999, through mid-2024, Nvidia’s stock appreciated by more than 300,000 percent. The run-up magnified the impact of careless decisions made many years before. Dwight Diercks had sold some shares at the IPO to buy his dad a car. “That’s a billion-dollar -- 232 of 263 -- Cadillac,” he said. Catanzaro and Aarts, the boomerang hires, both felt the sting of repricing their stock options, although both did fine in the end. Curtis Priem, of course, had liquidated all his shares many years before. His company had finally made them green with envy—only he was left with the most to envy of all. By late 2024, Nvidia was worth twenty times more than struggling Intel. (In fact, Huang’s holdings made him personally worth more than Intel.) In a talk to MIT students in December 2023, Intel’s new CEO, Pat Gelsinger, claimed that Nvidia “got extraordinarily lucky” with its GPU efforts. “They didn’t even want to support their first AI project,” he said. A lean and aquiline electrical engineer from rural Pennsylvania, Gelsinger had tried to force Huang out of business several times. In the late 1990s, he’d been the product manager for the Intel i740, a 3D graphics accelerator that flopped. In the late 2000s, he’d been the executive in charge of Project Larabee, the “Nvidia Killer” that never shipped. In 2022, Gelsinger, now CEO, had released the Intel Arc, a knockoff consumer- graphics chip that enjoyed a frisky 1 percent market share. As his rivals embraced the nimble “merchant chip” approach, Gelsinger doubled down on manufacturing. Under his leadership, in one of the great semiconductor bull markets of all time, Intel stock was cut in half. Intel’s board forced Gelsinger out in late 2024, but he was not the only one to express the sentiment that Huang had gotten lucky. No one, not even Catanzaro, had seen the intersection of parallel computing and neural nets coming. Aarts, who understood the technical aspects of the two technologies as well as anyone, marveled at the perfect harmony. Two fringe strains of computer science, starved of investment, hated—no, detested—by industry and researchers alike had somehow unified to form a thriving, sprawling entity now careering toward sentience. “I just thought, there is no way that Nvidia is this lucky,” Aarts said. “There’s no way that deep learning just fits this perfectly because Nvidia has never put any effort into it!” Huang called it “luck, founded by vision.” -- 233 of 263 -- For Dally, it was Huang’s tireless work ethic that made Nvidia succeed. Even Dally, who left no spare second in his day, could not quite believe the superhuman efforts of his boss. “The rest of us are just here to reduce the bandwidth demands on Jensen,” Dally said. “I mean, when does he sleep?” Diercks agreed: “His hobbies are work, email, and work.” Plenty of people worked long hours, though. Jens Horstmann attributed Huang’s success to his adaptability. “I’ve often asked myself, how is it that we started in the same cubicle, you know, with a similar IQ, both working equally hard,” Horstmann said. “How is it that this person not only built this amazing company, but also a network around him of people that—that would just die for him if needed?” Huang, Horstmann believed, had changed himself many times. He recalled Huang at LSI, pushing the simulation software to its outer limits. “Now, he’s still doing the same thing, but what he’s engineering is himself. He was not born as a great CEO; he was not destined to be one. He transformed himself into one, just by abstracting! Just by problem-solving the inputs and outputs of what a good CEO should be.” But the final word went to Morris Chang. He didn’t attribute Huang’s success to his work ethic, which, at TSMC, would have been considered slightly above average—nor did he find him especially adaptable. Chang was ninety-two years old when I spoke with him, wearing a purple corded sweater and sitting in front of a striking piece of abstract art, his face serene, his hair completely white. In seventy years of corporate life across two continents he had seen every manner of executive there was to see. To him, the explanation was simple, and there was no secret: “His intellect is just superior.” -- 234 of 263 -- N TWENTY-ONE Jensen vidia employees worshiped Huang, and I couldn’t blame them. They were rich. They were insanely, incredibly rich, and he had made them that way. He’d taken a niche product for dorks and turned it into the dominant computing platform of their time. They saw Huang—excuse me, Jensen—they saw Jensen not just as a leader but as a prophet. Jensen was a prophet who made predictions about things. And then those things came true. And every time one of them came true, everybody in the spaceships got to add a zero to their net worth. Jensen had done much to encourage this cult of personality. The speeches, the struggle sessions, the sweet talk alternating with abuse—all of these were tactics that served to break people to his will. He had also developed his personal brand, and not only with the leather jackets. For much of Nvidia’s history, Jensen had been known as “Jen-Hsun,” the transliteration of his Taiwanese name, but by 2023 he’d decided he’d just be “Jensen,” leaning into an iconic mononym. The death of Steve Jobs in 2011 had left vacant the position of Visionary Tech Executive with a Signature Outfit, an important ceremonial role in American culture. By 2024 it looked like Jensen could be that guy. But he remained a bit elusive as well. Even once you got him talking, you had to be careful: Jensen had a rich trove of anecdotes to share, but the details of those anecdotes would sometimes shift around a bit. Quoting him directly was a risky proposition, I had learned; just because he said something out loud didn’t necessarily mean that he believed it. Jensen’s -- 235 of 263 -- stream of patter might lead anywhere, to bizarre non sequiturs about dog vomit, men’s fashion, the quality of the eggs at Denny’s, or whatever else popped into his mind. Consistency was not a feature of his personality: often what seemed like a well-considered opinion, or even a pithy aphorism, was actually just something he came up with off-the-cuff, which he did not necessarily mean and which he later would not remember saying. Jensen contradicted himself frequently, sometimes offering opposing viewpoints within the same interview. He wasn’t playing devil’s advocate, exactly—he just liked to attack ideas from both sides. “He’s not trying to be a politician,” Horstmann said. “He’s not trying to stay on message. He’s trying to process real-time input, and he’s willing to entertain a contradictory thought for a while.” What might appear to be a definitive pronouncement was often just Jensen thinking out loud. Only once he started to repeat himself was it time to pay attention. When an idea really struck Jensen, it slowly built up steam over a period of days or even weeks. It cycled into his vocabulary and was repeated at every meeting. Concepts like the “zero-billion-dollar market” or the “speed of light” hadn’t come to Jensen in a flash; they’d arrived as polished nuggets of wisdom after spending months being tossed in the rock tumbler of his mind. Having arrived, they were then drilled so thoroughly into his employees that his staff sometimes sounded like characters from The Manchurian Candidate, repeating Jensen’s catchphrases verbatim with a glassy look in their eyes. Even employees who hadn’t worked at Nvidia for years could still recite the catechism from memory. Yet there was a logic to it. Jensen’s charisma and sense of humor kept things loose. If his leadership style was at once both chaotic and dictatorial, at least it wasn’t boring. One employee, after leaving Nvidia, recalled experiencing a sinking feeling watching a presentation at his new firm. “We have our CEO doing our quarterly presentation, and I’m watching, like, ‘God, where’s the humor? Is it always this frickin’ dry?’ ” he said. The other employees shot the new hire questioning glances as he shifted around in his chair. “I’m like, ‘Jesus, you guys don’t even know what you’re missing,’ ” he said. -- 236 of 263 -- • • • J . In late 2023 I saw him be publicly interviewed by Hao Ko, the lead architect of Nvidia’s headquarters. The conversation took place at an upscale resort on the Pacific Coast. I arrived early to find the two men facing the ocean and engaged in quiet conversation. They were dressed nearly identically, in black leather jackets, black jeans, and black shoes, although Hao was much taller. I was hoping to catch some candid statements about the future of computing; instead, I got a six-minute roast of Hao’s wardrobe. “Look at this guy!” Jensen said. “He’s dressed just like me. He’s copying me—which is smart—only his pants have too many pockets.” Hao gave a nervous chuckle and looked down at his designer jeans, which did have a few more zippered pockets than function would strictly demand. “Simplify, man!” Jensen said, before turning to me. “That’s why he’s dressed like me. I taught this guy everything he knows.” The interview was sponsored by Gensler, Hao’s firm, and there were several hundred architects in attendance. As the event approached, Jensen increased the intensity of his shtick, cracking a series of weak jokes and rocking back and forth on his feet. Jensen did dozens of speaking gigs each year and had given a talk to a different audience earlier that day, but I realized, to my amazement, that he was nervous. “I hate public speaking,” he said. It was true; he really did. Onstage, though, he was compelling—Hao barely had to ask a question. Looking at a picture of his corporate headquarters behind him, Jensen began to criticize the design of the top floor. Jensen had initially wanted an open-air balcony on the roof, but Hao hadn’t been able to deliver this in time. Jensen wondered if the building was really finished—then, leaping forward from this question, wondered if any building was really finished. He began to spitball about shape-shifting architecture in the age of AI, which would rearrange itself to meet the evolving demands of customers. “Maybe at one o’clock, the top three floors just become a nightclub,” Jensen said. -- 237 of 263 -- Ko looked on, perplexed. Jensen was just getting started. “Now, if the building is robotic, the entire building is software driven, isn’t that right?” he asked the crowd. Huang reasoned that in the future, office buildings would collect data about workers’ behavior, then feed it into an AI, which would redesign the building’s digital twin in the Omniverse before rolling out physical design changes overnight. He speculated that Gensler would evolve from an architectural firm into a tech company that managed the shape-shifting software. “Just as we operate fleets of computers, you’ll operate these buildings,” Jensen said. “And now Gensler, the next buildings you design will be free—isn’t that right? They’ll be free, and the reason for that is you’ll make your money by operating the building, and you’ll be enriched beyond your wildest dreams. You heard it here first.” Following the interview, Jensen took questions from the audience, including one about the potential risks of AI. “There’s the doomsday AIs— the AI that somehow jumped out of the computer and consumes tons and tons of information and learns all by itself, reshaping its attitude and sensibility, and starts making decisions on its own, including pressing buttons of all kinds,” Jensen said, pantomiming pressing the buttons in the air. The room grew quiet. “No AI should be able to learn without a human in the loop,” he said. One architect asked when AI might start to figure things out on its own. “Reasoning capability is two to three years out,” Jensen said. A low murmur went through the crowd. Afterward, I caught up with Hao. He seemed a bit stressed—his “interview” with Jensen had consisted of him asking a single question and Jensen giving a forty-five-minute answer. I assured him that the event was a hit; the rapt attention of the crowd was something that couldn’t be faked. I then asked Hao’s thoughts on shape-shifting architecture. “Look, when I first met him, he was talking about self-driving cars, and at the time that sounded crazy too,” Hao said with a shrug. “I’ve learned not to question him.” • • • -- 238 of 263 -- A N’ , Jensen the executive was transformed into Jensen the celebrity. His fan base now included not only gamers but also scientists whose work he supported, AI accelerationists whose technology he enabled, boomer retirees whose portfolios he had rescued, and degenerate retail “investors” who YOLO’d their life savings on Nvidia stock options whenever he spoke. Huang’s keynote address at GTC 2024 was so popular it took place at an NHL arena in downtown San Jose. The venue seated seventeen thousand people, but this was not enough to accommodate the thronging crowds. Eight thousand hopefuls were turned away and had to watch the presentation via simulcast. The address began with an AI art installation conducted live by multimedia artist Refik Anadol. Across a giant screen at the back of the arena, a field of wild, pulsating colors occasionally coalesced into recognizable shapes of flowers, trees, and birds before disintegrating into a riot of swirling pixels. I thought of Ovid’s Metamorphoses: “the face of Nature in a vast expanse was naught but Chaos uniformly waste.” Then Huang took the stage to applause, followed by the synchronized action of several thousand smartphone cameras lifted to record him at once. It had been five years since Huang had presented at GTC in person. In the interim period his world had been transformed. He looked a little older now; his hair was fully gray, and the strapping, muscular frame he’d maintained well into middle age was diminished a bit by time. He showed off his hand-drawn map of the history of computing, beginning with IBM’s 1964 S/360 computing architecture and ending with his new industrial revolution. “I made a little cartoon for you,” he said. “Literally, I drew this.” He progressed to discussion of AlexNet, which he termed AI’s “first contact” moment, and as he did so, a close-up of his exploded-view diagram of the GeForce card used to train it appeared on the screen. Huang believed that the field of computing had been reinvented at this moment. IBM’s architecture, predominant for the field’s first sixty years, was now being superseded by parallel computing, neural networks, and the cloud. Computing would no longer involve clunky interactions with finicky screens or equipment. Instead, users would issue the machine natural- -- 239 of 263 -- language commands to instantly perform actions of almost unlimited power. Humans, henceforth, would be sorcerers. The audience adored it. A technology podcaster, sitting next to me on the arena floor, began shaking his leg in excitement. Huang presented new hardware, new “microservices” for faster inference, new software for researchers, and a new robotics-training platform called GR00T. He was joined, at the end of the presentation, by two three-foot-tall droids who marched with him offstage to resounding applause. The conference continued over the next four days at the adjacent convention center, where hundreds of exhibitors displayed all manner of strange technology, including a robot bartender. Whenever Huang took the floor, he was instantly mobbed by autograph seekers who wanted him to sign their conference badges and their gaming GPUs. He tolerated the adulation with even temper and good humor, although at one point, when he tried to go to the men’s room, he could barely make it through the congregation of worshippers who surrounded him from every side. “People taking pictures of me in the bathroom, that’s a new one for me,” he said. As he made his way across the floor, I wondered if he was merely one of the great businessmen of the age or something more. • • • U , it was clear to me that Jensen was having some trouble processing success. He wasn’t sleeping much, and the constant public appearances were subjecting him to considerable stress. When he gave the commencement address at the 2024 CalTech graduation, he killed it, as usual, but when I caught up with him afterward, he was sitting on a wooden bench with his arms crossed and a churlish expression on his face. Every time I saw Jensen, he was wearing the same all-black T-shirt with a Thomas Burberry monogram. He had bought twenty-four of the shirts in 2020 and had been rotating them every day for the last four years. Dressed in this uniform, he had adopted what I now recognized as his customary pose, using the bench as a kind of prop, his body diagonal, his legs splayed -- 240 of 263 -- out, his expensive shoes pressed against the cobblestone patio. His net worth fluctuated with the stock market, but on that day, at least, he was worth more than $100 billion. “I’m very rich now,” he said. “Do you know how rich I am?” It wasn’t a boast. I asked him if he had any plan for what to do with the money. “I have no idea,” he said. “None.” Jensen told me he’d woken up that morning at 3:30 a.m. with one of his dogs sleeping between his legs. After thirty-one years, he was the longest- serving CEO of any tech stock in the S&P. He didn’t need to get out of bed, he admitted, and he really didn’t want to either. “Is there a specific thing that’s going to drive us out of business today?” he asked himself. No, there wasn’t. But was there something in striking distance? Well, maybe. Nvidia’s biggest customers were now building their own microchips. AMD, realizing that hardware wasn’t enough, was hiring a great number of software engineers. If Nvidia engineers didn’t hold the hands of their customers, neural-network technology might disappoint, and demand for his products would slacken. By four a.m., Jensen was up and working. He always began his workday with his most important long-term project, figuring that as long as he addressed it, the day couldn’t be considered a bust no matter what else happened. I had sensed, in multiple conversations with him, that he had something big under wraps, but he refused to divulge what it was. “I have to have some secrets,” he said. I could guess, though. Rumors had been circulating about a neural net that OpenAI was working on called Project Strawberry. This was said to be a “rational” AI, designed to conduct pure research in mathematics. Jensen never discussed Project Strawberry with me, but a couple of times in passing, he had asked me to imagine a world in which mathematical proofs were produced on demand with the click of a button. “The marginal cost of calculation has gone to zero, and this has opened up incredible possibilities,” he had said. “Well, ask yourself, when the marginal cost of doing math goes to zero, then what do you do?” I gazed out toward the CalTech turtle pond. All of the scientists here, all of this brainpower, superseded in an instant by a zillion nestled transistors. -- 241 of 263 -- Then what happened? Did we give the Fields Medal to a computer? Did we arrive at a unified field theory in physics? I thought of Pascal’s gears, turning for the first time to produce a mechanical calculation—but Jensen had already conquered arithmetic. Now his machines would assault reason itself. -- 242 of 263 -- T TWENTY-TWO The Fear he Fear hit Yoshua Bengio on a winter’s day in early 2023. The Montreal AI pioneer was one of the early neural-net advocates. He had long been fascinated by the technology and had championed it through several recessions. In 2019 he was awarded the A. M. Turing Award, computer science’s equivalent of the Nobel Prize, for his seminal research in the field. His frequent collaborators Yann LeCun and Geoffrey Hinton won the award alongside him. Bengio was lean, with steeply arched, bushy eyebrows and a penetrating gaze. His parents, both Sephardic Jews from Morocco, had moved to Montreal in the 1970s to participate in the avant-garde performing-arts scene. Bengio had gravitated toward computers instead, although studying neural networks in the 1990s was scarcely more respectable than experimental theater. But his work laid the foundation for AlexNet, and by the mid-2010s, Bengio was enjoying a sense of vindication. Obscure research papers he’d published in niche journals years before now served as the basis for a new scientific field. Bengio was a pure academic with little interest in commerce, but he watched the AI boom with a kind of fatherly pride. Like many researchers in the field, though, Bengio sometimes imagined a world where AI grew too powerful. For most of his career, though, this thought experiment was kind of a joke: If AI was going to conquer the planet, why was it so hard to get a $10,000 research grant to study it? But after the public release of ChatGPT, Bengio, for the first time in his life, -- 243 of 263 -- began to grow genuinely concerned. He was astonished by ChatGPT’s capabilities, just stunned—his socks were blown completely off. Bengio didn’t think he would personally live to see computers that interacted with humans in a way that seemed intelligent. Then, one day in late 2022, the technology had simply arrived. He compared it to meeting an extraterrestrial. Bengio went for a walk every morning to gather his thoughts, regardless of the weather. On the day The Fear hit, the temperature had been milder than usual, and the Montreal streets were coated with slush. Surrounded by bare trees and graying snow, a flood of strange emotion came over Bengio, unlike any he’d experienced before. He called it a “conversion.” “I was thinking about my children and my grandchild,” he told me. “What is it going to be for them in twenty years? Will they have a life?” The Fear was the worst kind of fear, one that grew scarier when you set aside emotions and examined the problem rationally. Bengio compared it to the threat of nuclear war. Actually, it was worse than that: some people would survive a nuclear war, he reasoned, but if an AI decided to wipe us out, it could design a lethal pathogen that could exterminate every human on Earth. “I don’t think there’s anything close in terms of the scale of danger,” he said. After his walk, Bengio contacted his friend and frequent collaborator Geoffrey Hinton to conduct a sanity check. This didn’t help—Hinton had independently come to the same conclusions. In fact, he’d come to the same conclusions in the same way, at almost the exact same moment: by probing the capabilities of ChatGPT, first with a sense of satisfaction, then personal vindication, then a creeping unease. A short time later, Hinton quit his job at Google to devote himself full-time to advising humanity about the risks of runaway AI. Ilya Sutskever, who had built ChatGPT, was also increasingly concerned and was proposing to devote himself full-time to the “alignment” problem of ensuring that superintelligent AIs did not pursue goals contrary to the benefit of the human race. Bengio, Hinton, and Sutskever—this was an alarming convergence. They ranked one, two, and three as the most-cited computer scientists alive. All three were worried that AI might kill us all. -- 244 of 263 -- • • • N it this way, of course. Yann LeCun, the third co- recipient of the Turing prize, thought that his colleagues were being ridiculous. LeCun was the lead architect of a 1998 neural network that could recognize human handwriting, which he had licensed to the post office and financial institutions, which had used it to read addresses and analyze paper checks. LeCun’s handwriting analyzer was the first widespread industrial use of a multilayer neural network. LeCun hailed from Paris, had a good sense of humor, and spoke with the authority of a man whose intuitions had been repeatedly validated by scientific research. Bengio had been his coauthor many times, including on the famous handwriting paper. Both men spoke French natively and had developed a close friendship in their many years of research together. Now their opposing perspectives on AI risk were creating a rift in their relationship. “In theory, we’re still friends,” Bengio told me. “But today we argue in public in ways that friends usually don’t.” LeCun didn’t have any fears at all about AI, and he thought that Bengio was anthropomorphizing a harmless machine. “The drive that some humans have for domination, or at least influence, has been hardwired into us by evolution, because we are a social species with a hierarchical organization,” LeCun told Time magazine. “AI systems, as smart as they might be, will be subservient to us. We set their goals, and they don’t have any intrinsic goal that we would build into them to dominate.” Of course, a neural net could have a desire to dominate, LeCun conceded; it was just that he thought no one would be foolish enough to build one. “It would be really stupid to build that. It would also be useless. Nobody would buy it anyway,” he said. (LeCun’s words echoed those of Pascal, who, in the mid-1600s, had stared into the gears of his calculator in search of consciousness. “The adding machine produces effects which are closer to thought than anything done by animals,” he wrote, “but it does nothing to justify the assertion that it has a will.”) -- 245 of 263 -- Bengio thought LeCun was guilty of overconfidence. People, both individuals and whole societies, did all sorts of dumb, destructive shit. For now, the cost of training AI technology limited its development to relatively responsible organizations, but as Nvidia drove down the cost of computing, it spread the ability to train an AI to everyone—and rogue superintelligence had to emerge only once. “The more powerful the AI gets, the more likely it is that it would want to take control of the computer on which it is running so that it can give itself more reward,” Bengio said. “That’s the end of the story for us.” Hinton agreed, and he cautioned that such an event might happen inadvertently. “Suppose one of them had just a tiny bit of drive—just the teensiest, weensiest bit of drive to sort of help itself,” he said. That AI would win, Hinton said. “It will give itself a few more resources. In fact, the variants that are most selfish will do the best. And I think you can see that’s a very slippery slope.” • • • T B and LeCun mirrored a split in the AI community at large. The argument was sometimes framed with a parametric variable called p(doom), which represented your assessment of the probability that AI might one day eliminate the human race. For LeCun, p(doom) was zero. For Bengio, p(doom) was 50 percent. The two represented extreme views of the problem, although any number greater than zero was concerning. When he’d talked to Bengio in early 2023, Hinton’s p(doom) had also been at 50 percent. To put that another way, for emphasis: the world’s most famous AI scientist believed the chance that AI would cause a catastrophic outcome for humanity was one chance in two. But when I talked to him in 2024, Hinton had reduced this estimate to between 10 and 20 percent. The reason he lowered his estimate, he told me, was that so many people whose intelligence he respected had strenuously disagreed with him. “Most notably, of course, there is Jensen,” he said. -- 246 of 263 -- Jensen’s p(doom) was zero—actually, although it wasn’t technically possible to have a negative p(doom), Jensen somehow did. Jensen vocally rejected the whole framing, thought the question was stupid, and thought the people who argued about it were holding back humanity. At Denny’s, he had even suggested that Hinton was tarnishing his academic legacy by entertaining such speculations. This caused Hinton to recalibrate, but Bengio firmly stood his ground. (Not coincidentally, Bengio is one of the only major AI scientists who has never taken money from Silicon Valley.) Both Hinton and Bengio had endorsed a California state bill that called for regulation of AI models that cost more than $100 million to train. This bill, SB 1047, was very unpopular in Silicon Valley. It was opposed by a coalition of venture capitalists and tech corporations, as well as a battalion of Sacramento lobbyists. Storied tech investor Nancy Pelosi issued a statement against it, as did a number of academics, like Stanford’s Fei-Fei Li, who argued the bill would hinder innovation while failing to reduce risk. Andrew Ng, the professor who’d synthesized images of cats at Google, compared worrying about an AI takeover to worrying about overpopulation on Mars. Opinion polls suggested that nearly 80 percent of the public supported SB 1047, but in September 2024 California governor Gavin Newsom vetoed the bill. Jensen made no public comment about SB 1047, but he continued to stress that there was no data supporting the (admittedly pretty wild) speculation about AI risk. When I shared Jensen’s objections with Bengio, he grew agitated. “Of course, there is no data!” he said. “Humanity hasn’t disappeared yet! Are we going to wait until we have repeated the death of humanity multiple times to decide that, oh, now we have data?!” He made a good point. All the data in the world would not have predicted the breakthrough of AlexNet or the success of the transformer architecture. Twice in ten years, AI had experienced unforeseeable and permanent upgrades to its capabilities. Bengio did not think that the current models posed an immediate threat to human life—but what about the next breakthrough? No one could say what it might bring or when it might happen. -- 247 of 263 -- Even if superintelligence did not appear in the next decade, its emergence in twenty, or thirty, or even a hundred years’ time seemed inevitable. That was a time horizon outside the boundary of projectable return on investment but within the span of human history. In one or two generations, homo sapiens might no longer be the dominant species on the planet—but venture capitalists didn’t look that far ahead. The mismatch between short-term profit and long-term risk created turmoil inside AI start-ups. At OpenAI, a bizarre boardroom coup had unfolded in November 2023, with Ilya Sutskever first voting to remove Sam Altman, then a few days later pleading for his reinstatement. Altman ultimately triumphed, and the rest of the nonprofit board was replaced. Following Altman’s restoration, it was fairly clear that, despite what OpenAI’s nonprofit mission statement read, Microsoft’s “capped” profit interest was spurring the organization to continue to develop the most sophisticated AI models ever seen. Sutskever shunned the press after the coup, but when I spoke with him in September 2023, he said he had changed his focus from building ever-larger language models toward aligning AI superintelligence with human interests. “I can’t talk about specific models,” Sutskever said, “but I’m coming up with what I believe will be the definitive solution to the concern people have, that an AI might go rogue and do something very highly undesirable.” As of late 2024, no such solution had appeared, even as OpenAI’s products grew ever larger and more powerful. In May 2024, OpenAI released GPT4o—the “o” stood for “omni”—a multimedia AI that accepted any combination of text, audio, image, and video and returned any combination of text, audio, and image outputs. The model’s seamless response times were made possible by the lightning-fast inference capabilities of the latest generation of Nvidia chips. The day after GPT-4o’s release, Sutskever resigned. OpenAI soon packaged a conversational module with GPT-4o, and with almost no perceivable delay in response, talking with it felt like talking to a hyperintelligent human. (Many compared it to the AI in Her.) The same day OpenAI released GPT-4o, Google showed off Astra, an augmented-reality -- 248 of 263 -- AI assistant that seemed able to answer any question, remember any detail, and describe any environment in a fraction of a second. Claude, a model from the start-up Anthropic, rivaled or exceeded GPT-4o on many benchmarks. National governments, unwilling to lease their sensitive data to the cloud, were building large-scale sovereign AI-training centers. Musk’s xAI initiative struck a $10 billion deal to rent Oracle’s GPU servers. (The existential concerns he’d expressed about AI in 2015 must have abated.) Mark Zuckerberg’s Meta made the largest investment, announcing that it was going to spend $30 billion to buy one million Nvidia chips and secure a dedicated nuclear reactor to power them. The belief that any of this might have been prevented by a California Senate bill evinced a charming faith in the power of state government. As the speed of Nvidia’s GPUs increased, so, too, did attendant expectations for economic growth. The industrial barons of Silicon Valley had placed their bets, and to cap their upside would mean tanking the stock market. No politician anywhere had enough clout to do it. Still, if Bengio, Hinton, and Sutskever had been sidelined by capital, the points they made remained valid. They had seen better than anyone the potential of what AI technology could be, and they had the academic credentials to prove it. If they were worried now, I felt it was worth listening to. “Right now there are ninety-nine very smart people trying to make AI better and one very smart person trying to figure out how to stop it taking over,” Hinton said. • • • I that I had The Fear myself—it’s why I first researched Nvidia, and it’s why I wrote this book. From the moment ChatGPT went public, I assumed that my career was coming to an end. GPT-4 was only a passable writer; it had scored in the 22nd percentile of the AP English Literature and Composition exam. (I am confident I would rank in the 23rd.) But the models were getting better all the time, and perhaps more importantly, they already wrote better than I had as an undergraduate. The -- 249 of 263 -- long apprenticeship I’d gone through before first publishing my work, at age thirty-six, would have looked pointless had such a tool been available in my youth. The looming death of the author was a minor personal tragedy, but I was willing to acknowledge that coach drivers had probably hated automobiles, too—at least the streets were no longer covered in horseshit. I was adaptable to change and not by any means a Luddite. I loved technology, loved the internet, loved space travel, wanted a self-driving automobile, and had purchased the first iPhone the day it had come out. Nor did I have any ideological bias against capitalism—indeed, as an American, I felt it was my birthright to have large corporations catering to my whims. Yet AI troubled me. It troubled me to the point of losing sleep. In this I had company: Bengio told me he’d had nightmares, too. Just a few months after GPT-4o, OpenAI released “o1,” its first attempt to build an AI capable of complex scientific and mathematical reasoning. Accompanying its release, OpenAI published an updated analysis of the potential risks to humanity that AI presented. With the debut of o1, OpenAI concluded that the risk that most concerned Bengio—the risk that AI might be used to develop an unstoppable biological weapon—had increased from “low” to “medium.” Were we really in control of these systems? At some point in the next few years, someone was going to unveil the first one-hundred-trillion parameter model, rivaling the number of synapses in the human brain. What was the role of humans after that? Accelerationists insisted that AI would unlock new modes of employment, just as the last industrial revolution had. But in my experience, if you cornered the optimists and pressed them, you could force them to admit that this facile comparison was borne of convenience and that there was no way to know if it was really true. Mark Twain was incorrect: history did not rhyme. The Book of Ecclesiastes was wrong, too: there were plenty of new things under the sun—actually, they arose all the time, from the trilobites to the dinosaurs—and certainly a mechanical brain with one hundred trillion synapses firing at five billion -- 250 of 263 -- cycles a second had no precedent in history, religion, or philosophy. No one could say, with any certainty, what would happen to earthlings in the AI era. But Jensen insisted it was safe. He had his own incentives, of course— money, but also pride. Having waged a long, lonely battle to make parallel computing profitable, Jensen was now determined to enjoy its rewards. He seemed offended by technologists who performatively decried the risks of AI while continuing to purchase his hardware. I could see his point of view —there was something nauseating in the way AI developers adopted doomer vocabulary while training ever-larger models. Jensen’s confidence was also born of the many ways in which AI wasn’t like the biological brain. The brain didn’t learn through backpropagation; that much was certain. And the neural nets only mimicked neurons; there was no equivalent to the glial cells that supported the neurons, for example, nor were the AI systems connected to autonomic nervous systems. The neural net did not have a hypothalamus or a hippocampus or a pineal gland. The neural net was unaffected by hormones, had no sex drive, didn’t go to sleep, couldn’t love its children, and never had a dream. A well-trained model mimicked (and sometimes exceeded) our own capacity for cogitation, but it had no sense memory, no emotions, no imagination, no reproductive function, and no instinct to survive. In this sense, one could not say it was really alive—it did not have the self- sustainability of even the most primitive single-celled organism. Although there was certainly an analogy between training neural nets and evolution, AI was not the product of the kind of ruthless, Darwinian, kill-or-be-killed, eat-or-be-eaten planetary laboratory from which biological life had emerged. There was also the upside to consider. Nick Bostrom, he of the paper- clip-maximizing argument, was by 2024 considering a different question: What would human beings do in the “solved world” of limitless AI? Bostrom’s 2024 book, Deep Utopia, invited readers to imagine a world where all unpleasant tasks were handled by robots, and in which any imaginable experience might be replicated in a simulator. These were not -- 251 of 263 -- idle speculations; with image recognition handled, Fei-Fei Li was now trying to build this solved world. She had unveiled her initiative, called Behavior-1K, at Nvidia’s 2024 GTC conference. Onstage with Bill Dally, Li described how she had assembled a diverse group of respondents using Amazon’s Mechanical Turk to rank thousands of answers to a single question: “How much would you benefit if a robot did this for you?” At the top were the tasks that people least enjoyed: scrubbing toilets, washing floors, and cleaning up after parties. At the bottom were those activities that inspired some measure of human joy: picking out jewelry, playing squash, and opening presents. Using these results, Li was now training artificial intelligence in Nvidia’s Omniverse to complete the most unpleasant tasks. Someday, perhaps sooner rather than later, one of these neural nets would be installed inside a real- world robot and set loose on the commode. The solved world had its appeal. People might grow bored of playing squash and opening presents, but as Li’s research suggested, it was behavior, not philosophical arguments, that drove outcomes. It was easy to imagine a consumer environment where, in a decade or two, owning a fleet of chore-happy robots was as much the default expectation for a first-world household as owning a refrigerator is today. Perhaps a world where nobody ever had to clean the toilet again carried with it a certain amount of existential risk, but I suspected that this was a trade that almost everyone would go along with, even if only implicitly. Did I want a toilet robot? You bet I did. By late 2024, it was clear that the intellectual mood around AI was shifting away from doomerism. Op-eds and think pieces about AI risk no longer generated much attention. Instead, we were instructed to look on these new systems with optimism. “Here is one narrow way to look at human history,” OpenAI CEO Sam Altman wrote in a September 2024 essay. “After thousands of years of compounding scientific discovery and technological progress, we have figured out how to melt sand, add some impurities, arrange it with astonishing precision at extraordinarily tiny scale into computer chips, run energy through it, and end up with systems capable -- 252 of 263 -- of creating increasingly capable artificial intelligence. This may turn out to be the most consequential fact about all of history so far.” Shortly after the essay appeared, OpenAI announced it was partnering with the defense start- up Anduril, a manufacturer of AI-enabled weaponry. But even when I tried to embrace the upside of AI, I couldn’t shake my anxiety. The more I used the AI models, the more I questioned my own role. Not only would they soon be able to write better than I would; with time, even my interviews would be filtered by AI. I imagined video-chatting with some powerful executive and unwittingly interviewing his digital AI clone, never myself the wiser. It seemed to me that the end of my profession was approaching. It seemed to me that the end of reality was approaching. It seemed to me that the end of consciousness was approaching. Was Huang, seduced by the power of what he’d built, gambling with the future of our species? I had to know. I had to ask him one last time. -- 253 of 263 -- M TWENTY-THREE The Thinking Machine y final interview with Jensen took place on a Friday, following the conclusion of GTC 2024. We met in his conference room “office,” in the center of Nvidia’s south campus. I sat across from him at a long wooden table; surrounding us, on all sides, were precise diagrams and schedules that he’d written on the walls. At the far end of the room was a calendar showing Jensen’s forthcoming release schedule for new GPUs and software products. Behind him, spanning at least thirty feet of whiteboard, was an intricate diagram of the Nvidia computing stack, beginning with the GPUs and continuing forward, in colored marker and block capitals, all the way to the microservice architecture he’d just introduced. But even this massive whiteboard was not enough—the writing wrapped around the far wall, then behind me onto the frosted-glass exterior of the room. Jensen, after four days of nonstop presentations, interviews, and technical demos, was visibly exhausted. His opening keynote was his largest-ever presentation, and I could see how much it had taken out of him. I’d witnessed the anxiety in his body language when he gave an inconsequential Q&A to a room of four hundred architects—I could only imagine the dread he’d felt before giving the most important presentation of his life to a hockey arena packed with rabid fans. People were getting a little critical of him, I noticed. At a press conference at GTC, one journalist had asked if he was the new Robert Oppenheimer. (“Oppenheimer was building a bomb,” Jensen said. “We’re not doing that.”) Bill Whitaker, a host for 60 Minutes, had suggested that -- 254 of 263 -- Jensen was “not easy to work for.” (“It should be like that. If you want to do extraordinary things, it shouldn’t be easy,” Jensen said.) Nvidia’s pricing power was triggering a reaction from government power centers; in the United States, Lina Khan, the chair of the Federal Trade Commission, had been eyeing the company suspiciously, and Nvidia’s offices in France had been raided by authorities over concerns about anticompetitive practices. But our interview started amicably. I congratulated Jensen on his fine performance at the keynote, then asked him about life in Taiwan, about his family, about how he was handling his newfound fame. His answers were curt and to the point. I asked him specifically what new jobs might be created by AI. “Well, ask yourself, when the marginal cost of doing math goes to zero, then what do you do?” he said. “I don’t know,” I said. “Well, you just asked me the same question. OK. You’re just as intelligent as I am to figure this out,” he said. I paused, flummoxed. It was too much for me to unravel; despite Jensen’s kind words about my intelligence, I was in no way at his level. Unsure how to proceed, I decided to show Jensen the same clip of Arthur C. Clarke that had inspired Catanzaro’s rhapsodic take. As Clarke droned on about the future of mechanical evolution, I watched the blood drain from Jensen’s face. “I just feel like you’re interviewing the wrong person,” he said, quietly. “I feel like you’re interviewing Elon right now, and I’m just not that guy.” I paused, unsure what to say, but it was too late. I’d hit the trip wire. “Is it going to—is it going to destroy jobs?” Huang asked, his voice crescendoing with anger. “Are calculators going to destroy math? That conversation is so old, and I’m so, so tired of it,” he said. “I don’t want to talk about it anymore! It’s the same conversation over and over and over and over and over again. We invented agriculture and then made the marginal cost of producing food zero. It was good for society! We manufactured electricity at scale, and it caused the marginal cost of chopping down trees, lighting fires, carrying fires and torches around to -- 255 of 263 -- approximately zero, and we went off to do something else. And then we made the marginal cost of doing calculations—long division! We made it zero!” He was yelling now. “We make the marginal cost of things zero, generation after generation after generation, and this exact conversation happens every single time!” I tried to switch subjects, but it was no use. His anger was tinged with disgust. He began to lecture me in the voice that one would use with a wayward teenager. He’d placed high expectations in me, he said, and I had disappointed him. I had wasted his time; I had wasted everyone’s time; the whole project of the book was now called into question. The interview was attended by two of Jensen’s PR reps, but neither made any attempt to intervene—they weren’t about to draw fire. Kirk had theorized that Huang’s anger was strategic. I can tell you, it didn’t feel that way in the moment. His anger seemed uncontained, omnidirectional, and wildly inappropriate. I was not Jensen’s employee, and he had nothing to gain from raging at me. He just seemed tired of being asked about the negative aspects of the tools he was building. He thought the question was stupid, and he had been asked it one too many times. “This cannot be a ridiculous sci-fi story,” he said. He gestured to his frozen PR reps at the end of the table. “Do you guys understand? I didn’t grow up on a bunch of sci-fi stories, and this is not a sci-fi movie. These are serious people doing serious work!” he said. “This is not a freaking joke! This is not a repeat of Arthur C. Clarke. I didn’t read his fucking books. I don’t care about those books! It’s not—we’re not a sci-fi repeat! This company is not a manifestation of Star Trek! We are not doing those things! We are serious people, doing serious work. And—it’s just a serious company, and I’m a serious person, just doing serious work.” For the next twenty minutes, in a tone that alternated among accusatory, exasperated, and belittling, Jensen questioned my professionalism, questioned my interview approach, questioned my dedication to the project. He accused me of trying to psychoanalyze him; he told me how much he disliked answering my biographical questions, especially those that attempted to illuminate his mental state. “I don’t like these probing -- 256 of 263 -- questions,” he said. “I don’t like talking about myself, OK?! I’m not into therapy.” He suggested that my questions were stupid; he called them “pedestrian.” He denied there was anything exceptional about himself, against all accumulated evidence. “Look, I’m—I am super normal,” he said. “I have never met anyone like you,” I said. “I’m super normal,” Huang said. Gradually, the anger wore off. Huang changed topics a couple times, talking about upcoming products, asking after the welfare of his PR reps, and recalling the importance of the late John Nickolls to the CUDA project. At one point he brought up the Roman Empire. He continued to chide me gently—he was done with me—and I was ushered out the door. I left the interview bewildered. I’d had plenty of tense conversations with executives, but I’d never had someone explode at me in this way. I was stunned—but also, if I’m being honest, I was a little giddy. To be targeted by the Wrath of Huang was in a certain sense an honor: a rite of passage that everyone who gained admittance to his inner circle underwent. Walking away from the conference room, I turned to one of the PR reps. “That went well,” I said. He laughed. “Oh, that?” he said. “That was nothing.” • • • F , I was driven to see Eos, a ten-thousand-chip supercomputer housed in a nearby data center. Eos was preposterously fast; as a benchmark, it had trained OpenAI’s GPT-3 model in under four minutes. I was met there by Marc Hamilton, a veteran supercomputer engineer. He guided me through an airlock and onto the sterile data-center floor, where dozens of racks of Nvidia hardware, separated into walled-off pods, pulsed ceaselessly beneath fluorescent light. For the human to trespass into the realm of the computer felt like a violation of some sacred penetralium. The thousands of computer fans in simultaneous operation created a dull, cumulative roar, like applause heard from a distance, but I was hardly present; I was still reeling from my -- 257 of 263 -- dressing-down from Huang. I wondered, had he done that on purpose? What had seemed like an unprovoked explosion at the moment was perhaps more deliberate in retrospect. Revisiting our conversation in my head, I realized that Huang hadn’t lost his temper at me all at once. Instead, over the course of a few minutes, he worked himself into an angrier and angrier state, seemingly on purpose, building up a head of steam by repeatedly revisiting how stupid I was to show him a clip of Arthur C. Clarke. I didn’t think it was that dumb, and I still don’t: the question of whether mechanical brains will evolve faster than physical ones seems fair to ask of a man who has made his fortune building mechanical brains. But I also hadn’t pressed Huang to answer, and when it became clear that the question had pissed him off, I had several times tried to move the subject to safer ground. He didn’t let me. Looking back, it became clear to me that Jensen had wanted to lose his temper; he’d made a conscious decision to thrash me. Once the performance had started, his fury was genuine, but it was all in service of a larger point he wanted to make. It wasn’t just that Jensen didn’t read science fiction—it was that he actually hated science fiction. He was a serious man. The reason that Jensen had succeeded in fields where others had failed— parallel computing, AI, the Omniverse—was precisely because he didn’t tolerate airy speculation about the future. He examined technologies coldly, from first principles, swayed neither by optimism nor fear but only by a cold and patient sense of business logic that he alone could push to the outer limits of corporate foresight. Beyond that he did not look or care to imagine. The potential for human extinction was not a question of corporate strategy and thus, to him, was as foolish as drawing a dragon on the unexplored portion of the map. I recalled the discipline of the Nvidia executives I’d talked to: Jensen had them wound as tight as piano strings. They were confident, intelligent, and exceptionally well-prepared, down to the smallest detail—I never once caught one slipping. I recalled, too, with sudden clarity, how disinclined those same executives had been to discuss the potential future implications -- 258 of 263 -- of the technology they were building, a disinclination that I sensed spilled over into discomfort and even fear. Now I saw where the fear was coming from. The executives were more afraid of Jensen yelling at them than they were of wiping out the human race. I was interrupted from my reverie when I approached one of the computing pods and its doors slid open automatically. I stepped back, my back went straight, the hairs on my arms stood at attention—then I ventured within. Inside, I was buffeted by air from all directions, the cooling fans on the racks to either side of me roaring incessantly, my pants rippling from air blowing out of a grate on the floor. It was almost too loud to think. I was standing inside the computer’s brain now, the circuits around me executing a collective ten quintillion calculations every second. It would kill us all, or it would save us; whatever happened, it was going to happen on Jensen’s chips. I tried to envision the entire stack, from the angstrom- width transistors to the complex circuit architecture to Buck’s numerical magic to Aarts’s compilers to Nickolls’s CUDA suite, and finally, above all, to the giant, layered grids of simulated neurons evolving at a supernatural rate. Standing here, I began to see the scope of Nvidia’s accomplishment, from the scale of the atom to the dawn of machine consciousness—but only Jensen, the American Daedalus, could really see it all. Shouting to be heard, I asked what problem Eos was working on. Hamilton responded that it was training an internal Nvidia model in the style of GPT-4. In other words, I was surrounded by a language model: one that was catching up to me, one that I sensed would one day soon replace me. Hamilton, screaming over the fans, informed me that to complete the training, the computer would conduct a total of thirty billion quadrillion operations. I looked around, feeling inadequate in my obsolete and dying flesh. The pod was beautiful; I was nothing; there was no way to fight it. This was the thinking machine, and with each spin of the fan, with each pulse of the circuits, it got a little smarter. -- 259 of 263 -- ACKNOWLEDGMENTS I would like to thank Jensen Huang and the rest of the team at Nvidia for their time, their candor, and their insight. In particular, I’m grateful to Bob Sherbin, who pushed hard to make this project a possibility. Thank you, Bob, for everything. You gave me the opportunity of a lifetime. I’m also grateful to everyone who shared their stories with me for this book, especially Jens Horstmann, Curtis Priem, Jon Peddie, and David Kirk. I’d like to thank Willing Davidson and the rest of the team at The New Yorker for commissioning the magazine profile that led to this book. Willing is a superb editor and I suggest you pitch him your ideas. I’d also like to thank Allison Lorentzen at Viking and Stuart Williams at Bodley Head for once again acquiring, editing, and publishing my work. Allison and Stuart, your insights and advice were invaluable. I’m also thankful to Sean Lavery, Yinuo Shi, and Anna Kordunsky for fact-checking the manuscript. And, as ever, I am eternally grateful to my agent, Chris Parris- Lamb, who has for more than ten years supported and championed my career. I’m so lucky to have Chris. Meghan McEnery encouraged me to pursue this project and provided a great deal of moral and emotional support. Thank you, Meghan. I could not have written this book without you. I’m also grateful to my friends Jay Budzik and Marcus Moretti, who both shared my fascination with Nvidia and discussed the company with me for hours. I’d like to thank my parents, Leonard and Diana, for always being there for me, and my sister, Emily, for acting as an enduring inspiration. And, of course, I am grateful to my daughter, Jane, who was patient with Dad when he disappeared into his -- 260 of 263 -- darkened bedroom to write for days at a time. Thank you, Jane. This book is for you. -- 261 of 263 -- ABOUT THE AUTHOR Stephen Witt is the author of How Music Got Free, which was a finalist for the Los Angeles Times Book Prize, the J. Anthony Lukas Book Prize, and the Financial Times and McKinsey Business Book of the Year Award. His writing has appeared in The New Yorker, Financial Times, New York, The Wall Street Journal, Rolling Stone, and GQ. He lives in Los Angeles, California. -- 262 of 263 -- Wat’s next on your reading list? Discover your next great read! Get personalized book picks and up-to-date news about this author. Sign up now. -- 263 of 263 --
Bellwether · 2026 Marco