Machines of Loving Grace1
How AI Could Transform the World for the Better
I think and talk a lot about the risks of powerful AI. The company I'm the CEO of, Anthropic, does a lot of research on how to reduce these risks. Because of this, people sometimes draw the conclusion that I'm a pessimist or doomer who thinks AI will be mostly bad or dangerous. I don't think that at all. In fact, one of my main reasons for focusing on risks is that they're the only thing standing between us and what I see as a fundamentally positive future. I think that most people are underestimating just how radical the upside of AI could be, just as I think most people are underestimating how bad the risks could be.
In this essay I try to sketch out what that upside might look like—what a world with powerful AI might look like if everything goes right. Of course no one can know the future with any certainty or precision, and the effects of powerful AI are likely to be even more unpredictable than past technological changes, so all of this is unavoidably going to consist of guesses. But I am aiming for at least educated and useful guesses, which capture the flavor of what will happen even if most details end up being wrong. I'm including lots of details mainly because I think a concrete vision does more to advance discussion than a highly hedged and abstract one.
First, however, I wanted to briefly explain why I and Anthropic haven't talked that much about powerful AI's upsides, and why we'll probably continue, overall, to talk a lot about risks. In particular, I've made this choice out of a desire to:
- Maximize leverage. The basic development of AI technology and many (not all) of its benefits seems inevitable (unless the risks derail everything) and is fundamentally driven by powerful market forces. On the other hand, the risks are not predetermined and our actions can greatly change their likelihood.
- Avoid perception of propaganda. AI companies talking about all the amazing benefits of AI can come off like propagandists, or as if they're attempting to distract from downsides. I also think that as a matter of principle it's bad for your soul to spend too much of your time talking your book.
- Avoid grandiosity. I am often turned off by the way many AI risk public figures (not to mention AI company leaders) talk about the post-AGI world, as if it's their mission to single-handedly bring it about like a prophet leading their people to salvation. I think it's dangerous to view companies as unilaterally shaping the world, and dangerous to view practical technological goals in essentially religious terms.
- Avoid sci-fi baggage. Although I think most people underestimate the upside of powerful AI, the small community of people who do discuss radical AI futures often does so in an excessively sci-fi tone (featuring e.g. uploaded minds, space exploration, or general cyberpunk vibes). I think this causes people to take the claims less seriously, and to imbue them with a sort of unreality. To be clear, the issue isn't whether the technologies described are possible or likely (the main essay discusses this in granular detail)—it's more that the vibe connotatively smuggles in a bunch of cultural baggage and unstated assumptions about what kind of future is desirable, how various societal issues will play out, etc. The result often ends up reading like a fantasy for a narrow subculture, while being off-putting to most people.
Yet despite all of the concerns above, I really do think it's important to discuss what a good world with powerful AI could look like, while doing our best to avoid the above pitfalls. In fact I think it is critical to have a genuinely inspiring vision of the future, and not just a plan to fight fires. Many of the implications of powerful AI are adversarial or dangerous, but at the end of it all, there has to be something we're fighting for, some positive-sum outcome where everyone is better off, something to rally people to rise above their squabbles and confront the challenges ahead. Fear is one kind of motivator, but it's not enough: we need hope as well.
The list of positive applications of powerful AI is extremely long (and includes robotics, manufacturing, energy, and much more), but I'm going to focus on a small number of areas that seem to me to have the greatest potential to directly improve the quality of human life. The five categories I am most excited about are:
- Biology and physical health
- Neuroscience and mental health
- Economic development and poverty
- Peace and governance
- Work and meaning
My predictions are going to be radical as judged by most standards (other than sci-fi singularity visions2), but I mean them earnestly and sincerely. Everything I'm saying could very easily be wrong (to repeat my point from above), but I've at least attempted to ground my views in a semi-analytical assessment of how much progress in various fields might speed up and what that might mean in practice. I am fortunate to have professional experience in both biology and neuroscience, and I am an informed amateur in the field of economic development, but I am sure I will get plenty of things wrong. One thing writing this essay has made me realize is that it would be valuable to bring together a group of domain experts (in biology, economics, international relations, and other areas) to write a much better and more informed version of what I've produced here. It's probably best to view my efforts here as a starting prompt for that group.
Basic assumptions and framework
To make this whole essay more precise and grounded, it's helpful to specify clearly what we mean by powerful AI (i.e. the threshold at which the 5-10 year clock starts counting), as well as laying out a framework for thinking about the effects of such AI once it's present.
What powerful AI (I dislike the term AGI)3 will look like, and when (or if) it will arrive, is a huge topic in itself. It's one I've discussed publicly and could write a completely separate essay on (I probably will at some point). Obviously, many people are skeptical that powerful AI will be built soon and some are skeptical that it will ever be built at all. I think it could come as early as 2026, though there are also ways it could take much longer. But for the purposes of this essay, I'd like to put these issues aside, assume it will come reasonably soon, and focus on what happens in the 5-10 years after that. I also want to assume a definition of what such a system will look like, what its capabilities are and how it interacts, even though there is room for disagreement on this.
1. Biology and health
This section would discuss how AI could transform medicine, healthcare, and biological research...
2. Neuroscience and mind
This section would explore AI's potential impact on understanding the brain and mental health...
3. Economic development and poverty
This section would examine how AI could accelerate economic growth and reduce poverty globally...
4. Peace and governance
This section would discuss how AI might influence international relations and governance structures...
5. Work and meaning
This section would explore how AI could transform work and help humans find meaning...
Taking stock
This concluding section would summarize the essay's main points...
- The title is a reference to Richard Brautigan's 1967 poem.↩
- To be clear, I think that many singularity predictions are plausible on a longer timescale (e.g. 20-50 years) but not on the 5-10 year timescale I'm focusing on here.↩
- AGI (artificial general intelligence) has a lot of baggage in terms of specific approaches to AI, specific understandings of intelligence, and social connotations. I prefer to focus on capability thresholds and likely impacts, rather than philosophical debates.↩