What properties should we want a proposal for an AI governance pathway to have?
Spending time on basic research and exploration, and learning. Thinking about some topics without immediate goal of publication.
Recent: Was subject of a New Yorker feature. New working paper, on the strategic implications of openness in AI development. Paperback edition of Superintelligence released in April. Talks incl. Seoul, New York, Montreal, Las Vegas.
From 2014: completed the superintelligence book. [Added: tops Financial Time's science summer reading list] [Added #2: widely reviewed — Financial Times, Guardian, The Economist, etc.] [#3: Twelve translations done or in the works; also audiobook edition.] [#4: makes New York Times bestseller list.] [#5: recommended by Bill Gates, Elon Musk, & some leading AI people.] [#6: paperback edition to be released in spring 2016.] Landed in the top 15 of Prospect magazine's World Thinkers list.
What properties should we want a proposal for an AI governance pathway to have?
Recounts the Tale of a most vicious Dragon that ate thousands of people every day, and of the actions that the King, the People, and an assembly of Dragonologists took with respect thereto.
We present a heuristic for correcting for one kind of bias (status quo bias), which we suggest affects many of our judgments about the consequences of modifying human nature. We apply this heuristic to the case of cognitive enhancements, and argue that the consequentialist case for this is much stronger than commonly recognized.
An analysis of the global desirability of different forms of openness (including source code, science, data, safety techniques, capabilities, and goals).
Suns are illuminating and heating empty rooms, unused energy is being flushed down black holes, and our great common endowment of negentropy is being irreversibly degraded into entropy on a cosmic scale. These are resources that an advanced civilization could have used to create value-structures, such as sentient beings living worthwhile lives...
Cosmology shows that we might well be living in an infinite universe that contains infinitely many happy and sad people. Given some assumptions, aggregative ethics implies that such a world contains an infinite amount of positive value and an infinite amount of negative value. But you can presumably do only a finite amount of good or bad. Since an infinite cardinal quantity is unchanged by the addition or subtraction of a finite quantity, it looks as though you can't change the value of the world. Aggregative consequentialism (and many other important ethical theories) are threatened by total paralysis. We explore a variety of potential cures, and discover that none works perfectly and all have serious side-effects. Is aggregative ethics doomed?
In cases where several altruistic agents each have an opportunity to undertake some initiative, a phenomenon arises that is analogous to the winner's curse in auction theory. To combat this problem, we propose a principle of conformity. It has applications in technology policy and many other areas.
Does human enhancement threaten our dignity as some have asserted? Or could our dignity perhaps be technologically enhanced? After disentangling several different concepts of dignity, this essay focuses on the idea of dignity as a quality (a kind of excellence admitting of degrees). The interactions between enhancement and dignity as a quality are complex and link into fundamental issues in ethics and value theory.
Brief paper, critiques a host of bioconservative pundits who believe that enhancing human capacities and extending human healthspan would undermine our dignity.
Original essays by various prominent moral philosophers on the ethics of human enhancement.
The introductory chapter from the book: 1-22
A transhumanist ethical framework for public policy regarding genetic enhancements, particularly human germ-line genetic engineering
Anthology chapter on the ethics of human enhancement
Overview of ethical issues raised by the possibility of creating intelligent machines. Questions relate both to ensuring such machines do not harm humans and to the moral status of the machines themselves.
Some cursory notes; not very in-depth.
Short article summarizing some of the key issues and offering specific recommendations, illustrating the opportunity and need for "smart policy": the integration into public policy of a broad-spectrum of approaches aimed at protecting and enhancing cognitive capacities and epistemic performance of individuals and institutions.
A review/commentary on The Fountain of Youth (OUP, 2004).
The good life: just how good could it be? A vision of the future from the future.
After some definitions and conceptual clarification, I argue for two theses. First, some posthuman modes of being would be extremely worthwhile. Second, it could be good for human beings to become posthuman.
The revised version 2.1. The document represents an effort to develop a broadly based consensus articulation of the basics of responsible transhumanism. Some one hundred people collaborated with me in creating this text.
Wonderful ways of being may be located in the "posthuman realm", but we can't reach them. If we enhance ourselves using technology, however, we can go out §there and realize these values. This paper sketches a transhumanist axiology.
The human desire to acquire new capacities, to extend life and overcome obstacles to happiness is as ancient as the species itself. But transhumanism has emerged gradually as a distinctive outlook, with no one person being responsible for its present shape. Here's one account of how it happened.
Discusses the Fermi paradox, and explains why I hope we find no signs of life, whether extinct or still thriving, on Mars or anywhere else we look.
Existential risks are those that threaten the entire future of humanity. This paper elaborates the concept of existential risk and its relation to basic issues in axiology and develops an improved classification scheme for such risks. It also describes some of the theoretical and practical challenges posed by various existential risks and suggests a new way of thinking about the ideal of sustainability.
Examines the risk from physics experiments and natural events to the local fabric of spacetime. Argues that the Brookhaven report overlooks an observation selection effect. Shows how this limitation can be overcome by using data on planet formation rates.
This paper discusses four families of scenarios for humanity’s future: extinction, recurrent collapse, plateau, and posthumanity.
Twenty-six leading experts look at the gravest risks facing humanity in the 21st century, including natural catastrophes, nuclear war, terrorism, global warming, biological weapons, totalitarianism, advanced nanotechnology, general artificial intelligence, and social collapse. The book also addresses over-arching issues—policy responses and methods for predicting and managing catastrophes. Foreword by Lord Martin Rees.
This paper explores some dystopian scenarios where freewheeling evolutionary developments, while continuing to produce complex and intelligent forms of organization, lead to the gradual elimination of all forms of being worth caring about. We then discuss how such outcomes could be avoided and argue that under certain conditions the only possible remedy would be a globally coordinated effort to control human evolution by adopting social policies that modify the default fitness function of future life forms.
Technological revolutions are among the most important things that happen to humanity. This paper discusses some of the ethical and policy issues raised by anticipated technological revolutions, such as nanotechnology.
Existential risks are ways in which we could screw up badly and permanently. Remarkably, relatively little serious work has been done in this important area. The point, of course, is not to welter in doom and gloom but to better understand where the biggest dangers are so that we can develop strategies for reducing them.
Information hazards are risks that arise from the dissemination or the potential dissemination of true information that may cause harm or enable some agent to cause harm. Such hazards are often subtler than direct physical threats, and, as a consequence, are easily overlooked. They can, however, be important.
Concept describing a kind of social structure.
The embryo selection during IVF can be vastly potentiated when the technology for stem-cell derived gametes becomes available for use in humans. This would enable iterated embryo selection (IES), compressing the effective generation time in a selection program from decades to months.
Some have argued that because blind evolutionary processes produced human intelligence on Earth, it should be feasible for clever human engineers to create human-level artificial intelligence in the not-too-distant future. We evaluate this argument.
Human beings are a marvel of evolved complexity. Such systems can be difficult to enhance. Here we describe a heuristic for identifying and evaluating the practicality, safety and efficacy of potential human enhancements, based on evolutionary considerations.
Presents two theses, the orthogonality thesis and the instrumental convergence thesis, that help understand teh possible range of behavior of superintelligent agents - also pointing to some potential dangers in building such an agent.
A 130-page report on the technological prerequisites for whole brain emulation (aka "mind uploading").
Cognitive enhancements in the context of converging technologies.
Some new ideas related to the challenge of endowing a hypothetical future superintelligent AI with values that would cause it to act in ways that are beneficial. Paper is somewhat obscure.
Game theory model of a technology race to develop AI. Participants skimp on safety precautions to get there first. Analyzes factors that determine level of risk in the Nash equilibrium.
Preliminary survey of various issues related to the idea of using boxing methods to safely contain a superintelligent oracle AI.
Some polling data.
Cognitive enhancement comes in many diverse forms. In this paper, we survey the current state of the art in cognitive enhancement methods and consider their prospects for the near-term future. We then review some of ethical issues arising from these technologies. We conclude with a discussion of the challenges for public policy and regulation created by present and anticipated methods for cognitive enhancement.
This paper argues that at least one of the following propositions is true: (1) the human species is very likely to go extinct before reaching the posthuman stage; (2) any posthuman civilization is extremely unlikely to run significant number of simulations or (variations) of their evolutionary history; (3) we are almost certainly living in a computer simulation. It follows that the naïve transhumanist dogma that there is a significant chance that we will one day become posthumans who run ancestor-simulations is false, unless we are currently living in a simulation. A number of other consequences of this result are also discussed.
Superintelligence is now out in paperback. Buy many copies now!
“I highly recommend this book.”—Bill Gates
“terribly important ... groundbreaking” “extraordinary sagacity and clarity, enabling him to combine his wide-ranging knowledge over an impressively broad spectrum of disciplines — engineering, natural sciences, medicine, social sciences and philosophy — into a comprehensible whole” “If this book gets the reception that it deserves, it may turn out the most important alarm bell since Rachel Carson's Silent Springfrom 1962, or ever.”—Olle Haggstrom, Professor of Mathematical Statistics
“Nick Bostrom makes a persuasive case that the future impact of AI is perhaps the most important issue the human race has ever faced. ... It marks the beginning of a new era.”—Stuart Russell, Professor of Computer Science, University of California, Berkley
“Those disposed to dismiss an 'AI takeover' as science fiction may think again after reading this original and well-argued book.” —Martin Rees, Past President, Royal Society
“Worth reading.... We need to be super careful with AI. Potentially more dangerous than nukes”—Elon Musk
“There is no doubting the force of [Bostrom's] arguments ... the problem is a research challenge worthy of the next generation's best mathematical talent. Human civilisation is at stake.” —Financial Times
“This superb analysis by one of the world's clearest thinkers tackles one of humanity's greatest challenges: if future superhuman artificial intelligence becomes the biggest event in human history, then how can we ensure that it doesn't become the last?” —Professor Max Tegmark, MIT
“a damn hard read” —The Telegraph
Failure to consider observation selection effects result in a kind of bias that infest many branches of science and philosophy. This book presented the first mathematical theory for how to correct for these biases. It also discusses some implications for cosmology, evolutionary biology, game theory, the foundations of quantum mechanics, the Doomsday argument, the Sleeping Beauty problem, the search for extraterrestrial life, the question of whether God exists, and traffic planning.
Current cosmological theories say that the world is so big that all possible observations are in fact made. But then, how can such theories be tested? What could count as negative evidence? To answer that, we need to consider observation selection effects.
Summary of some of the difficulties that a theory of observation selection effects faces and sketch of a solution.
"Anthropic shadow" is an observation selection effect that prevent observers from observing certain kinds of catastrophes in their recent geological and evolutionary past. We risk underestimating the risk of catastrophe types that lie in this shadow.
An advanced Introduction to observation selection theory and its application to the cosmological fine-tuning problem.
Argues against Olum and the Self-Indication Assumption.
Have Korb and Oliver refuted the doomsday argument? No.
On the Doomsday argument and related paradoxes.
The Doomsday argument purports to prove, from basic probability theory and a few seemingly innocuous empirical premises, that the risk that our species will go extinct soon is much greater than previously thought. My view is that the Doomsday argument is inconclusive - although not for any trivial reason. In my book, I argued that a theory of observation selection effects is needed to explain where it goes wrong.
The Sleeping Beauty problem is an important test stone for theories about self-locating belief. I argue against both the traditional views on this problem and propose a new synthetic approach.
Argues against George Sower's refutation of the doomsday argument, and outlines what I think is the real flaw.
When driving on the motorway, have you ever wondered about (and cursed!) the fact that cars in the other lane seem to be getting ahead faster than you? One might be tempted to account for this by invoking Murphy's Law ("If anything can go wrong, it will", discovered by Edward A. Murphy, Jr, in 1949). But there is an alternative explanation, based on observational selection effects...
A paradoxical thought experiment
Examines the implications of recent evidence for a cosmological constant for the prospects of indefinite information processing in the multiverse. Co-authored with Milan M. Cirkovic.
If two brains are in identical states, are there two numerically distinct phenomenal experiences or only one? Two, I argue. But what happens in intermediary cases? This paper looks in detail at this question and suggests that there can be a fractional (non-integer) number of qualitatively identical experiences. This has implications for what it is to implement a computation and for Chalmer's Fading Qualia thought experiment.
A self-undermining variant of the Newcomb problem.
Finite version of Pascal's Wager.
Nick Bostrom is Professor at Oxford University, where he is the founding Director of the Future of Humanity Institute. He also directs the Strategic Artificial Intelligence Research Center. He is the author of some 200 publications, including Anthropic Bias (Routledge, 2002), Global Catastrophic Risks (ed., OUP, 2008), Human Enhancement (ed., OUP, 2009), and Superintelligence: Paths, Dangers, Strategies (OUP, 2014), a New York Times bestseller.
Bostrom has a background in physics, computational neuroscience, and mathematical logic as well as philosophy. He is recipient of a Eugene R. Gannon Award (one person selected annually worldwide from the fields of philosophy, mathematics, the arts and other humanities, and the natural sciences). He has been listed on Foreign Policy's Top 100 Global Thinkers list twice; and he was included on Prospect magazine's World Thinkers list, the youngest person in the top 15 from all fields and the highest-ranked analytic philosopher. His writings have been translated into 24 languages. There have been more than 100 translations and reprints of his works.
A thread that runs through my work is a concern with "crucial considerations". A crucial consideration is an idea or argument that might plausibly reveal the need for not just some minor course adjustment in our endeavours but a major change of direction or priority.
If we have overlooked even just one such crucial consideration, then all our best efforts might be for naught—or they might make things worse. It could therefore be very valuable to unearth an overlooked crucial consideration.
Where might one find a crucial consideration? Places to explore might include such issues and possibilities as existential risks, the simulation argument, human enhancement, infinite utilities, anthropic reasoning, information hazards, new techniques for solving social coordination problems, and the future of machine intelligence. It might also be relevant to inquire into moral philosophy, decision theory, normative uncertainty, game theory, epistemology, theology, and philosophy of mind—and, if one is looking for crucial considerations pertaining to one’s own decision-making, to reflect on one’s own values.
The search for crucial considerations relates to what I call “macrostrategy”: the quest to uncover links between long-term outcomes for humanity and present-day actions. How profound and inescapable is our cluelessness about such matters? And among the available bets, which one has the highest expected value?
I was born in Helsingborg, Sweden, and grew up by the seashore. I was bored in school. At age fifteen or sixteen I had an intellectual awakening, and feeling that I had wasted the first one and a half decades of my life, I resolved to focus on what was important. Since I did not know what was important, and I did not know how to find out, I decided to start by trying to place myself in a better position to find out. So I began a project of intellectual self-development, which I pursued with great intensity for the next one and a half decades.
As an undergraduate, I studied many subjects in parallel, and I gather that my performance set a national record. I was once expelled for studying too much, after the head of Umeå University psychology department discovered that I was concurrently following several other full-time programs of study (physics, philosophy, and mathematical logic), which he believed to be psychologically impossible.
For my postgraduate work, I went to London, where I studied physics and neuroscience at King's College, and obtained a PhD from the London School of Economics. For a while I did a little bit stand-up comedy on the vibrant London pub and theatre circuit.
During those years, I co-founded, with David Pearce, the World Transhumanist Association, a nonprofit grassroots organization. Later, I co-founded the Institute for Ethics and Emerging Technologies, a nonprofit think tank. The aim was to stimulate wider discussion about the implications of future technologies, particularly ones that could enable human enhancement. (Those organizations have since developed along their own trajectories, and it is definitely not the case that I agree with everything that may be said by those who flock under the transhumanist banner.)
Since 2006, I've been the founding director of the Future of Humanity Institute at Oxford University. We were initially incubated in the Oxford Martin School. FHI is a unique multidisciplinary research center in which a group of star researchers, from the fields of mathematics, computer science, philosophy, among other disciplines, come together to work on big-picture questions for humanity. The Institute, which has grown to about 20 people, has had an amazing track record of originating many of the concepts that now shape the field. FHI researchers were instrumental in the launching the effective altruism movement, which draws in part on FHI’s work on macrostrategy, existential risk, and technology policy. A number of other academic research centers, including at Cambridge, MIT, and Berkeley, have sprung up inspired by FHI’s example.
Since 2015, I also direct the Strategic Artificial Intelligence Research Center. The surprise bestseller Superintelligence, and some of our other work on AI, has changed the global conversation on the future of machine intelligence, helping to stimulate the emergence of a new field of technical research on scalable AI control methods. FHI runs a joint research seminar with DeepMind, and collaborate extensively with other groups active in this area, such as OpenAI, MIRI, and CHAI.
I am in a very fortunate position—having no teaching duties, being supported by a staff of brilliant research colleagues and assistants, and facing no restrictions on what I can work on. Must try hard to be worthy of such privilege!
(For more background, see e.g. New Yorker profile)
For admistrative matters, scheduling, speaking invitations, etc., please contact my assistant, Kyle Scott:
+44 (0)1865 286800
If you need to contact me directly (I regret I am unable to respond to all emails):
www.fhi.ox.ac.uk—Future of Humanity Institute
www.anthropic-principle.com—Papers on observational selection effects
www.simulation-argument.com—Devoted to the question, "Are you living in a computer simulation?"
www.existential-risk.org—Human extinction scenarios and related concerns
On the bank at the end
Of what was there before us
Gazing over to the other side
On what we can become
Veiled in the mist of naïve speculation
We are busy here preparing
Rafts to carry us across
Before the light goes out leaving us
In the eternal night of could-have-been
Keynote at the 2016 RSA Conference
My second TED talk
Radio lecture on extraterrestrial life and the Fermi Paradox, commissioned for the BBC Radio 3
On this page.
A long-form feature profile of me, by Raffi Khatchadourian.
Long article by Ross Andersen about the work of the Future of Humanity Institute
Interview for the meta-charity 80,000 Hours on how to make a maximally positive impact on the world for people contemplating an academic career trajectory
15-minute audio interview explaining explaining the simulation argument.
15-minute interview about status quo bias in bioethics, and the "reversal test" by which such bias might be cured.
Covering Future of Humanity Institute, crucial considerations, existential risks, information hazards, and academic specialization. Interviewed by Prof. Dave Levine, KZSU-FM.
Interviewed by Martin Eiermann about existential risks, genetic enhancements, and ethical discourses about technological progress.
With Peter Snow
Interviewed by David Edmonds
On the future of "human identity" in relation to information and communication technologies, automation and robotics, and biotechnology and medicine.
Summarizing some of the key issues and offering policy recommendations for a "smart policy" on biomedical methods of enhancing cognitive performance.
Humans will not always be the most intelligent agents on Earth, the ones steering the future. What will happen to us when we no longer play that role, and how can we prepare for this transition?
Those who seek the advancement of science should focus more on scientific research that facilitates further research across a wide range of domains—particularly cognitive enhancement.
Short letter to the editor on obstacles to the development of better cognitive enhancement drugs.
Fictional interview of an uploaded dog by Larry King.
A poetry cycle... in Swedish, unfortunately. I stopped writing poetry after this, although I've had a few relapses in the English language.
Imaginary dialogue, set in the year 2050, in which three pundits debate the big issues of their time
According to Francis Fukuyama, yes. This is my response.
Review of Kwame Anthony Appiah's book "Experiments in Ethics".
This paper, now a few years old, examines how likely it might be that we will develop superhuman artificial intelligence within the first third of this century.
This slightly more recent (but still obscolete) article briefly reviews the argument set out in the previous one, and notes four immediate consequences of human-level machine intelligence.
Response to 2008 Edge Question: "What have you changed your mind about?"
Response to 2009 Edge Question: "What will change everything?"
Response to 2010 Edge Question: "How has the Internet changing the way you think?"
Short article on existential risks.
Response to 2011 Edge Question: "What scientific concept would improve everybody's cognitive toolkit?"