Uncategorized – AI Optimism

Deconstructing Bostrom’s Argument for AI Doom

I had a pretty great discussion with social psychologist and philosopher Lance Bush recently about the orthogonality thesis, which ended up turning into a broader analysis of Nick Bostrom’s argument for AI doom as presented in Superintelligence, and some related issues. While the video is intended for a general audience interested in philosophy, and assumes no…

Nora Belrose

March 10, 2024

Uncategorized

Counting arguments provide no evidence for AI doom

This is Part 2 of an essay series that started with AI is easy to control. Introduction AI doom scenarios often suppose that future AIs will engage in scheming— planning to escape, gain power, and pursue ulterior motives, while deceiving us into thinking they are aligned with our interests. The worry is that if a…

AI Optimism Board

February 27, 2024

Uncategorized

Introducing AI Optimism

Artificial intelligence promises to greatly improve the quality of life of every human on Earth. Already, AI assistants are democratizing access to education, high-quality medical advice, and psychotherapy. Text-to-image models like Midjourney and Stable Diffusion have unleashed the creativity of the masses, empowering people to create stunning artwork at little or no cost. Tools like…

Nora Belrose

October 16, 2023

Uncategorized

AI Pause Will Likely Backfire

Should we lobby governments to impose a moratorium on AI research? Since we don’t enforce pauses on most new technologies, I hope the reader will grant that the burden of proof is on those who advocate for such a moratorium. We should only advocate for such heavy-handed government action if it’s clear that the benefits…

Nora Belrose

September 16, 2023

Uncategorized

Deceptive Alignment is <1% Likely by Default

By David W. Thanks to Wil Perkins, Grant Fleming, Thomas Larsen, Declan Nishiyama, and Frank McBride for feedback on this post. Thanks also to Paul Christiano, Daniel Kokotajlo, and Aaron Scher for comments on the original post that helped clarify the argument. Any mistakes are my own. Introduction In this post, I argue that deceptive alignment is…

AI Optimism Board

February 21, 2023

Uncategorized

against doom, technical

Category: Uncategorized

Deconstructing Bostrom’s Argument for AI Doom

Counting arguments provide no evidence for AI doom

Introducing AI Optimism

AI Pause Will Likely Backfire

Deceptive Alignment is <1% Likely by Default