Home - Joe Carlsmith

I’m a writer, researcher, and philosopher. I work as a senior advisor at Open Philanthropy, where I focus on existential risk from advanced artificial intelligence. I also write independently about various topics in philosophy and futurism, and I have a doctorate in philosophy from the University of Oxford.

Much of my work is about trying to help us orient wisely towards humanity’s long-term future. I see this work as fitting a loose logical structure, with questions about meta-ethics and rationality at the foundation, feeding into questions about ethics (and especially about effective altruism), which motivate concern for the long-term future, which motivates attention to advanced artificial intelligence in particular – and a further strand of my work (what I’m calling the “crazy train” strand) grapples with some of the stranger places that further philosophical reflection of this broad sort can lead. I also sometimes write about topics in the philosophy of mind, and about other aspects of my overall existential/spiritual orientation (death, evil, clinging, sincerity).

I recently completed a series of essays called "Otherness and control in the age of AGI" (PDF here), which mixes a lot of these topics together. You can listen to me discuss this series on the Dwarkesh Podcast here.

You can learn more about me here.
My CV is here.
My twitter is @jkcarlsmith.
My email is joseph [dot] k [dot] carlsmith @ gmail [dot] com.

If you'd like new essays emailed to you: Subscribe

RSS feed for this site here.

See here for audio versions of the essays, or search “Joe Carlsmith Audio” on your podcast app.

Latest

View all the latest essays

How do we solve the alignment problem? / Part 8: Controlling the options AIs can pursue

On blocking paths to power, and on making deals.

Video and transcript of talk on giving AIs safe motivations

From a talk at UT Austin in September 2025.

How do we solve the alignment problem? / Part 7: Giving AIs safe motivations

A four-part picture.

Favorites

View all Favorites

Is Power-Seeking AI an Existential Risk?

Report for Open Philanthropy examining what I see as the core argument for concern about existential risk from misaligned artificial intelligence.

Otherness and control in the age of AGI / Part 1: Otherness and control in the age of AGI

Introduction and summary for a series of essays about how agents with different values should relate to one another, and about the ethics of seeking and sharing power.

Actually possible: thoughts on Utopia

There are oceans we have barely dipped a toe into. There are drums and symphonies we can barely hear. There are suns whose heat we can barely feel on our skin.

Meta-Ethics

Why should ethical anti-realists do ethics?

Who needs a system if you’re free?

02.16.2023

Seeing more whole

On looking out of your own eyes.

02.17.2023

On the limits of idealized values

Contra some meta-ethical views, you can’t forever aim to approximate the self you would become in idealized conditions. You have to actively create yourself, often in the here and now.

06.21.2021

View all Meta-Ethics essays

Rationality

[Series] On expected utility

A series of essays examining the theoretical foundations of expected utility maximization as a theory of instrumental rationality. PDF of the full series here.

03.16.2022

Part 1: Skyscrapers and madmen

Part 2: Why it can be OK to predictably lose

Part 3: VNM, separability, and more

Part 4: Dutch books, Cox, and Complete Class

[Series] SIA > SSA

A series of essays arguing that one prominent theory of anthropic reasoning (the "Self-Indication Assumption" or "SIA") is better than another (the "Self-Sampling Assumption" or "SSA"). PDF of the full series here.

09.30.2021

Part 1: Learning from the fact that you exist

Part 2: Telekinesis, reference classes, and other scandals

Part 3: An aside on betting in anthropics

Part 4: In defense of the presumptuous philosopher

View all Rationality essays

Ethics

Wholehearted choices and “morality as taxes”

Reimagining Peter Singer’s drowning child from the perspective of care rather than guilt.

12.12.2020

In search of benevolence (or: what should you get Clippy for Christmas?)

What is altruism towards a paperclipper? Can you paint with all the colors of the wind at once?

07.19.2021

Care and demandingness

We wouldn’t expect prudence to be “not demanding.” Why would morality be different?

03.07.2021

View all Ethics essays

Longtermism

Actually possible: thoughts on Utopia

There are oceans we have barely dipped a toe into. There are drums and symphonies we can barely hear. There are suns whose heat we can barely feel on our skin.

01.18.2021

On future people, looking back at 21st century longtermism

An intuition pump for a certain kind of “holy sh**” reaction to existential risk, and to the possible size and quality of the future at stake.

03.22.2021

Against neutrality about creating happy lives

Making happy people is good. Just ask the golden rule.

03.14.2021

View all Longtermism essays

AI

[Series] Otherness and control in the age of AGI

A series of essays about how agents with different values should relate to one another, and about the ethics of seeking and sharing power. PDF here.

01.02.2024

Part 1: Otherness and control in the age of AGI

Part 2: Gentleness and the artificial Other

Part 3: Deep atheism and AI risk

Part 4: When “yang” goes wrong

Part 5: Does AI risk “other” the AIs?

Part 6: An even deeper atheism

Part 7: Being nicer than Clippy

Part 8: On the abolition of man

Part 9: On green

Part 10: On attunement

Part 11: Loving a world you don’t trust

New report: “Scheming AIs: Will AIs fake alignment during training in order to get power?”

My report examining the probability of a behavior often called “deceptive alignment.”

11.15.2023

View all AI essays

Crazy Train

A Stranger Priority? Topics at the Outer Reaches of Effective Altruism

My dissertation.

02.21.2023

Can you control the past?

I think that you can “control” events you have no causal interaction with, including events in the past, and that this is a wild and disorienting fact, with uncertain but possibly significant implications. This essay attempts to impart such disorientation.

08.27.2021

On infinite ethics

Infinities puncture the dream of a simple, bullet-biting utilitarianism. But they’re everyone’s problem.

01.30.2022

View all Crazy Train essays

Mind

Grokking illusionism

Trying to see through the eyes of people who think consciousness is an illusion.

11.29.2020

Thoughts on personal identity

On living in a glass tunnel, and on open air.

11.16.2020

How core is confusion about consciousness?

If we’re confused about consciousness, what else are we confused about?

11.08.2020

View all Mind essays

Spirit

Thoughts on being mortal

You can’t keep any of it. The only thing to do is to give it away on purpose.

12.06.2020

On clinging

How can “non-attachment” be compatible with care? We need to distinguish between caring and clinging.

01.24.2021

On sincerity

Nearby is the country they call life.

12.23.2022

View all Spirit essays

Misc

Killing the ants

If you kill something, look it in the eyes as you do.

02.07.2021

In memory of Louise Glück

“It was, she said, a great discovery, albeit my real life.”

10.15.2023

The impact merge

Against hitching together your desire to accomplish things and your desire to do good.

11.22.2020

View all Misc essays

About

Before doing my doctorate at Oxford, I was a PhD student in philosophy at NYU from 2016-2018. I also have a BPhil in philosophy from Oxford, and a BA in philosophy from Yale.

I spent the academic year of 2017-18 helping Toby Ord write The Precipice: Existential Risk and the Future of Humanity, which is a clear statement of a number of ideas that matter to me: notably, that human history might be just beginning; that the future could be incomprehensibly vast and extraordinary; and that it is extremely important that we do not mess up in a way that destroys this possibility. This New Yorker piece about Toby and the book is also a good summary.

I am also interested in meditation. I've spent over a year of my life on silent meditation retreat, in stretches ranging from a few days to three months.

Opinions on this site are my own, not speaking on behalf of my employer.

Audio:

Extended audio from my conversation with Dwarkesh Patel. Transcript here.
"Joe Carlsmith on Scheming AI," on Hear This Idea with Fin Moorhouse. Transcript here.
"Navigating serious philosophical confusions" on the 80,000 Podcast with Rob Wiblin. Transcript here.
"Utopia on earth and morality without guilt" on Clearer Thinking with Spencer Greenberg.
"Creating Utopia" on the Utilitarian Podcast with Gus Docker. Transcript here.

Video:

"Can goodness compete?" (Public talk at Mox, July 2025)
"How should we think about AI welfare?", (Talk at Anthropic, May 2025).
"Can we safely automate alignment research?" (Talk at Anthropic, April 2025).
"Otherness and control in the age of AGI," (Lecture at Stanford University, October 2024).
"Otherness and control in the age of AGI," (The Dwarkesh Podcast, August 2024).
"Sharing Reality with Walt Whitman" (Lovely short video created by Michel Justen, inspired by my essay "On future people, looking back at 21st century longtermism," April 2024)
"Scheming AIs" (EA Global Global Catastrophic Risks, February 2024)
"On Infinite Ethics, Utopia, and AI" (The Foresight Institute Existential Hope Podcast, September 2023)
"The Dangers of Advanced AI: An Existential Risk?" (The Flares Podcast, July 2023).
"How We Change Our Minds About AI Risk" (Future of Life Institute Podcast with Gus Docker, June 2023)
"Utopia, AI, and Infinite Ethics" (Lunar Society Podcast with Dwarkesh Patel, August 2022)
"Existential Risk from Power-seeking AI" (Harvard Effective Altruism Agathon Lecture Series)
"Orienting towards the long-term future" (Effective Altruism Global 2017)

The writing on this site published prior to October 2022 was originally hosted on the blog Hands and Cities, which now links to this site. You can view the old comments on that blog here.

The logo is a reference to the "true city," which is a term I once heard used to refer to Utopia. The other images are from Marilyn Robinson’s Housekeeping, and from Walt Whitman’s "Crossing Brooklyn Ferry."

"Something happened, something so memorable that when I think back to the crossing of the bridge, one moment bulges like the belly of a lens and all the others are at the peripheries and diminished. Was it only that the wind rose suddenly, so that we had to cower and lean against it like blind women groping their way along a wall? or did we really hear some sound too loud to be heard, some word so true we did not understand it, but merely felt it pour through our nerves like darkness or water?"

— Marilyn Robinson

"It avails not, time nor place—distance avails not,
I am with you, you men and women of a generation, or ever so many generations hence,
Just as you feel when you look on the river and sky, so I felt,
Just as any of you is one of a living crowd, I was one of a crowd,
Just as you are refresh’d by the gladness of the river and the bright flow, I was refresh’d,
Just as you stand and lean on the rail, yet hurry with the swift current, I stood yet was hurried..."

— Walt Whitman