Joe Carlsmith
  • About
  • Archive
Is Power-Seeking AI an Existential Risk?
Last updated: 10.10.2022
Published: 09.06.2022

Is Power-Seeking AI an Existential Risk?

Further reading

11.12.2025
How do we solve the alignment problem? / Part 9:
How human-like do safe AI motivations need to be?

AIs with alien motivations can still follow instructions safely on the inputs that matter.

Continue reading
11.03.2025
Leaving Open Philanthropy, going to Anthropic

On a career move, and on AI-safety-focused people working at AI companies.

Continue reading
09.29.2025
How do we solve the alignment problem? / Part 8:
Controlling the options AIs can pursue

On blocking paths to power, and on making deals.

Continue reading
@ Joe Carlsmith, 2025
  • Policies
Designed by And–Now