Last updated: 10.10.2022
Published: 09.06.2022
Is Power-Seeking AI an Existential Risk?
Further reading
01.29.2026
How do we solve the alignment problem? / Part 10:
Building AIs that do human-like philosophyAIs will face philosophical questions humans can’t answer for them.
Continue reading
12.17.2025
Video and transcript of talk on human-like-ness in AI safety
From a talk I gave at Constellation in December 2025.
Continue reading
11.12.2025
How do we solve the alignment problem? / Part 9:
How human-like do safe AI motivations need to be?AIs with alien motivations can still follow instructions safely on the inputs that matter.
Continue reading