Last updated: 10.10.2022
Published: 09.06.2022
Is Power-Seeking AI an Existential Risk?
Further reading
12.18.2024
Takes on “Alignment Faking in Large Language Models”
What can we learn from recent empirical demonstrations of scheming in frontier models?
Continue reading
10.08.2024
Video and transcript of presentation on Otherness and control in the age of AGI
An attempt to distill down the whole “Otherness and control” series into a single talk.
Continue reading
09.30.2024
(Part 2, AI takeover) Extended audio/transcript from my conversation with Dwarkesh Patel
Extra content includes: AI collusion; the nature of intelligence; concrete takeover scenarios; flawed training signals; tribalism and mistake theory; more on what good outcomes look like.
Continue reading