Last updated: 10.10.2022
Published: 09.06.2022
Is Power-Seeking AI an Existential Risk?
Further reading
04.30.2025
Video and transcript of talk on automating alignment research
From a talk at Anthropic in April 2025.
Continue reading
04.30.2025
How do we solve the alignment problem? / Part 6:
Can we safely automate alignment research?It’s really important; we have a real shot; there are a lot of ways we can fail.
Continue reading
03.14.2025
How do we solve the alignment problem? / Part 5:
AI for AI safetyWe should try extremely hard to use AI labor to help address the alignment problem.
Continue reading