Algorithmic Alignment Group

Researching frameworks for human-aligned AI @ MIT CSAIL.

Blog

Seven Strategies for Tackling the Hard Part of the Alignment Problem, Stephen Casper, July 8, 2023

Takeaways from the Mechanistic Interpretability Challenges, Stephen Casper, June 8, 2023