Algorithmic Alignment Group
Researching frameworks for human-aligned AI @ MIT CSAIL.
Home
Team
Research
Blog
Contact
Blog
Seven Strategies for Tackling the Hard Part of the Alignment Problem
, Stephen Casper, July 8, 2023
Takeaways from the Mechanistic Interpretability Challenges
, Stephen Casper, June 8, 2023