AI Safety Student Team LogoUChicagoAI Safety

AI Safety Fundamentals Fellowship

Our flagship fellowship introduces fellows from any background to the core ideas in AI safety, with a particular focus on existential risk from advanced AI. By the program's end, fellows will have a working map of the field and the foundation to dive deeper into the subareas that interest them.

Week 1: Philosophical and Political Foundations of AI Safety

Explore the implications of increasingly intelligent systems.

Week 2: Outer Alignment

Examine the challenges in correctly specifying training goals for AI systems.

Week 3: Deception, Inner Alignment & Mechanistic Interpretability

Investigate the concept of mesa-optimizers and the potential for deceptive behavior in AI systems.

Week 4: AI Security

Explore various AI security issues including jailbreaks, adversarial examples, and potential vulnerabilities.

Week 5: AI Governance

Examine the challenges and approaches to governing AI development and deployment.

Week 6: Criticisms and Counter-Arguments

Examine critiques of AI safety concerns and alternative perspectives on AI development.

Week 7: Further Reading and Discussion

Explore various AI alignment approaches and dive deeper into specific areas of interest. Fellows will choose one of the optional readings to focus on for the week.