Our Programs
Learn all about AI alignment.
Explore Our Offerings
We provide various ways to engage with AI alignment through structured seminars, reading groups, and workshops for all levels of experience. Plus, we provide free food at every event!
Intro to AI Alignment Seminar
An overview of AI alignment, the field that aims to align advanced AI systems with human values and intentions. We cover the fundamentals such as interpreting the internals of neural networks, reinforcement learning from human feedback, and concrete ways that transformative AI might go really badly. Prior machine learning experience is not necessary! See our curriculum here.
Prerequisites: None
Duration: 8 weeks, ~2hr/wk
Advanced Reading Group
Each week, we select a technical paper from the AI alignment literature or related research and meet to individually read and then discuss them. We may also host talks from alignment researchers from time to time. Prior participation in our Intro to AI Alignment seminar is not necessary.
Prerequisites: Prior ML knowledge helpful
Duration: 8 weeks, ~1.5hr/wk
Coding Workshops
Live, hands-on coding workshops in GPU-accelerated Python notebooks. By the end of each session, participants will have their own working implementation of state-of-the-art techniques used by top AI research labs. Topics include finetuning, activation steering, and sleeper agents. These workshops were designed to be highly technical while still being as accessible as possible.
Prerequisites: Past programming experience helpful
Duration: Every other week, ~1.5hr/workshop
Join PAIA
Whether you're new to AI alignment or an experienced researcher, we have a place for you in our community.