Shallow Review of Technical AI Safety, 2025

Aligning what?

Develop alternatives to agent-level models of alignment, by treating human-AI interactions, AI-assisted institutions, AI economic or cultural systems, drives within one AI, and other causal/constitutive processes as subject to alignment
Theory of Change:Model multiple reality-shaping processes above and below the level of the individual AI, some of which are themselves quasi-agential (e.g. cultures) or intelligence-like (e.g. markets), will develop AI alignment into a mature science for managing the transition to an AGI civilization
Some names:Richard Ngo, Emmett Shear, Softmax, Full Stack Alignment, AI Objectives Institute, Jan Kulveit
Estimated FTEs:5-10