Scientist AI

Develop powerful, nonagentic, uncertain world models that accelerate scientific progress while avoiding the risks of agent AIs

Theory of Change:Developing non-agentic 'Scientist AI' allows us to: (i) reap the benefits of AI progress while (ii) avoiding the inherent risks of agentic systems. These systems can also (iii) provide a useful guardrail to protect us from unsafe agentic AIs by double-checking actions they propose, and (iv) help us more safely build agentic superintelligent systems.

General Approach:Cognitive

Target Case:Pessimistic

Orthodox Problems:

3.Pivotal processes require dangerous capabilities 4.Goals misgeneralize out of distribution 5.Instrumental convergence