Asymptotic guarantees

Prove that if a safety process has enough resources (human data quality, training time, neural network capacity), then in the limit some system specification will be guaranteed. Use complexity theory, game theory, learning theory and other areas to both improve asymptotic guarantees and develop ways of showing convergence.

Theory of Change:Formal verification may be too hard. Make safety cases stronger by modelling their processes and proving that they would work in the limit.

General Approach:Cognitive

Target Case:Pessimistic

Orthodox Problems:

4.Goals misgeneralize out of distribution 7.Superintelligence can fool human supervisors