And another one is around end of August, beginning of September, we’re going to have two local alignment assemblies where we ask people to evaluate how to align a powerful AI to the societal norm. And we’re going to take the result of those alignment assemblies and just make a low-rank adaptor to an existing AI and see if the AI can just listen to the assembly and steer its own behavior to fit the norms, also known as self-alignment.

Keyboard shortcuts

j previous speech k next speech