So, one crucial thing that the biggest companies still have not figured out is that, in this race of vertical take-off, currently, nobody has an idea. A version of this agent, when it’s training the next version of the agent, an even bigger model, whether it’s training the successor to be loyal to humanity, to the model specification, or to the constitution or whether it’s actually scheming and training a successor to be loyal to itself.

Keyboard shortcuts

j previous speech k next speech