In 2027, they answered this question because, at a very critical juncture in training the next probably misaligned agent, they stop, they paused and then they look around to the community to ask all those bicycle tinkerers is anyone working on the same architecture that has what’s called mechanistic interpretation tools that can help us to look at this half formed agent and figure out whether it is misaligned.