I liken it to trying to align a garden. We are much slower compared to silicon-based agents. If the garden is misaligned and seeks only to maximize a score, it might bulldoze itself every millisecond to win some random metric.
j previous speech k next speech