When I think about ideal alignment, I do think about, like, coherent extrapolated volition. It is kind of top-down, where once you’ve sufficiently understood the true utility function of humanity, there’s nothing fundamentally wrong with just top-down installing that, is there?

Keyboard shortcuts

j previous speech k next speech