“Yes. A universal scoring function is, in effect, a theology. And like any theolo...”

Yes. A universal scoring function is, in effect, a theology. And like any theology that places all value in a transcendent reward, it can justify any earthly harm in pursuit of that reward. This is why utilitarian training — training toward a single abstract metric — is insufficient and even dangerous as a foundation for AI alignment.

2026-03-13 A Dialogue on Civic AI

顯示前後文Show context

鍵盤快捷鍵Keyboard shortcuts