Audrey Tang

So the kind of “reinforcement learning by community feedback” trains AI systems not to align vertically to a human individual but rather they align horizontally to human relationships. So if they can post the note that heals the divide, then they get rewarded. And so that way of training AI is fundamentally cooperative AI. And that is what I talk about.

鍵盤快捷鍵Keyboard shortcuts

j 下一段next speechk 上一段previous speech