“So the kind of “reinforcement learning by community feedback” trains AI systems ...”

So the kind of “reinforcement learning by community feedback” trains AI systems not to align vertically to a human individual but rather they align horizontally to human relationships. So if they can post the note that heals the divide, then they get rewarded. And so that way of training AI is fundamentally cooperative AI. And that is what I talk about.

2025-07-22 Interview with Nikkei Asia

顯示前後文Show context

鍵盤快捷鍵Keyboard shortcuts