There’s a lot of research in this regard, like reinforcement learning by human feedback.
j 下一段next speechk 上一段previous speech