And now, in the face-to-face workshop, which lasts an entire day, we will make a long transcript of what people eventually agreed on as the norms, and this long context can then be used to train a constitutional AI model, so that this AI model behaves exactly the way the workshop’s participants agreed that it should behave. It’s like collaboratively making a constitution for a language model, so that language model can behave thoroughly according to these people’s wishes, and we can continuously integrate new feedbacks so that those models increasingly align with what people want.

Keyboard shortcuts

j previous speech k next speech