Because the other metaphor you sometimes hear is the ozone-freon or climate change metaphor. People are polluting the information landscape and we have to think of compounds that does not pollute. That’s where the provenance/watermarking conversation is coming in from that metaphor, but there’s something fundamentally different in that everybody can set up carbon sensors or greenhouse gas sensors or pollution sensors without any permit, right? To emit pollution, you need permit, but to measure, you don’t need permit. But the interactions with language models is fundamentally private, and nobody wants sensors — and censors — all the time, when they have such interactions. So unless it can be done through a trusted intermediary, which maintains contextual integrity and shares in a zero-knowledge way, nobody would want their open-source model and their fine-tunings to be measured in a way that could contribute to the risk sensing.

Keyboard shortcuts

j previous speech k next speech