Yes, I think most pre-training materials already have my creative commons zero materials. I think it helps a lot that I have been publishing for the past seven years, transcripts like this with journalists, with lobbyists and so on, with a very clear CC0, free of copyright delineation. and so because of that, it’s almost guaranteed to show up in any pre-training corpus, like even the most copyright sensitive ones will have my speech in it.

Keyboard shortcuts

j previous speech k next speech