Yeah. And I think the privilege only emerges because currently pre training is costly and cannot be done incrementally and doesn’t distribute very well in terms of computation. So it is the particular characteristics of the pretraining process that is this kind of privilege. This is not an insight, this is what everybody already agrees on. [laughter]

Keyboard shortcuts

j previous speech k next speech