So, for language, for image, or for multi-modality, and for things like that, I think we need to make state-of-the-art, like reaching ChatGPT level models in open source that can be run in a reasonable speed on networks.

Keyboard shortcuts

j previous speech k next speech