Viewing a single comment thread. View all comments

nmkd t1_jdhmgpm wrote

Nope, it's multimodal in terms of understanding language and images. It wasn't trained on mouse movement because that's neither language nor imagery.

4