nmkd t1_jdhmgpm wrote
Reply to comment by banmeyoucoward in [D] I just realised: GPT-4 with image input can interpret any computer screen, any userinterface and any combination of them. by Balance-
Nope, it's multimodal in terms of understanding language and images. It wasn't trained on mouse movement because that's neither language nor imagery.
Jean-Porte t1_jdjagqg wrote
> use 2 images
> movement
> boom
snylekkie t1_jdo5afc wrote
Absolutely mental
Viewing a single comment thread. View all comments