BullockHouse t1_jdil2ok wrote
Reply to comment by wyrdwulf in [D] I just realised: GPT-4 with image input can interpret any computer screen, any userinterface and any combination of them. by Balance-
I'm familiar! I'm curious though if it can generalize well enough to play semi-competently without specialized training. Has implications for multi-modal models and robotics.
Viewing a single comment thread. View all comments