Viewing a single comment thread. View all comments

BullockHouse t1_jdil2ok wrote on March 24, 2023 at 5:25 PM

Reply to comment by wyrdwulf in [D] I just realised: GPT-4 with image input can interpret any computer screen, any userinterface and any combination of them. by Balance-

I'm familiar! I'm curious though if it can generalize well enough to play semi-competently without specialized training. Has implications for multi-modal models and robotics.