I was wondering of what would happen in the following scenario which I can't try since Bard is not supported in my country and I do not have GPT4:

>What happens if you give as input to bard or GPT4 an ASCII version of a screenshot of a video game and ask it from what game it has been taken or to describe the next likely action or the input?

Comments

Fit-Recognition9795 t1_jdmsxa9 wrote on March 25, 2023 at 3:51 PM

As an AI language model, both GPT-4 and its predecessors like me, ChatGPT, are designed to process and generate text, not to analyze images or visual data. Giving an ASCII representation of a screenshot to GPT-4 or any text-based language model would likely result in a poor understanding of the actual image, as the model doesn't have the capability to process images in the same way that a human or a dedicated image recognition AI can.

However, if the ASCII representation is clear enough and contains easily recognizable elements that are unique to a particular video game, there is a chance that GPT-4 might be able to make an educated guess about the game in question, but the accuracy would be significantly lower compared to proper image recognition AI.

Regarding the prediction of the next likely action or input, GPT-4 might be able to provide some generic suggestions based on the text description, but again, its ability to understand the actual visual information would be limited.

For analyzing images and making predictions about visual content, you would be better off using a dedicated image recognition AI model, such as OpenAI's DALL-E or an AI model specifically trained for video game analysis.

Eggy-Toast t1_jdn7cki wrote on March 25, 2023 at 5:33 PM

ChatGPT doesn’t know that GPT-4 has multimodal input though, right? I assume based on “not [designed] to analyze images or visual data” this is the case.

Fit-Recognition9795 t1_jdnbjar wrote on March 25, 2023 at 6:02 PM

Even worse, gpt-4 doesn't know it is gpt-4.

I have chatgpt-plus and the above answer is generated using the gpt-4 model.

[deleted] t1_je1xktp wrote on March 28, 2023 at 8:35 PM

[removed]

[deleted] t1_je1y7ve wrote on March 28, 2023 at 8:39 PM

[removed]

[deleted] t1_je208kr wrote on March 28, 2023 at 8:52 PM

[removed]

[deleted] t1_je20cd3 wrote on March 28, 2023 at 8:52 PM

[removed]

scratchmassive t1_jdmr3m3 wrote on March 25, 2023 at 3:37 PM

But why

[deleted] t1_je1ygm2 wrote on March 28, 2023 at 8:41 PM

[removed]

SeymourBits t1_jdnttwn wrote on March 25, 2023 at 8:13 PM

Interesting experiment. I have not done it but I predict it would hallucinate a well-documented video game screen, like Pac-man, and then describe probable actions within the hallucinated game.

Difficult_Bid_9828 t1_jdom9pj wrote on March 25, 2023 at 11:47 PM

I tried something like this by transforming a random image of a game I found (Doom and Mario 64) into ASCII art and giving it to Bing with the instruction to guess which game it was from. Did not work.

mskogly t1_jdq58o4 wrote on March 26, 2023 at 9:05 AM

I believe gpt4 can read and describe the content of images. No need to go via ascii.