Submitted by UberStone t3_1015pjo in MachineLearning
Hi. The recent incredible improvements in AI and ML has resurrected an old project of mine to read the back panel of electronic components like this AV receiver and spit out logically formed text to describe each I/O. I have a LOT of direct experience this specific issue as well as general software experience but no AI/ML development experience.
I know it is possible but on a scale of 1-10 how hard? Any new tools make this easier? Ultimately I want to feed the AI pictures of electronic back panels and get formatted text back.
Thanks!
evanthebouncy t1_j2lsn7h wrote
Wait 1 year until we have something like chatgpt but with vision integrated in. Currently it's typically an ocr followed by some nlp. But in a year it can be as simple as give a few examples of what you want done (few shot prompting) in a single model hosted online somewhere
I'd wait a bit more.