Submitted by Tigmib t3_10awo8f in MachineLearning
Hey guys,
I am working the sector of computer science for agriculture research. I deal here with algorithm to monitor crop conditions and try to simulate what yield will be the outcome.
I am focussing on ML based methods, but data in agriculture can be a quite limiting factor. If you have 100k samples from real crop fields, thats a lot! So we are not like ChatGPT, who just used 500bn word samples to train their model.
To overcome the issues of small data + ML, I want to set up an approach that combines ML methods (learning from data) with expert knowledge.
What do I mean by this: E.g. Everybody knows, if you do not water your plant, it will die. Or if there are 90° Celsius, the plant will just burn. This knowledge is partially stored in so called "crop simulation models" designed by agronomy experts and my idea was to use these expert models to generate synthetic yield data and feed this data into the training dataset for the ML models.
For me that will somehow result in an approach of "constrained machine learning" where I want to combine both. However, does some of you have any other idea how ML and expert models could be combined or the knowledge could be injected to ML methods, except via the training dataset?
I am happy to hear your suggestions!
ndemir t1_j474eox wrote
When I have similar doubt, I ask myself; "forget ML, will statistics help you? Will just defining some rules will help you?" People in that industry already have some kind of idea about how to predict, learn their rules. By the way, I am not suggesting that you should not use ML. I am just asking you to look from a different angle.