I've just skimmed through the paper, so I don't make any assumptions about the described method, but aren't the predicted values in the last plot just 4 constant values? Is this behavior desired?
When a new approach is proposed, it's common to evaluate its performance against already established algorithms, like simple linear or lasso regression in this case. I'm sorry, but I don't see how your algorithm fares against the baselines, I don't even see any established real-world regression datasets.
I would advise you to look into that and test your model in a more controlled manner.
[deleted] OP t1_jdvehu6 wrote
[removed]