Submitted by EmbarrassedFuel t3_10w5f9u in MachineLearning
UnusualClimberBear t1_j7opc2r wrote
Reply to comment by UnusualClimberBear in Model/paper ideas: reinforcement learning with a deterministic environment [D] by EmbarrassedFuel
Also if your world is deterministic but you cannot build a good model of it, it may be that you are close to the situation of games such as Go, and Monte Carlo Tree search algorithms are an option to consider (variants of UCT with or without function approximation)
EmbarrassedFuel OP t1_j7p40eo wrote
oh also the model needs to run at inference time in a relatively short period of time on cheap hardware :)
Viewing a single comment thread. View all comments