ertgbnm t1_jbyocgi wrote
Reply to comment by serge_cell in [N] Man beats machine at Go in human victory over AI : « It shows once again we’ve been far too hasty to ascribe superhuman levels of intelligence to machines. » by fchung
Isn't alphago trained against itself? So I would consider it adversarial training.
serge_cell t1_jc1to7o wrote
There was a paper about it. There was a find - specific set of positions not encountered or pooply represented during self-play. Fully trained AlphaGo was failing on those positions. However then they were explicitly added to the training set the problem was fixed and AlphaGo was able to play them well. This adversarial traning seems just an automatic way to find those positions.
PS fintess landscape is not convex it separated by hills and valleys. Self-play may have a problem in reaching all important states.
Viewing a single comment thread. View all comments