Article Details
Retrieved on: 2018-02-27 23:13:12
Tags for this article:
Click the tags to see associated articles and topics
Excerpt
<div>"It found these human moves, it tried them, then ultimately it found something it prefers,” AlphaGo's lead programmer David Silver said at the time. It's also worth noting that the Qbert agent described in the new paper is using a different machine-learning technique from <b>AlphaGo Zero's</b> reinforcement ...</div>
Article found on:
This article is found inside other hiswai user's workspaces. To start your own collection, sign up for free.
Sign UpAlready have an account? Log in here