notes-cog-ai-explorationExploitation

http://en.wikipedia.org/wiki/Multi-armed_bandit#Approximate_solutions

http://en.wikipedia.org/wiki/Thompson_sampling

--

See also the solution in notes-cog-ai-games-go.