notes-cog-ai-explorationExploitation

Difference between revision 1 and current revision

No diff available.

http://en.wikipedia.org/wiki/Multi-armed_bandit#Approximate_solutions

http://en.wikipedia.org/wiki/Thompson_sampling

--

See also the solution in notes-cog-ai-games-go.