Apprentissage par Renforcement Reinforcement Learning Kenji Doya [email protected] ATR Human Information Science Laboratories CREST, Japan Science and Technology.