CS6140:MachineLearningSpring2017
Instructor:LuWangCollegeofComputerandInforma@onScience
NortheasternUniversityWebpage:www.ccs.neu.edu/home/luwang
Email:[email protected]
Logis@cs• GradesforA2isout.
• Nextweek:courseprojectpresenta@on.
• Thefinalreportisdueon4/24.Allassignmentshavetobeinby4/29.
• 4/20:finalexam
• Addi@onalofficehours:– 4.17,4-5pm,(Lu,448WVH)– 4.18,11am-12pm,(TA,166WVH)– 4.19,4-5pm,(Lu,448WVH)
Whatwelearnedlast@me
• Introduc@ontoReinforcementLearning• TheReinforcementLearningProblem• MarkovDecisionProcess
Today’sOutline
• PlanningbyDynamicProgramming– Policyevalua@onandpolicyimprovement– Valueitera@on
[SlidestakenfromDavidSilver’sreinforcementlearningcourse]
Top Related