Reinforcement Learning

Dynamic Programming - Policy Iteration, Value Iteration

LINK@KoreaTech
Febraury, 14, 2019
Email: link.koreatech at gmail.com
This site is made by using the source codes shared from the site, REINFORCEjs

Policy Iteration


  • Sum of All State Values: 0
  • Difference of Sum of All State Values: -1
  • Status: RESET
  • Discount Factor (gamma): 0.75
Iteration: -1



Value Iteration


  • Sum of All State Values: 0
  • Difference of Sum of All State Values: -1
  • Status: RESET
  • Discount Factor (gamma): 0.75
Iteration: -1


Laboratory Partners