LINK@KoreaTech

Laboratory of Intelligent Networks at KoreaTech

  • Home
  • People
    • Professor
    • Lab. Members
  • Courses
  • Research
    • Reinforcement Learning
    • Deep Learning
    • Social Network Analysis
    • Intelligence Defined Networks
    • Internet of Things
  • Projects
  • Publication
    • Papers
    • Patents
    • Standards
    • Books
    • Blog
  • Awards
  • Invited Talks
  • Activities
  • Lab. Gallery
  • call4paper
  • ATRC

Reinforcement Learning

  • Lecture Notes

    1. Probability, Conditional Probability, Bayes’ theorem, Likelihood, MLE, and MAP
    2. Information, Entropy, Cross Entropy, KL Divergence
    3. Markov Decision Problem, Value Function & Q-Value
    4. Bellman Equation, Dynamic Programming
    5. Monte-Carlo Learning, SARSA, Q-Learning
    6. Deep Q-Learning
    7. Policy Gradient, REINFORCE, A2C
      • Papers
        • Policy Gradient Methods for Reinforcement Learning with Function Approximation
    8. TRPO & PPO
    9. Inverse Reinforcement Learning
      • Papers
        • Basic IRL, Ng & Russel, 2000
        • Apprenticeship Learning via IRL, Abbeel & Ng, 2004
        • Bayesian IRL, Ramachandran & Amir, 2007
        • Maximum Entropy IRL, Ziebart et al., 2008
        • Maximum Causal Entropy IRL, Ziebart et al., 2010
        • Maximum Entropy Deep IRL, Wulfmeier et al., 2016
    10. Unity Machine Learning Agents Toolkit
  • Laboratory

    • https://github.com/link-kut/rl: RL Project@LINK
    • Reinforcement Learning Basic – Javascript Apps.
    • AI Gym – Table-of-environments
    • AI Gym on Colab
  • Movie Clips

    • Google Deepmind’s Atari Breakout
  • Papers

    • Reinforcement Learning For Automated Trading
    • Reinforcement learning of motor skills with policy gradients
    • Policy Gradient Methods for Reinforcement Learning with Function Approximation
  • LINK@KoreaTech - Laboratory of Intelligent Networks at KoreaTech
facebook

Colorway WordPress Theme by InkThemes.com