You May Also Enjoy
强化学习笔记(7)-时序差分方法
1 minute read
Temporal-Difference Learning
强化学习笔记(6)-随机梯度下降
1 minute read
Stochastic Approximation & Stochastic Gradient Descent
强化学习笔记(5)-蒙特卡洛方法
1 minute read
Monte Carlo Learning
强化学习笔记(4)-值迭代与策略迭代
2 minute read
Value iteration & Policy iteration