589689.xyz

[] Udemy - Artificial Intelligence Reinforcement Learning in Python

  • 收录时间:2022-04-04 17:04:14
  • 文件大小:3GB
  • 下载次数:1
  • 最近下载:2022-04-04 17:04:14
  • 磁力链接:

文件列表

  1. 10/1. Windows-Focused Environment Setup 2018.mp4 186MB
  2. 4. Markov Decision Proccesses/11. Bellman Examples.mp4 87MB
  3. 11. Extra Help With Python Coding for Beginners (FAQ by Student Request)/3. Proof that using Jupyter Notebook is the same as not using it.mp4 78MB
  4. 2. Return of the Multi-Armed Bandit/16. Bayesian Bandits Thompson Sampling Theory (pt 2).mp4 75MB
  5. 5. Dynamic Programming/4. Iterative Policy Evaluation in Code.mp4 68MB
  6. 9. Stock Trading Project with Reinforcement Learning/6. Code pt 2.mp4 65MB
  7. 1. Welcome/5. Warmup.mp4 63MB
  8. 4. Markov Decision Proccesses/5. Markov Decision Processes (MDPs).mp4 62MB
  9. 5. Dynamic Programming/9. Policy Iteration in Code.mp4 56MB
  10. 4. Markov Decision Proccesses/12. Optimal Policy and Optimal Value Function (pt 1).mp4 56MB
  11. 2. Return of the Multi-Armed Bandit/15. Bayesian Bandits Thompson Sampling Theory (pt 1).mp4 56MB
  12. 2. Return of the Multi-Armed Bandit/12. UCB1 Theory.mp4 56MB
  13. 3. High Level Overview of Reinforcement Learning/1. What is Reinforcement Learning.mp4 55MB
  14. 4. Markov Decision Proccesses/2. Gridworld.mp4 54MB
  15. 9. Stock Trading Project with Reinforcement Learning/2. Data and Environment.mp4 52MB
  16. 2. Return of the Multi-Armed Bandit/1. Section Introduction The Explore-Exploit Dilemma.mp4 52MB
  17. 5. Dynamic Programming/10. Policy Iteration in Windy Gridworld.mp4 51MB
  18. 2. Return of the Multi-Armed Bandit/2. Applications of the Explore-Exploit Dilemma.mp4 51MB
  19. 2. Return of the Multi-Armed Bandit/24. (Optional) Alternative Bandit Designs.mp4 50MB
  20. 9. Stock Trading Project with Reinforcement Learning/5. Code pt 1.mp4 50MB
  21. 9. Stock Trading Project with Reinforcement Learning/8. Code pt 4.mp4 49MB
  22. 2. Return of the Multi-Armed Bandit/19. Thompson Sampling With Gaussian Reward Theory.mp4 49MB
  23. 5. Dynamic Programming/6. Iterative Policy Evaluation for Windy Gridworld in Code.mp4 47MB
  24. 5. Dynamic Programming/3. Gridworld in Code.mp4 47MB
  25. 5. Dynamic Programming/12. Value Iteration in Code.mp4 46MB
  26. 9. Stock Trading Project with Reinforcement Learning/3. How to Model Q for Q-Learning.mp4 45MB
  27. 10/2. How to install Numpy, Scipy, Matplotlib, Pandas, IPython, Theano, and TensorFlow.mp4 44MB
  28. 2. Return of the Multi-Armed Bandit/8. Comparing Different Epsilons.mp4 44MB
  29. 2. Return of the Multi-Armed Bandit/20. Thompson Sampling With Gaussian Reward Code.mp4 43MB
  30. 5. Dynamic Programming/5. Windy Gridworld in Code.mp4 41MB
  31. 2. Return of the Multi-Armed Bandit/7. Epsilon-Greedy in Code.mp4 41MB
  32. 3. High Level Overview of Reinforcement Learning/3. From Bandits to Full Reinforcement Learning.mp4 41MB
  33. 1. Welcome/2. Course Outline and Big Picture.mp4 40MB
  34. 4. Markov Decision Proccesses/6. Future Rewards.mp4 40MB
  35. 13. Appendix FAQ/2. BONUS Where to get discount coupons and FREE deep learning material.mp4 38MB
  36. 13. Appendix FAQ Finale/2. BONUS Where to get discount coupons and FREE deep learning material.mp4 38MB
  37. 12. Effective Learning Strategies for Machine Learning (FAQ by Student Request)/4. Machine Learning and AI Prerequisite Roadmap (pt 2).mp4 38MB
  38. 4. Markov Decision Proccesses/1. MDP Section Introduction.mp4 37MB
  39. 3. High Level Overview of Reinforcement Learning/2. On Unusual or Unexpected Strategies of RL.mp4 37MB
  40. 2. Return of the Multi-Armed Bandit/23. Bandit Summary, Real Data, and Online Learning.mp4 35MB
  41. 1. Welcome/1. Introduction.mp4 34MB
  42. 9. Stock Trading Project with Reinforcement Learning/7. Code pt 3.mp4 34MB
  43. 2. Return of the Multi-Armed Bandit/18. Thompson Sampling Code.mp4 33MB
  44. 4. Markov Decision Proccesses/3. Choosing Rewards.mp4 32MB
  45. 2. Return of the Multi-Armed Bandit/22. Nonstationary Bandits.mp4 31MB
  46. 12. Effective Learning Strategies for Machine Learning (FAQ by Student Request)/3. Machine Learning and AI Prerequisite Roadmap (pt 1).mp4 29MB
  47. 2. Return of the Multi-Armed Bandit/5. Epsilon-Greedy Beginner's Exercise Prompt.mp4 29MB
  48. 2. Return of the Multi-Armed Bandit/3. Epsilon-Greedy Theory.mp4 28MB
  49. 4. Markov Decision Proccesses/8. The Bellman Equation (pt 1).mp4 28MB
  50. 2. Return of the Multi-Armed Bandit/21. Why don't we just use a library.mp4 27MB
  51. 9. Stock Trading Project with Reinforcement Learning/1. Stock Trading Project Section Introduction.mp4 27MB
  52. 4. Markov Decision Proccesses/9. The Bellman Equation (pt 2).mp4 27MB
  53. 4. Markov Decision Proccesses/10. The Bellman Equation (pt 3).mp4 25MB
  54. 2. Return of the Multi-Armed Bandit/11. Optimistic Initial Values Code.mp4 25MB
  55. 11. Extra Help With Python Coding for Beginners (FAQ by Student Request)/1. How to Code by Yourself (part 1).mp4 25MB
  56. 2. Return of the Multi-Armed Bandit/6. Designing Your Bandit Program.mp4 25MB
  57. 2. Return of the Multi-Armed Bandit/9. Optimistic Initial Values Theory.mp4 24MB
  58. 9. Stock Trading Project with Reinforcement Learning/4. Design of the Program.mp4 23MB
  59. 2. Return of the Multi-Armed Bandit/4. Calculating a Sample Mean (pt 1).mp4 23MB
  60. 1. Welcome/3. Where to get the Code.mp4 23MB
  61. 5. Dynamic Programming/2. Designing Your RL Program.mp4 22MB
  62. 4. Markov Decision Proccesses/4. The Markov Property.mp4 22MB
  63. 2. Return of the Multi-Armed Bandit/14. UCB1 Code.mp4 21MB
  64. 4. Markov Decision Proccesses/7. Value Functions.srt 19MB
  65. 4. Markov Decision Proccesses/7. Value Functions.mp4 19MB
  66. 12. Effective Learning Strategies for Machine Learning (FAQ by Student Request)/1. How to Succeed in this Course (Long Version).mp4 18MB
  67. 2. Return of the Multi-Armed Bandit/17. Thompson Sampling Beginner's Exercise Prompt.mp4 18MB
  68. 2. Return of the Multi-Armed Bandit/25. Suggestion Box.mp4 16MB
  69. 9. Stock Trading Project with Reinforcement Learning/9. Stock Trading Project Discussion.mp4 16MB
  70. 4. Markov Decision Proccesses/13. Optimal Policy and Optimal Value Function (pt 2).mp4 16MB
  71. 1. Welcome/4. How to Succeed in this Course.mp4 16MB
  72. 11. Extra Help With Python Coding for Beginners (FAQ by Student Request)/2. How to Code by Yourself (part 2).mp4 15MB
  73. 4. Markov Decision Proccesses/14. MDP Summary.mp4 14MB
  74. 2. Return of the Multi-Armed Bandit/10. Optimistic Initial Values Beginner's Exercise Prompt.mp4 14MB
  75. 8. Approximation Methods/9. Course Summary and Next Steps.mp4 13MB
  76. 2. Return of the Multi-Armed Bandit/13. UCB1 Beginner's Exercise Prompt.mp4 13MB
  77. 8. Approximation Methods/8. Semi-Gradient SARSA in Code.mp4 11MB
  78. 6. Monte Carlo/6. Monte Carlo Control in Code.mp4 10MB
  79. 6. Monte Carlo/5. Monte Carlo Control.mp4 9MB
  80. 7. Temporal Difference Learning/5. SARSA in Code.mp4 9MB
  81. 6. Monte Carlo/2. Monte Carlo Policy Evaluation.mp4 9MB
  82. 8. Approximation Methods/6. TD(0) Semi-Gradient Prediction.mp4 8MB
  83. 5. Dynamic Programming/13. Dynamic Programming Summary.mp4 8MB
  84. 7. Temporal Difference Learning/4. SARSA.mp4 8MB
  85. 6. Monte Carlo/8. Monte Carlo Control without Exploring Starts in Code.mp4 8MB
  86. 6. Monte Carlo/3. Monte Carlo Policy Evaluation in Code.mp4 8MB
  87. 11. Extra Help With Python Coding for Beginners (FAQ by Student Request)/4. Python 2 vs Python 3.mp4 8MB
  88. 6. Monte Carlo/4. Policy Evaluation in Windy Gridworld.mp4 8MB
  89. 8. Approximation Methods/5. Monte Carlo Prediction with Approximation in Code.mp4 7MB
  90. 8. Approximation Methods/2. Linear Models for Reinforcement Learning.mp4 6MB
  91. 8. Approximation Methods/1. Approximation Intro.mp4 6MB
  92. 8. Approximation Methods/3. Features.mp4 6MB
  93. 5. Dynamic Programming/11. Value Iteration.mp4 6MB
  94. 7. Temporal Difference Learning/2. TD(0) Prediction.mp4 6MB
  95. 6. Monte Carlo/9. Monte Carlo Summary.mp4 6MB
  96. 13. Appendix FAQ Finale/1. What is the Appendix.mp4 5MB
  97. 13. Appendix FAQ/1. What is the Appendix.mp4 5MB
  98. 7. Temporal Difference Learning/7. Q Learning in Code.mp4 5MB
  99. 7. Temporal Difference Learning/3. TD(0) Prediction in Code.mp4 5MB
  100. 6. Monte Carlo/1. Monte Carlo Intro.mp4 5MB
  101. 5. Dynamic Programming/1. Intro to Dynamic Programming and Iterative Policy Evaluation.mp4 5MB
  102. 7. Temporal Difference Learning/6. Q Learning.mp4 5MB
  103. 8. Approximation Methods/7. Semi-Gradient SARSA.mp4 5MB
  104. 6. Monte Carlo/7. Monte Carlo Control without Exploring Starts.mp4 5MB
  105. 5. Dynamic Programming/7. Policy Improvement.mp4 5MB
  106. 7. Temporal Difference Learning/8. TD Summary.mp4 4MB
  107. 5. Dynamic Programming/8. Policy Iteration.mp4 3MB
  108. 8. Approximation Methods/4. Monte Carlo Prediction with Approximation.mp4 3MB
  109. 7. Temporal Difference Learning/1. Temporal Difference Intro.mp4 3MB
  110. 11. Extra Help With Python Coding for Beginners (FAQ by Student Request)/1. How to Code by Yourself (part 1).srt 30KB
  111. 4. Markov Decision Proccesses/11. Bellman Examples.srt 29KB
  112. 2. Return of the Multi-Armed Bandit/16. Bayesian Bandits Thompson Sampling Theory (pt 2).srt 26KB
  113. 12. Effective Learning Strategies for Machine Learning (FAQ by Student Request)/4. Machine Learning and AI Prerequisite Roadmap (pt 2).srt 23KB
  114. 2. Return of the Multi-Armed Bandit/12. UCB1 Theory.srt 22KB
  115. 4. Markov Decision Proccesses/5. Markov Decision Processes (MDPs).srt 22KB
  116. 10/1. Windows-Focused Environment Setup 2018.srt 20KB
  117. 1. Welcome/5. Warmup.srt 20KB
  118. 4. Markov Decision Proccesses/2. Gridworld.srt 19KB
  119. 11. Extra Help With Python Coding for Beginners (FAQ by Student Request)/2. How to Code by Yourself (part 2).srt 18KB
  120. 2. Return of the Multi-Armed Bandit/15. Bayesian Bandits Thompson Sampling Theory (pt 1).srt 18KB
  121. 10/2. How to install Numpy, Scipy, Matplotlib, Pandas, IPython, Theano, and TensorFlow.srt 18KB
  122. 5. Dynamic Programming/4. Iterative Policy Evaluation in Code.srt 18KB
  123. 5. Dynamic Programming/3. Gridworld in Code.srt 18KB
  124. 9. Stock Trading Project with Reinforcement Learning/2. Data and Environment.srt 17KB
  125. 2. Return of the Multi-Armed Bandit/19. Thompson Sampling With Gaussian Reward Theory.srt 17KB
  126. 12. Effective Learning Strategies for Machine Learning (FAQ by Student Request)/3. Machine Learning and AI Prerequisite Roadmap (pt 1).srt 16KB
  127. 8. Approximation Methods/9. Course Summary and Next Steps.srt 16KB
  128. 2. Return of the Multi-Armed Bandit/24. (Optional) Alternative Bandit Designs.srt 15KB
  129. 2. Return of the Multi-Armed Bandit/1. Section Introduction The Explore-Exploit Dilemma.srt 15KB
  130. 12. Effective Learning Strategies for Machine Learning (FAQ by Student Request)/1. How to Succeed in this Course (Long Version).srt 15KB
  131. 4. Markov Decision Proccesses/6. Future Rewards.srt 14KB
  132. 11. Extra Help With Python Coding for Beginners (FAQ by Student Request)/3. Proof that using Jupyter Notebook is the same as not using it.srt 14KB
  133. 3. High Level Overview of Reinforcement Learning/3. From Bandits to Full Reinforcement Learning.srt 13KB
  134. 9. Stock Trading Project with Reinforcement Learning/3. How to Model Q for Q-Learning.srt 13KB
  135. 9. Stock Trading Project with Reinforcement Learning/6. Code pt 2.srt 13KB
  136. 4. Markov Decision Proccesses/12. Optimal Policy and Optimal Value Function (pt 1).srt 13KB
  137. 5. Dynamic Programming/10. Policy Iteration in Windy Gridworld.srt 12KB
  138. 4. Markov Decision Proccesses/8. The Bellman Equation (pt 1).srt 12KB
  139. 5. Dynamic Programming/9. Policy Iteration in Code.srt 12KB
  140. 3. High Level Overview of Reinforcement Learning/1. What is Reinforcement Learning.srt 12KB
  141. 2. Return of the Multi-Armed Bandit/2. Applications of the Explore-Exploit Dilemma.srt 12KB
  142. 1. Welcome/2. Course Outline and Big Picture.srt 11KB
  143. 5. Dynamic Programming/5. Windy Gridworld in Code.srt 11KB
  144. 5. Dynamic Programming/6. Iterative Policy Evaluation for Windy Gridworld in Code.srt 11KB
  145. 6. Monte Carlo/2. Monte Carlo Policy Evaluation.srt 11KB
  146. 2. Return of the Multi-Armed Bandit/3. Epsilon-Greedy Theory.srt 10KB
  147. 9. Stock Trading Project with Reinforcement Learning/5. Code pt 1.srt 10KB
  148. 6. Monte Carlo/5. Monte Carlo Control.srt 10KB
  149. 2. Return of the Multi-Armed Bandit/22. Nonstationary Bandits.srt 10KB
  150. 2. Return of the Multi-Armed Bandit/23. Bandit Summary, Real Data, and Online Learning.srt 10KB
  151. 5. Dynamic Programming/12. Value Iteration in Code.srt 10KB
  152. 7. Temporal Difference Learning/4. SARSA.srt 10KB
  153. 4. Markov Decision Proccesses/9. The Bellman Equation (pt 2).srt 9KB
  154. 5. Dynamic Programming/13. Dynamic Programming Summary.srt 9KB
  155. 2. Return of the Multi-Armed Bandit/7. Epsilon-Greedy in Code.srt 9KB
  156. 4. Markov Decision Proccesses/1. MDP Section Introduction.srt 9KB
  157. 9. Stock Trading Project with Reinforcement Learning/4. Design of the Program.srt 9KB
  158. 4. Markov Decision Proccesses/4. The Markov Property.srt 9KB
  159. 9. Stock Trading Project with Reinforcement Learning/8. Code pt 4.srt 9KB
  160. 4. Markov Decision Proccesses/10. The Bellman Equation (pt 3).srt 9KB
  161. 3. High Level Overview of Reinforcement Learning/2. On Unusual or Unexpected Strategies of RL.srt 9KB
  162. 2. Return of the Multi-Armed Bandit/4. Calculating a Sample Mean (pt 1).srt 8KB
  163. 2. Return of the Multi-Armed Bandit/21. Why don't we just use a library.srt 8KB
  164. 13. Appendix FAQ/2. BONUS Where to get discount coupons and FREE deep learning material.srt 8KB
  165. 2. Return of the Multi-Armed Bandit/20. Thompson Sampling With Gaussian Reward Code.srt 8KB
  166. 8. Approximation Methods/1. Approximation Intro.srt 8KB
  167. 2. Return of the Multi-Armed Bandit/9. Optimistic Initial Values Theory.srt 8KB
  168. 13. Appendix FAQ Finale/2. BONUS Where to get discount coupons and FREE deep learning material.srt 8KB
  169. 8. Approximation Methods/2. Linear Models for Reinforcement Learning.srt 7KB
  170. 9. Stock Trading Project with Reinforcement Learning/1. Stock Trading Project Section Introduction.srt 7KB
  171. 2. Return of the Multi-Armed Bandit/5. Epsilon-Greedy Beginner's Exercise Prompt.srt 7KB
  172. 6. Monte Carlo/9. Monte Carlo Summary.srt 7KB
  173. 5. Dynamic Programming/2. Designing Your RL Program.srt 7KB
  174. 2. Return of the Multi-Armed Bandit/8. Comparing Different Epsilons.srt 7KB
  175. 5. Dynamic Programming/11. Value Iteration.srt 7KB
  176. 1. Welcome/3. Where to get the Code.srt 7KB
  177. 8. Approximation Methods/3. Features.srt 7KB
  178. 7. Temporal Difference Learning/2. TD(0) Prediction.srt 6KB
  179. 8. Approximation Methods/6. TD(0) Semi-Gradient Prediction.srt 6KB
  180. 2. Return of the Multi-Armed Bandit/18. Thompson Sampling Code.srt 6KB
  181. 6. Monte Carlo/3. Monte Carlo Policy Evaluation in Code.srt 6KB
  182. 11. Extra Help With Python Coding for Beginners (FAQ by Student Request)/4. Python 2 vs Python 3.srt 6KB
  183. 2. Return of the Multi-Armed Bandit/6. Designing Your Bandit Program.srt 6KB
  184. 6. Monte Carlo/1. Monte Carlo Intro.srt 6KB
  185. 4. Markov Decision Proccesses/3. Choosing Rewards.srt 6KB
  186. 9. Stock Trading Project with Reinforcement Learning/7. Code pt 3.srt 6KB
  187. 6. Monte Carlo/6. Monte Carlo Control in Code.srt 6KB
  188. 7. Temporal Difference Learning/6. Q Learning.srt 6KB
  189. 2. Return of the Multi-Armed Bandit/11. Optimistic Initial Values Code.srt 6KB
  190. 7. Temporal Difference Learning/5. SARSA in Code.srt 6KB
  191. 6. Monte Carlo/7. Monte Carlo Control without Exploring Starts.srt 6KB
  192. 8. Approximation Methods/7. Semi-Gradient SARSA.srt 5KB
  193. 4. Markov Decision Proccesses/13. Optimal Policy and Optimal Value Function (pt 2).srt 5KB
  194. 8. Approximation Methods/8. Semi-Gradient SARSA in Code.srt 5KB
  195. 5. Dynamic Programming/1. Intro to Dynamic Programming and Iterative Policy Evaluation.srt 5KB
  196. 6. Monte Carlo/4. Policy Evaluation in Windy Gridworld.srt 5KB
  197. 5. Dynamic Programming/7. Policy Improvement.srt 5KB
  198. 2. Return of the Multi-Armed Bandit/25. Suggestion Box.srt 5KB
  199. 7. Temporal Difference Learning/8. TD Summary.srt 5KB
  200. 9. Stock Trading Project with Reinforcement Learning/9. Stock Trading Project Discussion.srt 5KB
  201. 1. Welcome/1. Introduction.srt 4KB
  202. 1. Welcome/4. How to Succeed in this Course.srt 4KB
  203. 2. Return of the Multi-Armed Bandit/14. UCB1 Code.srt 4KB
  204. 8. Approximation Methods/5. Monte Carlo Prediction with Approximation in Code.srt 4KB
  205. 4. Markov Decision Proccesses/14. MDP Summary.srt 4KB
  206. 7. Temporal Difference Learning/3. TD(0) Prediction in Code.srt 4KB
  207. 13. Appendix FAQ/1. What is the Appendix.srt 4KB
  208. 2. Return of the Multi-Armed Bandit/17. Thompson Sampling Beginner's Exercise Prompt.srt 4KB
  209. 13. Appendix FAQ Finale/1. What is the Appendix.srt 4KB
  210. 6. Monte Carlo/8. Monte Carlo Control without Exploring Starts in Code.srt 4KB
  211. 5. Dynamic Programming/8. Policy Iteration.srt 3KB
  212. 7. Temporal Difference Learning/7. Q Learning in Code.srt 3KB
  213. 7. Temporal Difference Learning/1. Temporal Difference Intro.srt 3KB
  214. 2. Return of the Multi-Armed Bandit/10. Optimistic Initial Values Beginner's Exercise Prompt.srt 3KB
  215. 2. Return of the Multi-Armed Bandit/13. UCB1 Beginner's Exercise Prompt.srt 3KB
  216. 8. Approximation Methods/4. Monte Carlo Prediction with Approximation.srt 2KB
  217. 1. Welcome/[Tutorialsplanet.NET].url 128B
  218. 13. Appendix FAQ/[Tutorialsplanet.NET].url 128B
  219. 6. Monte Carlo/[Tutorialsplanet.NET].url 128B
  220. [Tutorialsplanet.NET].url 128B