[ ] Udemy - Advanced Reinforcement Learning - policy gradient methods
- 收录时间:2022-08-26 19:24:03
- 文件大小:733MB
- 下载次数:1
- 最近下载:2022-08-26 19:24:03
- 磁力链接:
-
文件列表
- ~Get Your Files Here !/10 - Advantage Actor Critic (A2C)/001 A2C.mp4 50MB
- ~Get Your Files Here !/06 - Refresher Brief introduction to Neural Networks/005 Stochastic Gradient Descent.mp4 50MB
- ~Get Your Files Here !/02 - Refresher The Markov Decision Process (MDP)/001 Elements common to all control tasks.mp4 39MB
- ~Get Your Files Here !/06 - Refresher Brief introduction to Neural Networks/004 How to represent a Neural Network.mp4 38MB
- ~Get Your Files Here !/06 - Refresher Brief introduction to Neural Networks/001 Function approximators.mp4 36MB
- ~Get Your Files Here !/08 - PyTorch Lightning/001 PyTorch Lightning.mp4 32MB
- ~Get Your Files Here !/05 - Refresher N-step bootstrapping/003 Effect of changing n.mp4 28MB
- ~Get Your Files Here !/07 - Refresher REINFORCE/002 Representing policies using neural networks.mp4 28MB
- ~Get Your Files Here !/06 - Refresher Brief introduction to Neural Networks/003 Artificial Neurons.mp4 26MB
- ~Get Your Files Here !/02 - Refresher The Markov Decision Process (MDP)/002 The Markov decision process (MDP).mp4 25MB
- ~Get Your Files Here !/06 - Refresher Brief introduction to Neural Networks/002 Artificial Neural Networks.mp4 24MB
- ~Get Your Files Here !/03 - Refresher Monte Carlo methods/002 Solving control tasks with Monte Carlo methods.mp4 24MB
- ~Get Your Files Here !/06 - Refresher Brief introduction to Neural Networks/006 Neural Network optimization.mp4 23MB
- ~Get Your Files Here !/07 - Refresher REINFORCE/007 Entropy regularization.mp4 23MB
- ~Get Your Files Here !/07 - Refresher REINFORCE/001 Policy gradient methods.mp4 22MB
- ~Get Your Files Here !/03 - Refresher Monte Carlo methods/003 On-policy Monte Carlo control.mp4 20MB
- ~Get Your Files Here !/04 - Refresher Temporal difference methods/004 SARSA.mp4 18MB
- ~Get Your Files Here !/07 - Refresher REINFORCE/004 The policy gradient theorem.mp4 16MB
- ~Get Your Files Here !/02 - Refresher The Markov Decision Process (MDP)/006 Discount factor.mp4 15MB
- ~Get Your Files Here !/04 - Refresher Temporal difference methods/002 Solving control tasks with temporal difference methods.mp4 15MB
- ~Get Your Files Here !/02 - Refresher The Markov Decision Process (MDP)/010 Solving a Markov decision process.mp4 14MB
- ~Get Your Files Here !/03 - Refresher Monte Carlo methods/001 Monte Carlo methods.mp4 14MB
- ~Get Your Files Here !/07 - Refresher REINFORCE/005 REINFORCE.mp4 13MB
- ~Get Your Files Here !/04 - Refresher Temporal difference methods/001 Temporal difference methods.mp4 13MB
- ~Get Your Files Here !/05 - Refresher N-step bootstrapping/001 N-step temporal difference methods.mp4 13MB
- ~Get Your Files Here !/02 - Refresher The Markov Decision Process (MDP)/009 Bellman equations.mp4 12MB
- ~Get Your Files Here !/07 - Refresher REINFORCE/006 Parallel learning.mp4 12MB
- ~Get Your Files Here !/05 - Refresher N-step bootstrapping/002 Where do n-step methods fit.mp4 11MB
- ~Get Your Files Here !/04 - Refresher Temporal difference methods/005 Q-Learning.mp4 11MB
- ~Get Your Files Here !/07 - Refresher REINFORCE/008 REINFORCE 2.mp4 11MB
- ~Get Your Files Here !/04 - Refresher Temporal difference methods/003 Monte Carlo vs temporal difference methods.mp4 9MB
- ~Get Your Files Here !/02 - Refresher The Markov Decision Process (MDP)/003 Types of Markov decision process.mp4 9MB
- ~Get Your Files Here !/07 - Refresher REINFORCE/003 Policy performance.mp4 9MB
- ~Get Your Files Here !/02 - Refresher The Markov Decision Process (MDP)/007 Policy.mp4 7MB
- ~Get Your Files Here !/01 - Introduction/003 Google Colab.mp4 6MB
- ~Get Your Files Here !/02 - Refresher The Markov Decision Process (MDP)/005 Reward vs Return.mp4 5MB
- ~Get Your Files Here !/02 - Refresher The Markov Decision Process (MDP)/004 Trajectory vs episode.mp4 5MB
- ~Get Your Files Here !/02 - Refresher The Markov Decision Process (MDP)/008 State values v(s) and action values q(s,a).mp4 4MB
- ~Get Your Files Here !/04 - Refresher Temporal difference methods/006 Advantages of temporal difference methods.mp4 4MB
- ~Get Your Files Here !/10 - Advantage Actor Critic (A2C)/001 A2C_en.vtt 11KB
- ~Get Your Files Here !/08 - PyTorch Lightning/001 PyTorch Lightning_en.vtt 9KB
- ~Get Your Files Here !/06 - Refresher Brief introduction to Neural Networks/001 Function approximators_en.vtt 9KB
- ~Get Your Files Here !/06 - Refresher Brief introduction to Neural Networks/004 How to represent a Neural Network_en.vtt 7KB
- ~Get Your Files Here !/03 - Refresher Monte Carlo methods/002 Solving control tasks with Monte Carlo methods_en.vtt 7KB
- ~Get Your Files Here !/07 - Refresher REINFORCE/007 Entropy regularization_en.vtt 7KB
- ~Get Your Files Here !/06 - Refresher Brief introduction to Neural Networks/005 Stochastic Gradient Descent_en.vtt 6KB
- ~Get Your Files Here !/02 - Refresher The Markov Decision Process (MDP)/001 Elements common to all control tasks_en.vtt 6KB
- ~Get Your Files Here !/06 - Refresher Brief introduction to Neural Networks/003 Artificial Neurons_en.vtt 6KB
- ~Get Your Files Here !/02 - Refresher The Markov Decision Process (MDP)/002 The Markov decision process (MDP)_en.vtt 6KB
- ~Get Your Files Here !/07 - Refresher REINFORCE/002 Representing policies using neural networks_en.vtt 5KB
- ~Get Your Files Here !/07 - Refresher REINFORCE/001 Policy gradient methods_en.vtt 5KB
- ~Get Your Files Here !/05 - Refresher N-step bootstrapping/003 Effect of changing n_en.vtt 5KB
- ~Get Your Files Here !/03 - Refresher Monte Carlo methods/003 On-policy Monte Carlo control_en.vtt 5KB
- ~Get Your Files Here !/06 - Refresher Brief introduction to Neural Networks/006 Neural Network optimization_en.vtt 4KB
- ~Get Your Files Here !/07 - Refresher REINFORCE/005 REINFORCE_en.vtt 4KB
- ~Get Your Files Here !/02 - Refresher The Markov Decision Process (MDP)/006 Discount factor_en.vtt 4KB
- ~Get Your Files Here !/04 - Refresher Temporal difference methods/004 SARSA_en.vtt 4KB
- ~Get Your Files Here !/06 - Refresher Brief introduction to Neural Networks/002 Artificial Neural Networks_en.vtt 4KB
- ~Get Your Files Here !/07 - Refresher REINFORCE/004 The policy gradient theorem_en.vtt 4KB
- ~Get Your Files Here !/04 - Refresher Temporal difference methods/002 Solving control tasks with temporal difference methods_en.vtt 4KB
- ~Get Your Files Here !/04 - Refresher Temporal difference methods/001 Temporal difference methods_en.vtt 4KB
- ~Get Your Files Here !/07 - Refresher REINFORCE/006 Parallel learning_en.vtt 4KB
- ~Get Your Files Here !/05 - Refresher N-step bootstrapping/001 N-step temporal difference methods_en.vtt 3KB
- ~Get Your Files Here !/03 - Refresher Monte Carlo methods/001 Monte Carlo methods_en.vtt 3KB
- ~Get Your Files Here !/02 - Refresher The Markov Decision Process (MDP)/010 Solving a Markov decision process_en.vtt 3KB
- ~Get Your Files Here !/02 - Refresher The Markov Decision Process (MDP)/009 Bellman equations_en.vtt 3KB
- ~Get Your Files Here !/05 - Refresher N-step bootstrapping/002 Where do n-step methods fit_en.vtt 3KB
- ~Get Your Files Here !/07 - Refresher REINFORCE/003 Policy performance_en.vtt 3KB
- ~Get Your Files Here !/04 - Refresher Temporal difference methods/005 Q-Learning_en.vtt 3KB
- ~Get Your Files Here !/07 - Refresher REINFORCE/008 REINFORCE 2_en.vtt 2KB
- ~Get Your Files Here !/02 - Refresher The Markov Decision Process (MDP)/003 Types of Markov decision process_en.vtt 2KB
- ~Get Your Files Here !/02 - Refresher The Markov Decision Process (MDP)/007 Policy_en.vtt 2KB
- ~Get Your Files Here !/01 - Introduction/003 Google Colab_en.vtt 2KB
- ~Get Your Files Here !/04 - Refresher Temporal difference methods/003 Monte Carlo vs temporal difference methods_en.vtt 2KB
- ~Get Your Files Here !/02 - Refresher The Markov Decision Process (MDP)/005 Reward vs Return_en.vtt 2KB
- ~Get Your Files Here !/02 - Refresher The Markov Decision Process (MDP)/008 State values v(s) and action values q(s,a)_en.vtt 1KB
- ~Get Your Files Here !/04 - Refresher Temporal difference methods/006 Advantages of temporal difference methods_en.vtt 1KB
- ~Get Your Files Here !/02 - Refresher The Markov Decision Process (MDP)/004 Trajectory vs episode_en.vtt 1KB
- ~Get Your Files Here !/01 - Introduction/002 Reinforcement Learning series.html 699B
- ~Get Your Files Here !/Bonus Resources.txt 386B
- Get Bonus Downloads Here.url 183B
- ~Get Your Files Here !/01 - Introduction/001 Introduction.html 70B
- ~Get Your Files Here !/01 - Introduction/004 Where to begin.html 70B
- ~Get Your Files Here !/08 - PyTorch Lightning/002 Link to the code notebook.html 70B
- ~Get Your Files Here !/09 - REINFORCE for continuous control tasks/001 REINFORCE for continuous action spaces.html 70B
- ~Get Your Files Here !/11 - Generalized Advantage Estimation (GAE)/001 Generalized Advantage Estimation.html 70B
- ~Get Your Files Here !/12 - Proximal Policy Optimization (PPO)/001 Proximal Policy Optimization.html 70B
- ~Get Your Files Here !/13 - Phasic PPO/001 Phasic PPO.html 70B