Course Outline
Introduction
- Learning through positive reinforcement
Elements of Reinforcement Learning
Important Terms (Actions, States, Rewards, Policy, Value, Q-Value, etc.)
Overview of Tabular Solutions Methods
Creating a Software Agent
Understanding Value-based, Policy-based, and Model-based Approaches
Working with the Markov Decision Process (MDP)
How Policies Define an Agent's Way of Behaving
Using Monte Carlo Methods
Temporal-Difference Learning
n-step Bootstrapping
Approximate Solution Methods
On-policy Prediction with Approximation
On-policy Control with Approximation
Off-policy Methods with Approximation
Understanding Eligibility Traces
Using Policy Gradient Methods
Summary and Conclusion
Requirements
- Experience with machine learning
- Programming experience
Audience
- Data scientists
Custom Corporate Training
Training solutions designed exclusively for businesses.
- Customized Content: We adapt the syllabus and practical exercises to the real goals and needs of your project.
- Flexible Schedule: Dates and times adapted to your team's agenda.
- Format: Online (live), In-company (at your offices), or Hybrid.
Price per private group, online live training, starting from 4800 € + VAT*
Contact us for an exact quote and to hear our latest promotions