Reinforcement Learning Playground
Choose an exercise to understand Prediction Error and Q-learning.
Q-Value: 0.500
Choice Probability: 50.0%
Q-Value: 0.500
Choice Probability: 50.0%
You've completed 20 trials. Below is a graph of the Q-values over time.