Qtargets

Qtargets is a term used in reinforcement learning to refer to the target values used when updating Q-values in algorithms such as Q-learning and its extensions. These targets represent the estimated future return that the learning process aims to achieve from a given state-action pair.

In the standard Q-learning update, the Q-target for a transition (s, a, r, s') is y = r

In deep Q-learning, a separate target network with fixed parameters theta- is commonly used to compute the

Variants and extensions often modify how targets are computed. Double Q-learning uses two estimators to reduce

Relation to other methods: In SARSA, the target is y = r + gamma * Q(s', a'), using the

Applications of Q-targets span game playing, robotics, and other domains involving sequential decision making, where stable

+

*

temporal-difference

=

y

-

y

a

y

=

r

+

*