shapingcontrol
Shaping control is a concept in reinforcement learning that involves guiding an agent's learning process by providing intermediate rewards or shaping functions. Instead of solely relying on the final reward for achieving a goal, shaping control introduces a series of smaller rewards that encourage desirable intermediate behaviors. This can significantly speed up the learning process, especially in tasks where the final goal is rare or difficult to reach.
The core idea is to define a shaping function that assigns a reward based on how close
A key consideration in shaping control is the potential for "reward hacking," where an agent might exploit