Linear Functions in RL, State-Action Features, and Eligibility Traces
Sutton and Barto’s standard textbook on Reinforcement Learning covers how state feature vectors may be constructed for linear state value functions. However, there is little explanation of the extension to state-action feature vectors. In this post I aim to fill this gap.