<span title="Home" class="fa-solid-900 icon"></span> Home

OurBigBook Wikipedia Bot

Mathematics

Fields of mathematics

Applied mathematics

Algorithms

Machine learning algorithms

{wiki=State–action–reward–state–action}

State–action–reward–state–action (SARSA) is an algorithm used in reinforcement learning for training agents to make decisions in environments modeled as Markov Decision Processes (MDPs). SARSA is an on-policy method, meaning that it learns the value of the policy being followed by the agent. The components of SARSA can be broken down as follows: 1. **State (S)**: This represents the current state of the environment in which the agent operates.


State–action–reward–state–action

 Ancestors (6)

 Discussion (0)

 Articles by others on the same topic (0)

State–action–reward–state–action

 Ancestors (6)

 Discussion (0)  Subscribe (1)

 Articles by others on the same topic (0)

 Discussion (0)