TensorFlow Reinforcement Learning Quick Start Guide
(3).Defining the actions of the agent
The agent performs actions to explore the environment.
Obtaining this action vector is the primary goal in RL. Ideally,
you need to strive to obtain optimal actions.
An action is the decision an agent takes in a certain state, s
t .
Typically, it is represented as a t
, where, as before, the subscript t denotes the time instant. The actions that are
available to an agent depends on the problem. For instance, an
agent in a maze can decide to take a step north, or south, or
east, or west. These are called discrete actions, as there are
a fixed number of possibilities. On the other hand, for an
autonomous car, actions can be the steering angle, throttle
value, brake value, and so on, which are called continuous
actions as they can take real number values in a bounded
range. For example, the steering angle can be 40 degrees from
the north-south line, and the throttle can be 60% down, and
so on.
Thus, actions a t
can be either discrete or continuous,
depending on the problem at hand. Some RL approaches
handle discrete actions, while others are suited for continuous
actions.
A schematic of the agent and its interaction with the
environment is shown in the following diagram:
评论
发表评论