| reward |
A scalar value which represents the degree to which a state or action is desirable. Reward functions can be used to specify a wide range of planning goals (eg by penalizing every non-goal state, an agent can be guided towards learning the fastest route to the final state).
Ãâó: www-anw.cs.umass.edu/rlr/terms.html
|
|---|