| reward |
A scalar value which represents the degree to which a state or action is desirable. Reward functions can be used to specify a wide range of planning goals (eg by penalizing every non-goal state, an agent can be guided towards learning the fastest route to the final state).
Ãâó: www-anw.cs.umass.edu/rlr/terms.html
|
|---|
| reward | providing personal satisfaction |
|---|---|
| reward | in a rewarding manner |
Á¦Ç°¸í |
ÆÇ¸Å»ç |
º¸ÇèÄÚµå | ¼ººÐ/ÇÔ·® | ±¸ºÐ/º¸Çè±Þ¿© |
|---|
Á¦Ç°¸í |
ÆÇ¸Å»ç |
º¸ÇèÄÚµå | ¼ººÐ/ÇÔ·® | ±¸ºÐ/º¸Çè±Þ¿© |
|---|