¼±Åà - È­»ìǥŰ/¿£ÅÍŰ ´Ý±â - ESC

 
"reward"¿¡ ´ëÇÑ ¼¼ºÎ °Ë»ö °á°úÀÔ´Ï´Ù
KMLE À¥ ¿ë¾î ¸ÂÃã °Ë»ö °á°ú : 1 ÆäÀÌÁö: 2
reward A scalar value which represents the degree to which a state or action is desirable. Reward functions can be used to specify a wide range of planning goals (eg by penalizing every non-goal state, an agent can be guided towards learning the fastest route to the final state).
Ãâó: www-anw.cs.umass.edu/rlr/terms.html
ÀÌ ¾Æ·¡ ºÎÅÍ´Â °á°ú°¡ ¾ø½À´Ï´Ù.
KMLE À¥ ¿ë¾î À¯»ç °Ë»ö °á°ú : 0 ÆäÀÌÁö: 2
ÅëÇÕ°Ë»ö ¿Ï·á