midterm2.pdf
• •
k
k = 1 � = 1
� P (ai) / exp(�Ui(ai, s�i))
100 ⇥ 100
U(s) ⇡ ~�>~�(s) ~�
~�(s) s
0.5
� = 1
�10, 0 �1,�1
�5,�5 0,�10
�5,�5,�5, 0, 0, 0,�10,�1
T (s0 | s, a)
a s T (s0 | s, a) = 0 s0
midterm2.pdf
• •
k
k = 1 � = 1
� P (ai) / exp(�Ui(ai, s�i))
100 ⇥ 100
U(s) ⇡ ~�>~�(s) ~�
~�(s) s
0.5
� = 1
�10, 0 �1,�1
�5,�5 0,�10
�5,�5,�5, 0, 0, 0,�10,�1
T (s0 | s, a)
a s T (s0 | s, a) = 0 s0