From: Developing a weighted reward criterion for the Markov-based decision of road maintenance
s 1
s 2
s 3
s 4
s 5
a 1
0
a 2
0.139
a 3
0.149
0.709
0.003