From: Developing a weighted reward criterion for the Markov-based decision of road maintenance
 | a 1 | a 2 | a 3 | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
s 1 | s 2 | s 3 | s 4 | s 5 | s 1 | s 2 | s 3 | s 4 | s 5 | s 1 | s 2 | s 3 | s 4 | s 5 | |
s 1 | 0.74 | 0.26 | 0 | 0 | 0 | 0.74 | 0.26 | 0 | 0 | 0 | 0.74 | 0.26 | 0 | 0 | 0 |
s 2 | 0.74 | 0.26 | 0 | 0 | 0 | 0.74 | 0.26 | 0 | 0 | 0 | 0 | 0.82 | 0.18 | 0 | 0 |
s 3 | 0.74 | 0.26 | 0 | 0 | 0 | 0.28 | 0.61 | 0.11 | 0 | 0 | 0 | 0 | 0.86 | 0.14 | 0 |
s 4 | 0.74 | 0.26 | 0 | 0 | 0 | 0 | 0.19 | 0.7 | 0.11 | 0 | 0 | 0 | 0 | 0.87 | 0.13 |
s 5 | 0.74 | 0.26 | 0 | 0 | 0 | 0 | 0 | 0.09 | 0.8 | 0.11 | 0 | 0 | 0 | 0 | 1 |