Non-stationary MDP Average Model - The Existence of Persistently Optimal (G, B)-Generated Policies
Xian Ping GUO
Acta Mathematica Sinica, Chinese Series ›› 2000, Vol. 43 ›› Issue (2) : 269-274.
Non-stationary MDP Average Model - The Existence of Persistently Optimal (G, B)-Generated Policies
{{custom_ref.label}} |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
/
〈 | 〉 |