作成者 |
|
|
本文言語 |
|
出版者 |
|
|
発行日 |
|
収録物名 |
|
巻 |
|
号 |
|
開始ページ |
|
終了ページ |
|
出版タイプ |
|
アクセス権 |
|
Crossref DOI |
|
関連DOI |
|
|
関連URI |
|
|
関連情報 |
|
|
概要 |
There are various procedures to compute the optimal threshold probability in discounted Markov decision processes. In the actual numerical computation of an approximate optimal solution, the estimatio...n of the discrepancy between the approximate solution and the exact solution is important. White(1993 b) derived such an error estimation for the value iteration method, however, this estimation is not actually in the computable form. In this paper, we present a numerical enclosure method to compute the optimal threshold probability, that guarantees a rigorous a posteriori error bound.続きを見る
|