Reversed dynamic programming on general state spaces - 九大コレクション

＜紀要論文＞
Reversed dynamic programming on general state spaces

作成者	作成者名 Iwamoto, Seiichi 岩本, 誠一
本文言語	英語
出版者	Faculty of Science, Kyushu University
出版者	九州大学理学部
発行日	1978-09-30
収録物名	九州大学理学部紀要 : Series A, Mathematics
巻	32
号	2
開始ページ	267
終了ページ	276
出版タイプ	Version of Record
アクセス権	restricted access
関連DOI	以下と同一 https://doi.org/10.2206/kyushumfs.32.267
関連DOI	Memoirs of the Faculty of Science, Kyushu University. Series A, Mathematics \|\| 32(2) \|\| p267-276
	九州大学理学部紀要 \|\| 32(2) \|\| p267-276
	http://www.math.kyushu-u.ac.jp/
関連URI	Memoirs of the Faculty of Science, Kyushu University. Series A, Mathematics \|\| 32(2) \|\| p267-276
	九州大学理学部紀要 \|\| 32(2) \|\| p267-276
	http://www.math.kyushu-u.ac.jp/
関連URI
関連HDL
関連情報	Memoirs of the Faculty of Science, Kyushu University. Series A, Mathematics \|\| 32(2) \|\| p267-276
	九州大学理学部紀要 \|\| 32(2) \|\| p267-276
	http://www.math.kyushu-u.ac.jp/
概要	This paper studies a class of finite-stage deterministic dynamic programmings (DP's) with general state and action spaces from an algebraic viewpoint. The author has introduced inverse, reversal, comp...osition, maximum and minimum operations on a class of DP's with one-dimensional state space [5]. For general state case we can introduce the reversal and composition operations, because of an automaton-like treatment for DP. Specifying finite-stage deterministic DP in terms of set-theoretical notation (§ 2), we state three versions of Bellman's Principle of Optimality (§ 3). Our main and new result is REVERSE THEOREM in dynamic programming (§ 4): A pair of optimal reward functions and an optimal policy for a given DP is characterized by the pair of those for its reversed DP in a reverse sense. The REVERSE THEOREM has been motivated by the author's INVERSE THEOREM [2], [3], [4]. The latter is valid for one-dimensional state space. But the former for general state space. Another result is DECOMPOSITION THEOREM in dynamic programming: An $ N $-stage DP is decomposed into $ N $ one-stage DP's. This theorem is a variant of the corresponding theorems in Mitten [6] and Nemhauser [7].続きを見る

詳細

レコードID	11149
査読有無	査読有
ISSN	0373-6385
DOI	10.2206/kyushumfs.32.267
NCID	AA00732864
タイプ	紀要論文
登録日	2009.09.24
更新日	2024.01.10

この情報を出力する

このページのリンク

他の検索サイト

利用統計

＜紀要論文＞ Reversed dynamic programming on general state spaces

詳細

この資料を見た人はこんな資料も見ています

＜紀要論文＞
Reversed dynamic programming on general state spaces