Reversed dynamic programming on general state spaces - Collections

＜departmental bulletin paper＞
Reversed dynamic programming on general state spaces

Creator	Creator Name Iwamoto, Seiichi 岩本, 誠一
Language	English
Publisher	Faculty of Science, Kyushu University
Publisher	九州大学理学部
Date	1978-09-30
Source Title	Memoirs of the Faculty of Science, Kyushu University. Series A, Mathematics
Vol	32
Issue	2
First Page	267
Last Page	276
Publication Type	Version of Record
Access Rights	restricted access
Related DOI	isIdenticalTo https://doi.org/10.2206/kyushumfs.32.267
Related DOI	Memoirs of the Faculty of Science, Kyushu University. Series A, Mathematics \|\| 32(2) \|\| p267-276
	九州大学理学部紀要 \|\| 32(2) \|\| p267-276
	http://www.math.kyushu-u.ac.jp/
Related URI	Memoirs of the Faculty of Science, Kyushu University. Series A, Mathematics \|\| 32(2) \|\| p267-276
	九州大学理学部紀要 \|\| 32(2) \|\| p267-276
	http://www.math.kyushu-u.ac.jp/
Related URI
Related HDL
Relation	Memoirs of the Faculty of Science, Kyushu University. Series A, Mathematics \|\| 32(2) \|\| p267-276
	九州大学理学部紀要 \|\| 32(2) \|\| p267-276
	http://www.math.kyushu-u.ac.jp/
Abstract	This paper studies a class of finite-stage deterministic dynamic programmings (DP's) with general state and action spaces from an algebraic viewpoint. The author has introduced inverse, reversal, comp...osition, maximum and minimum operations on a class of DP's with one-dimensional state space [5]. For general state case we can introduce the reversal and composition operations, because of an automaton-like treatment for DP. Specifying finite-stage deterministic DP in terms of set-theoretical notation (§ 2), we state three versions of Bellman's Principle of Optimality (§ 3). Our main and new result is REVERSE THEOREM in dynamic programming (§ 4): A pair of optimal reward functions and an optimal policy for a given DP is characterized by the pair of those for its reversed DP in a reverse sense. The REVERSE THEOREM has been motivated by the author's INVERSE THEOREM [2], [3], [4]. The latter is valid for one-dimensional state space. But the former for general state space. Another result is DECOMPOSITION THEOREM in dynamic programming: An $ N $-stage DP is decomposed into $ N $ one-stage DP's. This theorem is a variant of the corresponding theorems in Mitten [6] and Nemhauser [7].show more

Details

Record ID	11149
Peer-Reviewed	Refereed
ISSN	0373-6385
DOI	10.2206/kyushumfs.32.267
NCID	AA00732864
Type	紀要論文
Created Date	2009.09.24
Modified Date	2024.01.10

Export

Link to this page

Search Other Services

Statistics

＜departmental bulletin paper＞ Reversed dynamic programming on general state spaces

Details

People who viewed this item also viewed

＜departmental bulletin paper＞
Reversed dynamic programming on general state spaces