ON THE LEARNING ALGORITHM OF 2-PERSON ZERO-SUM MARKOV GAME WITH EXPECTED AVERAGE REWARD CRITERION - Collections

＜journal article＞
ON THE LEARNING ALGORITHM OF 2-PERSON ZERO-SUM MARKOV GAME WITH EXPECTED AVERAGE REWARD CRITERION

Creator	Creator Name Tanaka, Kensuke 田中, 謙輔 Affiliation Affiliation Name Department of Mathematics, Faculty of Science, Niigata, University 新潟大学理学部数学科
Language	English
Publisher	Research Association of Statistical Sciences
Publisher	統計科学研究会
Date	1985-03
Source Title	Bulletin of informatics and cybernetics
Vol	21
Issue	3/4
First Page	1
Last Page	17
Publication Type	Version of Record
Access Rights	open access
Crossref DOI	https://doi.org/10.5109/13364
Related DOI	Bulletin of informatics and cybernetics \|\| 21(3/4) \|\| p1-17
Related DOI	http://bic.math.kyushu-u.ac.jp/
Related URI	Bulletin of informatics and cybernetics \|\| 21(3/4) \|\| p1-17
Related URI	http://bic.math.kyushu-u.ac.jp/
Relation	Bulletin of informatics and cybernetics \|\| 21(3/4) \|\| p1-17
Relation	http://bic.math.kyushu-u.ac.jp/
Abstract	We develop a method for learning the optimal strategies of 2-person zero-sum Markov game with expected average reward criterion. To do this, at each stage the players play a modified matrix game with ...relation to each state, and then receive an information about the result of the game from a teacher. Using the information, the players generate a pair of mixed strategies with relation to each state used at next stage. Then, such a pair of mixed strategies generated by the players converges with probability one and in mean square to a pair of the optimal stationary strategies. Further, when the learning is stopped at some stage by the teacher, the probability of error is estimated.show more

Hide fulltext details.

File	FileType	Size	Views	Description
p001	pdf	734 KB	371

Details

PISSN	0286-522X
EISSN	2435-743X
NCID	AA10634475
Record ID	13364
Peer-Reviewed	Refereed
Type	学術雑誌論文
Created Date	2009.04.22
Modified Date	2020.10.22

Export

Link to this page

Search Other Services

Statistics

＜journal article＞ ON THE LEARNING ALGORITHM OF 2-PERSON ZERO-SUM MARKOV GAME WITH EXPECTED AVERAGE REWARD CRITERION

Hide fulltext details.

Details

People who viewed this item also viewed

＜journal article＞
ON THE LEARNING ALGORITHM OF 2-PERSON ZERO-SUM MARKOV GAME WITH EXPECTED AVERAGE REWARD CRITERION