ON THE LEARNING ALGORITHM OF 2-PERSON ZERO-SUM MARKOV GAME WITH EXPECTED AVERAGE REWARD CRITERION - 九大コレクション

＜学術雑誌論文＞
ON THE LEARNING ALGORITHM OF 2-PERSON ZERO-SUM MARKOV GAME WITH EXPECTED AVERAGE REWARD CRITERION

作成者	作成者名 Tanaka, Kensuke 田中, 謙輔所属機関所属機関名 Department of Mathematics, Faculty of Science, Niigata, University 新潟大学理学部数学科
本文言語	英語
出版者	Research Association of Statistical Sciences
出版者	統計科学研究会
発行日	1985-03
収録物名	Bulletin of informatics and cybernetics
巻	21
号	3/4
開始ページ	1
終了ページ	17
出版タイプ	Version of Record
アクセス権	open access
Crossref DOI	https://doi.org/10.5109/13364
関連DOI	Bulletin of informatics and cybernetics \|\| 21(3/4) \|\| p1-17
関連DOI	http://bic.math.kyushu-u.ac.jp/
関連URI	Bulletin of informatics and cybernetics \|\| 21(3/4) \|\| p1-17
関連URI	http://bic.math.kyushu-u.ac.jp/
関連情報	Bulletin of informatics and cybernetics \|\| 21(3/4) \|\| p1-17
関連情報	http://bic.math.kyushu-u.ac.jp/
概要	We develop a method for learning the optimal strategies of 2-person zero-sum Markov game with expected average reward criterion. To do this, at each stage the players play a modified matrix game with ...relation to each state, and then receive an information about the result of the game from a teacher. Using the information, the players generate a pair of mixed strategies with relation to each state used at next stage. Then, such a pair of mixed strategies generated by the players converges with probability one and in mean square to a pair of the optimal stationary strategies. Further, when the learning is stopped at some stage by the teacher, the probability of error is estimated.続きを見る

本文ファイル

ファイル	ファイルタイプ	サイズ	閲覧回数	説明
p001	pdf	734 KB	448

詳細

PISSN	0286-522X
EISSN	2435-743X
NCID	AA10634475
レコードID	13364
査読有無	査読有
タイプ	学術雑誌論文
登録日	2009.04.22
更新日	2020.10.22

この情報を出力する

このページのリンク

他の検索サイト

利用統計

＜学術雑誌論文＞ ON THE LEARNING ALGORITHM OF 2-PERSON ZERO-SUM MARKOV GAME WITH EXPECTED AVERAGE REWARD CRITERION

本文ファイル

詳細

この資料を見た人はこんな資料も見ています

＜学術雑誌論文＞
ON THE LEARNING ALGORITHM OF 2-PERSON ZERO-SUM MARKOV GAME WITH EXPECTED AVERAGE REWARD CRITERION