Algorithms for estimation of comic speakers considering reading order of frames and texts - 九大コレクション | 九州大学附属図書館

＜会議発表論文＞
Algorithms for estimation of comic speakers considering reading order of frames and texts

作成者	作成者名 Omori, Yuga 大森, 優雅オオモリ, ユウガ所属機関所属機関名 Graduate School of Information Science and Electrical Engineering, Kyushu University 九州大学大学院システム情報科学府
	作成者名 Nagamizo, Kota 永溝, 孝太ナガミゾ, コウタ所属機関所属機関名 ForeVision Inc.
	著者識別子 100021285 00294992 作成者名 Ikeda, Daisuke 池田, 大輔イケダ, ダイスケ所属機関所属機関名 Faculty of Information Science and Electrical Engineering, Kyushu University 九州大学大学院システム情報科学研究院
本文言語	英語
出版者	IEEE
発行日	2022-07
収録物名	2022 12th International Congress on Advanced Applied Informatics IIAI-AAI 2022 Proceedings
開始ページ	367
終了ページ	372
会議情報	会議名 International Congress on Advanced Applied Informatics (IIAI-AAI) 回次 12 主催機関 IIAI - International Institute of Applied Informatics 開催期間 July 2-7, 2022 開催地 Kanazawa, Ishikawa 開催国日本
会議情報	会議名 International Conference on Smart Computing and Artificial Intelligence (SCAI) 回次 11 主催機関 IIAI - International Institute of Applied Informatics 開催期間 July 2- 7, 2022 開催地 Kanazawa, Ishikawa 開催国日本
出版タイプ	Accepted Manuscript
アクセス権	open access
関連DOI	以下の異版 https://doi.org/10.1109/IIAIAAI55812.2022.00080
関連DOI
概要	Machine learning methods in recent years have focused on multimodal input and cross-modal tasks, and they are used as approaches to problems in various domains. Associating comic texts and characters ...using these approaches is informative for commercial activities such as speech synthesis and automatic translation of texts. In this study, we address the task of associating a text with a speaker in comics. It is challenging to correspond between them because these are not self-evidently attached, and few studies have attempted. These previous studies have less considered the continuity of comics such as narrative flow or contextual information. We assume that considering the continuity of comics is effective for speaker estimation. This paper proposes algorithms for estimating the reading order of frames or texts, and it also proposes methods for estimating speakers based on these orders. As a result, our proposed method improves accuracy compared to previous methods. Consideration of the frame order is an effective clue to the comic speaker estimation.続きを見る

本文ファイル

ファイル	ファイルタイプ	サイズ	閲覧回数	説明
6781038	pdf	1.77 MB	37

詳細

PISSN	2472-0070
レコードID	6781038
主題	Speaker estimation
	Comic
	Multimodal
登録日	2023.04.10
更新日	2024.07.01