Image Generation from Hyper Scene Graphs with Trinomial Hyperedges Using Object Attention - 九大コレクション

＜会議発表論文＞
Image Generation from Hyper Scene Graphs with Trinomial Hyperedges Using Object Attention

作成者	作成者名 Miyake, Ryosuke 三宅, 涼介ミヤケ, リョウスケ所属機関所属機関名 Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University 九州大学大学院システム情報科学府
	著者識別子 100021300 0000-0002-8841-6304 作成者名 Matsukawa, Tetsu 松川, 徹マツカワ, テツ所属機関所属機関名 Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University 九州大学大学院システム情報科学研究院
	著者識別子 100021271 0000-0001-7743-6177 作成者名 Suzuki, Einoshin 鈴木, 英之進スズキ, エイノシン所属機関所属機関名 Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University 九州大学大学院システム情報科学研究院
本文言語	英語
出版者	Science and Technology Publications
発行日	2024
巻	2
開始ページ	266
終了ページ	279
会議情報	会議名 International Conference on Computer Graphics Theory and Applications 回次 19 主催機関 Institute for Systems and Technologies of Information, Control and Communication (INSTICC) 開催期間 27-29, February, 2024 開催地 Rome 開催国イタリア
出版タイプ	Version of Record
アクセス権	open access
権利関係	© 2024 by SCITEPRESS – Science and Technology Publications, Lda.
権利関係	Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International
関連DOI	以下と同一 https://doi.org/10.5220/0012472500003660
関連DOI	Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications
関連URI
関連HDL
関連ISBN	以下の部分 978-989-758-679-8
関連HDL
関連情報	Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications
概要	Conditional image generation, which aims to generate consistent images with a user’s input, is one of the critical problems in computer vision. Text-to-image models have succeeded in generating realis...tic images for simple situations in which a few objects are present. Yet, they often fail to generate consistent images for texts representing complex situations. Scene-graph-to-image models have the advantage of generating images for complex situations based on the structure of a scene graph. We extended a scene-graph-to-image model to an image generation model from a hyper scene graph with trinomial hyperedges. Our model, termed hsg2im, improved the consistency of the generated images. However, hsg2im has difficulty in generating natural and consistent images for hyper scene graphs with many objects. The reason is that the graph convolutional network in hsg2im struggles to capture relations of distant objects. In this paper, we propose a novel image generation model which addresses thi s shortcoming by introducing object attention layers. We also use a layout-to-image model auxiliary to generate higher-resolution images. Experimental validations on COCO-Stuff and Visual Genome datasets show that the proposed model generates more natural and consistent images to user’s inputs than the cutting-edge hyper scene-graph-to-image model.続きを見る

本文ファイル

ファイル	ファイルタイプ	サイズ	閲覧回数	説明
7357487	pdf	8.86 MB	43

詳細

PISSN	2184-4321
レコードID	7357487
関連ISBN	978-989-758-679-8
主題	Image Generation
	Hyper Scene Graph
	Object Attention
注記	Topics: Generative AI
助成情報	助成機関名 Japan Society for the Promotion of Science (JSPS) 日本学術振興会研究課題番号 JP21K19795 研究課題名 AI program that explains statistical data in an evil way by exploiting thinking traits of humans 人間の思考特性につけこみ統計データを悪く解説するAI
登録日	2025.05.01
更新日	2025.05.01

この情報を出力する

このページのリンク

他の検索サイト

利用統計

＜会議発表論文＞ Image Generation from Hyper Scene Graphs with Trinomial Hyperedges Using Object Attention

本文ファイル

詳細

この資料を見た人はこんな資料も見ています

＜会議発表論文＞
Image Generation from Hyper Scene Graphs with Trinomial Hyperedges Using Object Attention