Domain Bias in Fake News Datasets Consisting of Fake and Real News Pairs - 九大コレクション | 九州大学附属図書館

＜会議発表論文＞
Domain Bias in Fake News Datasets Consisting of Fake and Real News Pairs

作成者	作成者名 Kato, Shingo 加藤, 真吾カトウ, シンゴ所属機関所属機関名 Graduate School of Information Science and Electrical Engineering, Kyushu University 九州大学大学院システム情報科学府
	作成者名 Yang, Linshuo 所属機関所属機関名 Graduate School of Information Science and Electrical Engineering, Kyushu University 九州大学大学院システム情報科学府
	著者識別子 K000021 00294992 作成者名 Ikeda, Daisuke 池田, 大輔イケダ, ダイスケ所属機関所属機関名 Faculty of Information Science and Electrical Engineering, Kyushu University 九州大学大学院システム情報科学研究院
本文言語	英語
出版者	IEEE
発行日	2022-07
収録物名	2022 12th International Congress on Advanced Applied Informatics IIAI-AAI 2022 Proceedings
開始ページ	101
終了ページ	106
会議情報	会議名 International Congress on Advanced Applied Informatics (IIAI-AAI) 回次 12 主催機関 IIAI - International Institute of Applied Informatics 開催期間 July 2-7, 2022 開催地 Kanazawa, Ishikawa 開催国日本
会議情報	会議名 International Conference on E-Service and Knowledge Management (ESKM) 回次 14 主催機関 IIAI - International Institute of Applied Informatics 開催期間 July 2- 7, 2022 開催地 Kanazawa, Ishikawa 開催国日本
出版タイプ	Accepted Manuscript
アクセス権	open access
関連DOI	以下の異版 https://doi.org/10.1109/IIAIAAI55812.2022.00029
関連DOI
概要	News intentionally containing false information–known as "fake news"–is common on the Internet and often causes social disruption. In order to solve it, research on automatic detection of fake news us...ing supervised learning has been active. Although the accuracy is improving, a major challenge for practical application remains: models can not work well for news in unknown fields (domains) due to domain biases. The goal of this study is to mitigate these domain biases and improve the accuracy of cross-domain fake news detection, which tests news from unknown domains. We firstly try to mitigate the bias by masking noun phrases which are considered a major source of domain bias. However, masking has not improved accuracy. Therefore, we point out that the dataset in this study has the property that it always contains pairs of fake and real news on the exact same topic. In this paper, we focus on this property of dataset and examine how it may affect domain bias and accuracy. Comparative experiments show that accuracy is higher when trained on a dataset with the property shown in this study. We suggest that a fake news dataset consisting of paired news could be effective for cross-domain detection.続きを見る

本文ファイル

ファイル	ファイルタイプ	サイズ	閲覧回数	説明
6779689	pdf	223 KB	29

詳細

PISSN	2472-0070
レコードID	6779689
主題	fake news detection
	cross-domain
	BERT
登録日	2023.04.05
更新日	2024.07.01