Contextualized Word Representations for Multi-Sense Embedding - 九大コレクション

＜会議発表論文＞
Contextualized Word Representations for Multi-Sense Embedding

作成者	作成者名 Ashihara, Kazuki 芦原, 和樹アシハラ, カズキ所属機関所属機関名 Osaka University 大阪大学
	著者識別子 70824960 作成者名 Kajiwara, Tomoyuki 梶原, 智之カジワラ, トモユキ所属機関所属機関名 Osaka University 大阪大学
	著者識別子 00747165 作成者名 Arase, Yuki 荒瀬, 由紀アラセ, ユキ所属機関所属機関名 Osaka University 大阪大学大学院情報科学研究科
	著者識別子 K005428 0000-0002-7779-0085 作成者名 Uchida, Satoru 内田, 諭ウチダ, サトル所属機関所属機関名 Kyushu University 九州大学
本文言語	英語
出版者	Association for Computational Linguistics
発行日	2018-12-02
収録物名	Proceedings of Pacific Asia Conference on Language, Information and Computation
巻	32
号	Y18-1004
開始ページ	1
終了ページ	9
会議情報	会議名 Pacific Asia Conference on Language, Information and Computation 回次 32 開催地 Hung Hom, Kowloon 開催国香港
出版タイプ	Version of Record
アクセス権	open access
権利関係	Creative Commons Attribution 4.0 International License
関連DOI
関連URI	以下と同一 https://www.aclweb.org/anthology/Y18-1004
関連HDL
概要	Distributed word representations are used in many natural language processing tasks. When dealing with ambiguous words, it is desired to generate multi-sense embeddings, i.e., multiple representations... per word. Therefore, several methods have been proposed to generate different word representations based on parts of speech or topic, but these methods tend to be too coarse to deal with ambiguity. In this paper, we propose methods to generate multiple word representations for each word based on dependency structure relations. In order to deal with the data sparseness problem due to the increase in the size of vocabulary, the initial value for each word representations is determined using pre-trained word representations. It is expected that the representations of low frequency words will remain in the vicinity of the initial value, which will in turn reduce the negative effects of data sparseness. Extensive evaluation results confirm the effectiveness of our methods that significantly outperformed state-of-the-art methods for multi-sense embeddings. Detailed analysis of our method shows that the data sparseness problem is resolved due to the pre-training.続きを見る

本文ファイル

ファイル	ファイルタイプ	サイズ	閲覧回数	説明
2244110	pdf	460 KB	306

詳細

EISSN	2619-7782
レコードID	2244110
関連URI	https://www.aclweb.org/anthology/Y18-1004
登録日	2019.06.06
更新日	2023.08.03

この情報を出力する

このページのリンク

他の検索サイト

利用統計

＜会議発表論文＞ Contextualized Word Representations for Multi-Sense Embedding

本文ファイル

詳細

この資料を見た人はこんな資料も見ています

＜会議発表論文＞
Contextualized Word Representations for Multi-Sense Embedding