PCFGによる派生語処理手法の比較と検討 - 九大コレクション | 九州大学附属図書館

＜紀要論文＞
PCFGによる派生語処理手法の比較と検討

作成者	作成者名市丸, 夏樹 Ichimaru, Natsuki イチマル, ナツキ所属機関所属機関名九州大学大学院システム情報科学研究科知能システム学専攻 Department of Intelligent Systems, Graduate School of Information Science and Electrical Engineering, Kyushu University
	作成者名中村, 貞吾 Nakamura, Teigo ナカムラ, テイゴ所属機関所属機関名九州工業大学情報工学部 School of Computer Science and Systems Engineering, Kyushu Institute of Technology
	作成者名日高, 達 Hitaka, Toru ヒタカ, トオル所属機関所属機関名九州大学大学院システム情報科学研究科知能システム学専攻 Department of Intelligent Systems, Graduate School of Information Science and Electrical Engineering, Kyushu University
本文言語	日本語
出版者	九州大学大学院システム情報科学研究院
出版者	Faculty of Information Science and Electrical Engineering, Kyushu University
発行日	1999-03-26
収録物名	九州大学大学院システム情報科学紀要
巻	4
号	1
開始ページ	63
終了ページ	68
出版タイプ	Version of Record
アクセス権	open access
JaLC DOI	https://doi.org/10.15017/1500395
関連DOI	https://portal.isee.kyushu-u.ac.jp/
関連URI	https://portal.isee.kyushu-u.ac.jp/
関連情報	https://portal.isee.kyushu-u.ac.jp/
概要	In Japanese language, derivative word consists of a noun and some following suffixes, and those components are concatenated into a sequence without separators. This construction often cause ambiguity ...in parsing or Kana-Kanji conversion. Some methods to treat derivatives have been developed; 1) recognizing arbitrary combination of any noun and any suffix, 2) registering collected derivative words directly into the word dictionary, and 3) using semantic category to enable selectional restriction. However, these methods have too simple mechanism to derive correct analysis. In our previous paper, we proposed 4) an example-based method in which collected sample words are generalized for wider coverage of derivative words. In this paper, we compare these methods through experiments. To realize fair comparison, all methods were represented in Probabilistic Context Free Grammar (PCFG) and equally tuned with the same training method Maximum Likelihood Estimate. The results show that our method is superior to the methods 1) and 3), under the condition that the grammar learned more than 80,000 generalized examples.続きを見る

本文ファイル

ファイル	ファイルタイプ	サイズ	閲覧回数	説明
p063	pdf	545 KB	140

詳細

PISSN	1342-3819
EISSN	2188-0891
NCID	AN10569524
レコードID	1500395
査読有無	査読有
主題	シソーラス
	Thesaurus
	意味カテゴリ
	Semantic Category
	選択制限
	Selectional Restriction
	Example-Based Method
	用例方式
	Probabilistic Context Free Grammar
	確率文脈自由文法
	トレーニング
	Training
登録日	2015.04.21
更新日	2020.11.02