Pattern Matching Machine for Text Compressed Using Finite State Model - 九大コレクション | 九州大学附属図書館

＜テクニカルレポート＞
Pattern Matching Machine for Text Compressed Using Finite State Model

作成者	著者識別子 K000172 作成者名 Takeda, Masayuki 竹田, 正幸所属機関所属機関名 Department of Informatics Kyushu University 九州大学大学院システム情報科学研究院情報理学部門
本文言語	英語
出版者	Department of Informatics, Kyushu University
出版者	九州大学大学院システム情報科学研究院情報理学部門
発行日	1997-10
収録物名	DOI Technical Report
巻	142
出版タイプ	Accepted Manuscript
アクセス権	open access
関連DOI	DOI Technical Report \|\| 142
関連DOI	http://www.i.kyushu-u.ac.jp/research/report.html
関連URI	DOI Technical Report \|\| 142
関連URI	http://www.i.kyushu-u.ac.jp/research/report.html
関連情報	DOI Technical Report \|\| 142
関連情報	http://www.i.kyushu-u.ac.jp/research/report.html
概要	The classical pattern matching problem is to find all occurrences of patterns in a text. In many practical cases, since the text is very large and stored in the secondary storage, most of the time for... the pattern matching is dominated by data transmission of the text. Therefore the text compression can speed-up the pattern matching. In this framework it is required to develop an efficient pattern matching algorithm for searching the compressed text directly without decoding. In 1992, Fukamachi et al. proposed a method of constructing pattern matching machine that runs on Huffman coded text, based on the Aho-Coracick algorithm. However, since the Huffman code is optimal only under the assumption of the memoryless source model, the compression ratio is not very high. On the other hand, it is known that English text can be highly compressed by the compression method based on the Markov model. In this paper, we focus our attention on the finite-state model, which subsumes the Markov model as an important special case, and show an algorithm for constructing pattern matching machine for text compressed under the assumption of this model. We also give a proof of the correctness of the algorithm.続きを見る

本文ファイル

ファイル	ファイルタイプ	サイズ	閲覧回数	説明
trcs142.ps	gz	138 KB	226
trcs142	pdf	217 KB	203

詳細

レコードID	3011
査読有無	査読無
タイプ	テクニカルレポート
登録日	2009.04.22
更新日	2018.08.31