<technical report>
Pattern Matching Machines for Japanese Texts

Creator
Language
Publisher
Date
Source Title
Vol
Publication Type
Access Rights
Related DOI
Related URI
Relation
Abstract Texts in Japanese use many characters, Japanese alphabet kana and Chinese letter kanji, unlike texts in European languages. For that reason, Japanese characters are represented by 2-byte code in most ...computer systems. In many cases, the usual 1-byte characters are used together with 2-byte characters. In this paper, we discuss pattern matching algorithms for Japanese texts, in which 1-byte characters and 2-byte characters are mixed. We have already succeeded to realize run-time efficient pattern matching machines for texts of 1-byte characters by dividing character codes. We show that the method of dividing character codes is also applicable to pattern matching machines for Japanese texts.show more

Hide fulltext details.

pdf RR_110 pdf 1.17 MB 360  

Details

Record ID
Peer-Reviewed
Subject Terms
Type
Created Date 2009.04.22
Modified Date 2020.10.13

People who viewed this item also viewed