PATTERN DISCOVERY OF GENOME SEQUENCES BY SUBSTRING AMPLIFICATION - 九大コレクション | 九州大学附属図書館

＜会議発表論文＞
PATTERN DISCOVERY OF GENOME SEQUENCES BY SUBSTRING AMPLIFICATION

作成者	著者識別子 100021285 作成者名 Ikeda, Daisuke 池田, 大輔所属機関所属機関名 Computing and Communications Center,Kyushu University 九州大学情報基盤センター
	著者識別子 K000008 作成者名 Hirokawa, Sachio 廣川, 佐千男所属機関所属機関名 Computing and Communications Center,Kyushu University 九州大学情報基盤センター
	著者識別子 L002646 作成者名 Yamada, Yasuhiro 山田, 泰寛所属機関所属機関名 Department of Informatics,Kyushu University 九州大学大学院システム情報科学府
本文言語	英語
発行日	2003-11
収録物名	Proceedings of International Symposium on Information Science and Electrical Engineering
巻	2003
開始ページ	637
終了ページ	640
出版タイプ	Accepted Manuscript
アクセス権	open access
関連DOI	Proceedings of International Symposium on Information Science and Electrical Engineering \|\| 2003 \|\| p637-640
関連DOI	http://matu.cc.kyushu-u.ac.jp/
関連URI	Proceedings of International Symposium on Information Science and Electrical Engineering \|\| 2003 \|\| p637-640
関連URI	http://matu.cc.kyushu-u.ac.jp/
関連情報	Proceedings of International Symposium on Information Science and Electrical Engineering \|\| 2003 \|\| p637-640
関連情報	http://matu.cc.kyushu-u.ac.jp/
概要	In this paper, we study a problem which is, given a set of genome sequences, to find common subsequences. We assume that the sequences are generated by some fixed but unknown pattern. The authors deve...loped a method, called “substring amplification,” to find the template part of a pattern from semi-structured documents, such as HTML files, generated by the pattern. Substring amplification exploits the disparity of frequency distributions between the template and background parts, and so requires only positive data. In HTML files, many characters are used and the length of a successive part of a template is enough long compared to genome sequences. In this paper, we examine the applicability of the method to genome sequences in which a constant sequence is embedded. By a series of experiments in which the length and alphabet size of the embedded sequences are varied, we show the effectiveness and limit of our method to genome sequences.続きを見る

本文ファイル

ファイル	ファイルタイプ	サイズ	閲覧回数	説明
2003_c_1	pdf	80.8 KB	361

詳細

レコードID	2967
査読有無	査読有
主題	n-gramの頻度によるパターン発見
タイプ	会議発表論文
登録日	2009.04.22
更新日	2018.08.31