Hyper Column Model vs. Fast DCT for Feature Extraction in Visual Arabic Speech Recognition - 九大コレクション

＜会議発表論文＞
Hyper Column Model vs. Fast DCT for Feature Extraction in Visual Arabic Speech Recognition

作成者	著者識別子 L002717 作成者名 Sagheer, Alaa アラー, サギール所属機関所属機関名 Department of Intelligent Systems, Kyushu University
	作成者名 Tsuruta, Naoyuki 鶴田, 直之ツルタ, ナオユキ所属機関所属機関名 Department of Electronics Engineering and Computer Science, Fukuoka University 福岡大学工学部電子情報工学科
	著者識別子 100017052 作成者名 Taniguchi, Rin-ichiro 谷口, 倫一郎タニグチ, リンイチロウ所属機関所属機関名 Department of Intelligent Systems, Kyushu University 九州大学システム情報科学研究院知能システム学部門
	作成者名 Maeda, Sakashi 前田, 佐嘉志マエダ, サカシ所属機関所属機関名 Department of Electronics Engineering and Computer Science, Fukuoka University 福岡大学工学部電子情報工学科
本文言語	英語
発行日	2005-12
収録物名	Proceedings of 5th International IEEE Symposium on Signal Processing and Information Technology
開始ページ	761
終了ページ	766
出版タイプ	Accepted Manuscript
アクセス権	open access
関連DOI	Proceedings of 5th International IEEE Symposium on Signal Processing and Information Technology \|\| \|\| p761-766
	http://limu.is.kyushu-u.ac.jp/~rin/
	http://www.tl.fukuoka-u.ac.jp/~tsuruta/index.html
関連URI	Proceedings of 5th International IEEE Symposium on Signal Processing and Information Technology \|\| \|\| p761-766
	http://limu.is.kyushu-u.ac.jp/~rin/
	http://www.tl.fukuoka-u.ac.jp/~tsuruta/index.html
関連情報	Proceedings of 5th International IEEE Symposium on Signal Processing and Information Technology \|\| \|\| p761-766
	http://limu.is.kyushu-u.ac.jp/~rin/
	http://www.tl.fukuoka-u.ac.jp/~tsuruta/index.html
概要	Recently, the multimedia signal processing community has shown increasing interest for research development on visual speech recognition domain. In this paper we present a novel visual speech recognit...ion approach based on our model hyper column model (HCM). HCM is used for feature extraction task. The extracted features are modeled by Gaussian distributions through using hidden Markov model (HMM). The proposed system, HCM and HMM, can be used for any visual recognition task. We use it here to comprise a complete lip-reading system and evaluate its performance using Arabic database set. According to our knowledge, this is the first time that visual speech recognition is applied for Arabic language. Toward fair evaluation we compare our accuracy results with those using fast discrete cosine transform (FDCT) approach, in a separate experiment and using same data set and conditions of HCM experiment. Comparison turns out that HCM shows higher recognition accuracy than FDCT for Arabic sentences and words. HCM does not provide higher accuracy only but also it capable to achieve shift invariant recognition whereas FDCT can not.続きを見る

本文ファイル

ファイル	ファイルタイプ	サイズ	閲覧回数	説明
AlaaISSPIT05	pdf	395 KB	500

詳細

レコードID	5860
査読有無	査読有
主題	LIMU
	neuro
	Visual speech recognition
	feature extraction
	self organizing map
	hyper-column model
	discrete cosine transform
タイプ	会議発表論文
登録日	2009.04.22
更新日	2020.11.17

この情報を出力する

このページのリンク

他の検索サイト

利用統計

＜会議発表論文＞ Hyper Column Model vs. Fast DCT for Feature Extraction in Visual Arabic Speech Recognition

本文ファイル

詳細

この資料を見た人はこんな資料も見ています

＜会議発表論文＞
Hyper Column Model vs. Fast DCT for Feature Extraction in Visual Arabic Speech Recognition