振幅変調に基づく音響信号への情報秘匿とその応用 - 九大コレクション | 九州大学附属図書館

＜博士論文＞
振幅変調に基づく音響信号への情報秘匿とその応用

	Audio data hiding based on amplitude modulation and its application
作成者	作成者姓名西村, 明 Nishimura, Akira ニシムラ, アキラ所属機関所属機関名 Faculty of Informatics, Tokyo University of Information Sciences 東京情報大学総合情報学部 : 准教授 : 音響情報処理
論文調査委員	岩宮, 眞一郎
本文言語	日本語
学位授与年度	2010
学位授与大学	九州大学
学位授与大学	Kyushu University
学位	博士（芸術工学）
学位種別	論文博士
出版タイプ	Version of Record
アクセス権	open access
JaLC DOI	https://doi.org/10.15017/18879
概要	音響信号に何らかの形で情報を秘匿し，必要な時に秘匿情報を取り出して利用する技術を音響情報秘匿(audio data hiding)とよぶ．本論文は，埋め込み前の信号を検出時に必要としないブラインド検出を可能とする音響情報秘匿技術を提案して，その性能を検証し，技術の新しい利用法を示し評価することを目的とした．はじめに音響情報秘匿技術について概観し，主に著作権管理を目的としたデータを埋め込む電子透か...しと，その目的は様々だが埋め込んだデータを活用するステガノグラフィに大きく分類されることを示した．そして用途に応じた技術への要求点，すなわちブラインド検出，低音質劣化，秘匿性，耐性(頑強性)，適用性，秘匿容量，空間伝搬耐性を挙げ，従来の研究においては，対象とした音響信号の種類が少なく，評価が十分でないことを示した．また，音質劣化を評価する主観評価実験に問題のある従来研究が多く，適切な客観音質劣化評価法も確立されていないことを述べた．本研究では，隣接する帯域信号ペアに与える逆相の振幅変調を秘匿情報のキャリアとする音響情報秘匿手法を提案した．最初に，電子透かし用途を前提として，原信号に含まれる振幅変動成分の強さを元にして変調強度を逐一設定することで，様々なジャンルの音楽信号に適用でき，知覚符号化，残響，雑音などを加えても十分に透かし情報が検出できることをシミュレーション実験により明らかにした．また，音質変化検知訓練を積んだ被験者4名に対して秘匿情報の埋め込み強度検知限を明らかにし，ITU-R BS.1116-1 に準拠した主観評価実験によって，情報秘匿に伴う音質劣化度合は``違いが分かるが気にならない''程度であり，MP3 128 kbps の符号化を経た音質より良いことが分かった．さらに，音質劣化の主観評価と，ITU-R BS.1387 PEAQを用いた情報秘匿に伴う音質劣化の客観評価の間には有意な相関が認められ，PEAQは情報秘匿に起因する音質劣化の客観評価法としてある程度有効であることが分かった．　つぎに，振幅変調に基づく情報秘匿の，空間伝搬条件における利用を検討した．すなわち，スピーカ拡声されるアナウンス音声に実時間でデータを埋め込み，利用者の手元の機器のマイクロホンで受音し復号化と表示を行う利用を前提とし，残響と雑音が重畳する環境をシミュレーションした．その結果，一定の振幅変調度0.4で 48 bps の秘匿情報を埋め込んだ場合，音声の明瞭度を保ったままで，SNR 10 dB のときに 90%の条件で84%以上の検出率が得られることが分かった．さらに，秘匿情報の検出を受音した端末で行うのではなく，携帯電話の音声通話により接続するサーバコンピュータにおいて実行するため，携帯電話の音声符号化を経ても情報検出が可能かどうかをシミュレーションと実環境で調べた．その結果，残響や背景雑音が存在しても，AMR音声符号化ビットレートが高ければ，品質を大きく劣化させない情報秘匿が可能であることが分かった．音響情報秘匿技術の性能比較のため，振幅変調法と従来から提案されているエコー拡散法に対して，電子透かし用途として 4.8 bps のデータを音楽信号に埋め込んだときの耐性と，空間伝搬用途として 48 bps のデータを音声信号に埋め込んだときの耐性をシミュレーション実験により調べた．その結果，多くの条件においてエコー拡散法は秘匿情報検出が困難となる楽曲が存在し，振幅変調法の方があらゆる楽曲に対して適用できることが分かった．また，空間伝搬条件でも振幅変調法の方がより高い検出率を示すことが明らかになった．さらに，スピーカから再生される音に同期して情報を呈示するシステムを提案し，カラオケの伴奏音楽に歌詞を同期呈示するための情報を埋め込み，スピーカ再生される伴奏音楽に同期してリアルタイムに歌詞を表示させるシステムを構築した．シミュレーションによるシステムの評価実験の結果，伴奏音楽に対して歌唱音，背景雑音，振幅クリッピング歪あるいは残響の重畳を経ても，埋め込まれたデータは十分検出可能であることが分かった．また，歌詞表示の時間制御の基となるデータフレーム境界時刻の検出精度も十分であることが分かった．PEAQを用いた客観音質劣化評価の結果，頑強性を重視して埋め込みを行った伴奏音楽の音質劣化は MP3 48 kbps/ch の符号化を経た音質劣化より少ないことが分かった．　振幅変調に基づく音響情報秘匿技術は，電子透かしから空間伝搬利用のステガノフラフィまで，従来技術にない様々な場面において利用可能であることを本論文は示した．実用に繋がるいくつかの課題を解決することによって，さらに利用可能な範囲が広がると期待される． Audio data hiding is usually associated with watermarking, which enables copyright management of digital music, and steganography, which is the embedding of useful auxiliary data in an audio signal. An audio data hiding technique is typically evaluated on the basis of criteria such as blind detection, sound quality degradation, concealment, robustness, availability, capacity, and suitability for aerial transmission, depending its purpose and application. The limitations of previous studies on audio data hiding are that few audio signals were used to evaluate the robustness of hidden data detection and a precise method for the subjective evaluation of audio quality degradation was lacking. This paper proposes a new audio data hiding technique based on amplitude modulation (AM). This technique was evaluated in terms of robustness and sound quality degradation with the use of audio watermarking and aerial data transmission, where the stego signal was emitted from a loudspeaker and received by a microphone in a closed or open space. The robustness was evaluated by a computer simulation using 100 pieces of music from a database containing various genres and using speech sounds of 22 speakers from a continuous speech database. The audio quality degradation was carefully measured by using subjective listening tests and an objective evaluation program. The results of both the subjective listening tests and the computer simulation showed that the quality degradation caused by watermarking with moderate embedding intensity was ‘perceptible but not annoying', while sufficient detectability of the watermarks was maintained. For aerial data transmission, reverberant speech signals with various background noises could transmit more than 90% of embedded data at 48 bps, with only a small deterioration in the syllable identification score. These signals were also able to transmit at least 80% of the embedded data at 8 bps after transcoding to the AMR speech codec at a bitrate of 12.2 kbps, which simulates reception of the stego signal by a cellular phone. The results of the computer simulation also revealed that the AM-based method was superior to the spread-echo method in terms of detection performance with the use of watermarking and aerial data transmission. The AM-based audio data hiding technique was applied to a karaoke system that displays hidden information (song lyrics) synchronously with a stego signal (music) broadcast from a loudspeaker. The robustness of the system was evaluated by simulating reverberant and noisy conditions, amplitude clipping, and singing voices as disturbances. The results showed that the performance of hidden data detection were sufficiently high. Objective measurements of the stego audio quality revealed that the degradation was less than ‘slightly annoying'続きを見る
目次	第1章序論第2章音響信号への情報秘匿技第3章振幅変調に基づく情報秘匿技術第4章情報秘匿信号の空間伝搬および携帯電話音声符号化耐性第5章既存音響情報秘匿技術との性能比較第6章情報秘匿に基づく音響信号と同期した情報呈示第7章結論謝辞参考文献

本文ファイル

ファイル	ファイルタイプ	サイズ	閲覧回数	説明
11_references	pdf	75.5 KB	935	参考文献・引用文献
10_acknowledgement	pdf	21.9 KB	1,501	謝辞
09_chapter7	pdf	35.1 KB	379	第7章
08_chapter6	pdf	119 KB	358	第6章
07_chapter5	pdf	95.7 KB	538	第5章
06_chapter4	pdf	253 KB	738	第4章
05_chapter3	pdf	422 KB	629	第3章
04_chapter2	pdf	124 KB	1,014	第2章
03_chapter1	pdf	37.7 KB	454	第1章
02_contents	pdf	30.1 KB	209	目次
01_cover	pdf	10.0 KB	256	表紙

詳細

レコードID	18879
査読有無	査読有
報告番号	乙第8389号
学位記番号	芸博乙第0019号
授与日(学位/助成/特許)	2011-01-31
受理日	2010-10-12
部局	芸博
登録日	2013.07.10
更新日	2023.11.21