Abstracts of Doctor Thesis 2013

Search system users have to find information they need out by themselves, because most of existing search systems return a list of documents as search results. It takes a large effort to find it out from long-length documents. In addition, there is a possibility that users cannot find useful information nevertheless they spent long time on the documents. This information seeking process is much cost in information retrieval. On the other hand, users need not to find information they need out by themselves because XML element search systems return a list of elements which satisfy users' information needs. Therefore, the framework of an XML element search system can reduce the cost in information retrieval, which is the reason why XML element retrieval techniques are useful and worth working on.

There are two main streams for researches of XML element retrieval techniques, i.e., 1) attaining effective search for satisfying accurate information retrieval, and 2) attaining efficient search for fast query processing. In order to satisfy 1), we proposed a scoring method to identify informative XML elements and a reconstruction method of search results to identify the most appropriate granularity of XML elements as search results. Our experimental evaluations showed our proposed methods overwhelmed existing methods in search accuracy.

Then, we also try to handle document updates of an XML search system. This is because document updates need to be managed when we come to think of a practical use of search systems. If document updates are not handled in a search system, users cannot obtain appropriate search results, which reduces the usefulness of the search system. We propose to extend a function of incremental updates of indices to general XML element retrieval systems, with filters to reduce the update cost by eliminating unimportant elements and terms. Moreover, we apply a method for integrating path expression which estimates accurate global weights in term calculation. We confirmed the proposed methods update indices in short time without a drop in search accuracy.

As an output of these researches, we developed a practical XML element search system which achieves accurate search and fast query processing with satisfying immediate reflection of document updates. However, there still is a gap between a practical XML element search system and a practical Web search system. To fill the gap, we try to apapt (XML) element retrieval techniques to HTML documents.

1261027 Tanvir Ahmed
Techniques to Reduce the Overhead and to Improve the Robustness in a Fault Tolerant Reconfigurable Architecture

Nowadays, fault tolerance has been playing a progressively important role in covering increasing soft/hard error rates in electronic devices that accompany the advances of process technologies. Research shows that wear-out faults have a gradual onset, starting with a temporal/transient fault and then eventually leading to a permanent fault. Error detection is thus a required function to maintain execution correctness. Currently, however, many highly dependable methods to cover permanent faults are commonly over-designed by using very frequent checking, due to lack of awareness of the fault possibility in circuits used for the pending executions. In this dissertation, to address those issues, a technique has been proposed to add check instructions selectively on the data-path, where a metric has been introduced for permanent defects, as operation defective probability (ODP), to quantitatively instruct the check operations being placed only at critical positions. By using this selective checking approach, I can achieve a near-100% dependability by having about 53% less check operations, as compared to the ideal reliable method, which performs exhaustive checks to guarantee a zero-error propagation. By this means, I am able to reduce 21.7% power consumption by avoiding the non-critical checking inside the over-designed approach. Further, by additionally taking the data importance into account, extra energy savings is possible from the current over-designed fault tolerable system. Partial redundancy is a well used method to cover single event effects~(SEEs) on critical data while leaving less important data unprotected. Under a low SEE rate, the method can provide a good cost-effective fault tolerance, while many silent data corruptions~(SDCs) may occur under a high fault rate due to incomplete fault coverage. Thus, a system-level approach is proposed to additionally cover SDCs in a partial redundancy by a light-weighted error prediction. Simulation results under a stress radiation test condition show that with an average 8% cost in energy consumption, which can reduce the SDC rate from 12% to 0.37%, for the work loads those have been studied.

0761204 塚本悟司
アンテナ指向性の周期的な可変による単一 RF回路でのダイバーシチ受信の研究

国内のスマートフォンや携帯電話の普及率は今や 90%を超え，無線通信の利用が一般化するにつれて利用者は常に通信可能である事を当然と考えるようになった．また，省エネルギーや利便性向上のために無線センサや RFタグなどの超小型無線デバイスや無線通信機能を有する機器も年々増加している．しかし，無線通信では常時安定した通信を行うのは難しく，マルチパスによる干渉や受信信号強度の低下によって伝送速度や接続性が大きく低下することがある．マルチパスに起因する干渉は信号処理による等化技術の進歩により，その影響を軽減することが可能となったが，受信信号強度自体の低下による影響は避けられない．昀大比合成ダイバーシチはこれに対して有効な手段であるが，小型無線端末では高周波回路（ RF回路）の規模増加によるコストや消費電力の増加が問題となる．一方，単一 RFでのダイバーシチが実現できる選択ダイバーシチは回路規模の点では有利だが，受信ブランチの不適切な選択により劣化が生じる懸念がある．また，近年多くの無線システムに採用されている OFDM（orthogonal frequency division multiplexing）方式は，マルチパスによって生じる周波数選択性のフェージングに対しての耐性は高いが，周波数選択性が低いフェージング下での受信信号強度の低下ではやはり劣化が生じてしまう．しかも，OFDMシステムに選択ダイバーシチを適用しようとするとサブキャリアによって昀適ブランチが異なる場合があり，効果が薄くなるという懸念がある．そこで，この受信信号強度低下を軽減する技術として，受信アンテナの指向性を周期的に切り替えることによる単一 RF回路でのダイバーシチ受信方式の検討を行った．簡単な構造で電気的に指向性を可変できるアンテナとして ESPARアンテナに着目し，その指向性を受信シンボル周期と同じ周期で高速に切り替えることで，ダイバーシチ受信を可能にする技術を考案した．小型無線端末に採用の多いシングルキャリア方式の ZigBeeや OFDM方式を採用している無線 LANへの適用を念頭に理論検討やコンピュータシミュレーションを実施し，高速フェージング時にもダイバーシチゲインが得られる事を確認した．

0761030 和田眞昌
シロイヌナズナゲノムにおける隣接遺伝子の共発現データに基づいたオペロン様遺伝子群の推定に関する研究

生物の遺伝子発現の制御機構は、生命維持を担う一次代謝やそれぞれの生物固有の物質生産を担う二次代謝を調節する要因の一つである。すなわち生存のうえで重要なシステムであるタンパク質の生成の初期段階にあり、外部環境などに適応するための生体の制御機構でもある。植物の二次代謝物は人間社会においても薬品や健康食品に利用されており、この経路に関わる遺伝子の発現制御を理解することは最重要課題である。そのため、植物の二次代謝に関わる制御機構は長年にわたり研究対象とされてきた。近年の研究で、微生物の発現制御機構であるオペロンと類似した遺伝子発現機構が、植物の二次代謝経路に関わる遺伝子について報告されている。ゲノムプロジェクトによりモデル植物シロイヌナズナのゲノム解析が完了したことによって、世界中の研究者により採取されたシロイヌナズナ全遺伝子の膨大な発現データがビッグデータとして公開されている。そこで、本研究ではシロイヌナズナの全遺伝子を対象として、公開されている遺伝子発現データを活用し、オペロン様遺伝子群を推定する方法を提案した。まず遺伝子発現群を特定するために全遺伝子の遺伝子対遺伝子の発現相関データベースを作った。次に、このデータベースにおける遺伝子発現が、過去の研究に報告される発現傾向と同様であることを確認した。さらに統計手法により、遺伝子群の大きさの閾値を決定して、オペロン様遺伝子群の推定を行う方法を開発した。これにより、シロイヌナズナには、統計的に有意なオペロン様遺伝子群の個数を100 個と推定した。この方法により予測された遺伝子群について、現在報告されている植物遺伝子の機能アノテーションを用い、これらが機能的にオペロン様遺伝子群と同様の機能を持つことが確認された。

1161005 金城　健
線形化マルコフゲーム理論によるロバスト制御

自動制御ロボットが自身の環境の中で最適に振る舞う制御則を求める計算論的枠組みとして，最適制御やモデルベース強化学習が一般的に用いられる．しかしながら，連続行動空間において最適制御則を求める際には，非線形な Hamilton-Jacobi-Bellman (HJB) 方程式を解くことが障害となっている．　近年，Linearly solvable Markov decision process(LMDP) と呼ばれる，コスト関数と制御入力がダイナミクスに及ぼす影響に対して制約を置くことで，非線形なBellman 方程式を線形に変換する新たなマルコフ決定過程の枠組みが提案された．LMDPを実機のロボットの制御に適用するためには，精密な制御対象の環境ダイナミクスが既知でなければならず，事前にそれを保持することは容易ではない．本研究の先行研究ではこの問題点を解決するために状態と行動の時系列データから制御対象の環境ダイナミクスを推定し，推定されたダイナミクスをLMDPに用いる手法を提案した．先行研究ではシミュレーションによる実験でのみの評価であったことから，本研究では車輪型ロボットのSpring Dogを用いた実験を行い提案手法の実機における実現性を確かめた．　また，提案手法は推定精度が悪化するにつれて獲得した制御則の性能が劣化することが先行研究により明らかとなっている．この問題の一つの解決策として，推定された環境ダイナミクスの予測誤差が存在しても制御性を余り損なわないロバストな制御則の導入が考えられる．ロバストな制御則は連続時間系ではHamilton-Jacobi-Issacs(HJI)方程式を，離散時間系ではBellman-Issacs方程式を解くことで得られる．近年，LMDPの拡張としてLinearly Markov Games (LMG)と呼ばれる枠組みが提案された．LMG において非線形なHJI方程式およびBellman-Issacs方程式は厳密に線形化される． LMGはロバストな制御則の獲得手法として好ましく，先行研究の問題点の解決策として有効であると期待される．しかしながら，LMG のロバスト性については十分に議論されていないことから，本研究では予測誤差を持つようなモデルに対して LMG を適用し，ロバスト性について研究し先行研究の改善を行った．

1161030 Noppawat Chaisamran
Protecting IP Telephony against SPIT and SIP Flooding Attacks

The global communication market is rapidly moving toward IP (Internet Protocol) telephony. Like other IP-based applications, it is vulnerable to several attacks. Then, security concerns become more important for users and service providers. In this dissertation, I propose a real-time attack detection system that protects an IP telephony against Spam over Internet Telephony (SPIT) and Session Initiation Protocol (SIP) flooding attacks. It consists of three main contributions. First, I propose a trust-based SPIT detection based on calling behavior and human relationships. A call duration and its direction as well as a calling ratio of each user are used to calculate a trust value. This trust value is automatically adjustable according to the call characteristics in order to keep track of a current user's behavior and avoid a bias in trust value assignment. Second, I present an anomaly-based SIP flooding attack detection system. Three statistical algorithms are proposed to analyze an incoming traffic to a SIP server: an application of Tanimoto Distance, an adaptive threshold, and a Momentum Oscillation Indicator. Due to a stateless and a low computation cost of these algorithms, the proposed system can classify traffic in nearly real-time that is suitable for a IP telephony system. Lastly, I reduce false positive alarms of the flooding attack detection by using a trust filtering. A reliable trust value is calculated through the call activities and the human behavior of each user. Trust value of suspicious callers will be checked before raising any alarm. I use the comprehensive synthetic datasets containing various malicious traffic patterns to validate the effectiveness of the proposed system. The results show that it accurately identified attacks and has the flexibility to deal with many types of attack patterns with a low false positive rate.

1161013 水本　旭洋
効率のよいコンテキストアウェアシステム実現ための最適化アルゴリズムに関する研究

センサやモバイルデバイスから収集した情報を基に実世界の現在の状況（コンテキスト）を認識し，コンテキストの変化に適応するように動作するコンテキストアウェアシステムは，ユーザの快適さ，消費される時間やエネルギーなどの点で最適にサービスを提供することが望まれる．しかしながら，既存研究ではコンテキストアウェアシステムにおけるサービスの効率や最適性の実現を目的としたものはほとんど存在しない．本研究では，災害時の医療支援や家庭での省エネ支援という近年関心が高まっている２つの分野において効率の良いコンテキストアウェアシステムを実現するために，それぞれのシステムで提供されるサービスを最適化問題として定式化し，それを解くアルゴリズムを提案する．災害時の医療支援に対して，効率的なコンテキストアウェアシステムを実現するために，生体センサにより実時間で傷病者の容態が認識可能な電子トリアージタグを利用し，傷病者の容態変化や病床数の変化などの被災地のコンテキストに自動適応して搬送計画を行う手法を提案する．本手法は，対象とする問題がNP困難問題であることから，実時間で搬送計画が行えるように，傷病者が命を取りとめることが可能な搬送限界時刻が早い順番で救急車を割り当てるようなヒューリスティックなアルゴリズムを用いる．また，家庭での省エネ支援に対して，効率的なコンテキストアウェアシステムを実現するために，最小の消費電力量でスマートスペースをユーザの好みのコンテキストに遷移するような家電を自動で操作する手法を提案する．本手法では，目的のコンテキストに遷移させる問題を最短経路問題として定式化し，A*アルゴリズムを用いて最適化を行う．しかしながら，対象とする問題は各エッジとコストがあらかじめ与えられていないため，そのままA*アルゴリズムを適用できない．そこで，シミュレーションにより各エッジの存在とコストを動的に探索し，最短コストの経路を求めるような手法により解決を行う．両分野に対してそれぞれ提案した最適化アルゴリズムが既存の手法と比べて，医療支援では救命率の向上を，家庭での省エネ支援では消費電力量の削減を行えることを確認した．

0961009 笠井則充
判別モデルに目視評価を組合せたfault-prone モジュール判別手法

本論文では，ソフトウェアモジュールに対するfault-prone判別を，従来の判別モデルに目視評価を組み合わせて行う方法を提案する．fault-prone判別とは，対象とするソフトウェアモジュールに不具合（fault）が含まれる可能性が，ある基準以上であるか（fault-prone であるか）どうかを推定する手法である．判別モデルでは，ソースコードやその更新履歴の特性値群が，判別のための入力データとなる．ただし，ソースコードに記載されたコメントの不備や例外処理の不備，プラットフォームに依存した実装といった不具合の存在や兆候をそれら特性値で捉えることは困難であり，モデルだけで判別精度を高めることには限界があるとの指摘がある．一方，目視評価には，多くの時間・工数が必要であり，実用規模のソフトウェアにおいて全モジュールを評価することは現実的ではない．そこで，提案法では，受け入れ検査工程を対象に，従来の判別モデルによる評価結果に基づいて目視評価すべきモジュールとその評価順序を決定することで，目視評価のコストを抑えつつ，より高精度なfault-prone判別を目指すこととした．まず，従来の判別モデルの得点と目視による評価得点の和をfault-prone判別得点とすることで，fault-prone判別の精度が向上することを実験により確かめた．具体的には，判別モデルをサポートベクタマシン，ソースコード行数，ランダムとし，判別モデルにより得られた得点と次の4つの基準により選んだモジュール(全モジュールのα%)の目視評価得点に荷重βを乗じた値の和をfault-prone判別得点とする．すなわち判別モデルによって得られた得点の，(1)昇順，(2)降順，(3)中間値から順に一つ大きい値，一つ小さい値，二つ大きい値，…の順，(4)最大値，最小値，2番目に大きい値，2番目に小さい値， 3番目に大きい値…の順で，全体のモジュールのα%を選ぶ．いずれの組合せにおいても，適切なα，βを与えることにより，目視評価を行うことによって判別モデル単体よりも高精度でfault-prone モジュールを判別することが確認された．次に，目視評価のコストを抑えることを目的として目視評価得点のスコアリング方法を開発した．具体的には，規模の小さいモジュールにおいて目視評価の観点を省略することにより，判別精度を維持しつつ目視評価のコストを小さくできるよう，目視評価得点のスコアリング方法を考案し，試行した．目視評価するモジュールの割合であるα%を事前に決定せずに評価した場合でも，判別精度が大きくなることがわかった．また，αが100%の場合と75%の場合とで同程度の判別精度が得られることがわかった．

1261203 塚本英邦
学習者のモチベーション解析に基づくプログラミング教材の改善

1161002 池田　俊
ゲノム配列に基づく生物多様性に関するバイオインフォマティクス

　生物は構造や機能で多くの共通性を持ちながら、地球上のあらゆる環境に適応し生息している。生物の多様性は生物が獲得してきた多彩な特徴であるといえる。生物の多様性が構築されていく経緯を明らかにすることは生物進化の解明だけでなく、生物利用としても意義を持つ。本論文では、バイオインフォマティクスの技術を用いた生物の代謝とゲノムの多様性の原因となるメカニズムの解析を行った。まず、酵素タンパク質の多様性として、生物が保持する酵素の代謝反応クラスに関するデータベースを構築し、酵素タンパク質のアミノ酸配列のパターンを一括学習型自己組織化マップによってクラスタリングした。その酵素タンパク質配列のクラスタリング結果が二次代謝物グループに基づいて分類されるだけでなく、各酵素グループの機能や各酵素グループの祖先関係を反映することを示した。次に、ゲノムの多様性が生まれるメカニズムについて環境圧の影響を基に統計解析を行った。環境が生物に対して与える影響を明らかにするために、各原核生物のゲノム指標として、ゲノムの遺伝暗号であるコドンを用いた指標を開発した。そして、その指標を基に主成分分析を行った結果がゲノムの特徴を反映することを示し、原核生物において、新規コドン指標と生育環境条件を用いた統計検定を行った。その結果、ゲノムの多様性の原因にについて、第一にATP合成効率、第二に温度、第三に生物のコドンを使用したシステムの影響があることを結論づけた。

1161010 福嶋　誠
State-Space Methods for Reconstructing Neuronal Current Sources （状態空間法による脳内電流源の推定）

Elucidating mechanisms of how functionally specialized brain regions dynamically interact has recently received attention in the neuroimaging community. Such dynamic integration of functional brain regions can be investigated by Magnetoencephalography (MEG) and Electroencephalography (EEG). To discover functional brain networks from MEG/EEG sensor measurements, it is indispensable to properly reconstruct neuronal current sources from these data and identify directed interactions (i.e., effective connectivity) between the current sources. State-space approaches for MEG/EEG source reconstruction potentially provide ways to solve the above estimation problems. The state-space framework can incorporate a priori knowledge on neuronal current dynamics into the dynamic model of current sources. Imposing realistic priors on the source dynamics allows reconstructing current sources from MEG/EEG data more accurately. The richness of the prior assumptions also contributes to identification of functional brain networks. This can be achieved by first introducing model parameters of the source interactions based on prior knowledge, and then estimating these parameters from the measurements. In this thesis, to realize accurate source reconstruction and discovery of functional brain networks, two novel extensions on state-space methods are applied. First, a limitation of previous state-space methods in reconstructing spatially focal current sources has been resolved. By replacing spatially homogeneous dynamic source model in existing methods to spatially inhomogeneous one, focal current sources are successfully reconstructed under the state-space framework for the first time. Second, inference of functional brain networks has became available by incorporating long-range directed interactions into the dynamic source model, under prior knowledge on anatomical brain connectivity. The new state-space method extends previous dynamic models in which spatially local (or self) source interactions are only assumed and from which the functional networks cannot be identified.

1061019 布江田友理
血液透析における血管内容積変動のモデルを用いた要観察な患者抽出法

腎臓は浸透圧調節や体液調節、ホルモン分泌調節など生体にとって重要な臓器である。代謝産物の排泄は腎臓の糸球体で行われ、糸球体毛細血管の内皮細胞、基底膜、上皮細胞が濾過膜として働いている。そこで血管内皮細胞培養系を使用し、物質の透過性について調査した。この結果、動脈硬化や高血圧などに効果がある長鎖脂肪酸が血管内皮細胞の透過性を促進することが分かった。また、腎機能不全では体液調節や代謝産物の排泄ができないために、透析療法が必要不可欠である。血液透析療法は拡散によって老廃物の除去と電解質調節を行い、限外濾過によって過剰容積を除去して、体液調節を行う。生体は限外濾過に伴い血管内容積が減少すると、血圧を維持するために血管外から血管内へ水分を引き込む。このバランスに不均衡が生じると、血液透析患者では透析性低血圧などの合併症が生じ、これは合併症の約30% を占めていると言われている。そのため、血液透析療法中、血管内容積変動の把握は重要である。血液透析療法中に生じる血管内容積変動は限外濾過と血管外から血管内への容積移動が関係していることから、容積変化に着目して数理モデルを考えた。血管外から血管内へ移行する容積は血管外に蓄積された容積変動に関係するため、血管外に蓄積された容積変動をロジスティック方程式を用いて表現した。限外濾過量は、単位時間当たりの限外濾過量を使用して求めた。この結果、提案したモデルと血管内容積計測値との相関は高く、血管内容積変動をモデル化でき、血管外に蓄積された容積変動はロジスティック方程式が適合することが分かった。さらに、提案したモデルと血管内容積計測値との相関係数を指標として、透析性低血圧のために薬剤投与を行った症例と薬剤投与や補液などの外乱のない症例との関係を調べた。血液透析療法開始30 分間の容積変動とモデルとの相関係数が 0.95 を境界域として、透析開始後初期時に観察が必要な患者を抽出できる可能性を示した。また、提案したモデルから血管外最大過剰容積が推定できた。現在、透析患者は基準体重を設定し、基準体重からの増加量を過剰容積として除水する。推定した血管外最大過剰容積と基準体重からの増加量が大きく異なる場合、透析条件の再設定が必要であると思われ、モデルのパラメータである血管外最大過剰容積を基準体重設定の新しい指標として、提案した。

1161014 IGOR DE SOUZA ALMEIDA
Co-located Augmented Reality Mediated Communication

Augmented Reality (AR) has recently grown out being a new way to interact with virtual contents to become a way to enhance communication. This new found niche of AR, referred in this work as Augmented Reality Mediated Communication (ARMC), can be defined as any form of active communication between two or more persons that benefits from the assistance of virtual imagery displayed in their real world view. Two particularities of ARMC serve as motivation for this thesis: human factors are often overlooked in the conception of ARMC systems, and the fact that there are considerably fewer works focusing on co-located ARMC than on the remote case. In this thesis, two prototypes systems were developed. The first is an intermediate experimental work, named HANDY, targeting remote ARMC. It proposes an AR video conferencing in which a user is able to virtually “reach out” to another’s real world by using a two cameras setup. This system evaluated the effect of ARMC on the human factor Social Presence. This work represents a stepping stone towards the co-located ARMC research through which it was possible to empirically analyze the trivial case (remote) while experimenting with the evaluation of human factors. The second prototype, and the main work in this thesis, is a new ARMC approach to co-located meeting support for small audiences. The system, named Meetsu, consists of virtual icons and text annotations (containing meeting participants' comments) displayed on a live video feed of the meeting room. It was targeted as a method to promote willingness to communicate (WTC) among meeting participants, arguably the first work to attempt it. The experiments with Meetsu measured the levels of WTC in two distinct groups for a period of time, before and after using the system, and compared the use of AR and Non-AR views as display method.

1261017 宮﨑　亮一
Musical-Noise-Free Speech Enhancement Based on Higher-Order Statistics Pursuit

In this dissertation, I propose a new speech enhancement theory for hearing aid and video conference systems, where the output speech quality of nonlinear signal processing is controlled using higher-order statistics. In these systems, since interference signals and noise deteriorate the quality of a user’s input speech, it is desirable to develop a digital signal processing technique to clean the microphone signal before it is stored. In order to remove background noise, there have been many studies on noise reduction methods that have high noise reduction performance. However, the reduction of noise spectra often introduces an artificial distortion in the residual noise, which is a well-known phenomenon so-called musical noise, leading to a serious deterioration of sound quality. In this study, first, I theoretically clarify that iterative spectral subtraction with a specific parameter generates almost no musical noise even with high noise reduction performance. Based on the fact, I propose a musical-noise-free theory for single-channel speech enhancement using iterative nonlinear signal processing. In the proposed theory, the fixed point in kurtosis yields the no-musical-noise state; we call this ``musical-noise-free condition.’’ In addition, I mathematically derive the optimal internal parameter settings to satisfy the musical-noise-free condition based on higher-order statistics pursuit. Next, I propose a new iterative blind signal extraction method integrating blind noise estimation and iterative noise reduction for addressing reduction of nonstationary noise. This method includes a dynamic estimation of the noise power spectral density based on independent component analysis and multi-channel Wiener filtering, which can provide effective reduction even in the case that noise has time-varying properties. From the experimental evaluation, it is asserted that the proposed methods are advantageous to the conventional speech enhancement methods in terms of total sound quality.

1161028 GEMALYN DACILLO ABRAJANO
Rainfall Attenuation in Microwave Mesh Networks

Microwave mesh networks with frequencies from 10 GHz are affected mainly by rainfall attenuation. This effect of rain on the propagation of the microwave signals has been extensively studied in the past to minimize the effects on the communication links. This research studied how route diversity can be used to lessen the effect of rain attenuation on the delivery of information of mesh networks. Route diversity is the availability of more than one physical path from the source of information to the destination. This study looks into the links' orientation and angular separation from each other in implementing the route diversity. Network configurations where simulated using real rainfall data to see the differences in attenuation for links oriented in different directions. The results showed significant gain for the route diversity.

The next aim of this study is to determine the location and intensity of the rainfall field from the attenuation of a mesh network. Because of the links' susceptibility to rain, it has been suggested in the past that a rainfall sensing system can be built using microwave networks. The information on rainfall attenuation can be used to detect and reconstruct the rainfall field without using other weather sensors. However, the resolution of the link attenuation data is not enough for detecting the sporadic rain. In this study, we propose an efficient use of compressed sensing algorithm for improving the resolution. The proposed method can identify intense rainfall rates like that of "guerilla rain", which are of interest because they can cause disasters like landslides and flash floods. The links' rainfall attenuation can be used as the input to a rain sensing system. This system can complement existing weather sensing systems and act as standalone rainfall sensor in areas where traditional weather sensors are not available.

1161012 間島慶
Neural Decoding of Electrocorticographic signals

Over the last decade, neural decoding technology based on machine learning has been developed and enabled us to extract fine information on visual experiences and motor commands from measured neural signals, which is becoming a powerful tool for revealing neural representations and BMI application. To extract information with high predictive performance, neural recording methods that provide high spatiotemporal resolution and signal stability are required. A promising candidate is electrocorticogram (ECoG), which measures population activity of neurons with electrodes placed on the surface of the brain.

Here, toward high-performance decoding with ECoG, we tested the utility of ECoG systems in animal and human studies, and improved techniques to extract information from ECoG data. In chapter 2, the signal stability of ECoG responses recorded via a newly developed high-density mesh electrode array was tested. Collaborators applied it to the visual cortex in rats and this thesis demonstrates above-chance, generalized decoding performance for simple visual stimulation, using six hours of continuous data. In chapter 3, by applying decoding analysis to simultaneously recorded ECoG, LFP, and MUA signals from the monkey IT cortex, extractable information on visually presented objects was compared. The resultant decoding performance with ECoG was high and comparable with LFP and MUA. In chapter 4, ECoG was used to investigate how face-selective regions and written word- selective regions are distributed on the human cortex, which is considered a challenging task with fMRI. Results reveal that there exist multiple, separate face- and written word-selective regions in the human cortex. In chapter 5, using ECoG responses from human patients when they viewed objects, efficient input signal features for decoding analysis were explored. Spectral powers, phases and temporal correlations of ECoG signals were used as input features, and the decoding performances were compared. Results show the performance using temporal correlations between ECoG electrodes is higher than using spectral powers and phases in individual electrodes.

Those results suggest that the combination of ECoG recordings and neural decoding techniques is a powerful approach for extracting neural information, and we can considerably improve a decoder’s predictive performance by using signal features that take into account fine temporal patterns in ECoG signals.

1161018 小木曽智信
日本語通時コーパスのための形態論情報アノテーションの研究

近年，コーパスを用いた日本語研究が盛んになり，国立国語研究所においては，日本語史研究のための通時コーパスを構築する準備が進められている．通時コーパスには，現代語のコーパスと同様の形態論情報を付与することが必要とされているが，従来は歴史的な日本語資料に十分な精度で形態素解析を施すことができず，形態論情報のアノテーションは困難であった．

このような中，本研究は，日本語通時コーパスのための形態論情報アノテーションを実現するために自然言語処理技術を応用して，次の貢献を行った．

古文の形態素解析を実現するための言語資源として，新たに辞書と学習用のコーパスを整備し，統計的機械学習にもとづく形態素解析技術を用いて，中古和文と近代文語文について実用的な精度（見出し語認定のF値で0.96以上）が得られる形態素解析システムを実現した．
上記の言語資源と通時コーパス自体の整備のために，辞書の見出し語とコーパスの出現形とを関連付けながら形態論情報の修正作業を行うことのできるデータベースシステム（国語研究所「形態論情報データベース」）を構築し，通時コーパス整備の基盤を整えた．
通時コーパスに収録される多様なテキストに対して高い精度で形態論情報のアノテーションを行う方法を検討し，近世口語文，和漢混淆文，旧仮名遣いの口語文について，実際に形態論情報のアノテーションを行った．
上記の形態素解析技術や形態論情報付きの通時コーパスを人文科学系の研究者に使いやすい形で提供するために，新たなツールの作成・既存のツールの適用を行った．

以上により，通時コーパス構築の基盤を構築し，通時コーパスを用いた日本語史研究のための環境を整備した．

A study on Morphological Annotation for the Japanese Diachronic Corpus

Toshinobu OGISO (1161018)

Recently, corpus-based study of Japanese language has become popular, and a diachronic corpus of Japanese is being developed at the National Institute for Japanese Language and Linguistics (NINJAL) to study history of Japanese language. In order to construct a richly annotated diachronic corpus of Japanese, morphological analysis of historical Japanese text is required. However, morphological analysis of old Japanese texts with adequate accuracy was impossible by conventional means, and annotation of diachronic corpora with morphological information was difficult using existing technology.

Given this situation, this study applied natural language processing technology to carry out the annotation of morphological information for the diachronic corpus of Japanese, and made contributions as below.

Dictionaries and corpora of historical Japanese text were newly created as language resources to carry out the morphological analysis of historical Japanese. Using these resources and a morphological analyzer based on statistical machine learning, morphological analyses of historical Japanese texts in the literary style of the Meiji era and morphological analysis of literature of the Heian era were achieved with high accuracy (over 96% at lemmatization level).
For compilation and maintenance of the language resources mentioned above and the diachronic corpus itself, a database system (NINJAL morphological information database) was developed. The database system makes it possible to modify annotations in the diachronic corpus and in related dictionary entries, while maintaining consistency between the two.
For a variety of texts included in a diachronic corpus, methods for performing annotation of morphological information with high accuracy were studied, and actual annotation was conducted.
For researchers of Japanese language and scholars of humanities, some newly created tools and existing software were applied to the diachronic corpus in order to make it possible to use the morphological analysis system and the annotated diachronic corpora easily.

In this way, a basis for the compilation of diachronic corpora was constructed and an environment for studying the history of the Japanese language using diachronic corpora was provided.

1161009 林部祐太

Japanese Predicate Argument Structure Analysis Based on Positional Relations between Predicates and Arguments(述語と項の位置関係に基づく日本語述語項構造解析)

述語項構造解析の目的は,述語とそれらの項を文の意味的な構成単位として, 文章から「誰が何をどうした」という意味的な関係を抽出することである.これは,機械翻訳や自動要約などの自然言語処理の応用において重要なタスクの 1 つである.

一般に,項は述語に近いところにあるという特性がある.そのため,従来の述語項構造解析の研究では,候補を述語との位置関係でグループ分けし,あらかじめ求めておいたグループ間の優先順序に従って正解項を探索してきた.しかしながら,その方法には異なるグループに属する候補同士の比較ができないという問題がある.

そこで,異なるグループごとに最尤候補を選出し,それらの中から最終的な出力を決めるモデルを提案する.このモデルは優先度の高いグループに属する候補以外も参照することによって最終的な決定を行うことができ,全体的な最適化が可能である.

実験では,提案手法は優先順序に従う解析よりも精度が向上することを確認した.そして,述語項構造解析の精度を向上させるために必要な今後の課題について,述語の種類に応じて分析し議論する.

情報科学研究科副専攻長

平成25年度 情報科学研究科 博士学位論文発表梗概

A study on Morphological Annotation for the Japanese Diachronic Corpus

Toshinobu OGISO (1161018)

Japanese Predicate Argument Structure Analysis Based on Positional Relations between Predicates and Arguments(述語と項の位置関係に基づく日本語述語項構造解析)

平成25年度情報科学研究科博士学位論文発表梗概