コロキアムB発表

日時: 12月19日(月)3限（13:30-15:00）

会場: L1

司会: 嶋利一真

阪上　遥	M, 1回目発表	ユビキタスコンピューティングシステム	安本　慶一, 中村　哲, 諏訪　博彦, 松田　裕貴
title: A study of the digital twin of people flow combining congestion sensing and people flow simulation abstract: Before the spread of the coronavirus pandemic, overtourism was a problem in Japan's tourist destinations, which were concentrated in a small number of locations. Post-pandemic tourism requires a mechanism for optimizing people flow that solves this problem and encourages behavioral change, such as moving tourists from crowded places to quieter places, and dispersing them. In this study, we focus on the prediction of future people flow, and aim to predict future congestion and construct a digital twin of people flow by combining congestion sensing, which captures current human behavior, and people flow simulation, which can take into account local conditions and social situations. To achieve this, we take the following three approaches. First, the past and current congestion (number of people staying) at each Point of Interest (PoI) is obtained from the GPS trajectory data calculated using the "point-type flow population data" provided by Agoop, Inc. Then, the probability of movement between PoIs and transition probability matrices are obtained for each hour. Next, we simulate the movement of all people in the target area by creating movement routes that take into account the attribute information of people. Synthetic population data is used to obtain information on the attributes of people in the target area. Synthetic population data is a virtual composite data based on publicly available statistical data such as census results, and includes household information (address and composition), member age, gender, income, and so on. In order to minimize the error between the predicted and actual congestion levels, a transition probability matrix is used to iteratively modify a set of travel routes to construct a more accurate and realistic people flow simulation mechanism. language of the presentation: Japanese 発表題目: 混雑センシングと人流シミュレーションを組み合わせた人流のデジタルツインに関する研究発表概要: 新型コロナウイルス流行以前の日本の観光地では，一極集中のオーバーツーリズムが問題となっていた．ポストコロナの観光においては，この問題を解決し，観光客を混雑する場所から閑散としてる場所へ移動させるといった行動変容を促し，分散させる人流最適化の仕組みが必要である．そこで本研究では，人流の最適化の中でも特に，未来の人流予測に焦点を当て，現在の人々の行動を捉える混雑度センシングと，地域ごとの事情や社会的情勢を考慮可能な人流シミュレーションを組み合わせることで，未来の混雑度の予測および人流のデジタルツインの構築を目指す．この実現のために，次の３つのアプローチを行う．まず，株式会社 Agoop によって提供されている「ポイント型流動人口データ」を使用し算出したGPSの軌跡データから，各場所（PoI：Point of Interest）の過去・現在の混雑状況（滞在人数データ）を取得し，時間ごとのPoI間の移動確率及び遷移確率行列を求める．次に，人の属性情報を考慮した移動経路の作成を行い，対象エリアの全ての人の移動のシミュレーションを行う．対象エリアの人の属性情報の取得には，合成人口データを利用する．合成人口データとは，国勢調査の結果など公開されている統計データに基づいて，仮想的に合成されたデータであり，世帯情報（住所・構成）・構成員の年齢，性別，所得などを含んでいる．さらに，予測した混雑度と実際の混雑度との誤差を最小化するために，遷移確率行列を用い，移動経路群を繰り返し修正を行うことで，より確度の高いリアルな人流シミュレーション機構を構築する．

杉本　浩之	M, 1回目発表	知能コミュニケーション	中村　哲, 安本　慶一, 田中　宏季
title: Analyzing the speaking style of clinical psychologists in terms of entrainment abstract: In many studies, acoustic/prosodic entrainment is defined as that acoustic/prosodic features in one speaker’s utterances are getting similar to ones in his or her interlocutor’s utterances in a conversation. Generally, entrainment contributes to establishing good relationships between humans or between humans and robots. However, many studies on entrainment are limited, and some studies have reported that dis-entrainment, which is an antonym of entrainment, also contributes to establishing good relationships in some cases.When we achieve robots or virtual agents for cognitive behavioral therapy, it seems to be good to reproduce the speaking style of clinical psychologists. Entrainment is a factor of the speaking style, however it has not been revealed how they entrain their voice. Therefore it is needed to analyze how they entrain their voice as a first step. In this study, I focused on entrainment in their utterances in conversations with elderly people to reveal how they entrain their voice in comparison with care-takers. language of the presentation: Japanese

大西　一誉	M, 1回目発表	知能コミュニケーション	中村　哲, 安本　慶一, 田中　宏季
title: Multimodal Turn-taking Prediction without Explicit Labels abstract: In this study, we propose a multimodal and continuous turn-taking model for Social Skill Training agents. Previous research has focused on natural human-agent interaction and the importance of turn-taking in dialogue. Recently, methods for continuously predicting speech activity and mapping it to turn-taking events have been proposed. However, current turn-taking models have not sufficiently explored multimodal and continuous approaches. To address this gap, we implement a Transformer-based turn-taking model and compare four different output windows. Additionally, we define and evaluate four zero-shot tasks related to turn-shift and back-channel prediction. Through our proposed model and evaluations, we aim to advance the development of ocial Skill Training agents. language of the presentation: Japanese 発表題目: 明示的なラベルを用いないマルチモーダルturn-taking予測発表概要: 本研究では，Social Skill Trainingのためのエージェントの開発を行っている．過去の研究では，人対エージェントにおける自然な対話の実現に取り組んでおり，特に対話におけるターンテイクの重要性が指摘されている．そこで近年では，連続的に音声活動を予測し，ターンテイクイベントにマッピングする手法が提案されている．しかし，現在のターンテイクモデルは，マルチモーダルかつ連続的なアプローチが十分に検討されていない．そこで本研究では，エージェントに適応したマルチモーダルかつ連続的なターンテイクモデルを提案する．その第一段階として，Transformerを用いたターンテイクモデルの実装を行い，4つの異なるタイプの出力窓を比較する．評価には，ターンシフトとバックチャネルの予測に関連する4つのゼロショットタスクを定義し，評価を行う．

LEE SANGMYEONG	M, 1回目発表	知能コミュニケーション	中村　哲, 吉野　幸一郎（客員教授）, 品川　政太朗
title: Prompt Naturalization for Diffusion-Model-based Text-to-Image Editing abstract: Text-to-Image Synthesis technology is entering its golden age as paradigm-altering generative models are coming into being. However, relatively little attention is paid to how natural our text-prompt should be when using the system. Current editing research based on state-of-the-art diffusion model is preassumed that the editing prompt contains information of the previous image. Not only does this harm naturalness but also confuses the user if he/she intends to undergo several editing processess to build up a complex image. In this research, we seek to alleviate this situation by setting up a compiler-bridge between users' prompts and system's inputs, so that both sides can enjoy comfort to the utmost level without greatly modifying the model structure. language of the presentation: Japanese 発表題目: ディフュージョンモデル基盤のテキストー画像編集の為の入力の自然化発表概要: テキストー画像合成技術は、生成モデルの画期的な発展に伴い黄金期を迎えている。しかし、このシステムを使う時のテキストの入力がどれほど自然性を持つかについては相対的に注目が少ない。最新のディフュージョンモデルに基盤した現在の編集の研究は、入力のテキストが直前の画像の情報を反映することが前提としてされている。これは自然性を下げるだけでなく、複雑な画像を作るために数回編集過程を繰り返すときユーザを混乱させることになる。当研究は、ユーザからの入力とシステムに入る入力の間に橋を作り、モデルを過度に変えずに両側を最上の状況で臨めさせることで現状の改善を図る。

渡邉　寛大	M, 1回目発表	知能コミュニケーション/ロボット対話知能	中村　哲, 吉野　幸一郎（客員教授）, 湯口　彰重（客員助教）
title: Corpus Collection for Building a Dialogue System for Experience-based Knowledge Recall abstract: In order for dialogue systems to be used in human society, they need to be highly reliable and have the ability to remember and learn in order to build long-term relationships. For example, in the case of dialogue between humans, it is possible to remember and use previous conversations in multiple sessions, and to deepen one's own knowledge based on keywords that appear in the conversations, and to use this knowledge in future conversations. This study aims to construct such a model by collecting and analyzing a corpus of chat dialogues, considering the dialog histories of past sessions and the external databases associated with them, using the WoZ method. language of the presentation: Japanese 発表題目: 経験に基づく知識の想起を行う対話システムの構築に向けたコーパス収集発表概要: 対話システムが人間社会で活用されるには，高い信頼性と長期的な関係性を築くための記憶・学習能力が必要である．例えば人間同士の対話であれば，複数回行われる対話の中で以前の会話を記憶・利用し，またそこに出現するキーワードなどから自身の知識を深めて次回以降の会話に活用することができる．本研究ではこうしたモデルを構築することを指向して，過去セッションの対話履歴とそれに紐づけられた外部データベースを考慮した雑談対話コーパスをWoZ法により収集し，分析を行う．

ZHENG HAOZHI	M, 1回目発表	ディペンダブルシステム学	井上　美智子, 笠原　正治, 大下　福仁（客員教授）, 江口　僚太
title: Synchronization Algorithm for Swarm Robots abstract: Swarm robotics is a rapidly growing area of research due to its potential for tackling challenging tasks through collaboration. However, the implementation of formal distributed computing techniques with swarm robots has received less attention. One of the challenges in using these techniques with robots is the assumption of full-synchronization in many discrete models, which can be difficult to recreate on physical robots. In this work, we aim to address this issue by proposing a fault-tolerant synchronization algorithm for swarm robots, enabling them to directly execute discrete algorithms. language of the presentation: Japanese 発表題目: 群ロボットに対する同期アルゴリズム発表概要: 群ロボットは、協調的に難問を解決する可能性があるため、最近注目されている。しかし、形式的な分散コンピューティング技術を群ロボットで実装することに焦点を当てた研究は少ない。これらの技術をロボットで使用する上での課題の1つは、多くの離散モデルが完全な同期を前提としているため、実際のロボットで再現することが困難である。本研究では、故障耐性の同期アルゴリズムを提案することで、この問題を解決し、群ロボットが直接離散計算モデルを実行できるようにすることを目指す。

会場: L3

司会: 日永田智絵

坂本　州	M, 1回目発表	数理情報学/コミュニケーション学	池田　和司, 岩田　具治（客員教授）, 田中　佑典（客員准教授）
title: Identifying Physical Law of Hamiltonian Systems via Meta-Learning considering physical conditions abstract: There has been a lot of research on tasks related to physics using machine learning, and there has been a lot of research on incorporating prior knowledge of physics common among tasks into learning. In particular, the Hamiltonian neural network (HNN), which uses a descriptive form of dynamical systems based on the Hamiltonian, shows excellent accuracy as a neural network that can learn dynamical behavior while satisfying conservation laws. Furthermore, by applying meta-learning to HNN, the behavior of dynamical systems can be learned efficiently from a small amount of observed data. In this study, we propose a model that can learn HNN using meta-learning by adding physical conditions as data. language of the presentation:Japanese 発表題目: 物理条件を考慮したハミルトニアンニューラルネットワークのメタ学習発表概要:物理学に関係する機械学習を用いたタスクの研究は多く、タスク間に共通する物理学の事前知識を学習に組み込む研究が進んでいる。特にハミルトニアンを用いた力学系の記述形式を利用したハミルトニアンニューラルネットワーク（HNN）は保存則を満たしつつ力学の動作を学習できるニューラルネットワークとして優れた精度を示す。さらに、HNNにメタ学習を適用することで少ない観測データから効率的に力学系の動作を学習できる。本研究では、メタ学習を用いたHNNの学習に物理的な条件をデータとして加えて学習できるモデルを提案する。