コロキアムB発表

日時: 9月22日(水)1限（9:20~10:50）

会場: L1

司会: 福嶋誠

渡邊　剛	M, 2回目発表	知能システム制御	杉本　謙二,	和田　隆広,	松原　崇充
title: Estimation of Garbage Weight from Camera Images in Bucket of Garbage Crane abstract: In recent years, efforts to automate garbage crane operations have been attracting attention. In order to determine whether the crane bucket has grabbed a large amount of garbage, it is necessary to measure the weight of the garbage after lifting the bucket, because the weight cannot be estimated during the grabbing operation. In this research, a camera is attached to the inside of the bucket, and the weight of the garbage that can be grabbed is estimated from the image information inside the bucket during the grabbing operation, aiming for time-efficient operation. A bucket with a camera embedded inside is attached to the garbage crane used in the simulated environment. Then, we estimated the weight using Deep Kernel Learning from the image data during the operation of grabbing the garbage. language of the presentation: Japanese 発表題目: ゴミクレーンにおけるバケット内カメラ画像からのゴミ重量推定発表概要: 近年ではごみクレーン操作を自動化する取り組みが注目されている。クレーンのバケットでごみを沢山掴めたかを判断する際、ごみの掴み動作中に重さを推定できないので、バケットを持ち上げてから重さを測定する必要がある。本研究ではバケット内部にカメラを取り付けて、掴み動作中におけるバケット内の画像情報から、掴めるごみの重さを推定することで時間効率の良い操作を目指す。模擬環境で使用するごみクレーンにカメラが内部に埋め込まれたバケットを取り付けた。そして、ごみを掴む動作中の画像データから、Deep Kernel Learning を用いた重さの推定を行なった。

権　裕煥	D, 中間発表	知能システム制御	杉本　謙二,	和田　隆広,	松原　崇充
title: Preferential Bayesian Cost-function Optimization for Food Arrangement abstract: Preference-based methods can estimate hidden evaluation functions or optimal points from the preference relation of the queries. However, it is difficult to naively use these methods to estimate food arrangement preferences because the search space becomes higher-dimensional when the number of ingredients increases. Our Preferential Bayesian Cost-function Optimization allows estimating preferable arrangement by expressing the evaluation function with lower-dimensional weights and searching for the optimal weight. We conducted experiments focusing on tempura arrangement to investigate the effectiveness of our framework. language of the presentation: Japanese

柴田　一騎	D, 中間発表	知能システム制御	杉本　謙二,	和田　隆広,	松原　崇充
title: Deep reinforcement learning of event-triggered communication and control for multi-agent cooperative transport abstract: Most multi-agent cooperative transport studies used analytical approach and derived suitable communication topologies for simple tasks or tasks with strong assumptions. However, there are few principled approaches for deriving minimum-communication strategies in general setups. This may be because the minimum-communication design problem cannot be separated from the design problem of the control strategy for multi-agent systems. Therefore, these problems need to be addressed simultaneously. In this work, we propose a multi-agent reinforcement learning approach to address the simultaneous design problem of communication and control strategies.　We demonstrated that our approach could balance the transport performance and communication savings through numerical simulations. language of the presentation: Japanese 発表題目: 深層強化学習によるマルチエージェント協調輸送のコミュニケーションと制御の獲得発表概要: マルチエージェント協調輸送の先行研究では、シンプルなタスクや強い仮定の下で、エージェント間の通信を最小にするコミュニケーション戦略を解析的に導出する。しかしながら、一般的な設定では、コミュニケーション戦略を導出する枠組みはほとんど存在しない。この理由として、コミュニケーションの設計問題は、マルチエージェントシステムの制御戦略の設計問題と切り離せないことが考えられる。したがって、これらの設計問題は同時に考える必要がある。本研究では、深層強化学習を用いて、マルチエージェント協調輸送のためのコミュニケーション戦略と制御戦略の両方を同時に最適化する枠組みを提案する。固定レート通信と比較してスパースな通信のみで良好な制御性能を達成できることを数値実験的に示す。