Augmented Human Communication

Research Staff

  • Prof. Satoshi Nakamura

    Prof.
    Satoshi Nakamura

  • Assoc.Prof. Katsuhito Sudoh

    Assoc.Prof.
    Katsuhito Sudoh

  • Assoc.Prof. Yu Suzuki

    Assoc.Prof.
    Yu Suzuki

  • Assist.Prof. Sakriani Sakti

    Assoc.Prof.
    Sakriani Sakti

  • Assist.Prof. Keiji Yasuda

    Assoc.Prof.
    Keiji Yasuda

  • Assist.Prof. Koichiro Yoshino

    Assist.Prof.
    Koichiro Yoshino

  • Assist.Prof. Hiroki Tanaka

    Assist.Prof.
    Hiroki Tanaka

  • Assist.Prof. Graham Neubig

    Affiliate Assoc.Prof.
    Graham Neubig

E-mail { s-nakamura, sudoh, ysuzuki, ssakti, koichiro, hiroki-tan, neubig }[at] is.naist.jp, ke-yasuda [at] dsc.naist.jp

Go Beyond the Communication Barrier

The AHC Laboratory pursues research to solve problems related to human communication based on speech and language, paralanguage, and non-verbal information. By applying various artificial intelligence technologies including deep learning, our lab is pursuing tasks that were previously not able to be solved. Additionally, we seek knowledge related to human cognitive functions, as well as new information through brain measurement, and use it to perform research. Especially in research activities, we focus not only on theoretical aspects, but also on the applicability of technology, and aim at building prototype systems and validation. Below you can find our research areas.

NAIST launched the NAIST big data analytics project in April 2014, and subsequently the NAIST Data Science Center (NAIST DSC) in 2017. NAIST DSC focuses on material informatics, chemo-informatics, and social informatics by applying machine learning and artificial intelligence methodologies. The project also encourages close collaboration with industry. (For details, please see http://bigdata.naist.jp/, http://www-dsc.naist.jp/dsc_en/ )

Research Area

Real-time simultaneous speech-to-speech translation

Our current research project focuses on human-like simultaneous speech interpretation of complex utterances such as news and lectures, interpretation support technology for conferences attended by multiple speakers who speak multiple languages, and multimodal interpretation technology. (Fig. 1)

Natural Language Processing

Our research into natural language processing focuses on deep learning machine translation and natural language interfaces between humans and computers, thus allowing computers to understand natural language queries and commands so that they may answer questions and follow directions.

Multi-lingual statistical speech processing

Speech recognition and synthesis are fundamental technologies for realizing natural human-computer interaction. We study statistical methodologies such as hidden Markov models, Gaussian mixture models, deep neural networks, and recurrent neural networks. We are extending these models for emotional, conversational spontaneous, and multilingual speech.

Goal-oriented and Chatbot-type Spoken Dialog System

We focus on new statistical dialogue models for natural dialogue using individuality modeling, verbal information, intonation, emotion, face and gesture information. (Fig. 2)

Brain Analysis for Verbal and Non-verbal Communication

Our research on cognitive communication analyzes brain activity to detect real-time communication difficulty using Electroencephalograms (EEG). We also perform research on support for communication disabilities such as autism and dementia. (Fig.3)

Information Distillation

Research to summarize information that comes from a variety of complex data sources and to inform people of the summarized results in an understandable manner.

Knowledge Acquisition

Research on knowledge acquisition and understanding of objects in the real world to support the human-machine communication, in addition to available knowledge from a variety of information sources such as the Web.

Fig.1  Speech-to-speech translation

Fig.1 Speech-to-speech translation

Fig.2  Spoken dialogue system

Fig.2 A spoken dialogue system

Fig.3 EEG measurement system

Fig.3 A EEG measurement system