KAWAHARA Tatsuya

J-GLOBAL         Last updated: Sep 3, 2019 at 01:15
 
Avatar
Name
KAWAHARA Tatsuya
URL
http://sap.ist.i.kyoto-u.ac.jp/EN/
Affiliation
Kyoto University
Job title
Professor,Professor
Degree
Master of Engineering(Kyoto University), Doctor of Engineering(Kyoto University)
ORCID ID
0000-0002-2686-2296

Profile

Tatsuya Kawahara received B.E. in 1987, M.E. in 1989, and Ph.D. in 1995, all in information science, from Kyoto University, Kyoto, Japan. From 1995 to 1996, he was a Visiting Researcher at Bell Laboratories, Murray Hill, NJ, USA. Currently, he is a Professor in the School of Informatics, Kyoto University. He has also been an Invited Researcher at ATR and NICT.
Dr. Kawahara is a board member of APSIPA and ISCA, and a Fellow of IEEE.

Research Areas

 
 

Academic & Professional Experience

 
Apr 2003
 - 
Today
Professor, School of Informatics, Kyoto University
 
Apr 2003
 - 
Mar 2016
Professor, Academi Center for Computing & Media Studies, Kyoto University
 
Apr 1998
 - 
Mar 2003
Associate Professor, School of Informatics, Kyoto University
 
Jun 1995
 - 
Apr 1998
Associate Professor, Faculty of Engineering, Kyoto University
 
Sep 1995
 - 
Aug 1996
Visiting Researcher, Bell Laboratories
 
Nov 1990
 - 
May 1995
Research Associate, Faculty of Engineering, Kyoto University
 

Education

 
Mar 1987
 - 
Apr 1989
Department of Information Science, School of Engineering, Kyoto University
 
Apr 1983
 - 
Mar 1987
Department of Information Science, Faculty of Engineering, Kyoto University
 

Committee Memberships

 
Jan 2018
 - 
Today
APSIPA  BoG member
 
Jan 2018
 - 
Today
APSIPA Transactions on Signal and Information  Editor in Chief
 
Sep 2017
 - 
Today
ISCA  Board member
 
Jan 2017
 - 
Dec 2018
IEEE SPS Kansai Chapter  Chair
 
Jan 2014
 - 
Dec 2015
APSIPA  VP-Publications (BoG member)
 

Published Papers

 
End-to-End Modeling for Selection of Utterance Constructional Units via System Internal States
K.Tanaka, K.Inoue, S.Nakamura, K.Takanashi, T.Kawahara
Proc. Int'l Workshop Spoken Dialogue Systems (IWSDS)      2019   [Refereed]
Engagement-based Adaptive Behaviors for Laboratory Guide in Human-Robot Dialogue
K.Inoue, D.Lala, K.Yamamoto, K.Takanashi, T.Kawahara
Proc. Int'l Workshop Spoken Dialogue Systems (IWSDS)      2019   [Refereed]
A Job Interview Dialogue System with Autonomous Android ERICA
K.Inoue, K.Hara, D.Lala, S.Nakamura, K.Takanashi, T.Kawahara
Proc. Int'l Workshop Spoken Dialogue Systems (IWSDS)      2019   [Refereed]
Transfer Learning of Language-Independent End-to-End ASR with Language Model Fusion
H.Inaguma, J.Cho, M.K.Baskar, T.Kawahara, S.Watanabe
Proc. IEEE-ICASSP   6096-6100   2019   [Refereed]
Multi-speaker Sequence-to-Sequence Speech Synthesis for Data Augmentation in Acoustic-to-Word Speech Recognition
S.Ueno, M.Mimura, S.Sakai, T.Kawahara
Proc. IEEE-ICASSP   6161-6165   2019   [Refereed]
Prosodic Characteristics of Japanese Newscaster Speech for Different Speaking Situations
S.Nakamura, C.T.Ishi, T.Kawahara
Proc. Int'l Congress Phonetic Sciences (ICPhS)      2019   [Refereed]
ERICA and WikiTalk
D.Lala, G.Wilcock, K.Jokinen, T.Kawahara
Proc. IJCAI   Demo. Paper    2019   [Refereed]
Improving Transformer-based Speech Recognition Systems with Compressed Structure and Speech Attributes Augmentation
S.Li, R.Dabre, X.Lu, P.Shen, T.Kawahara, H.Kawai
Proc. INTERSPEECH      2019   [Refereed]
Investigating Radical-based End-to-End Speech Recognition Systems for Chinese Dialects and Japanese
S.Li, X.Lu, C.Ding, P.Shen, T.Kawahara, H.Kawai
Proc. INTERSPEECH      2019   [Refereed]
End-to-End Articulatory Attribute Modeling for Low-resource Multilingual Speech Recognition
S.Li, C.Ding, X.Lu, P.Shen, T.Kawahara, H.Kawai
Proc. INTERSPEECH      2019   [Refereed]
Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning
Y.Li, T.Zhao, T.Kawahara
Proc. INTERSPEECH      2019   [Refereed]
Analysis of effect and timing of fillers in natural turn-taking
D.Lala, S.Nakamura, T.Kawahara
Proc. INTERSPEECH      2019   [Refereed]
Turn-taking Prediction Based on Detection of Transition Relevance Place
K.Hara, K.Inoue, K.Takanashi, T.Kawahara
Proc. INTERSPEECH      2019   [Refereed]
Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
K.Shimada, Y.Bando, M.Mimura, K.Itoyama, K.Yoshii, T.Kawahara
IEEE/ACM Trans. Audio Speech & Language Process.   27 108-127   2019   [Refereed]
Joint Dialog Act Segmentation and Recognition in Human Conversations Using Attention to Dialog Context
T.Zhao, T.Kawahara
Computer Speech and Language   50 108-127   2019   [Refereed]
Voice Input Tutoring System for Older Adults using Input Stumble Detection
T.Hagiya, K.Hoashi, T.Kawahara
Proc. ACM Int'l Conf. Intelligent User Interfaces (IUI)   415-419   2018   [Refereed]
Audio-Visual Conversation Analysis by Smart Posterboard and Humanoid Robot
T.Kawahara, K.Inoue, D.Lala, K.Takanashi
Proc. IEEE-ICASSP   6573-6577   2018   [Refereed]
An End-to-End Approach to Joint Social Signal Detection and Automatic Speech Recognition
H.Inaguma, M.Mimura, K.Inoue, K.Yoshii, T.Kawahara
Proc. IEEE-ICASSP   6214-6218   2018   [Refereed]
Efficient Learning of Articulatory Models based on Multi-label Training and Label Correction for Pronunciation Learning
R.Duan, T.Kawahara, M.Dantsuji, H.Nanjo
Proc. IEEE-ICASSP   6239-6243   2018   [Refereed]
Unsupervised Beamforming based on Multichannel Nonnegative Matrix Factorization for Noisy Speech Recognition
K.Shimada, Y.Bando, M.Mimura, K.Itoyama, K.Yoshii, T.Kawahara
Proc. IEEE-ICASSP   5734-5738   2018   [Refereed]
Statistical Speech Enhancement based on Probabilistic Integration of Variational Autoencoder and Non-negative Matrix Factorization
Y.Bando, M.Mimura, K.Itoyama, K.Yoshii, T.Kawahara
Proc. IEEE-ICASSP   716-720   2018   [Refereed]
Acoustic-to-Word Attention-based Model Complemented with Character-level CTC-based Model
S.Ueno, H.Inaguma, M.Mimura, T.Kawahara
Proc. IEEE-ICASSP   5804-5808   2018   [Refereed]
Spoken Dialogue System for a Human-like Conversational Robot ERICA
T.Kawahara
Proc. Int'l Workshop Spoken Dialogue Systems (IWSDS)   201-208   2018   [Invited]
Generating Fillers based on Dialog Act Pairs for Smooth Turn-Taking by Humanoid Robot
R.Nakanishi, K.Inoue, S.Nakamura, K.Takanashi, T.Kawahara
Proc. Int'l Workshop Spoken Dialogue Systems (IWSDS)      2018   [Refereed]
Latent Character Model for Engagement Recognition Based on Multimodal Behaviors
K.Inoue, D.Lala, K.Takanashi, T.Kawahara
Proc. Int'l Workshop Spoken Dialogue Systems (IWSDS)      2018   [Refereed]
A Unified Neural Architecture for Joint Dialog Act Segmentation and Recognition in Spoken Dialog System
T.Zhao, T.Kawahara
Proc. SIGdial Meeting Discourse & Dialogue   201-208   2018   [Refereed]
Engagement Recognition in Spoken Dialogue via Neural Network by Aggregating Different Annotators' Models
K.Inoue, D.Lala, K.Takanashi, T.Kawahara
Proc. INTERSPEECH   616-626   2018   [Refereed]
Prediction of Turn-taking Using Multitask Learning with Prediction of Backchannels and Fillers
K.Hara, K.Inoue, K.Takanashi, T.Kawahara
Proc. INTERSPEECH   991-995   2018   [Refereed]
Forward-Backward Attention Decoder
M.Mimura, S.Sakai, T.Kawahara
Proc. INTERSPEECH   2232-2236   2018   [Refereed]
Encoder Transfer for Attention-based Acoustic-to-word Speech Recognition
S.Ueno, T.Moriya, M.Mimura, S.Sakai, Y.Yamaguchi, Y.Aono, T.Kawahara
Proc. INTERSPEECH   2424-2428   2018   [Refereed]
Improving CTC-based Acoustic Model with Very Deep Residual Time-delay Neural Networks
S.Li, X.Lu, R.Takashima, P.Shen, T.Kawahara, H.Kawai
Proc. INTERSPEECH   3708-3712   2018   [Refereed]
Independent Low-Rank Tensor Analysis for Audio Source Separation
K.Yoshii, K.Kitamura, Y.Bando, E.Nakamura, T.Kawahara
Proc. EUSIPCO   1671-1675   2018   [Refereed]
Evaluation of real-time deep learning turn-taking models for multiple dialogue scenarios
D.Lala, K.Inoue, T.Kawahara
Proc. ICMI   78-86   2018   [Refereed]
Human-like Conversational Robot
T.Kawahara
Proc. APSIPA ASC   1233-1239   2018   [Invited]
Bayesian Multichannel Speech Enhancement with a Deep Speech Prior
K.Sekiguchi, Y.Bando, K.Yoshii, T.Kawahara
Proc. APSIPA ASC   1233-1239   2018   [Refereed]
Dialogue Behavior Control Model for Expressing a Character of Humanoid Robots
K.Yamamoto, K.Inoue, S.Nakamura, K.Takanashi, T.Kawahara
Proc. APSIPA ASC   1732-1737   2018   [Refereed]
Improving Very Deep Time-Delay Neural Network with Vertical-Attention for Effectively Training CTC-based ASR Systems
S.Li, X.Lu, R.Takashima, P.Shen, T.Kawahara, H.Kawai
Proc. IEEE Spoken Language Technology Workshop (SLT)   77-83   2018   [Refereed]
Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR
H.Inaguma, M.Mimura, S.Sakai, T.Kawahara
Proc. IEEE Spoken Language Technology Workshop (SLT)   212-218   2018   [Refereed]
Leveraging Sequence-to-Sequence Speech Synthesis for Enhancing Acoustic-to-Word Speech Recognition
M.Mimura, S.Ueno, H.Inaguma, S.Sakai, T.Kawahara
Proc. IEEE Spoken Language Technology Workshop (SLT)   477-484   2018   [Refereed]
Speech Enhancement Based on Bayesian Low-Rank and Sparse Decomposition of Multichannel Magnitude Spectrograms
Y.Bando, K.Itoyama, M.Konyo, S.Tadokoro, K.Nakadai, K.Yoshii, T.Kawahara, H.G.Okuno
IEEE/ACM Trans. Audio Speech & Language Process.   26 17-36   2018   [Refereed]
Bayesian Multichannel Audio Source Separation Based on Integrated Source and Spatial Models
K.Itakura, Y.Bando, E.Nakamura, K.Itoyama, K.Yoshii, T.Kawahara
IEEE/ACM Trans. Audio Speech & Language Process.   26 17-36   2018   [Refereed]
Typing Tutor: Individualized Tutoring in Text Entry for Older Adults Based on Statistical Input Stumble Detection
T.Hagiya, T.Horiuchi, T.Yazaki, T.Kawahara
J. Information Processing   26    2018   [Refereed]
Exploiting Automatic Speech Recognition Errors to Enhance Partial and Synchronized Caption for Facilitating Second Language Listening
M.Mirzaei, K.Meshgi, T.Kawahara
Computer Speech and Language   49 17-36   2018   [Refereed]
Engagement recognition by a latent character model based on multimodal listener behaviors in spoken dialogue
K.Inoue, D.Lala, K.Takanashi, T.Kawahara
APSIPA Trans. Signal & Information Process.   7(e9) 1-16   2018   [Refereed]
Utterance behavior of users while playing basketball with a virtual teammate
D.Lala, Y.Li, T.Kawahara
Proc. ICAART   28-38   2017   [Refereed]
Bayesian Multichannel Nonnegative Matrix Factorization for Audio Source Separation and Localization
K.Itakura, Y.Bando, E.Nakamura, K.Itoyama, K.Yoshii, T.Kawahara
Proc. IEEE-ICASSP   551-555   2017   [Refereed]
Semi-supervised Ensemble DNN Acoustic Model Training
S.Li, X.Lu, S.Sakai, M.Mimura, T.Kawahara
Proc. IEEE-ICASSP   5270-5274   2017   [Refereed]
Effective Articulatory Modeling for Pronunciation Error Detection of L2 Learner without Non-native Training Data
R.Duan, T.Kawahara, M.Dantsuji, J.Zhang
Proc. IEEE-ICASSP   5815-5819   2017   [Refereed]
A Conversational Dialogue Manager for the Humanoid Robot ERICA
P.Milhorat, D.Lala, K.Inoue, Z.Tianyu, M.Ishida, K.Takanashi, S.Nakamura, T.Kawahara
Proc. Int'l Workshop Spoken Dialogue Systems (IWSDS)      2017   [Refereed]
Attentive Listening System with Backchanneling, Response Generation and Flexible Turn-taking
D.Lala, P.Milhorat, K.Inoue, M.Ishida, K.Takanashi, T.Kawahara
Proc. SIGdial Meeting Discourse & Dialogue   127-136   2017   [Refereed]
Social Signal Detection in Spontaneous Dialogue Using Bidirectional LSTM-CTC
H.Inaguma, K.Inoue, M.Mimura, T.Kawahara
Proc. INTERSPEECH   1691-1695   2017   [Refereed]
Analysis of the Relationship between Prosodic Features of Fillers and Its Forms or Occurrence Positions
S.Nakamura, R.Nakanishi, K.Takanashi, T.Kawahara
Proc. INTERSPEECH   1726-1230   2017   [Refereed]
Combined Multi-channel NMF-based Robust Beamforming for Noisy Speech Recognition
M.Mimura, Y.Bando, K.Shimada, S.Sakai, K.Yoshii, T.Kawahara
Proc. INTERSPEECH   2451-2455   2017   [Refereed]
Listening Difficulty Detection to Foster Second Language Listening with the Partial and Synchronized Caption System
M.Mirzaei, K.Meshgi, T.Kawahara
Proc. EUROCALL   211-216   2017   [Refereed]
Transfer Learning based Non-native Acoustic Modeling for Pronunciation Error Detection
R.Duan, T.Kawahara, M.Dantsuji, H.Nanjo
Proc. Workshop Speech \& Language Technology for Education (SLaTE)   50-54   2017   [Refereed]
Detecting listening difficulty for second language learners using Automatic Speech Recognition errors
M.Mirzaei, K.Meshgi, T.Kawahara
Proc. Workshop Speech \& Language Technology for Education (SLaTE)   164-168   2017   [Refereed]
Semi-Blind Speech Enhancement Based On Recurrent Neural Network For Source Separation And Dereverberation
M.Wake, Y.Bando, M.Mimura, K.Itoyama, K.Yoshii, T.Kawahara
Proc. IEEE Machine Learning for Signal Processing Workshop (MLSP)      2017   [Refereed]
Detection of Social Signals for Recognizing Engagement in Human-Robot Interaction
D.Lala, K.Inoue, P.Milhorat, T.Kawahara
Proc. AAAI Fall Sympo. Natural Communication for Human-Robot Collaboration      2017   [Refereed]
Modeling Difficulties of Second Language Learners using Speech Technology
T.Kawahara
Proc. Seoul International Conference on Speech Sciences (SICSS)   704-712   2017   [Invited]
Joint Learning of Dialog Act Segmentation and Recognition in Spoken Dialog Using Neural Networks
T.Zhao, T.Kawahara
Proc. IJCNLP   704-712   2017   [Refereed]
Automatic Meeting Transcription System for the Japanese Parliament (Diet)
T.Kawahara
Proc. APSIPA ASC   134-140   2017   [Refereed][Invited]
Emotion Recognition by Combining Prosody and Sentiment Analysis for Expressing Reactive Emotion by Humanoid Robot
Y.Li, C.T.Ishi, N.Ward, K.Inoue, S.Nakamura, K.Takanashi, T.Kawahara
Proc. APSIPA ASC      2017   [Refereed]
Cross-Domain Speech Recognition using Nonparallel Corpora with Cycle-consistent Adversarial Networks
M.Mimura, S.Sakai, T.Kawahara
Proc. IEEE Workshop Automatic Speech Recognition & Understanding (ASRU)   134-140   2017   [Refereed]
Incremental Training and Constructing the Very Deep Convolutional Residual Network Acoustic Models
S.Li, X.Lu, P.Shen, R.Takashima, T.Kawahara, H.Kawai
Proc. IEEE Workshop Automatic Speech Recognition & Understanding (ASRU)   222-227   2017   [Refereed]
Partial and Synchronized Captioning: A new tool to assist learners in developing second language listening skill
M.Mirzaei, K.Meshgi, Y.Akita, T.Kawahara
ReCALL Journal   29(2) 2174-2182   2017   [Refereed]
Assistive Typing Application for Older Adults Based on Input Stumble Detection
T.Hagiya, T.Horiuchi, T.Yazaki, T.Kato, T.Kawahara
J. Information Processing   25    2017   [Refereed]
Articulatory Modeling for Pronunciation Error Detection without Non-native Training Data based on DNN Transfer Learning
R.Duan, T.Kawahara, M.Dantsuji, J.Zhang
IEICE Trans.   E100-D(9) 2174-2182   2017   [Refereed]
Analysis and Prediction of Morphological Patterns of Backchannels for Attentive Listening Agents
T.Yamaguchi, K.Inoue, K.Yoshino, K.Takanashi, N.Ward, T.Kawahara
Proc. Int'l Workshop Spoken Dialogue Systems (IWSDS)      2016   [Refereed]
Data Selection from Multiple ASR Systems' Hypotheses for Unsupervised Acoustic Model Training
S.Li, Y.Akita, T.Kawahara
Proc. IEEE-ICASSP   5875-5879   2016   [Refereed]
Interactional and Pragmatics-related Prosodic Patterns in Mandarin Dialog
N.Ward, Y.Li, T.Zhao, T.Kawahara
Proc. Int'l Conf. Speech Prosody      2016   [Refereed]
Leveraging Automatic Speech Recognition Errors to Detect Challenging Speech Segments in TED Talks
M.Mirzaei, K.Meshgi, T.Kawahara
Proc. EUROCALL   313-318   2016   [Refereed]
ERICA: The ERATO Intelligent Conversational Android
D.F.Glas, T.Minato, C.T.Ishi, T.Kawahara, H.Ishiguro
Proc. RO-MAN   22-29   2016   [Refereed]
Prediction and Generation of Backchannel Form for Attentive Listening Systems
T.Kawahara, T.Yamaguchi, K.Inoue, K.Takanashi, N.Ward
Proc. INTERSPEECH   2890-2894   2016   [Refereed]
Joint Optimization of Denoising Autoencoder and DNN Acoustic Model Based on Multi-target Learning for Noisy Speech Recognition
M.Mimura, S.Sakai, T.Kawahara
Proc. INTERSPEECH   3803-3807   2016   [Refereed]
Talking with ERICA, an autonomous android
K.Inoue, P.Milhorat, D.Lala, T.Zhao, T.Kawahara
Proc. SIGdial Meeting Discourse & Dialogue   Demo. Paper 212-215   2016   [Refereed]
Managing Dialog and Joint Actions for Virtual Basketball Teammates
D.Lala, T.Kawahara
Proc. IVA   Poster    2016   [Refereed]
Confidence Estimation for Speech Recognition Systems using Conditional Random Fields Trained with Partially Annotated Data
S.Li, X.Lu, S.Mori, Y.Akita, T.Kawahara
Proc. Int'l Sympo. Chinese Spoken Language Processing (ISCSLP)      2016   [Refereed]
Pronunaciation Error Detection using DNN Articulatory Model based on Multi-lingual and Multi-task Learning
R.Duan, T.Kawahara, M.Dantsuji, J.Zhang
Proc. Int'l Sympo. Chinese Spoken Language Processing (ISCSLP)      2016   [Refereed]
Multimodal interaction with the autonomous android ERICA
D.Lala, P.Milhorat, K.Inoue, T.Zhao, T.Kawahara
Proc. ICMI   Demo. Paper 417-418   2016   [Refereed]
Prediction of Ice-breaking Between Participants Using Prosodic Features in the First Meeting Dialogue
H.Inaguma, K.Inoue, S.Nakamura, K.Takanashi, T.Kawahara
Proc. ICMI Workshop on Advancements in Social Signal Processing for Multimodal Interaction (ASSP4MI)      2016   [Refereed]
Annotation and analysis of listener's engagement based on multi-modal behaviors
K.Inoue, D.Lala, S.Nakamura, K.Takanashi, T.Kawahara
Proc. ICMI Workshop on Multimodal Analyses enabling Artificial Agents in Human-Machine Interaction (MA3HMI)      2016   [Refereed]
ASR errors as predictor of L2 listening difficulties and PSC enhancement
M.Mirzaei, K.Meshgi, T.Kawahara
Proc. Coling Workshop on Computational Linguistics for Linguistic Complexity (CL4LC)   192-201   2016   [Refereed]
Multi-lingual and Multi-task DNN Learning for Articulatory Error Detection
R.Duan, T.Kawahara, M.Dantsuji, J.Zhang
Proc. APSIPA ASC      2016   [Refereed]
Multi-modal Sensing and Analysis of Poster Conversations with Smart Posterboard
T.Kawahara, T.Iwatate, K.Inoue, S.Hayashi, H.Yoshimoto, K.Takanashi
APSIPA Trans. Signal & Information Process.   5(e2) 1-12   2016   [Refereed]
Semi-supervised Acoustic Model Training by Discriminative Data Selection from Multiple ASR Systems' Hypotheses
S.Li, Y.Akita, T.Kawahara
IEEE/ACM Trans. Audio Speech & Language Process.   24(9) 2174-2182   2016   [Refereed]
News Navigation System based on Proactive Dialogue Strategy
K.Yoshino, T.Kawahara
Proc. Int'l Workshop Spoken Dialogue Systems (IWSDS)      2015   [Refereed]
Toward Adaptive Generation of Backchannels for Attentive Listening Agents
T.Kawahara, M.Uesato, K.Yoshino, K.Takanashi
Proc. Int'l Workshop Spoken Dialogue Systems (IWSDS)      2015   [Refereed]
Deep Autoencoders Augmented with Phone-class Feature for Reverberant Speech Recognition
M.Mimura, S.Sakai, T.Kawahara
Proc. IEEE-ICASSP   4356-4369   2015   [Refereed]
Language Model Adaptation for Academic Lectures using Character Recognition Result of Presentation Slides
Y.Akita, Y.Tong, T.Kawahara
Proc. IEEE-ICASSP   5431-5435   2015   [Refereed]
Named Entity Recognizer Trainable from Partially Annotated Data
T.Sasada, S.Mori, T.Kawahara, Y.Yamakata
Proc. PACLING   10-17   2015   [Refereed]
Errors in Automatic Speech Recognition versus Difficulties in Second Language Listening
M.Mirzaei, K.Meshgi, Y.Akita, T.Kawahara
Proc. EUROCALL   410-415   2015   [Refereed]
ASR Technology to Empower Partial and Synchronized Caption for L2 Listening Development
M.Mirzaei, T.Kawahara
Proc. Workshop Speech \& Language Technology for Education (SLaTE)   65-70   2015   [Refereed]
Speech Dereverberation Using Long Short-Term Memory
M.Mimura, S.Sakai, T.Kawahara
Proc. INTERSPEECH   2435-2439   2015   [Refereed]
Ensemble Speaker Modeling using Speaker Adaptive Training Deep Neural Network for Speaker Adaptation
S.Li, X.Lu, Y.Akita, T.Kawahara
Proc. INTERSPEECH   2892-2896   2015   [Refereed]
Enhanced Speaker Diarization with Detection of Backchannels using Eye-gaze Information in Poster Conversations
K.Inoue, Y.Wakabayashi, H.Yoshimoto, K.Takanashi, T.Kawahara
Proc. INTERSPEECH   3086-3090   2015   [Refereed]
Discriminative Data Selection for Lightly Supervised Training of Acoustic Model using Closed Caption Texts
S.Li, Y.Akita, T.Kawahara
Proc. INTERSPEECH   3526-3530   2015   [Refereed]
Automatic Classification of Usability of ASR Result for Real-time Captioning of Lectures
Y.Akita, N.Kuwahara, T.Kawahara
Proc. APSIPA ASC   19-22   2015   [Refereed]
Synchrony in Prosodic and Linguistic Features between Backchannels and Preceding Utterances in Attentive Listening
T.Kawahara, T.Yamaguchi, M.Uesato, K.Yoshino, K.Takanashi
Proc. APSIPA ASC   392-395   2015   [Refereed]
Reverberant Speech Recognition Combining Deep Neural Networks and Deep Autoencoders Augmented with Phone-class Feature
M.Mimura, S.Sakai, T.Kawahara
EURASIP J. Advances in Signal Processing   2015(62) 1-13   2015   [Refereed]
Optimized Wavelet-domain Filtering Under Noisy and Reverberant Conditions
R.Gomez, T.Kawahara, K.Nakadai
APSIPA Trans. Signal & Information Process.   4(e3) 1-12   2015   [Refereed]

Misc

 
Improving articulatory attribute modeling based on multi-label training and label correction
Richeng Duan, Tatsuya Kawahara, Masatake Dantsuji, Hiroaki Nanjo
日本音響学会研究発表会講演論文集   2002/9/8    2018
Effective Articulatory Modeling for Pronunciation Error Detection
Richeng Duan, Tatsuya Kawahara, Masatake Dantsuji
日本音響学会研究発表会講演論文集   2-P-30    2017
Language Independent Non-native Articulatory Modeling for Pronunciation Error Detection
Richeng Duan, Tatsuya Kawahara, Masatake Dantsuji, Hiroaki Nanjo
日本音響学会研究発表会講演論文集   2002/11/8    2017
Diversity-driven Semi-supervised Ensemble DNN Acoustic Model Training
Sheng Li, Xugang Lu, Shinsuke Sakai, Tatsuya Kawahara
電子情報通信学会技術研究報告   SP2016-40    2016
Emotion Recognition by Combining Prosody with Text Information and Assessment Selection for Human-Robot Interaction
Yuanchao Li, Inoue Koji, Shizuka Nakamura, Katsuya Takanashi, Toshinori Ishi Carlos, Tatsuya Kawahara
人工知能学会研究会資料   SLUD-B506-09    2017
Joint Learning of Dialog Act Segmentation and Recognition Using Neural Networks
Tianyu Zhao, Tatsuya Kawahara
情報処理学会研究報告   SLP-119-12    2017
Pronunciation Error Detection using DNN Articulatory Model based on Multi-lingual and Multi-task Learning
Richeng Duan, Tatsuya Kawahara, Masatake Dantsuji
日本音響学会研究発表会講演論文集   3-Q-23    2016
Pronunciation Error Detection using DNN Articulatory Model based on Transfer Learning
Richeng Duan, Tatsuya Kawahara, Masatake Dantsuji
電子情報通信学会技術研究報告   SP2016-39    2016
Incorporating divergences from hypotheses of multiple ASR systems to improve unsupervised acoustic model training
Sheng Li, Yuya Akita, Tatsuya Kawahara
日本音響学会研究発表会講演論文集   1-P-23    2015
Effective Combination of Multiple ASR Hypotheses with CRF-based Classifiers
Sheng Li, Yuya Akita, Tatsuya Kawahara
日本音響学会研究発表会講演論文集   1-Q-14    2015
Discriminative Data Selection from Multiple ASR Systems' Hypotheses for Unsupervised Acoustic Model Training
Sheng Li, Yuya Akita, Tatsuya Kawahara
情報処理学会研究報告   SLP-109-8    2015
Data Selection Assisted by Caption to Improve Acoustic Modeling for Lecture Transcription
Sheng Li, Yuya Akita, Tatsuya Kawahara
日本音響学会研究発表会講演論文集   2002/4/4    2014
Unsupervised Training of Deep Neural Network Acoustic Models for Lecture Transcription
Sheng Li, Yuya Akita, Tatsuya Kawahara
日本音響学会研究発表会講演論文集   1-R-4    2014
Partial and Synchronized Caption Generation to Enhance the Listening Comprehension Skills of Second Language Learners
Maryam Sadat Mirzaei, Tatsuya Kawahara
情報処理学会研究報告   SLP-101-15    2014
Classifier-based Data Selection for Lightly-Supervised Training of Acoustic Model for Lecture Transcription
Sheng Li, Yuya Akita, Tatsuya Kawahara
情報処理学会研究報告   SLP-102-4    2014
Automatic transcription of Chinese spoken lectures
Sheng Li, Masato Mimura, Tatsuya Kawahara
日本音響学会研究発表会講演論文集   2-P-31    2013
Wavelet Packet Decomposition-based Dereverberation for ASR
Randy Gomez, Tatsuya Kawahara
日本音響学会研究発表会講演論文集   1-P-16    2012
Automatic Speech Recognition for TED Talks
Welly Naptali, Tatsuya Kawahara
日本音響学会研究発表会講演論文集   3-P-4    2012
Automatic transcription of TED Talks
Welly Naptali, Tatsuya Kawahara
音声ドキュメント処理ワークショップ      2012
Wavelet Packet Decomposition Approach to Reverberant Speech Recognition
Randy Gomez, Tatsuya Kawahara
情報処理学会研究報告   SLP-92-11    2012
Comparison of Discriminative Models for Lexicon Optimization for ASR of Agglutinative Language
Mijit Ablimit, Tatsuya Kawahara, Askar Hamdulla
情報処理学会研究報告   SLP-92-13    2012
Wavelet Optimization using Noise Profiles for Noise-robust Speech Recognition
Randy Gomez, Tatsuya Kawahara
日本音響学会研究発表会講演論文集   2-P-17    2011
Robust Speech Recognition in Noisy and Reverberant Conditions Using Wiener Filtering in the Wavelet Domain
Randy Gomez, Tatsuya Kawahara
日本音響学会研究発表会講演論文集   2-Q-21    2011
Combining Slot-based Vector Space Models for Voice Book Search
Cheongjae Lee, Alexander Rudnicky, Tatsuya Kawahara
情報処理学会研究報告   SLP-85-5    2011
Robust Speech Recognition Using Optimized Wavelet Denoising with Noise Profiles
Randy Gomez, Tatsuya Kawahara
情報処理学会研究報告   SLP-85-12    2011
Collecting Speech Data using Amazon's Mechanical Turk for Evaluating Voice Search System
Cheongjae Lee, Tatsuya Kawahara, Alexander Rudnicky
情報処理学会研究報告   SLP-87-9    2011
Robust Speech Recognition in Noisy and Reverberant Environments Using Wavelet-based Wiener Filtering
Randy Gomez, Tatsuya Kawahara
情報処理学会研究報告   SLP-87-14    2011
Evaluation of Lexicon Optimization based on Discriminative Learning
Mijit Ablimit, Tatsuya Kawahara, Askar Hamdulla
情報処理学会研究報告   SLP-89-2    2011
Wavelet Filtering in ASR Robust to Noisy and Reverberant Environments
Randy Gomez, Tatsuya Kawahara
日本音響学会研究発表会講演論文集   1-Q-2    2010
Wavelet Optimization for Robust Dereverberation in Automatic Speech Recognition
Randy Gomez, Tatsuya Kawahara
日本音響学会研究発表会講演論文集   1-Q-8    2010
Robust Speech Recognition using Optimized Wavelet-based Dereverbaration
Randy Gomez, Tatsuya Kawahara
情報処理学会研究報告   SLP-82-5    2010
Robust Speech Recognition Using Optimized Wavelet Filtering in Reverberant Conditions
Randy Gomez, Tatsuya Kawahara
人工知能学会研究会資料   Challenge-B002-4    2010
Unsupervised Optimization of Dereverberation Parameters based on the Likelihood of Speech Recognizer
Randy Gomez, Tatsuya Kawahara
日本音響学会研究発表会講演論文集   1-P-15    2009
Using Online Free Energy for Model Comparison with Application to Voice Activity Detection
David Cournapeau, Shinji Watanabe, Atsushi Nakamura, Tatsuya Kawahara
日本音響学会研究発表会講演論文集   2002/5/14    2009
Robust Dereverberation Using Synthetically Generated Impulse Response for Speech Recognition
Randy Gomez, Tatsuya Kawahara
日本音響学会研究発表会講演論文集   1-R-1    2009
Using Online Model Comparison in the Variational Bayes Framework - an Application to Voice Activity Detection
David Cournapeau, Shinji Watanabe, Atsushi Nakamura, Tatsuya Kawahara
情報処理学会研究報告   SLP-75-3    2009
Unsupervised Optimization of Dereverberation Parameters using Likelihood of Speech Recognizer
Randy Gomez, Tatsuya Kawahara
情報処理学会研究報告   SLP-75-4    2009
Speech Enhancement Optimization based on Acoustic Model Likelihood for Noisy and Reverberant Environment
Randy Gomez, Tatsuya Kawahara
人工知能学会研究会資料   Challenge-A902-9    2009
A Japanese CALL System for Practicing Sentence Patterns based on Dynamic Question Generation
Hongcui Wang, Tatsuya Kawahara
日本音響学会研究発表会講演論文集   2001/10/4    2008
A VAD Method using Online Variational Free Energy for Model Adaptation
David Cournapeau, Tatsuya Kawahara
日本音響学会研究発表会講演論文集   2001/10/7    2008
Speech Recognizer-based Optimization for Dereverberation Technique Using Multi-band Spectral Subtraction
Randy Gomez, Tatsuya Kawahara
日本音響学会研究発表会講演論文集   3-Q-3    2008
An Application of Online VB-EM Algorithm to Voice Activity Detection
David Cournapeau, Tatsuya Kawahara, Shinji Watanabe, Atsushi Nakamura
日本音響学会研究発表会講演論文集   3-Q-11    2008
Optimizing Scoring System for a Japanese Tutor System
Hongcui Wang, Tatsuya Kawahara
日本音響学会研究発表会講演論文集   3-Q-28    2008
Robust Speech Recognition in Reverberant Environment by Optimizing Multi-band Spectral Subtraction
Randy Gomez, Tatsuya Kawahara
人工知能学会研究会資料   Challenge-A802-4    2008
Real-time VAD Algorithm based on Enhanced Cumulant and On-line EM: Results on CENSREC-1-C
David Cournapeau, Tatsuya Kawahara
日本音響学会研究発表会講演論文集   2003/9/13    2007
Dynamic Situation Based Sentence Generation Used in Creating Questions for Students of Japanese
Christopher Waple, Yasushi Tsubota, Masatake Dantsuji, Tatsuya Kawahara
言語処理学会年次大会発表論文集   S2-7 799-802   2007
Using Bayesian Prior for Real-Time Voice Activity Detection
David Cournapeau, Tatsuya Kawahara
日本音響学会研究発表会講演論文集   1-P-30    2007
An Approach to Error Analysis for Effective Prediction in ASR for a Japanese Tutor System
Hongcui Wang, Tatsuya Kawahara
日本音響学会研究発表会講演論文集   2002/8/5    2007
Decision Tree based Error Analysis for Effective Prediction in ASR for Japanese CALL system
Hongcui Wang, Tatsuya Kawahara
電子情報通信学会技術研究報告   SP2007-121    2007
Using Variational Bayes Free Energy for Noise Robust Online Voice Activity Detection
David Cournapeau, Tatsuya Kawahara
電子情報通信学会技術研究報告   SP2007-131    2007
Robust Voice Activity Detection Based on Enhanced Cumulant of LPC Residual and On-line EM Algorithm
David Cournapeau, Tatsuya Kawahara
情報処理学会研究報告   SLP-62-3    2006
Trigger-Based Language Model Adaptation for Automatic Transcription of Panel Discussions
C.Troncoso, T.Kawahara
日本音響学会研究発表会講演論文集   2001/5/22    2005
Effect of Dialogue Context and Topic Clustering on Out-of-Domain Detection
I.R.Lane, T.Kawahara, S.Nakamura
日本音響学会研究発表会講演論文集   2002/5/1    2005
Enhancement to Initial Transcription-Based Trigger Language Model Adaptation
C.Troncoso, T.Kawahara
日本音響学会研究発表会講演論文集   2002/1/2    2005
Detection of Speech Recognition Errors using In-domain Confidence and Discourse Coherence Measures
I.R.Lane, T.Kawahara
日本音響学会研究発表会講演論文集   2002/1/10    2005
Automatic Transcription of Panel Discussions using Trigger-based Language Model Adaptation
Carlos Troncoso, Tatsuya Kawahara
情報処理学会研究報告   SLP-57-3    2005
Incorporating In-domain Confidence and Discourse Coherence Measures in Utterance Verification
Ian Lane, Tatsuya Kawahara
情報処理学会研究報告   SLP-57-7    2005
Investigation of Classification Modeling for Out-Of-Domain Utterance Detection
I.R.Lane, T.Kawahara, T.Matsui, S.Nakamura
日本音響学会研究発表会講演論文集   2003/8/2    2004
Out-Of-Domain Utterance Detection in Dialogue via Speech-to-Speech Translation
I.R.Lane, T.Kawahara, T.Matsui, S.Nakamura
日本音響学会研究発表会講演論文集   2003/1/23    2004
Trigger-Based Language Model Construction by Combining Different Corpora
Carlos Troncoso, Tatsuya Kawahara, Hirofumi Yamamoto, Genichiro Kikui
電子情報通信学会技術研究報告   SP2004-100    2004
Out-of-Domain Detection Incorporating Dialogue Context and Topic Clustering
Ian Lane, Tatsuya Kawahara, Satoshi Nakamura
電子情報通信学会技術研究報告   SP2004-130    2004
Out-Of-Domain Detection based on Verification for Multi-Domain Dialogue Speech
I.R.Lane, T.Kawahara, T.Matsui, S.Nakamura
日本音響学会研究発表会講演論文集   2002/6/8    2003
Out-of-Domain Utterance Detection based on Confidence Measures from Multiple Topic Classification
Ian Lane, Tatsuya Kawahara, Tomoko Matsui, Satoshi Nakamura
電子情報通信学会技術研究報告   SP2003-159    2003
Intelligibility Assessment and Pronunciation Error Diagnosis for a CALL System
Raux Antoine, Tatsuya Kawahara
電子情報通信学会技術研究報告   SP2001-134    2002
Language model switching based on topic detection for dialog speech recognition
Ian Lane, Tatsuya Kawahara, Tomoko Matsui, Satoshi Nakamura
電子情報通信学会技術研究報告   SP2002-145    2002
Speaking-Style Dependent Lexicalized Filler Model for Key-Phrase Detection and Verification
Tatsuya Kawahara, Shuji Doshita, Chin-Hui Lee
電子情報通信学会技術研究報告   SP97-78    1997
Prosodic Analysis of Various Disfluencies in Japanese
Felix Quimbo, Tatsuya Kawahara, Shuji Doshita
人工知能学会研究会資料   SIG-SLUD-9503-9    1996
On Integration of Multiple Knowledge Sources for Spoken Language Understanding
T.Kawahara
RWC情報統合ワークショップ   208-216   1995
Unsupervised Speaker Normalization by Speaker Markov Model Converter for Speaker-Independent Speech Recognition Systems
P.Fung, T.Kawahara, S.Doshita
情報処理学会全国大会講演論文集   6D-1    1991

Books etc

 
Springer Handbook on Speech Processing and Speech Communication
Sadaoki Furui and Tatsuya Kawahara (Part:Contributor, Chpter 32: Transcription and distillation of spontaneous speech)
Springer   2008   
Spoken Language Systems
Seiichi Nakagawa, Michio Okada, and Tatsuya Kawahara, editors
Ohmsha/IOS Press   2005   

Conference Activities & Talks

 
Captioning Software using Automatic Speech Recognition (ASR)
KAWAHARA Tatsuya
Intersteno Conference   16 Jul 2019   
Dialogue Models and Systems: From Research Labs, to the Cloud to Your Living Room --Human-like Dialogue with Robot [Invited]
KAWAHARA Tatsuya
IEEE Int'l Workshop on Spoken Language Technology (SLT)   19 Dec 2018   
Human-like conversational robot [Invited]
KAWAHARA Tatsuya
APSIPA ASC   14 Nov 2018   
Spoken dialogue for a human-like conversational robot ERICA [Invited]
KAWAHARA Tatsuya
International Workshop on Spoken Dialogue Systems Technology (IWSDS 2018)   14 May 2018   
Automatic Meeting Transcription System for the Japanese Parliament (Diet) [Invited]
KAWAHARA Tatsuya
APSIPA ASC   14 Dec 2017   
Computer-Assisted Language Learning (CALL) using speech technology [Invited]
KAWAHARA Tatsuya
Seoul International Conference on Speech Sciences (SICSS2017)   11 Nov 2017   
Modeling difficulties of second language learners using speech technology [Invited]
KAWAHARA Tatsuya
Seoul International Conference on Speech Sciences (SICSS2017)   10 Nov 2017   
What makes a quality transcript in Parliamentary reporting
KAWAHARA Tatsuya
Intersteno Conference   25 Jul 2017   
Speech Understanding for Intelligent Conversational Agent [Invited]
KAWAHARA Tatsuya
Microsoft Research Asia Faculty Summit   4 Nov 2016   
Captioning Lectures withAutomatic Speech Recognition (ASR)
KAWAHARA Tatsuya
Intersteno Conference   21 Jul 2015   
Recent Paradigm Shift in Speech Recognition [Invited]
KAWAHARA Tatsuya
Kyoto Univ - Inamori Foundation Joint Kyoto Prize Symposium   13 Jul 2014   
Smart Posterboard: Multi-modal Sensing and Analysis of Poster Conversations [Invited]
KAWAHARA Tatsuya
APSIPA ASC   30 Oct 2013   
Subtitling Lecture Videos with Automatic Speech Recognition
KAWAHARA Tatsuya
Intersteno Conference   16 Jul 2013   
Transcription System using Automatic Speech Recognition for the Japanese Parliament (Diet)
KAWAHARA Tatsuya
AAAI/IAAI   26 Jul 2012   
Multi-modal Sensing and Analysis of Poster Conversations toward Smart Posterboard [Invited]
KAWAHARA Tatsuya
SIGdial Meeting Discourse & Dialogue   20 Jul 2012   
New Transcription System using Automatic Speech Recognition (ASR) in the Japanese Parliament (Diet)
KAWAHARA Tatsuya
Intersteno Conference   14 Jul 2011   
Automatic Transcription of Parliamentary Meetings and Classroom Lectures -- A Sustainable Approach and Real System Evaluations -- [Invited]
KAWAHARA Tatsuya
Int'l Sympo. Chinese Spoken Language Processing (ISCSLP)   3 Dec 2010   
New Perspectives on Spoken Language Understanding: Does Machine Need to Fully Understand Speech? [Invited]
KAWAHARA Tatsuya
IEEE Workshop Automatic Speech Recognition & Understanding (ASRU)   16 Dec 2009   
Transcription System using Automatic Speech Recognition (ASR) for the Japanese Parliament (Diet)
KAWAHARA Tatsuya
Intersteno Conference   19 Aug 2009   

Association Memberships

 
 

Research Grants & Projects

 
Ministry of Education, Culture, Sports, Science and Technology: Grants-in-Aid for Scientific Research(基盤研究(A))
Project Year: 2004 - 2006    Investigator(s): Tatsuya KAWAHARA
We investigated automatic speech recognition and post-processing of the transcripts of oral presentations at academic meetings, lectures at universities, and discussions on TV programs and parliaments.In these kinds of spontaneous speech, there is...
Ministry of Education, Culture, Sports, Science and Technology: Grants-in-Aid for Scientific Research(基盤研究(B))
Project Year: 2000 - 2002    Investigator(s): Tatsuya KAWAHARA
Automatic transcription of lectures is addressed using the corpus of spontaneous Japanese collected under the priority research project in Japan. First, we investigate the effect of speaking style and data amount for acoustic modeling. Then, to co...
Ministry of Education, Culture, Sports, Science and Technology: Grants-in-Aid for Scientific Research(基盤研究(B))
Project Year: 1999 - 2001    Investigator(s): Tatsuya KAWAHARA
A Computer-Assisted Language Learning (CALL) system focusing pronunciation training is studied for English learning by Japanese students.First, we model typical English pronunciation errors of Japanese learners and design a system that detects pro...