ITO Akinori

J-GLOBAL         Last updated: Sep 19, 2019 at 12:41
 
Avatar
Name
ITO Akinori
E-mail
aitospcom.ecei.tohoku.ac.jp
URL
http://db.tohoku.ac.jp/whois/e_detail/12e971e9540a73accb8e746dcce9d0c2.html
Affiliation
Tohoku University
Section
Graduate School of Engineering Department of Communications Engineering Intelligent Communication Network Engineering Intelligent Communication Network
Job title
Professor
Degree
Dr. Eng.(Tohoku University)
Research funding number
70232428
Twitter ID
akinori_ito
ORCID ID
0000-0002-8835-7877

Research Areas

 
 

Academic & Professional Experience

 
Apr 2010
 - 
Today
Professor, Graduate School of Engineering, Tohoku University
 
Apr 2002
 - 
Mar 2010
Associate Professor, Graduate School of Engineering, Tohoku University
 
Oct 1999
 - 
Mar 2002
Associate Professor, Faculty of Engineering, Yamagata University
 
May 1998
 - 
Apr 1999
Visiting Scholar, College of Engineering, Boston University
 
Apr 1995
 - 
Sep 1999
Lecturer, Faculty of Engineering, Yamagata University
 

Education

 
 
 - 
Mar 1991
Department of Information Engineering, Graduate School, Division of Engineering, Tohoku University
 
 
 - 
Mar 1986
Department of Commumication Engineering, Faculty of Engineering, Tohoku University
 

Committee Memberships

 
Jun 2009
 - 
Today
日本音響学会  理事
 
May 2009
 - 
Today
情報処理学会 音楽情報科学研究会  運営幹事
 
Apr 2009
 - 
Today
Journal of Information Hiding and Multimedia Signal Processing  Associate Editor
 
May 2007
 - 
Today
日本音響学会  評議員
 
Sep 2005
 - 
Today
日本音響学会 電子化推進委員会  委員
 

Awards & Honors

 
Oct 2008
Best Paper Award of International Conference on Natural Language Processing and Knowledge Engineering, Organizing Committee of International Conference on Natural Language Processing and Knowledge Engineering
Winner: Tomoaki Konno, Masashi Ito, Motoyuki Suzuki, Akinori Ito, Shozo Makino
 
Nov 2007
Best Paper Award of International Conference on Intelligent Information Hiding and Multimedia Signal Processing, Organizing Committee of International Conference on Intelligent Information Hiding and Multimedia Signal Processing
Winner: Akinori Ito, Shozo Makino
 
Jul 2007
Best Paper Award of The 5th International Conference on Education and Information Systems, Technologies and Applications, Organizing Committee of The 5th International Conference on Education and Information Systems, Technologies and Applications
Winner: Motoyuki Suzuki, Tatsuki Konno, Akinori Ito, Shozo Makino
 
Nov 2003
音声言語処理に関する研究, 石田(實)記念財団研究奨励賞, 石田(實)記念財団
 
Jun 2000
ソフトウェア“w3m”の開発, Open Software Prize, 電子ネットワーク協議会
 

Published Papers

 
Hafiyan Prafianto, Takashi Nose, Yuya Chiba, Akinori Ito
Speech Communication   111 14   Aug 2019   [Refereed]
Hiroto Aoyama, Takashi Nose, Yuya Chiba, Akinori Ito
Smart Innovation, Systems and Technologies   110 140-148   Jan 2019   [Refereed]
© Springer Nature Switzerland AG 2019. In order to synthesize more natural speech with Japanese text-to-speech systems, we improve accent sandhi rules. The conventional Japanese accent sandhi rules lack rules related to numerals and counter words ...
Akinori Ito
Smart Innovation, Systems and Technologies   110 82-89   Jan 2019   [Refereed]
Akinori Ito
Smart Innovation, Systems and Technologies   110 74-81   Jan 2019   [Refereed]
A Study on a Spoken Dialogue System with Cooperative Emotional Speech Synthesis Using Acoustic and Linguistic Information
Mai Yamanaka,Yuya Chiba,Takashi Nose,Akinori Ito
Smart Innovation, Systems and Technologies   110 101-108   Jan 2019   [Refereed]
DNN-Based Talking Movie Generation with Face Direction Consideration
Toru Ishikawa, Takashi Nose, Akinori Ito
Smart Innovation, Systems and Technologies   110 157-164   Jan 2019   [Refereed]
Two-Stage Sequence-to-Sequence Neural Voice Conversion with Low-to-High Definition Spectrogram Mapping
Sou Miyamoto, Takashi Nose, Kazuyuki Hiroshiba, Yuri Odagiri, Akinori Ito
Smart Innovation, Systems and Technologies   110 132-139   Jan 2019   [Refereed]
Melody Completion Based on Convolutional Neural Networks and Generative Adversarial Learning
Kosuke Nakamura, Takashi Nose, Yuya Chiba, Akinori Ito
Smart Innovation, Systems and Technologies   110 116-123   Jan 2019   [Refereed]
Segmental Pitch Control Using Speech Input Based on Differential Contexts and Features for Customizable Neural Speech Synthesis
Shinya Hanabusa, Takashi Nose, Akinori Ito
Smart Innovation, Systems and Technologies   110 124-131   Jan 2019   [Refereed]
Comparison of Speech Recognition Performance Between Kaldi and Google Cloud Speech API
Takashi Kimura, Takashi Nose, Shinji Hirooka, Yuya Chiba, Akinori Ito
Smart Innovation, Systems and Technologies   110 109-115   Jan 2019   [Refereed]
Evaluation of English Speech Recognition for Japanese Learners Using DNN-Based Acoustic Models
Jiang Fu, Yuya Chiba, Takashi Nose, Akinori Ito
Smart Innovation, Systems and Technologies   110 93-100   Jan 2019   [Refereed]
Yuko Nakamori, Yutaka Hiroi, Akinori Ito
ROBOMECH Journal   5 25   Dec 2018   [Refereed]
© 2018, The Author(s). We are developing a robot that can play an outdoor game with children. In realizing such a robot, the person detection and tracking methods play an important role. In this paper, we propose methods for improving person detec...
Effect of mutual self-disclosure in spoken dialog system on user impression
Shunsuke Tada, Yuya Chiba, Takashi Nose, Akinori Ito
Proceedings of 2018 APSIPA-ASC   806-810   Nov 2018   [Refereed]
Yuya Chiba,Takashi Nose,Taketo Kase,Mai Yamanaka,Akinori Ito
Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue, Melbourne, Australia, July 12-14, 2018   371-375   Jul 2018   [Refereed]
Yukiko Kageyama,Yuya Chiba,Takashi Nose,Akinori Ito
Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue, Melbourne, Australia, July 12-14, 2018   235-240   Jul 2018   [Refereed]
Ryo Masumura, Taichi Asami, Takanobu Oba, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Akinori Ito
IEICE Transactions on Information and Systems   E101D(6) 1581-1590   Jun 2018   [Refereed]
Copyright © 2018 The Institute of Electronics, Information and Communication Engineers. This paper proposes a novel domain adaptation method that can utilize out-of-domain text resources and partially domain matched text resources in language mode...
Yukiko Kageyama, Yuya Chiba, Takashi Nose, and Akinori Ito
IAENG International Journal of Computer Science   45(2) 285-293   May 2018   [Refereed]
Yuya Chiba, Takashi Nose, Akinori Ito
Proceedings - 9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017   2018-February 428-431   Feb 2018   [Refereed]
© 2017 IEEE. A dialog system can select a more favorable action to a user by estimating the user's internal state. In this paper, we introduce the user's willingness to talk, whether the user wants to talk about a topic or to answer a question pos...
Yuko Nakamori, Yutaka Hiroi, Akinori Ito
SII 2017 - 2017 IEEE/SICE International Symposium on System Integration   2018-January 494-499   Feb 2018   [Refereed]
© 2017 IEEE. We are developing a robot that can play Darumasan-ga-koronda game (similar to "Red light, green light" game) with human players. We have developed a method to detect and track the players, to determine whether the players are moving a...
Akinori Ito
Proceedings of the 2018 International Conference on Intelligent Information Technology   45-49   Feb 2018   [Refereed]
Haoran Wu, Yuya Chiba, Takashi Nose, Akinori Ito
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   2018-September 1746-1750   Jan 2018   [Refereed]
© 2018 International Speech Communication Association. All rights reserved. English proficiency is important for communication in English. Computer-Assisted Language Learning (CALL) systems are introduced to provide a convenient and low-cost langu...
Hafiyan Prafiyanto, Takashi Nose, Yuya Chiba, Akinori Ito
Acoustical Science and Technology   39 92-100   Jan 2018   [Refereed]
© 2018 The Acoustical Society of Japan. We investigate the effect of speaking rate and pauses on the perception of spoken Easy Japanese, which is Japanese language with mostly easy words to facilitate understanding by non-native speakers. In this ...
Ryo Masumura,Taichi Asami,Takanobu Oba,Hirokazu Masataki,Sumitaka Sakauchi,Akinori Ito
IEICE Transactions   101-D(6) 1581-1590   2018   [Refereed]
Haoran Wu,Yuya Chiba,Takashi Nose,Akinori Ito
Interspeech 2018, 19th Annual Conference of the International Speech Communication Association, Hyderabad, India, 2-6 September 2018.   1746-1750   2018   [Refereed]
Akinori Ito
IEICE Transactions   101-D(1) 1   2018   [Refereed]
Sato, K. and Nose, T. and Ito, A. and Chiba, Y. and Ito, A. and Shinozaki, T.
Smart Innovation, Systems and Technologies   82 113-118   2018   [Refereed]
Miyamoto, S. and Nose, T. and Ito, S. and Koike, H. and Chiba, Y. and Ito, A. and Shinozaki, T.
Smart Innovation, Systems and Technologies   82 97-103   2018   [Refereed]
Yamada, Y. and Nose, T. and Chiba, Y. and Ito, A. and Shinozaki, T.
Smart Innovation, Systems and Technologies   82 91-96   2018   [Refereed]
Nakamura, K. and Chiba, Y. and Nose, T. and Ito, A.
Smart Innovation, Systems and Technologies   82 104-111   2018   [Refereed]
Miyagawa, I. and Chiba, Y. and Nose, T. and Ito, A.
Smart Innovation, Systems and Technologies   82 130-136   2018   [Refereed]
Tada, S. and Chiba, Y. and Nose, T. and Ito, A.
Smart Innovation, Systems and Technologies   82 84-90   2018   [Refereed]
Mori, H. and Chiba, Y. and Nose, T. and Ito, A.
Smart Innovation, Systems and Technologies   82 77-83   2018   [Refereed]
Akinori Ito
Journal of Information Hiding and Multimedia Signal Processing   8 1325-1334   Nov 2017   [Refereed][Invited]
© 2017, Ubiquitous International. All rights reserved. This paper describes methods that add values to audio signals using side information. Many acoustic signal processing methods have been proposed for estimating the lost information from the or...
Akinori Ito, Yuto Sasaki
Journal of Information Hiding and Multimedia Signal Processing   8 1372-1381   Nov 2017   [Refereed]
© 2017, Ubiquitous International. All rights reserved. We propose a system that enables a listener of streaming audio to control the volume (magnitude of the signal) of independent part (specifically the vocal signal) in a mixed audio signal in re...
Kazuki Sato, Takashi Nose, Akinori Ito
Journal of Computer and Communications   5(10) 55-65   Aug 2017   [Refereed]
Kohei Morishita, Yutaka Hiroi, Akinori Ito
Journal of Robotics   2017 1   Jan 2017   [Refereed]
© 2017 Kohei Morishita et al. A life-support service robot must avoid both static and dynamic obstacles for working in a real environment. Here, a static obstacle means an obstacle that does not move, and a dynamic obstacle is the one that moves. ...
Yuko Nakamori,Yutaka Hiroi,Akinori Ito
IEEE/SICE International Symposium on System Integration, SII 2017, Taipei, Taiwan, December 11-14, 2017   494-499   2017   [Refereed]
Kageyama, Y. and Chiba, Y. and Nose, T. and Ito, A.
Communications in Computer and Information Science   713 458-464   2017   [Refereed]
Yuya Chiba,Takashi Nose,Akinori Ito
2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017, Kuala Lumpur, Malaysia, December 12-15, 2017   428-431   2017   [Refereed]
Akinori Ito
IEICE Transactions   100-D(1) 1   2017   [Refereed]
Chiba, Y. and Nose, T. and Ito, A.
Journal on Multimodal User Interfaces   11(2) 185-196   2017   [Refereed]
Yamada, S. and Nose, T. and Ito, A.
Smart Innovation, Systems and Technologies   63 159-166   2017   [Refereed]
Nagano, T. and Prafianto, H. and Nose, T. and Ito, A.
Smart Innovation, Systems and Technologies   64 221-228   2017   [Refereed]
Sato, K. and Nose, T. and Ito, A.
Smart Innovation, Systems and Technologies   64 29-36   2017   [Refereed]
Chiba, Y. and Ito, A.
Lecture Notes in Electrical Engineering   999 LNEE 411-419   2017   [Refereed]
Ito, A.
Smart Innovation, Systems and Technologies   63 3-10   2017   [Refereed]
Ito, A.
European Signal Processing Conference   106-109   Nov 2016   [Refereed]
Takeishi, E. and Nose, T. and Chiba, Y. and Ito, A.
2016 Conference of the Oriental Chapter of International Committee for Coordination and Standardization of Speech Databases and Assessment Techniques, O-COCOSDA 2016   16-21   Oct 2016   [Refereed]
Ito, A.
2016 IEEE International Conference on Signal and Image Processing, ICSIP 2016   726-730   Aug 2016   [Refereed]
Prafianto, H. and Nose, T. and Ito, A.
ICALIP 2016 - 2016 International Conference on Audio, Language and Image Processing - Proceedings   208-212   Jul 2016   [Refereed]
発話状態推定に基づく協調的感情音声合成による音声対話システムの評価
加瀬嵩人,能勢隆,千葉祐弥,伊藤彰則
電子情報通信学会誌A   J199-A(1) 25-35   Jan 2016   [Refereed]
Yuya Chiba,Akinori Ito
Dialogues with Social Robots - Enablements, Analyses, and Evaluation, Seventh International Workshop on Spoken Dialogue Systems, IWSDS 2016, Saariselkä, Finland, January 13-16, 2016   411-419   2016   [Refereed]
Masumura, R. and Asami, T. and Oba, T. and Masataki, H. and Sakauchi, S. and Ito, A.
IEICE Transactions on Information and Systems   E99D(10) 2452-2461   2016   [Refereed]
Array,Yosuke Nakamura,Akinori Ito,Motonobu Kawashima,Taichi Watanabe,Yoshihiro Kishimoto,Kunio Kondo
Computers & Graphics   61 1-10   2016   [Refereed]
Influence of the height of a robot on comfortableness of verbal interaction
Hiroi, Y. and Ito, A.
IAENG International Journal of Computer Science   43(4) 447-455   2016   [Refereed]
Estimating the user's state before exchanging utterances using intermediate acoustic features for spoken dialog systems
Chiba, Y. and Nose, T. and Ito, M. and Ito, A.
IAENG International Journal of Computer Science   43(1) 1-9   2016   [Refereed]
Automatic Generation of Proper Noun Entries in a Speech Recognizer for Local Information Recognition
Kenta Shiga, Takashi Nose, Akinori Ito, Ryo Masumura, Hirokazu Masataki
Proceedings of 12th Western Pacific Acoustics Conference      Dec 2015   [Refereed]
Investigation of Pause Insertion Effect in Spoken Easy Japanese for Non-Native Listeners
Hafiyan Prafianto, Takeshi Nagano, Takashi Nose, Akinori Ito
Proceedings of 12th Western Pacific Acoustics Conference   507-511   Dec 2015   [Refereed]
YANSIS: An “Easy Japanese” writing support system
Takeshi Nagano, Akinori Ito
Proceedings of 8th International Conference ICT for Language Learning      Nov 2015   [Refereed]
A Computer-Assisted English Conversation Training System for Response-Timing-Aware Oral Conversation Exercise
Naoto Suzuki, Yutaka Hiroi, Yuya Chiba, Takashi Nose, Akinori Ito
情報処理学会論文誌   56(11) 2177-2189   Nov 2015   [Refereed]
Sakai, K. and Hiroi, Y. and Ito, A.
Proceedings - 2015 3rd International Conference on Robot, Vision and Signal Processing, RVSP 2015   1-4   Nov 2015   [Refereed]
Saito, Y. and Nose, T. and Shinozaki, T. and Ito, A.
Proceedings - 2015 International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2015   433-436   Sep 2015   [Refereed]
Nishino, T. and Nose, T. and Ito, A.
Proceedings - 2015 International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2015   146-149   Sep 2015   [Refereed]
Entropy-based sentence selection for speech synthesis using phonetic and prosodic contexts
Takashi Nose, Yusuke Arao, Takao Kobayashi, Komei Sugiura, Yoshinori Shiga, Akinori Ito
Proceedings of 16th Annual Conference of the International Speech Communication Association   3491-3495   Sep 2015   [Refereed]
Yuma Fujiwara, Yutaka Hiroi, Yuki Tanaka, Akinori Ito
Proceedings of IEEE International Conference on Robot and Human Interactive Communication (RO-MAN)   413-418   Aug 2015   [Refereed]
Investigation of Precision of Human Perception of Pointing Gesture and a Method for Precision Improvement
廣井 富,伊藤 彰則
情報処理学会論文誌   56(8) 1634-1645   Aug 2015   [Refereed]
Taketo Kase, Takashi Nose, Akinori Ito
Proceedings of HCI International 2015   588-593   Aug 2015   [Refereed]
Taketo Kase, Takashi Nose, Akinori Ito
Communications in Computer and Information Science   528 747-752   Jan 2015   [Refereed]
© Springer International Publishing Switzerland 2015. Paralinguistic features such as emotion of an utterance is as important as its linguistic content for generating better response utterances in spoken dialog systems. In this research, we carrie...
Ryo Masumura, Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Akinori Ito
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   2015-January 2380-2384   Jan 2015   [Refereed]
Copyright © 2015 ISCA. This paper proposes a novel language modeling approach called latent word recurrent neural network language model, which solves the problems present in both recurrent neural network language models (RNNLMs) and latent word l...
Ryo Masumura, Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Akinori Ito
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   2015-January 463-467   Jan 2015   [Refereed]
Copyright © 2015 ISCA. This paper demonstrates combinations of various language model (LM) technologies simultaneously, not only modeling techniques but also those for training data expansion based on external language resources and unsupervised a...
Ryo Masumura, Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Akinori Ito
Conference Proceedings - EMNLP 2015: Conference on Empirical Methods in Natural Language Processing   1896-1901   Jan 2015   [Refereed]
© 2015 Association for Computational Linguistics. This paper focuses on language modeling with adequate robustness to support different domain tasks. To this end, we propose a hierarchical latent word language model (h-LWLM). The proposed model ca...
Takashi Nose, Yusuke Arao, Takao Kobayashi, Komei Sugiura, Yoshinori Shiga, Akinori Ito
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   2015-January 3491-3495   Jan 2015   [Refereed]
Copyright © 2015 ISCA. This paper proposes a sentence selection method using a maxi- mum entropy criterion to construct recording scripts for speech synthesis. In the conventional corpus design of speech syn- thesis, a greedy algorithm that maximi...
Ryo Masumura,Taichi Asami,Takanobu Oba,Hirokazu Masataki,Sumitaka Sakauchi,Akinori Ito
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, EMNLP 2015, Lisbon, Portugal, September 17-21, 2015   1896-1901   2015   [Refereed]
Taketo Kase,Takashi Nose,Akinori Ito
HCI International 2015 - Posters' Extended Abstracts - International Conference, HCI International 2015, Los Angeles, CA, USA, August 2-7, 2015. Proceedings, Part I   747-752   2015   [Refereed]
Tsukasa Nishino,Takashi Nose,Akinori Ito
2015 International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2015, Adelaide, Australia, September 23-25, 2015   146-149   2015   [Refereed]
Yuki Saito,Takashi Nose,Takahiro Shinozaki,Akinori Ito
2015 International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2015, Adelaide, Australia, September 23-25, 2015   433-436   2015   [Refereed]
Akinori Ito,Kengo Watanabe,Genki Kuroda,Kenichiro Ito
Looking Back, Looking Forward: Proceedings of the 41st International Computer Music Conference, ICMC 2015, Denton, TX, USA, September 25 - October 1, 2015      2015   [Refereed]
Kohei Machida, Takashi Nose, Akinori Ito
Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)      Dec 2014   [Refereed]
Akinori Ito, Yuto Sasaki
Proceedings of International Conference on Signal Processing   605-609   Oct 2014   [Refereed]
Analysis of spectral enhancement using global variance in HMM-based speech synthesis
Takashi Nose, Akinori Ito
Proceedings of Interspeech      Sep 2014   [Refereed]
Accent type and phrase boundary estimation using acoustic and language models for automatic prosodic labeling
Tomoki Koriyama, Hiroshi Suzuki, Takashi Nose, Takahiro Shinozaki, Akinori Ito
Proceedings of Interspeech      Sep 2014   [Refereed]
Kazumichi Yoshida, Takashi Nose, Akinori Ito
Proceedings of International Conference on Intelligent Information Hiding and Multimedia Signal Processing      Aug 2014   [Refereed]
Akinori Ito
Proceedings of International Conference on Intelligent Information Hiding and Multimedia Signal Processing   2014   Aug 2014   [Refereed]
Keisuke Sakai, Yutaka Hiroi, Akinori Ito
Proceedings of International Symposium on Robotics and Application      Aug 2014   [Refereed]
Yuya Chiba, Masashi Ito, Takashi Nose, Akinori Ito
Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue      Jul 2014   [Refereed]
TEMPO MODIFICATION OF MUSIC SIGNAL USING SINUSOIDAL MODEL AND LPC-BASED RESIDUE MODEL
Akinori Ito, Yuki Igarashi, Masashi Ito, Takashi Nose
Proceedings of International Congress on Sound and Vibration      Jul 2014   [Refereed]
Hafiyan Prafianto, Takashi Nose, Yuya Chiba, Akinori Ito, Kazuyuki Sato
International Conference on Audio, Language and Image Processing      Jul 2014   [Refereed]
Masahito Okamoto, Takashi Nose, Akinori Ito, Takeshi Nagano
International Conference on Audio, Language and Image Processing      Jul 2014   [Refereed]
Noriko Totsuka, Yuya Chiba, Takashi Nose, Akinori Ito
International Conference on Audio, Language and Image Processing      Jul 2014   [Refereed]
User Modeling by Using Bag-of-Behaviors for Building a Dialog System Sensitive to the Interlocutor’s Internal State
Yuya Chiba, Takashi Nose, Akinori Ito, Masashi Ito
Proceedings of 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue   74   Jun 2014   [Refereed]
Yutaka Hiroi, Akinori Ito
Proceedings of the 8th ACM/IEEE international conference on Human-robot interaction   141-142   Mar 2014   [Refereed]
Modeling User's State During Dialog Turn Using HMM For Multi-modal Spoken Dialog System
Yuya Chiba, Masashi Ito, Akinori Ito
Proceedings of The Seventh International Conference on Advances in Computer-Human Interactions   343-346   Mar 2014   [Refereed]
Ryunosuke Daido, Masashi Ito, Shozo Makino, Akinori Ito
Computer Speech and Language   28(2) 501-517   Mar 2014   [Refereed]
Evaluation of singing skill is a popular function of karaoke machines. Here, we introduce a different aspect of evaluating the singing voice of an amateur singer: “singing enthusiasm”. First, we investigated whether human listeners can evaluate si...
Takashi Nose, Akinori Ito
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   2917-2921   Jan 2014   [Refereed]
Copyright © 2014 ISCA. This paper analyzes the problem of the spectral enhancement technique using global variance (GV) in HMM-based speech synthesis. In the conventional GV-based parameter generation, spectral enhancement with variance compensati...
Yuya Chiba, Takashi Nose, Akinori Ito, Masashi Ito
SIGDIAL 2014 - 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference   74-78   Jan 2014   [Refereed]
© 2014 Association for Computational Linguistics. When using spoken dialog systems in actual environments, users sometimes abandon the dialog without making any input utterance. To help these users before they give up, the system should know why t...
Akinori Ito, Yuki Igarashi, Masashi Ito, Takashi Nose
21st International Congress on Sound and Vibration 2014, ICSV 2014   1 928-935   Jan 2014   [Refereed]
Changing tempo of the music signal is one of the most basic signal processing applied to music signals. Traditional algorithms such as phase vocoder and PSOLA uniformly stretch and shrink the input signal. Therefore, those methods change not only ...
Akinori Ito
Proceedings - 2014 10th International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2014   558-561   Jan 2014   [Refereed]
© 2014 IEEE. Singing enthusiasm is a new concept to evaluate singing voice, and the perceived enthusiasm have been shown to be able to be estimated accurately. However, the intended enthusiasm is still difficult to estimate. In this paper, methods...
Yuya Chiba, Akinori Ito, Masashi Ito
ACHI 2014 - 7th International Conference on Advances in Computer-Human Interactions   343-346   Jan 2014   [Refereed]
Copyright © IARIA, 2014. Conventional spoken dialog systems cannot estimate the user's state while waiting for an input from the user because the estimation process is triggered by observing the user's utterance. This is a problem when, for some r...
Takeshi Nagano, Akinori Ito
Journal of Information Hiding and Multimedia Signal Processing   5(2) 285-294   2014   [Refereed]

Misc

 
Akinori Ito
The Journal of the Acoustical Society of Japan   66(1) 32-35   Jan 2010
Akinori Ito, Takuya Kuraishi, Masashi Ito, Shozo Makino
APSIPA ASC 2009 - Asia-Pacific Signal and Information Processing Association 2009 Annual Summit and Conference   453-456   Dec 2009
In this paper, we propose a method for multiple description coding (MDC) of Flash Video stream (FLV). Our target codec of FLV is Sorenson H.263. Conventional MDC methods had disadvantages that they required large redundancy. We proposed a method t...
Akinori Ito, Tomoaki Konno, Masashi Ito, Shozo Makino
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   596-599   Nov 2009
In this paper, we proposed a novel method for evaluating intonation of an English utterance spoken by a learner for intonation learning by a CALL system. The proposed method is based on an intonation evaluation method proposed by Suzuki et al., wh...
Masashi Ito, Keiji Ohara, Akinori Ito, Masafumi Yano
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   124-127   Nov 2009
Three psycho-acoustical experiments were carried out to investigate relative importance of formant frequency and whole spectral shape as cues for vowel perception. Four types of vowel-like signals were presented to eight listeners. The mean respon...
Motoyuki Suzuki, Daisuke Honma, Akinori Ito, Shozo Makino
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   1399-1402   Nov 2009
The triphone model is frequently used as an acoustic model. It is effective for modeling phonetic variations caused by coarticulation. However, it is known that acoustic features of phonemes are also affected by other factors such as speaking styl...
Masaharu Kato, Tetsuo Kosaka, Akinori Ito, Shozo Makino
IAENG International Journal of Computer Science   36    Nov 2009
Topic-based stochastic models such as the probabilistic latent semantic analysis (PLSA) are good tools for adapting a language model into a specific domain using a constraint of global context. A probability given by a topic model is combined with...
この曲、何だっけ? 歌で音楽を探す「歌声検索」
伊藤彰則,鈴木基之,牧野正三
DTM Magazine   16(11) 100-101   Nov 2009
Motoyuki Suzuki, Takuto Ichikawa, Akinori Ito, Shozo Makino
Journal of Information Processing   17 95-105   Jan 2009
© 2009 Information Processing Society of Japan. This paper describes a query-by-humming (QbH) music information retrieval (MIR) system based on a novel tonal feature and statistical modeling. Most QbH-MIR systems use a pitch extraction method in o...
Seongjun Hahm, Akinori Ito, Shozo Makino, Motoyuki Suzuki
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   1221-1224   Dec 2008
We propose a fast speaker adaptation method using an aspect model. The performance of speaker independent (SI) model is very sensitive to environments such as microphones, speakers, and noises. Speaker adaptation techniques try to obtain near spea...
Akinori Ito, Shozo Makino
Journal of Digital Information Management   6 189-195   Dec 2008
In this paper, we discuss a method of splitting one audio stream into two equal-quality streams and recover the original audio stream from only one of the split streams. From a mathematical consideration, it is found that the sum of errors of two ...
Akinori Ito, Ryohei Tsutsui, Shozo Makino, Motoyuki Suzuki
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   2819-2822   Dec 2008
Our goal is to develop a voice-interactive CALL system which enables language learners to practice words, phrases, and grammars interactively. Such a system must be able to recognize learner's utterances correctly. To enable the recognition of utt...
Akinori Ito, Toyomi Meguro, Shozo Makino, Motoyuki Suzuki
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   207-210   Dec 2008
This paper describes a method used to determine if a specific word is related to a certain spoken dialog task. In most ordinary spoken dialog systems, only the words that are actually used to achieve the task are included in the vocabulary. Theref...
Motoyuki Suzuki, Naoto Kuriyama, Akinori Ito, Shozo Makino
2008 International Conference on Natural Language Processing and Knowledge Engineering, NLP-KE 2008      Dec 2008
PLSA is one of the most powerful language models for adaptation to a target speech. The vocabulary divided PLSA language model (VD-PLSA) shows higher performance than the conventional PLSA model because it can be adapted to the target topic and th...
Yutaka Hiroi, Akinori Ito
IEEE/ASME International Conference on Advanced Intelligent Mechatronics, AIM   546-551   Sep 2008
Human symbiosis service robots of various sizes have already been developed. However, few quantitative investigations have been made concerning the influence of the size of a robot on a user's impression. We focused on the height of a robot (robot...
Motoyuki Suzuki, Tatsuki Konno, Akinori Ito, Shozo Makino
IMSCI 2007 - International Multi-Conference on Society, Cybernetics and Informatics, Proceedings   1 48-53   Jan 2007
Prosody plays an important role in speech communication between humans. Several computer-assisted language learning (CALL) systems with utterance evaluation have been developed so far; however, accuracy of their prosody evaluation is still poor. I...
T. Nishizawa, A. Ito
Journal of Horticultural Science and Biotechnology   82 227-234   Jan 2007
Changes in cell wall polysaccharides associated with fruit softening under storage conditions at 20°C were compared between 'Wasada-uri' (a "five-carpel-type" melon accession) and 'Prince' (a "three-carpel-type" melon cultivar). Ethylene productio...
Motoyuki Suzuki, Toru Hosoya, Akinori Ito, Shozo Makino
ISMIR 2006 - 7th International Conference on Music Information Retrieval   168-171   Dec 2006
Several music information retrieval (MIR) systems have been developed which retrieve musical pieces by the user's singing voice. All of these systems use only melody information for retrieval, although lyrics information is also useful for retriev...
Motoyuki Suzuki, Yasutomo Kajiura, Akinori Ito, Shozo Makino
INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP   5 2202-2205   Jan 2006
An n-gram trained by a general corpus gives high performance. However, it is well known that a topic-specialized n-gram gives higher performance than that of the general n-gram. In order to make a topic specialized n-gram, several adaptation metho...
Akinori Ito, Keisuke Shimada, Motoyuki Suzuki, Shozo Makino
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   2 1045-1048   Jan 2006
This paper describes a user simulator based on analysis of VoiceXML description. A user simulator is a method to evaluate a spoken dialog system without the use of human evaluators. The new feature of our simulator is that it uses a VoiceXML descr...
Toru Hosoya, Motoyuki Suzuki, Akinori Ito, Shozo Makino
ISMIR 2005 - 6th International Conference on Music Information Retrieval   532-535   Dec 2005
Recently, several music information retrieval (MIR) systems have been developed which retrieve musical pieces by the user's singing voice. All of these systems use only the melody information for retrieval. Although the lyrics information is usefu...
Motoyuki Suzuki, Yusuke Kato, Akinori Ito, Shozo Makino
9th European Conference on Speech Communication and Technology   973-976   Dec 2005
Background noise is one of the biggest problem for speech recognition systems in real environments. In order to achieve high recognition performance for corrupted speech, we proposed a new construction method of HMMs dealing with various kinds of ...
Akinori Ito, Yen Ling Lim, Motoyuki Suzuki, Shozo Makino
9th European Conference on Speech Communication and Technology   173-176   Dec 2005
We are developing a CALL system to train English pronunciation for Japanese native speakers. However, the precision of the error detection was not very high because the threshold for the detection was not optimum. To improve the detection accuracy...
Akinori Ito, Takashi Kanayama, Motoyuki Suzuki, Shozo Makino
9th European Conference on Speech Communication and Technology   2685-2688   Dec 2005
Speech recognition by a small robot is difficult because the robot makes noise itself. In this paper, two new methods are proposed that suppresses internal noise of the small robots. These methods are based on spectral subtraction (SS). The differ...
Sung Phil Heo, Motoyuki Suzuki, Akinori Ito, Shozo Makino, Hyun Yeol Chung
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)   3094 212-227   Dec 2004
This paper describes a music information retrieval system which uses humming as the key for retrieval. Humming is an easy way for the user to input a melody. However, there are several problems with humming that degrade the retrieval of informatio...
Recent Topics on Speech Recognition
Akinori Ito
IEICE Information and Systems Society Journal   9(1) 14-21   May 2004
Akinori Ito, Takanobu Oba, Takashi Konashi, Motoyuki Suzuki, Shozo Makino
8th International Conference on Spoken Language Processing, ICSLP 2004   193-196   Jan 2004
Speech recognition under noisy environment is one of the hottest topic in the speech recognition research. In this paper, we propose a method to improve accuracy of spoken dialog system from a dialog strategy point of view. In the proposed method,...
Oh Pyo Kweon, Akinori Ito, Motoyuki Suzuki, Shozo Makino
8th International Conference on Spoken Language Processing, ICSLP 2004   1833-1836   Jan 2004
This paper describes a dialogue-based CALL (Computer Assisted Language Learning) system. One of the major problems in CALL systems is that learners are usually assigned a passive role. Learners have no practices in composing their own utterances. ...
Takashi Konashi, Motoyuki Suzuki, Akinori Ito, Shozo Makino
8th International Conference on Spoken Language Processing, ICSLP 2004   189-192   Jan 2004
We have been developing a spoken dialog system. Conventional spoken dialog systems need grammar descriptions and scripts of a dialog, that are difficult to develop. The system proposed in this paper is based on semantic frames, and the system gene...
Motoyuki Suzuki, Hirokazu Ogasawara, Akinori Ito, Yuichi Ohkawa, Shozo Makino
8th International Conference on Spoken Language Processing, ICSLP 2004   2929-2932   Jan 2004
Several CALL systems have two acoustic models to evaluate a learner's pronunciation. In order to achieve high performance for evaluation, speaker adaptation method is introduced in CALL system. It requires adaptation data of a target language, how...
Yuichi Ohkawa, Akihiro Yoshida, Motoyuki Suzuki, Akinori Ito, Shozo Makino
EUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology   485-488   Jan 2003
In spontaneous speech, various speech style and speed changes can be observed, which are known to degrade speech recognition accuracy. In this paper, we describe an optimized multi-duration HMM (OMD). An OMD is a kind of multi-path HMM with at mos...
Akinori Ito, Akinori Ito, Chiori Hori, Chiori Hori, Masaharu Katoh, Masaharu Katoh, Masaki Kohda, Masaki Kohda
Systems and Computers in Japan   33 74   Mar 2002
Akinobu Lee, Tatsuya Kawahara, Kazuya Takeda, Masato Mimura, Atsushi Yamada, Akinori Ito, Katsunobu Itou, Kiyohiro Shikano
Proceedings of the 3rd International Conference on Language Resources and Evaluation, LREC 2002   1438-1441   Jan 2002
Continuous Speech Recognition Consortium (CSRC) was founded on 2000 to promote sharable high-quality platform for research and development of speech recognition. It is a continued work of the former Japanese Dictation Toolkit project from 1997 to ...
Se Jin Oh, Hyun Yeol Chung, Cheol Jun Hwang, Bum Koog Kim, Akinori Ito
2001 IEEE Fourth Workshop on Multimedia Signal Processing   39-44   Dec 2001
In this paper, we adopted the Korean phonological rules to state clustering of contextual domain for representing the unknown contexts and tying the model parameters of new states in state clustering of SSS (Successive State Splitting). We used th...
Akinori Ito, Chiori Hori, Masaharu Katoh, Masaki Kohda
Systems and Computers in Japan   32 1-9   Nov 2001
As a novel elastic image matching technique, piecewise linear 2D warping (PL2DW) is investigated. In PL2DW, the mapping of each row of one image to another image is given by the linear interpolation of the mapping of several points, called pivots,...
Japanese Dictation Toolkit -1999 version-
Tatsuya Kawahara,Akinobu Lee,Tetsunori Kobayashi,Kazuya Takeda,Nobuaki Minematsu,Shigeki Sagayama,Katsunobu Itoh,Akinori Ito,Mikio Yamamoto,Atsushi Yamada,Takehito Utsuro,Kiyohiro Shikano
J. Acoustical Society of Japan   57(3) 210-214   Mar 2001
T. Nishizawa, A. Ito, Y. Motomura, M. Ito, M. Togashi
Journal of the Japanese Society for Horticultural Science   69 563-569   Oct 2000
Biochemical changes in ripening netted melon fruits (Cucumis melo L. 'Andesu' and 'Luster') as influenced by shading were determined. Shading resulted in a rapid loss of flesh firmness in both cultivars which was positively correlated with ethylen...
w3m: a pager/text-based WWW browser
Akinori Ito
bit   32(9) 28-33   Sep 2000
Akinori Ito, Chiori Hori, Masaharu Kotow, Masaki Kohda
6th International Conference on Spoken Language Processing, ICSLP 2000      Jan 2000
This paper describes a language modeling technique using a kind of stochastic context free grammar (stochastic dependency grammar, SDG). In this work, two improvements are done upon the general CFG based SCFG model. The first improvement is to use...
Katsunobu Itou, Kiyohiro Shikano, Tatsuya Kawahara, Kazuya Takeda, Atsushi Yamada, Akinori Itou, Takehito Utsuro, Tetsunori Kobayashi, Nobuaki Minematsu, Mikio Yamamoto, Shigeki Sagayama, Akinobu Lee
2nd International Conference on Language Resources and Evaluation, LREC 2000      Jan 2000
Large vocabulary continuous speech recognition (LVCSR) is an important basis for the application development of speech recognition technology. We had constructed Japanese common LVCSR speech database and have been developing sharable Japanese LVCS...
Tatsuya Kawahara, Akinobu Lee, Tetsunori Kobayashi, Kazuya Takeda, Nobuaki Minematsu, Shigeki Sagayama, Katsunobu Itou, Akinori Ito, Mikio Yamamoto, Atsushi Yamada, Takehito Utsuro, Kiyohiro Shikano
6th International Conference on Spoken Language Processing, ICSLP 2000      Jan 2000
A sharable software repository for Japanese LVCSR (Large Vocabulary Continuous Speech Recognition) is introduced. It is designed as a baseline platform for research and developed by researchers of different academic institutes under a governmental...
Tatsuya Kawahara,Akinobu Lee,Tetsunori Kobayashi,Kazuya Takeda,Nobuaki Minematsu,Katsunobu Itoh,Akinori Ito,Mikio Yamamoto,Atsushi Yamada,Takehito Utsuro,Kiyohiro Shikano
J. Acoustical Society of Japan   55(3) 175-180   Mar 1999
Tatsuya Kawahara, Akinobu Lee, Tetsunori Kobayashi, Kazuya Takeda, Nobuaki Minematsu, Katsunobu Itou, Akinori Ito, Mikio Yamamoto, Atsushi Yamada, Takehito Utsuro, Kiyohiro Shikano
Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi)   20 233-239   Jan 1999
The Japanese Dictation Toolkit has been designed and developed as a baseline platform for Japanese LVCSR (Large Vocabulary Continuous Speech Recognition). The platform consists of a standard recognition engine, Japanese phone models and Japanese s...
Akinori Ito, Masaki Kohda
International Conference on Spoken Language Processing, ICSLP, Proceedings   1 490-493   Dec 1996
This paper describes a new powerful statistical language model based on N-gram model for Japanese speech recognition. In English, a sentence is written word-by-word. On the other hand, a sentence in Japanese has no word boundary character. Therefo...
Takashi Otsuki, Takashi Otsuki, Akinori Ito, Shozo Makino, Teruhiko Ohtomo
IEICE Transactions on Information and Systems   E79-D 47-52   Jan 1996
This paper presents the performance prediction method on sentence recognition system which uses a finite state word automaton. When each word is uttered separately, the relationship between word recognition score and sentence recognition score can...
Motoyuki Suzuki, Shozo Makino, Akinori Ito, Hirotomo Aso, Hiroshi Shimodaira
IEICE Transactions on Information and Systems   E78-D 662-668   Jun 1995
Many methods have been proposed for constructing context-dependent phoneme models using Hidden Markov Models (HMMs) to improve performance. These conventional methods require previously defined contextual factors. If these factors are deficient, t...
SuperTAINS: Tohoku University Network realizes multimedia applications through sub-giga network
Yukiyoshi Kameyama,Akinori Ito,Hiroaki Kobayashi
Computer and Network LAN   13(6) 114-120   Jun 1995
Takashi Otsuki, Teruhiko Otomo, Akinori Ito, Shozo Makino
Electronics and Communications in Japan (Part III: Fundamental Electronic Science)   78 10-19   Jan 1995
The words in natural language have different occurrence probabilities. Consequently, the information obtained from the event, i.e., the occurrence of a word, is larger than in the case of the occurrence with uniform probability. In other words, it...
Takashi Otsuki, Shozo Makino, Akinori Ito, Toshio Sone
Systems and Computers in Japan   25 72-81   Jan 1994
This paper considers word recognition based on the existence of the transition between phonemes and characters with complete segmentation between phonemes and characters. A method is proposed which estimates theoretically the relation between the ...
Akinori Ito, Shozo Makino
Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing   2    Jan 1993
In this paper, a new word pre-selection method called 'extended redundant hash addressing method' is proposed. This method extends the redundant hash addressing method to word spotting from continuous speech. Moreover, the improvement of the trigr...
Shozo Makino, Akinori Ito, Mitsuru Endo, Ken'iti Kido
Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing   1 273-276   Dec 1991
A prototype of a Japanese text dictation system has been developed. It is composed of an acoustic processor, a Bunsetsu-unit spotting processor, and a syntactic processor with semantic constraints. The acoustic processor is constructed using the m...

Books etc

 
音響学入門
鈴木陽一, 赤木正人, 伊藤彰則, 佐藤洋, 苣木禎史, 中村健太郎 (Part:Joint Work)
Feb 2010   
IT Text Speech Recognition System
Kiyohiro Shikano,Katsunobu Itoh,Tatsuya Kawahara,Kazuya Takeda,Mikio Yamamoto (Part:Editor, 第4章、第5章)
Ohmsha   May 2001   
Recent Research towards Advanced Man-Machine Interface through Spoken Language
Shozo Makino,Akinori Ito,Mitsuru Endo,Ken'iti Kido (Part:Joint Work, Chapter 4, pp. 193-204)
Elsevier   Jan 1996   
Spoken Language Systems
Seiichi Nakagawa, Michio Okada, Tatsuya Kawahara (Eds.) (Part:Joint Work)
Ohmsha/IOS Press   Sep 2005   

Conference Activities & Talks

 
Toru Ishikawa, Takashi Nose, Akinori Ito
Smart Innovation, Systems and Technologies   1 Jan 2019   
© Springer Nature Switzerland AG 2019. In this paper, we propose a method to generate a talking head animation considering the direction of the face. The proposed method parametrizes a facial image using the active appearance model (AAM) and model...
Sou Miyamoto, Takashi Nose, Kazuyuki Hiroshiba, Yuri Odagiri, Akinori Ito
Smart Innovation, Systems and Technologies   1 Jan 2019   
© Springer Nature Switzerland AG 2019. In this study, we propose a voice conversion technique with two-stage conversion, which is realized by using two models consisting of U-Net and pix2pix. Using U-Net, we tried to reproduce intonation of a targ...
Jiang Fu, Yuya Chiba, Takashi Nose, Akinori Ito
Smart Innovation, Systems and Technologies   1 Jan 2019   
© Springer Nature Switzerland AG 2019. Regarding the assistance of computer-assisted language learning (CALL) systems to make foreign language learning easier, it is necessary to recognize the utterances of the learner with high accuracy. The qual...
Takashi Kimura, Takashi Nose, Shinji Hirooka, Shinji Hirooka, Yuya Chiba, Akinori Ito
Smart Innovation, Systems and Technologies   1 Jan 2019   
© Springer Nature Switzerland AG 2019. In recent years, many systems having a speech interface have grown. The speech interface includes spoken dialogue function and high performance of a spoken dialogue system has been required. The spoken dialog...
Shinya Hanabusa, Takashi Nose, Akinori Ito
Smart Innovation, Systems and Technologies   1 Jan 2019   
© Springer Nature Switzerland AG 2019. This paper proposes a technique for controlling the pitch of synthetic speech at a segmental level using user input speech within a framework of speech synthesis based on deep neural networks (DNNs). In a pre...

Works

 
palmkit: a toolkit for statistical language modeling
Software   Nov 2001
w3m: a web browser
Software   Jan 1999

Research Grants & Projects

 
Development of spoken dialog systems
Project Year: Apr 2002 - Today
Development of a CALL system using speech recognition technology
Grant-in-Aid for Scientific Research
Project Year: Apr 2004 - Today
Development of Speech Recognition System
Ordinary Research
Project Year: Apr 2002 - Today
Music Information Processing
Project Year: Apr 2004 - Today

Patents

 
特許第5805474号 : 音声評価装置,音声評価方法,及びプログラム
特許第5780516号 : モデル縮減装置とその方法とプログラム
大庭 隆伸,堀 貴明,中村 篤,伊藤 彰則
特許第5700566号 : スコアリングモデル生成装置、学習データ生成装置、検索システム、スコアリングモデル生成方法、学習データ生成方法、検索方法及びそのプログラム
特許第5610304号 : モデルパラメータ配列装置とその方法とプログラム
大庭 隆伸,堀 貴明,,中村 篤,伊藤 彰則
特許第4911385号 : データ通信方法、データ通信システムおよびデータ通信プログラム
鈴木 陽一,伊藤 彰則,阿部 俊一郎,須藤 裕史,吉木 伸二,染谷 大

Social Contribution

 
サイエンスカフェ
[Others]  28 Jun 2013
「スマホやロボットとどうやって会話できるのか?」と題して、おんせい認識・合成・対話技術について公開の公演を行った。
出前講義
[Others]  4 Dec 2008
宮城県仙台第二高校において,「ロボットとの対話」という題目で,高校生を対象に出前講義を行った.
出前講義
[Others]  18 Oct 2008
群馬県立太田高校において,「ロボットとの対話」という題目で,高校生を対象に出前講義を行った.
ネット障害時 円滑送信
[Informant]  日本経済新聞  23 Mar 2007

Others

 
Apr 1997   日本語ディクテーション基本ソフトウェアの開発
日本語の大語彙連続音声認識の研究・開発・実用化を促進する
ため、誰でも利用でき、高精度な音声認識システムを開発する。
このため、不特定話者に対して利用できる高精度な音響モデル、
大量の言語データを用いて学習した言語モデル、および高速・
高精度な音声認識エンジンの開発を行う。