伊藤 彰則

J-GLOBALへ         更新日: 19/02/18 15:10
 
アバター
研究者氏名
伊藤 彰則
 
イトウ アキノリ
eメール
aitospcom.ecei.tohoku.ac.jp
URL
http://db.tohoku.ac.jp/whois/detail/12e971e9540a73accb8e746dcce9d0c2.html
所属
東北大学
部署
大学院工学研究科・工学部 通信工学専攻 知的通信ネットワーク工学講座 ヒューマンインターフェース分野
職名
教授
学位
工学博士(東北大学)
科研費研究者番号
70232428
Twitter ID
akinori_ito
ORCID ID
0000-0002-8835-7877

研究分野

 
 

経歴

 
2010年4月
 - 
現在
東北大学 大学院工学研究科 教授
 
2002年4月
 - 
2010年3月
東北大学 大学院工学研究科 助教授
 
1999年10月
 - 
2002年3月
山形大学工学部 助教授
 
1998年5月
 - 
1999年4月
ボストン大学工学部 客員研究員
 
1995年4月
 - 
1999年9月
山形大学工学部 講師
 

学歴

 
 
 - 
1991年3月
東北大学 工学研究科 情報工学専攻
 
 
 - 
1986年3月
東北大学 工学部 通信工学科
 

委員歴

 
2009年6月
 - 
現在
日本音響学会  理事
 
2009年5月
 - 
現在
情報処理学会 音楽情報科学研究会  運営幹事
 
2009年4月
 - 
現在
Journal of Information Hiding and Multimedia Signal Processing  編集委員
 
2007年5月
 - 
現在
日本音響学会  評議員
 
2005年9月
 - 
現在
日本音響学会 電子化推進委員会  委員
 

受賞

 
2008年10月
Organizing Committee of International Conference on Natural Language Processing and Knowledge Engineering Best Paper Award of International Conference on Natural Language Processing and Knowledge Engineering
受賞者: Tomoaki Konno, Masashi Ito, Motoyuki Suzuki, Akinori Ito, Shozo Makino
 
2007年11月
Organizing Committee of International Conference on Intelligent Information Hiding and Multimedia Signal Processing Best Paper Award of International Conference on Intelligent Information Hiding and Multimedia Signal Processing
受賞者: Akinori Ito, Shozo Makino
 
2007年7月
Organizing Committee of The 5th International Conference on Education and Information Systems, Technologies and Applications Best Paper Award of The 5th International Conference on Education and Information Systems, Technologies and Applications
受賞者: Motoyuki Suzuki, Tatsuki Konno, Akinori Ito, Shozo Makino
 
2003年11月
石田(實)記念財団 石田(實)記念財団研究奨励賞 音声言語処理に関する研究
 
2000年6月
電子ネットワーク協議会 オープンソフトウェア大賞 入賞 ソフトウェア“w3m”の開発
 

論文

 
Hiroto Aoyama, Takashi Nose, Yuya Chiba, Akinori Ito
Smart Innovation, Systems and Technologies   110 140-148   2019年1月
© Springer Nature Switzerland AG 2019. In order to synthesize more natural speech with Japanese text-to-speech systems, we improve accent sandhi rules. The conventional Japanese accent sandhi rules lack rules related to numerals and counter words ...
Yuko Nakamori, Yutaka Hiroi, Akinori Ito
ROBOMECH Journal   5    2018年12月
© 2018, The Author(s). We are developing a robot that can play an outdoor game with children. In realizing such a robot, the person detection and tracking methods play an important role. In this paper, we propose methods for improving person detec...
Ryo Masumura, Taichi Asami, Takanobu Oba, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Akinori Ito
IEICE Transactions on Information and Systems   E101D 1581-1590   2018年6月
Copyright © 2018 The Institute of Electronics, Information and Communication Engineers. This paper proposes a novel domain adaptation method that can utilize out-of-domain text resources and partially domain matched text resources in language mode...
Analyses of Example Sentences Collected by Conversation for Example-Based Non-Task-Oriented Dialog System
IAENG International Journal of Computer Science   45(2) 285-293   2018年5月   [査読有り]
Yuya Chiba, Takashi Nose, Akinori Ito
Proceedings - 9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017   2018-February 428-431   2018年2月
© 2017 IEEE. A dialog system can select a more favorable action to a user by estimating the user's internal state. In this paper, we introduce the user's willingness to talk, whether the user wants to talk about a topic or to answer a question pos...
Yuko Nakamori, Yutaka Hiroi, Akinori Ito
SII 2017 - 2017 IEEE/SICE International Symposium on System Integration   2018-January 494-499   2018年2月
© 2017 IEEE. We are developing a robot that can play Darumasan-ga-koronda game (similar to "Red light, green light" game) with human players. We have developed a method to detect and track the players, to determine whether the players are moving a...
Akinori Ito
Proceedings of the 2018 International Conference on Intelligent Information Technology   45-49   2018年2月   [査読有り]
Haoran Wu, Yuya Chiba, Takashi Nose, Akinori Ito
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   2018-September 1746-1750   2018年1月
© 2018 International Speech Communication Association. All rights reserved. English proficiency is important for communication in English. Computer-Assisted Language Learning (CALL) systems are introduced to provide a convenient and low-cost langu...
Hafiyan Prafiyanto, Takashi Nose, Yuya Chiba, Akinori Ito
Acoustical Science and Technology   39 92-100   2018年1月
© 2018 The Acoustical Society of Japan. We investigate the effect of speaking rate and pauses on the perception of spoken Easy Japanese, which is Japanese language with mostly easy words to facilitate understanding by non-native speakers. In this ...
Yuya Chiba,Takashi Nose,Taketo Kase,Mai Yamanaka,Akinori Ito
Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue, Melbourne, Australia, July 12-14, 2018   371-375   2018年   [査読有り]
Yukiko Kageyama,Yuya Chiba,Takashi Nose,Akinori Ito
Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue, Melbourne, Australia, July 12-14, 2018   235-240   2018年   [査読有り]
Haoran Wu,Yuya Chiba,Takashi Nose,Akinori Ito
Interspeech 2018, 19th Annual Conference of the International Speech Communication Association, Hyderabad, India, 2-6 September 2018.   1746-1750   2018年   [査読有り]
Ryo Masumura,Taichi Asami,Takanobu Oba,Hirokazu Masataki,Sumitaka Sakauchi,Akinori Ito
IEICE Transactions   101-D(6) 1581-1590   2018年   [査読有り]
Akinori Ito
IEICE Transactions   101-D(1) 1   2018年   [査読有り]
Sato, K. and Nose, T. and Ito, A. and Chiba, Y. and Ito, A. and Shinozaki, T.
Smart Innovation, Systems and Technologies   82 113-118   2018年   [査読有り]
Miyamoto, S. and Nose, T. and Ito, S. and Koike, H. and Chiba, Y. and Ito, A. and Shinozaki, T.
Smart Innovation, Systems and Technologies   82 97-103   2018年   [査読有り]
Yamada, Y. and Nose, T. and Chiba, Y. and Ito, A. and Shinozaki, T.
Smart Innovation, Systems and Technologies   82 91-96   2018年   [査読有り]
Nakamura, K. and Chiba, Y. and Nose, T. and Ito, A.
Smart Innovation, Systems and Technologies   82 104-111   2018年   [査読有り]
Miyagawa, I. and Chiba, Y. and Nose, T. and Ito, A.
Smart Innovation, Systems and Technologies   82 130-136   2018年   [査読有り]
Tada, S. and Chiba, Y. and Nose, T. and Ito, A.
Smart Innovation, Systems and Technologies   82 84-90   2018年   [査読有り]
Mori, H. and Chiba, Y. and Nose, T. and Ito, A.
Smart Innovation, Systems and Technologies   82 77-83   2018年   [査読有り]
Akinori Ito
Journal of Information Hiding and Multimedia Signal Processing   8 1325-1334   2017年11月
© 2017, Ubiquitous International. All rights reserved. This paper describes methods that add values to audio signals using side information. Many acoustic signal processing methods have been proposed for estimating the lost information from the or...
Akinori Ito, Yuto Sasaki
Journal of Information Hiding and Multimedia Signal Processing   8 1372-1381   2017年11月
© 2017, Ubiquitous International. All rights reserved. We propose a system that enables a listener of streaming audio to control the volume (magnitude of the signal) of independent part (specifically the vocal signal) in a mixed audio signal in re...
Yuko Nakamori,Yutaka Hiroi,Akinori Ito
IEEE/SICE International Symposium on System Integration, SII 2017, Taipei, Taiwan, December 11-14, 2017   494-499   2017年   [査読有り]
Kageyama, Y. and Chiba, Y. and Nose, T. and Ito, A.
Communications in Computer and Information Science   713 458-464   2017年   [査読有り]
Journal of Computer and Communications   5(10) 55-65   2017年8月   [査読有り]
Kohei Morishita, Yutaka Hiroi, Akinori Ito
Journal of Robotics   2017    2017年1月
© 2017 Kohei Morishita et al. A life-support service robot must avoid both static and dynamic obstacles for working in a real environment. Here, a static obstacle means an obstacle that does not move, and a dynamic obstacle is the one that moves. ...
Yuya Chiba,Takashi Nose,Akinori Ito
2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017, Kuala Lumpur, Malaysia, December 12-15, 2017   428-431   2017年   [査読有り]
Kohei Morishita,Array,Array
J. Robotics   2017 3148202:1-3148202:10   2017年   [査読有り]
Akinori Ito
IEICE Transactions   100-D(1) 1   2017年   [査読有り]
Yamada, S. and Nose, T. and Ito, A.
Smart Innovation, Systems and Technologies   63 159-166   2017年   [査読有り]
Chiba, Y. and Nose, T. and Ito, A.
Journal on Multimodal User Interfaces   11(2) 185-196   2017年   [査読有り]
Nagano, T. and Prafianto, H. and Nose, T. and Ito, A.
Smart Innovation, Systems and Technologies   64 221-228   2017年   [査読有り]
Sato, K. and Nose, T. and Ito, A.
Smart Innovation, Systems and Technologies   64 29-36   2017年   [査読有り]
Chiba, Y. and Ito, A.
Lecture Notes in Electrical Engineering   999 LNEE 411-419   2017年   [査読有り]
Yuya Chiba,Akinori Ito
Dialogues with Social Robots - Enablements, Analyses, and Evaluation, Seventh International Workshop on Spoken Dialogue Systems, IWSDS 2016, Saariselkä, Finland, January 13-16, 2016   411-419   2016年   [査読有り]
Akinori Ito
24th European Signal Processing Conference, EUSIPCO 2016, Budapest, Hungary, August 29 - September 2, 2016   106-109   2016年   [査読有り]
Masumura, R. and Asami, T. and Oba, T. and Masataki, H. and Sakauchi, S. and Ito, A.
IEICE Transactions on Information and Systems   E99D(10) 2452-2461   2016年   [査読有り]
Ito, A.
Smart Innovation, Systems and Technologies   63 3-10   2017年   [査読有り]
Ito, A.
European Signal Processing Conference   106-109   2016年11月   [査読有り]
Takeishi, E. and Nose, T. and Chiba, Y. and Ito, A.
2016 Conference of the Oriental Chapter of International Committee for Coordination and Standardization of Speech Databases and Assessment Techniques, O-COCOSDA 2016   16-21   2016年10月   [査読有り]
Ito, A.
2016 IEEE International Conference on Signal and Image Processing, ICSIP 2016   726-730   2016年8月   [査読有り]
Prafianto, H. and Nose, T. and Ito, A.
ICALIP 2016 - 2016 International Conference on Audio, Language and Image Processing - Proceedings   208-212   2016年7月   [査読有り]
発話状態推定に基づく協調的感情音声合成による音声対話システムの評価
加瀬嵩人,能勢隆,千葉祐弥,伊藤彰則
電子情報通信学会誌A   J199-A(1) 25-35   2016年1月   [査読有り]
Array,Yosuke Nakamura,Akinori Ito,Motonobu Kawashima,Taichi Watanabe,Yoshihiro Kishimoto,Kunio Kondo
Computers & Graphics   61 1-10   2016年   [査読有り]
Influence of the height of a robot on comfortableness of verbal interaction
Hiroi, Y. and Ito, A.
IAENG International Journal of Computer Science   43(4) 447-455   2016年   [査読有り]
Estimating the user's state before exchanging utterances using intermediate acoustic features for spoken dialog systems
Chiba, Y. and Nose, T. and Ito, M. and Ito, A.
IAENG International Journal of Computer Science   43(1) 1-9   2016年   [査読有り]
Automatic Generation of Proper Noun Entries in a Speech Recognizer for Local Information Recognition
Kenta Shiga, Takashi Nose, Akinori Ito, Ryo Masumura, Hirokazu Masataki
Proceedings of 12th Western Pacific Acoustics Conference      2015年12月   [査読有り]
Investigation of Pause Insertion Effect in Spoken Easy Japanese for Non-Native Listeners
Hafiyan Prafianto, Takeshi Nagano, Takashi Nose, Akinori Ito
Proceedings of 12th Western Pacific Acoustics Conference   507-511   2015年12月   [査読有り]
YANSIS: An “Easy Japanese” writing support system
Takeshi Nagano, Akinori Ito
Proceedings of 8th International Conference ICT for Language Learning      2015年11月   [査読有り]
応答タイミングを考慮した英会話練習のための音声対話型英語学習システム
鈴木 直人,廣井 富,千葉 祐弥,能勢 隆,伊藤 彰則
情報処理学会論文誌   56(11) 2177-2189   2015年11月   [査読有り]
Sakai, K. and Hiroi, Y. and Ito, A.
Proceedings - 2015 3rd International Conference on Robot, Vision and Signal Processing, RVSP 2015   1-4   2015年11月   [査読有り]
Saito, Y. and Nose, T. and Shinozaki, T. and Ito, A.
Proceedings - 2015 International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2015   433-436   2015年9月   [査読有り]
Nishino, T. and Nose, T. and Ito, A.
Proceedings - 2015 International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2015   146-149   2015年9月   [査読有り]
Entropy-based sentence selection for speech synthesis using phonetic and prosodic contexts
Takashi Nose, Yusuke Arao, Takao Kobayashi, Komei Sugiura, Yoshinori Shiga, Akinori Ito
Proceedings of 16th Annual Conference of the International Speech Communication Association   3491-3495   2015年9月   [査読有り]
Yuma Fujiwara, Yutaka Hiroi, Yuki Tanaka, Akinori Ito
Proceedings of IEEE International Conference on Robot and Human Interactive Communication (RO-MAN)   413-418   2015年8月   [査読有り]
指差しによる人間への位置提示精度調査とその精度向上手法
廣井 富,伊藤 彰則
情報処理学会論文誌   56(8) 1634-1645   2015年8月   [査読有り]
On Appropriateness and Estimation of the Emotion of Synthesized Response Speech in a Spoken Dialogue System
Taketo Kase, Takashi Nose, Akinori Ito
Proceedings of HCI International 2015   588-593   2015年8月   [査読有り]
Taketo Kase, Takashi Nose, Akinori Ito
Communications in Computer and Information Science   528 747-752   2015年1月
© Springer International Publishing Switzerland 2015. Paralinguistic features such as emotion of an utterance is as important as its linguistic content for generating better response utterances in spoken dialog systems. In this research, we carrie...
Ryo Masumura, Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Akinori Ito
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   2015-January 2380-2384   2015年1月
Copyright © 2015 ISCA. This paper proposes a novel language modeling approach called latent word recurrent neural network language model, which solves the problems present in both recurrent neural network language models (RNNLMs) and latent word l...
Ryo Masumura, Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Akinori Ito
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   2015-January 463-467   2015年1月
Copyright © 2015 ISCA. This paper demonstrates combinations of various language model (LM) technologies simultaneously, not only modeling techniques but also those for training data expansion based on external language resources and unsupervised a...
Ryo Masumura, Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Akinori Ito
Conference Proceedings - EMNLP 2015: Conference on Empirical Methods in Natural Language Processing   1896-1901   2015年1月
© 2015 Association for Computational Linguistics. This paper focuses on language modeling with adequate robustness to support different domain tasks. To this end, we propose a hierarchical latent word language model (h-LWLM). The proposed model ca...
Takashi Nose, Yusuke Arao, Takao Kobayashi, Komei Sugiura, Yoshinori Shiga, Akinori Ito
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   2015-January 3491-3495   2015年1月
Copyright © 2015 ISCA. This paper proposes a sentence selection method using a maxi- mum entropy criterion to construct recording scripts for speech synthesis. In the conventional corpus design of speech syn- thesis, a greedy algorithm that maximi...
Yuki Saito,Takashi Nose,Takahiro Shinozaki,Akinori Ito
2015 International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2015, Adelaide, Australia, September 23-25, 2015   433-436   2015年   [査読有り]
Tsukasa Nishino,Takashi Nose,Akinori Ito
2015 International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2015, Adelaide, Australia, September 23-25, 2015   146-149   2015年   [査読有り]
Akinori Ito,Kengo Watanabe,Genki Kuroda,Kenichiro Ito
Looking Back, Looking Forward: Proceedings of the 41st International Computer Music Conference, ICMC 2015, Denton, TX, USA, September 25 - October 1, 2015      2015年   [査読有り]
Taketo Kase,Takashi Nose,Akinori Ito
HCI International 2015 - Posters' Extended Abstracts - International Conference, HCI International 2015, Los Angeles, CA, USA, August 2-7, 2015. Proceedings, Part I   747-752   2015年   [査読有り]
Ryo Masumura,Taichi Asami,Takanobu Oba,Hirokazu Masataki,Sumitaka Sakauchi,Akinori Ito
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, EMNLP 2015, Lisbon, Portugal, September 17-21, 2015   1896-1901   2015年   [査読有り]
Kohei Machida, Takashi Nose, Akinori Ito
Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)      2014年12月   [査読有り]
Akinori Ito, Yuto Sasaki
Proceedings of International Conference on Signal Processing   605-609   2014年10月   [査読有り]
Analysis of spectral enhancement using global variance in HMM-based speech synthesis
Takashi Nose, Akinori Ito
Proceedings of Interspeech      2014年9月   [査読有り]
Accent type and phrase boundary estimation using acoustic and language models for automatic prosodic labeling
Tomoki Koriyama, Hiroshi Suzuki, Takashi Nose, Takahiro Shinozaki, Akinori Ito
Proceedings of Interspeech      2014年9月   [査読有り]
Kazumichi Yoshida, Takashi Nose, Akinori Ito
Proceedings of International Conference on Intelligent Information Hiding and Multimedia Signal Processing      2014年8月   [査読有り]
Akinori Ito
Proceedings of International Conference on Intelligent Information Hiding and Multimedia Signal Processing   2014   2014年8月   [査読有り]
Keisuke Sakai, Yutaka Hiroi, Akinori Ito
Proceedings of International Symposium on Robotics and Application      2014年8月   [査読有り]
Yuya Chiba, Masashi Ito, Takashi Nose, Akinori Ito
Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue      2014年7月   [査読有り]
TEMPO MODIFICATION OF MUSIC SIGNAL USING SINUSOIDAL MODEL AND LPC-BASED RESIDUE MODEL
Akinori Ito, Yuki Igarashi, Masashi Ito, Takashi Nose
Proceedings of International Congress on Sound and Vibration      2014年7月   [査読有り]
Hafiyan Prafianto, Takashi Nose, Yuya Chiba, Akinori Ito, Kazuyuki Sato
International Conference on Audio, Language and Image Processing      2014年7月   [査読有り]
Masahito Okamoto, Takashi Nose, Akinori Ito, Takeshi Nagano
International Conference on Audio, Language and Image Processing      2014年7月   [査読有り]
Noriko Totsuka, Yuya Chiba, Takashi Nose, Akinori Ito
International Conference on Audio, Language and Image Processing      2014年7月   [査読有り]
User Modeling by Using Bag-of-Behaviors for Building a Dialog System Sensitive to the Interlocutor’s Internal State
Yuya Chiba, Takashi Nose, Akinori Ito, Masashi Ito
Proceedings of 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue   74   2014年6月   [査読有り]
Yutaka Hiroi, Akinori Ito
Proceedings of the 8th ACM/IEEE international conference on Human-robot interaction   141-142   2014年3月   [査読有り]
Modeling User's State During Dialog Turn Using HMM For Multi-modal Spoken Dialog System
Yuya Chiba, Masashi Ito, Akinori Ito
Proceedings of The Seventh International Conference on Advances in Computer-Human Interactions   343-346   2014年3月   [査読有り]
Ryunosuke Daido, Masashi Ito, Shozo Makino, Akinori Ito
Computer Speech and Language   28(2) 501-517   2014年3月   [査読有り]
Evaluation of singing skill is a popular function of karaoke machines. Here, we introduce a different aspect of evaluating the singing voice of an amateur singer: “singing enthusiasm”. First, we investigated whether human listeners can evaluate si...
Takashi Nose, Akinori Ito
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   2917-2921   2014年1月
Copyright © 2014 ISCA. This paper analyzes the problem of the spectral enhancement technique using global variance (GV) in HMM-based speech synthesis. In the conventional GV-based parameter generation, spectral enhancement with variance compensati...
Yuya Chiba, Takashi Nose, Akinori Ito, Masashi Ito
SIGDIAL 2014 - 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference   74-78   2014年1月
© 2014 Association for Computational Linguistics. When using spoken dialog systems in actual environments, users sometimes abandon the dialog without making any input utterance. To help these users before they give up, the system should know why t...
Akinori Ito, Yuki Igarashi, Masashi Ito, Takashi Nose
21st International Congress on Sound and Vibration 2014, ICSV 2014   1 928-935   2014年1月
Changing tempo of the music signal is one of the most basic signal processing applied to music signals. Traditional algorithms such as phase vocoder and PSOLA uniformly stretch and shrink the input signal. Therefore, those methods change not only ...
Akinori Ito
Proceedings - 2014 10th International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2014   558-561   2014年1月
© 2014 IEEE. Singing enthusiasm is a new concept to evaluate singing voice, and the perceived enthusiasm have been shown to be able to be estimated accurately. However, the intended enthusiasm is still difficult to estimate. In this paper, methods...
Yuya Chiba, Akinori Ito, Masashi Ito
ACHI 2014 - 7th International Conference on Advances in Computer-Human Interactions   343-346   2014年1月
Copyright © IARIA, 2014. Conventional spoken dialog systems cannot estimate the user's state while waiting for an input from the user because the estimation process is triggered by observing the user's utterance. This is a problem when, for some r...
Takeshi Nagano, Akinori Ito
Journal of Information Hiding and Multimedia Signal Processing   5(2) 285-294   2014年   [査読有り]
Naoto Suzuki, Takashi Nose, Yutaka Hiroi, Akinori Ito
Proceedings of HCI International 2014-Poster's Extended Abstracts   588-593   2014年   [査読有り]
Kohei Machida, Akinori Ito
Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)   1-4   2013年10月   [査読有り]
Keizo Kato, Akinori Ito
Proceedings of International Conference on Intelligent Information Hiding and Multimedia Signal Processing   460-463   2013年10月   [査読有り]
Yohei Abe, Akinori Ito
Proceedings of International Conference on Intelligent Information Hiding and Multimedia Signal Processing   271-274   2013年10月   [査読有り]
Yuya Chiba, Masashi Ito, Akinori Ito
Proceedings of International Conference on Human-Computer Interaction      2013年7月   [査読有り]
Takeshi Nagano, Akinori Ito
Proceedings - 2013 9th International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2013   267-270   2013年1月
In a VoIP application, packet losses degrade speech quality. Especially, IP network under a large-scale disaster should cause severe packet losses. We have investigated the relationship between parameter loss and speech quality for G.729 codec. In...
Yuki Igarashi, Masashi Ito, Akinori Ito
Proceedings - 2013 9th International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2013   464-467   2013年1月
There are various kinds of sound signal analysis methods. Sinusoidal modeling, one of those signal analysis method, is based on the idea that all sound signal can be expressed as the sum of sinusoidal components of which instantaneous frequency an...
廣井富,伊藤彰則
日本バーチャルリアリティ学会誌   18(2) 161-170   2013年   [査読有り]
Meng Zhang, Akinori Ito, Kazuyuki Sato
ICALIP 2012 - 2012 International Conference on Audio, Language and Image Processing, Proceedings   303-307   2012年12月
In this paper, we developed a method to assess easiness of a Japanese sentence for a non-native speaker of Japanese. This method is intended to be used as a writing aid of Easy Japanese (EJ), which is used as a language to convey information to fo...
Akinori Ito, Takeshi Nagano
International Symposium on Wireless Personal Multimedia Communications, WPMC   489-490   2012年12月
This paper describes an outline of a project for developing a VoIP codec that can be used under a very severe communication environment where half of the packets drop. The codec is based on G.729 CS-ACELP, and a packet loss concealment (PLC) metho...

Misc

 
伊藤彰則
日本音響学会誌   66(1) 32-35   2010年1月
Akinori Ito, Takuya Kuraishi, Masashi Ito, Shozo Makino
APSIPA ASC 2009 - Asia-Pacific Signal and Information Processing Association 2009 Annual Summit and Conference   453-456   2009年12月
In this paper, we propose a method for multiple description coding (MDC) of Flash Video stream (FLV). Our target codec of FLV is Sorenson H.263. Conventional MDC methods had disadvantages that they required large redundancy. We proposed a method t...
Akinori Ito, Tomoaki Konno, Masashi Ito, Shozo Makino
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   596-599   2009年11月
In this paper, we proposed a novel method for evaluating intonation of an English utterance spoken by a learner for intonation learning by a CALL system. The proposed method is based on an intonation evaluation method proposed by Suzuki et al., wh...
Masashi Ito, Keiji Ohara, Akinori Ito, Masafumi Yano
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   124-127   2009年11月
Three psycho-acoustical experiments were carried out to investigate relative importance of formant frequency and whole spectral shape as cues for vowel perception. Four types of vowel-like signals were presented to eight listeners. The mean respon...
Motoyuki Suzuki, Daisuke Honma, Akinori Ito, Shozo Makino
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   1399-1402   2009年11月
The triphone model is frequently used as an acoustic model. It is effective for modeling phonetic variations caused by coarticulation. However, it is known that acoustic features of phonemes are also affected by other factors such as speaking styl...
Masaharu Kato, Tetsuo Kosaka, Akinori Ito, Shozo Makino
IAENG International Journal of Computer Science   36    2009年11月
Topic-based stochastic models such as the probabilistic latent semantic analysis (PLSA) are good tools for adapting a language model into a specific domain using a constraint of global context. A probability given by a topic model is combined with...
この曲、何だっけ? 歌で音楽を探す「歌声検索」
伊藤彰則,鈴木基之,牧野正三
DTM Magazine   16(11) 100-101   2009年11月
Motoyuki Suzuki, Takuto Ichikawa, Akinori Ito, Shozo Makino
Journal of Information Processing   17 95-105   2009年1月
© 2009 Information Processing Society of Japan. This paper describes a query-by-humming (QbH) music information retrieval (MIR) system based on a novel tonal feature and statistical modeling. Most QbH-MIR systems use a pitch extraction method in o...
Seongjun Hahm, Akinori Ito, Shozo Makino, Motoyuki Suzuki
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   1221-1224   2008年12月
We propose a fast speaker adaptation method using an aspect model. The performance of speaker independent (SI) model is very sensitive to environments such as microphones, speakers, and noises. Speaker adaptation techniques try to obtain near spea...
Akinori Ito, Shozo Makino
Journal of Digital Information Management   6 189-195   2008年12月
In this paper, we discuss a method of splitting one audio stream into two equal-quality streams and recover the original audio stream from only one of the split streams. From a mathematical consideration, it is found that the sum of errors of two ...
Akinori Ito, Ryohei Tsutsui, Shozo Makino, Motoyuki Suzuki
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   2819-2822   2008年12月
Our goal is to develop a voice-interactive CALL system which enables language learners to practice words, phrases, and grammars interactively. Such a system must be able to recognize learner's utterances correctly. To enable the recognition of utt...
Akinori Ito, Toyomi Meguro, Shozo Makino, Motoyuki Suzuki
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   207-210   2008年12月
This paper describes a method used to determine if a specific word is related to a certain spoken dialog task. In most ordinary spoken dialog systems, only the words that are actually used to achieve the task are included in the vocabulary. Theref...
Motoyuki Suzuki, Naoto Kuriyama, Akinori Ito, Shozo Makino
2008 International Conference on Natural Language Processing and Knowledge Engineering, NLP-KE 2008      2008年12月
PLSA is one of the most powerful language models for adaptation to a target speech. The vocabulary divided PLSA language model (VD-PLSA) shows higher performance than the conventional PLSA model because it can be adapted to the target topic and th...
Yutaka Hiroi, Akinori Ito
IEEE/ASME International Conference on Advanced Intelligent Mechatronics, AIM   546-551   2008年9月
Human symbiosis service robots of various sizes have already been developed. However, few quantitative investigations have been made concerning the influence of the size of a robot on a user's impression. We focused on the height of a robot (robot...
Motoyuki Suzuki, Tatsuki Konno, Akinori Ito, Shozo Makino
IMSCI 2007 - International Multi-Conference on Society, Cybernetics and Informatics, Proceedings   1 48-53   2007年1月
Prosody plays an important role in speech communication between humans. Several computer-assisted language learning (CALL) systems with utterance evaluation have been developed so far; however, accuracy of their prosody evaluation is still poor. I...
T. Nishizawa, A. Ito
Journal of Horticultural Science and Biotechnology   82 227-234   2007年1月
Changes in cell wall polysaccharides associated with fruit softening under storage conditions at 20°C were compared between 'Wasada-uri' (a "five-carpel-type" melon accession) and 'Prince' (a "three-carpel-type" melon cultivar). Ethylene productio...
Motoyuki Suzuki, Toru Hosoya, Akinori Ito, Shozo Makino
ISMIR 2006 - 7th International Conference on Music Information Retrieval   168-171   2006年12月
Several music information retrieval (MIR) systems have been developed which retrieve musical pieces by the user's singing voice. All of these systems use only melody information for retrieval, although lyrics information is also useful for retriev...
Motoyuki Suzuki, Yasutomo Kajiura, Akinori Ito, Shozo Makino
INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP   5 2202-2205   2006年1月
An n-gram trained by a general corpus gives high performance. However, it is well known that a topic-specialized n-gram gives higher performance than that of the general n-gram. In order to make a topic specialized n-gram, several adaptation metho...
Akinori Ito, Keisuke Shimada, Motoyuki Suzuki, Shozo Makino
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   2 1045-1048   2006年1月
This paper describes a user simulator based on analysis of VoiceXML description. A user simulator is a method to evaluate a spoken dialog system without the use of human evaluators. The new feature of our simulator is that it uses a VoiceXML descr...
Toru Hosoya, Motoyuki Suzuki, Akinori Ito, Shozo Makino
ISMIR 2005 - 6th International Conference on Music Information Retrieval   532-535   2005年12月
Recently, several music information retrieval (MIR) systems have been developed which retrieve musical pieces by the user's singing voice. All of these systems use only the melody information for retrieval. Although the lyrics information is usefu...
Motoyuki Suzuki, Yusuke Kato, Akinori Ito, Shozo Makino
9th European Conference on Speech Communication and Technology   973-976   2005年12月
Background noise is one of the biggest problem for speech recognition systems in real environments. In order to achieve high recognition performance for corrupted speech, we proposed a new construction method of HMMs dealing with various kinds of ...
Akinori Ito, Yen Ling Lim, Motoyuki Suzuki, Shozo Makino
9th European Conference on Speech Communication and Technology   173-176   2005年12月
We are developing a CALL system to train English pronunciation for Japanese native speakers. However, the precision of the error detection was not very high because the threshold for the detection was not optimum. To improve the detection accuracy...
Akinori Ito, Takashi Kanayama, Motoyuki Suzuki, Shozo Makino
9th European Conference on Speech Communication and Technology   2685-2688   2005年12月
Speech recognition by a small robot is difficult because the robot makes noise itself. In this paper, two new methods are proposed that suppresses internal noise of the small robots. These methods are based on spectral subtraction (SS). The differ...
Sung Phil Heo, Motoyuki Suzuki, Akinori Ito, Shozo Makino, Hyun Yeol Chung
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)   3094 212-227   2004年12月
This paper describes a music information retrieval system which uses humming as the key for retrieval. Humming is an easy way for the user to input a melody. However, there are several problems with humming that degrade the retrieval of informatio...
音声認識に関する最近の話題
伊藤彰則
情報・システムソサイエティ誌   9(1) 14-21   2004年5月
Akinori Ito, Takanobu Oba, Takashi Konashi, Motoyuki Suzuki, Shozo Makino
8th International Conference on Spoken Language Processing, ICSLP 2004   193-196   2004年1月
Speech recognition under noisy environment is one of the hottest topic in the speech recognition research. In this paper, we propose a method to improve accuracy of spoken dialog system from a dialog strategy point of view. In the proposed method,...
Oh Pyo Kweon, Akinori Ito, Motoyuki Suzuki, Shozo Makino
8th International Conference on Spoken Language Processing, ICSLP 2004   1833-1836   2004年1月
This paper describes a dialogue-based CALL (Computer Assisted Language Learning) system. One of the major problems in CALL systems is that learners are usually assigned a passive role. Learners have no practices in composing their own utterances. ...
Takashi Konashi, Motoyuki Suzuki, Akinori Ito, Shozo Makino
8th International Conference on Spoken Language Processing, ICSLP 2004   189-192   2004年1月
We have been developing a spoken dialog system. Conventional spoken dialog systems need grammar descriptions and scripts of a dialog, that are difficult to develop. The system proposed in this paper is based on semantic frames, and the system gene...
Motoyuki Suzuki, Hirokazu Ogasawara, Akinori Ito, Yuichi Ohkawa, Shozo Makino
8th International Conference on Spoken Language Processing, ICSLP 2004   2929-2932   2004年1月
Several CALL systems have two acoustic models to evaluate a learner's pronunciation. In order to achieve high performance for evaluation, speaker adaptation method is introduced in CALL system. It requires adaptation data of a target language, how...
Yuichi Ohkawa, Akihiro Yoshida, Motoyuki Suzuki, Akinori Ito, Shozo Makino
EUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology   485-488   2003年1月
In spontaneous speech, various speech style and speed changes can be observed, which are known to degrade speech recognition accuracy. In this paper, we describe an optimized multi-duration HMM (OMD). An OMD is a kind of multi-path HMM with at mos...
Akinori Ito, Akinori Ito, Chiori Hori, Chiori Hori, Masaharu Katoh, Masaharu Katoh, Masaki Kohda, Masaki Kohda
Systems and Computers in Japan   33 74   2002年3月
Akinobu Lee, Tatsuya Kawahara, Kazuya Takeda, Masato Mimura, Atsushi Yamada, Akinori Ito, Katsunobu Itou, Kiyohiro Shikano
Proceedings of the 3rd International Conference on Language Resources and Evaluation, LREC 2002   1438-1441   2002年1月
Continuous Speech Recognition Consortium (CSRC) was founded on 2000 to promote sharable high-quality platform for research and development of speech recognition. It is a continued work of the former Japanese Dictation Toolkit project from 1997 to ...
Se Jin Oh, Hyun Yeol Chung, Cheol Jun Hwang, Bum Koog Kim, Akinori Ito
2001 IEEE Fourth Workshop on Multimedia Signal Processing   39-44   2001年12月
In this paper, we adopted the Korean phonological rules to state clustering of contextual domain for representing the unknown contexts and tying the model parameters of new states in state clustering of SSS (Successive State Splitting). We used th...
Akinori Ito, Chiori Hori, Masaharu Katoh, Masaki Kohda
Systems and Computers in Japan   32 1-9   2001年11月
As a novel elastic image matching technique, piecewise linear 2D warping (PL2DW) is investigated. In PL2DW, the mapping of each row of one image to another image is given by the linear interpolation of the mapping of several points, called pivots,...
河原達也,李晃伸,小林哲則,武田一哉,峯松信明,嵯峨山茂樹,伊藤克亘,伊藤彰則,山本幹雄,山田篤,宇津呂武仁,鹿野清宏
日本音響学会誌   57(3) 210-214   2001年3月
T. Nishizawa, A. Ito, Y. Motomura, M. Ito, M. Togashi
Journal of the Japanese Society for Horticultural Science   69 563-569   2000年10月
Biochemical changes in ripening netted melon fruits (Cucumis melo L. 'Andesu' and 'Luster') as influenced by shading were determined. Shading resulted in a rapid loss of flesh firmness in both cultivars which was positively correlated with ethylen...
ページャ兼テキストベースWWWブラウザ“w3m”
伊藤彰則
bit   32(9) 28-33   2000年9月
Akinori Ito, Chiori Hori, Masaharu Kotow, Masaki Kohda
6th International Conference on Spoken Language Processing, ICSLP 2000      2000年1月
This paper describes a language modeling technique using a kind of stochastic context free grammar (stochastic dependency grammar, SDG). In this work, two improvements are done upon the general CFG based SCFG model. The first improvement is to use...
Katsunobu Itou, Kiyohiro Shikano, Tatsuya Kawahara, Kazuya Takeda, Atsushi Yamada, Akinori Itou, Takehito Utsuro, Tetsunori Kobayashi, Nobuaki Minematsu, Mikio Yamamoto, Shigeki Sagayama, Akinobu Lee
2nd International Conference on Language Resources and Evaluation, LREC 2000      2000年1月
Large vocabulary continuous speech recognition (LVCSR) is an important basis for the application development of speech recognition technology. We had constructed Japanese common LVCSR speech database and have been developing sharable Japanese LVCS...
Tatsuya Kawahara, Akinobu Lee, Tetsunori Kobayashi, Kazuya Takeda, Nobuaki Minematsu, Shigeki Sagayama, Katsunobu Itou, Akinori Ito, Mikio Yamamoto, Atsushi Yamada, Takehito Utsuro, Kiyohiro Shikano
6th International Conference on Spoken Language Processing, ICSLP 2000      2000年1月
A sharable software repository for Japanese LVCSR (Large Vocabulary Continuous Speech Recognition) is introduced. It is designed as a baseline platform for research and developed by researchers of different academic institutes under a governmental...
河原達也,李晃伸,小林哲則,武田一哉,峯松信明,伊藤克亘,伊藤彰則,山本幹雄,山田篤,宇津呂武仁,鹿野清宏
日本音響学会誌   55(3) 175-180   1999年3月
Tatsuya Kawahara, Akinobu Lee, Tetsunori Kobayashi, Kazuya Takeda, Nobuaki Minematsu, Katsunobu Itou, Akinori Ito, Mikio Yamamoto, Atsushi Yamada, Takehito Utsuro, Kiyohiro Shikano
Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi)   20 233-239   1999年1月
The Japanese Dictation Toolkit has been designed and developed as a baseline platform for Japanese LVCSR (Large Vocabulary Continuous Speech Recognition). The platform consists of a standard recognition engine, Japanese phone models and Japanese s...
Akinori Ito, Masaki Kohda
International Conference on Spoken Language Processing, ICSLP, Proceedings   1 490-493   1996年12月
This paper describes a new powerful statistical language model based on N-gram model for Japanese speech recognition. In English, a sentence is written word-by-word. On the other hand, a sentence in Japanese has no word boundary character. Therefo...
Takashi Otsuki, Takashi Otsuki, Akinori Ito, Shozo Makino, Teruhiko Ohtomo
IEICE Transactions on Information and Systems   E79-D 47-52   1996年1月
This paper presents the performance prediction method on sentence recognition system which uses a finite state word automaton. When each word is uttered separately, the relationship between word recognition score and sentence recognition score can...
Motoyuki Suzuki, Shozo Makino, Akinori Ito, Hirotomo Aso, Hiroshi Shimodaira
IEICE Transactions on Information and Systems   E78-D 662-668   1995年6月
Many methods have been proposed for constructing context-dependent phoneme models using Hidden Markov Models (HMMs) to improve performance. These conventional methods require previously defined contextual factors. If these factors are deficient, t...
サブギガネットワークでマルチメディア・アプリケーションを実現する東北大学「SuperTAINS」
亀山幸義,伊藤彰則,小林広明
コンピュータ&ネットワークLAN   13(6) 114-120   1995年6月
Takashi Otsuki, Teruhiko Otomo, Akinori Ito, Shozo Makino
Electronics and Communications in Japan (Part III: Fundamental Electronic Science)   78 10-19   1995年1月
The words in natural language have different occurrence probabilities. Consequently, the information obtained from the event, i.e., the occurrence of a word, is larger than in the case of the occurrence with uniform probability. In other words, it...
Takashi Otsuki, Shozo Makino, Akinori Ito, Toshio Sone
Systems and Computers in Japan   25 72-81   1994年1月
This paper considers word recognition based on the existence of the transition between phonemes and characters with complete segmentation between phonemes and characters. A method is proposed which estimates theoretically the relation between the ...
Akinori Ito, Shozo Makino
Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing   2    1993年1月
In this paper, a new word pre-selection method called 'extended redundant hash addressing method' is proposed. This method extends the redundant hash addressing method to word spotting from continuous speech. Moreover, the improvement of the trigr...
Shozo Makino, Akinori Ito, Mitsuru Endo, Ken'iti Kido
Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing   1 273-276   1991年12月
A prototype of a Japanese text dictation system has been developed. It is composed of an acoustic processor, a Bunsetsu-unit spotting processor, and a syntactic processor with semantic constraints. The acoustic processor is constructed using the m...

書籍等出版物

 
音響学入門
鈴木陽一, 赤木正人, 伊藤彰則, 佐藤洋, 苣木禎史, 中村健太郎 (担当:共著)
2010年2月   
IT Text 音声認識システム
鹿野清宏,伊藤克亘,河原達也,武田一哉,山本幹雄 (担当:編者, 範囲:第4章、第5章)
オーム社   2001年5月   
Recent Research towards Advanced Man-Machine Interface through Spoken Language
Shozo Makino,Akinori Ito,Mitsuru Endo,Ken'iti Kido (担当:共著, 範囲:Chapter 4, pp. 193-204)
Elsevier   1996年1月   
Spoken Language Systems
Seiichi Nakagawa, Michio Okada, Tatsuya Kawahara (Eds.) (担当:共著)
Ohmsha/IOS Press   2005年9月   

講演・口頭発表等

 
Toru Ishikawa, Takashi Nose, Akinori Ito
Smart Innovation, Systems and Technologies   2019年1月1日   
© Springer Nature Switzerland AG 2019. In this paper, we propose a method to generate a talking head animation considering the direction of the face. The proposed method parametrizes a facial image using the active appearance model (AAM) and model...
Sou Miyamoto, Takashi Nose, Kazuyuki Hiroshiba, Yuri Odagiri, Akinori Ito
Smart Innovation, Systems and Technologies   2019年1月1日   
© Springer Nature Switzerland AG 2019. In this study, we propose a voice conversion technique with two-stage conversion, which is realized by using two models consisting of U-Net and pix2pix. Using U-Net, we tried to reproduce intonation of a targ...
Jiang Fu, Yuya Chiba, Takashi Nose, Akinori Ito
Smart Innovation, Systems and Technologies   2019年1月1日   
© Springer Nature Switzerland AG 2019. Regarding the assistance of computer-assisted language learning (CALL) systems to make foreign language learning easier, it is necessary to recognize the utterances of the learner with high accuracy. The qual...
Takashi Kimura, Takashi Nose, Shinji Hirooka, Shinji Hirooka, Yuya Chiba, Akinori Ito
Smart Innovation, Systems and Technologies   2019年1月1日   
© Springer Nature Switzerland AG 2019. In recent years, many systems having a speech interface have grown. The speech interface includes spoken dialogue function and high performance of a spoken dialogue system has been required. The spoken dialog...
Shinya Hanabusa, Takashi Nose, Akinori Ito
Smart Innovation, Systems and Technologies   2019年1月1日   
© Springer Nature Switzerland AG 2019. This paper proposes a technique for controlling the pitch of synthetic speech at a segmental level using user input speech within a framework of speech synthesis based on deep neural networks (DNNs). In a pre...

Works

 
統計的言語モデルツールキット palmkit
コンピュータソフト   2001年11月
ウェブブラウザ w3m
コンピュータソフト   1999年1月

競争的資金等の研究課題

 
音声対話システムの開発
研究期間: 2002年4月 - 現在
音声認識を用いたCALLシステムの開発
科学研究費補助金
研究期間: 2004年4月 - 現在
音声認識システムの開発
経常研究
研究期間: 2002年4月 - 現在
音楽情報処理
研究期間: 2004年4月 - 現在

特許

 
特許第5805474号 : 音声評価装置,音声評価方法,及びプログラム
特許第5780516号 : モデル縮減装置とその方法とプログラム
大庭 隆伸,堀 貴明,中村 篤,伊藤 彰則
特許第5700566号 : スコアリングモデル生成装置、学習データ生成装置、検索システム、スコアリングモデル生成方法、学習データ生成方法、検索方法及びそのプログラム
特許第5610304号 : モデルパラメータ配列装置とその方法とプログラム
大庭 隆伸,堀 貴明,,中村 篤,伊藤 彰則
特許第4911385号 : データ通信方法、データ通信システムおよびデータ通信プログラム
鈴木 陽一,伊藤 彰則,阿部 俊一郎,須藤 裕史,吉木 伸二,染谷 大

社会貢献活動

 
サイエンスカフェ
【その他】  2013年6月28日
「スマホやロボットとどうやって会話できるのか?」と題して、おんせい認識・合成・対話技術について公開の公演を行った。
出前講義
【その他】  2008年12月4日
宮城県仙台第二高校において,「ロボットとの対話」という題目で,高校生を対象に出前講義を行った.
出前講義
【その他】  2008年10月18日
群馬県立太田高校において,「ロボットとの対話」という題目で,高校生を対象に出前講義を行った.
ネット障害時 円滑送信
【情報提供】  日本経済新聞  2007年3月23日

その他

 
1997年4月   日本語ディクテーション基本ソフトウェアの開発
日本語の大語彙連続音声認識の研究・開発・実用化を促進する
ため、誰でも利用でき、高精度な音声認識システムを開発する。
このため、不特定話者に対して利用できる高精度な音響モデル、
大量の言語データを用いて学習した言語モデル、および高速・
高精度な音声認識エンジンの開発を行う。