論文

査読有り
2015年

Compilation and Evaluation of Paraphrase Representation List of Compound Verbs Toward Development of "Control Language for Action"

2015 2ND INTERNATIONAL CONFERENCE ON ADVANCED INFORMATICS: CONCEPTS, THEORY AND APPLICATIONS ICAICTA
  • Tomoya Shirai
  • ,
  • Hirofumi Yabumoto
  • ,
  • Kyoko Kanzaki
  • ,
  • Hitoshi Isahara

記述言語
英語
掲載種別
研究論文(国際会議プロシーディングス)
出版者・発行元
IEEE

In order to realize friendly man-machine communication, machines must understand not only surface expressions of human utterance but also deep meanings of human behavior. We started compilation of "paraphrase representation list of compound verbs" as the first step of investigation and standardization of lexical items which is a part of "control language for action". We processed the corpus and vectorized the data by using Word2Vec. Using the created vector, we performed a calculation of similarity between the compound verbs and verbs in a corpus by cosine similarity, and created a paraphrase representation list. We got paraphrase expressions for 1899 compound verbs among 3289 compound verbs (including orthographic variants) stored in the compound verb lexicon. We found by this method words which do not exist in the Japanese WordNet. We investigated the words that exist only in the result of automatic extraction, and found that there are 213 unknown words and 227 new synonymous relationship. What is worthy of special mention is that there is 14 differences between the unknown word and a new synonymous relationship, which means we could find 14 words which are stored in the Japanese WordNet, but are not considered as synonyms of a word. We can say that the proposed method is useful for the expansion of paraphrase relationship listed by human intuitions.

リンク情報
Web of Science
https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=JSTA_CEL&SrcApp=J_Gate_JST&DestLinkType=FullRecord&KeyUT=WOS:000380390500005&DestApp=WOS_CPL
ID情報
  • Web of Science ID : WOS:000380390500005

エクスポート
BibTeX RIS