Research Projects

2005 - 2007

Building resources and a model for computing paraphrase based on lexical semantics

Japan Society for the Promotion of Science  Grants-in-Aid for Scientific Research Grant-in-Aid for Scientific Research (B)  Grant-in-Aid for Scientific Research (B)

Grant number
17300047
Japan Grant Number (JGN)
JP17300047
Grant amount
(Total)
15,860,000 Japanese Yen
(Direct funding)
14,600,000 Japanese Yen
(Indirect funding)
1,260,000 Japanese Yen

Aiming at building a computational model and computational recourses for computing paraphrase at the level of predicate-argument structure, this research project gained the following results:
(i) For paraphrase knowledge, a large-scale hierarchical lexicon of predicate-argument structure was built. The lexicon organizes about 4,000 Japanese basic verbs (about 7,000 senses in total) with predicate-argument structure information in a fine-grained semantic hierarchy so that lexical entries in a semantic class can be regarded as near synonyms. For augmenting this knowledge base, additional knowledge about event relations are extracted from glosses found in a human-use dictionary of Japanese. Over 35,000 relations are extracted and classified into 8 relation types, all of which are considered useful for recognizing paraphrase or textual entailment.
(ii) For scaling the basic paraphrase knowledge above, automatic acquisition of semantic relations between events from a large corpus was also explored. We proposed several extensions to a state-of-the-art method originally designed for entity relation extraction, reporting on the present results of our experiments on a Japanese Web corpus. The results show that (a) there are indeed specific cooccurrence patterns useful for event relation acquisition, (b) the use of cooccurrence samples involving verbal nouns has positive impacts on both re-call and precision, and (c) over five thousand relation instances are acquired from a 500M-sentence Web corpus with a precision of about 66% for action-effect relations.
(iii) For building a computational model of paraphrase, we explore the regularity underlying these classes of paraphrases, focusing on the paraphrasing of Japanese light-verb constructions (LVCs). We propose a paraphrasing model for LVCs that is based on transforming the Lexical Conceptual Structures (LCSs) of verbal elements. We also propose a refinement of an existing LCS dictionary. Experimental results show that our LCS-based paraphrasing model characterizes some of the semantic features of those verbs required for generating paraphrases, such as the direction of an action and the relationship between arguments and surface cases.

Link information
KAKEN
https://kaken.nii.ac.jp/grant/KAKENHI-PROJECT-17300047
ID information
  • Grant number : 17300047
  • Japan Grant Number (JGN) : JP17300047