Research Projects

Apr, 2018 - Mar, 2021

Research on the effectiveness of using RNN in topic models

Japan Society for the Promotion of Science  Grants-in-Aid for Scientific Research Grant-in-Aid for Scientific Research (C)  Grant-in-Aid for Scientific Research (C)

Grant number
18K11440
Japan Grant Number (JGN)
JP18K11440
Grant amount
(Total)
4,420,000 Japanese Yen
(Direct funding)
3,400,000 Japanese Yen
(Indirect funding)
1,020,000 Japanese Yen

Topic models, including LDA (latent Dirichlet allocation), can automatically extract semantically meaningful themes from a large corpus. However, text analysis using topic models often only considers word frequencies in a document and does not consider the way words are arranged. This work aims to improve topic models with RNN (recurrent neural network) for modeling word order. Several previous studies propose a method for combining RNN with topic models. Therefore, we have tried to propose a new method. As a result, we have proposed a new topic model using NNs (neural networks), where we perform no VAE (variational autoencoder) inference. We instead maximize the target given in the original LDA paper by training NNs in an amortized manner and obtaining posterior parameters as output of NNs. However, we currently only use MLP (multilayer perceptron) and thus have not achieved our goals yet. We now have a plan to replace MLP with RNN or other more recent NN architectures in near future.

Link information
KAKEN
https://kaken.nii.ac.jp/grant/KAKENHI-PROJECT-18K11440
ID information
  • Grant number : 18K11440
  • Japan Grant Number (JGN) : JP18K11440