論文

査読有り 本文へのリンクあり
2023年1月

データ分類タスクにおけるCompression-based Dissimilarity Measureの精度と速度の改良

人工知能学会論文誌
  • 高本綺架
  • ,
  • 小原佑斗
  • ,
  • 吉田光男
  • ,
  • 梅村恭司

38
1
記述言語
日本語
掲載種別
研究論文(学術雑誌)
DOI
10.1527/tjsai.38-1_A-M71

Compression-based Dissimilarity Measure (CDM) is reported to work well in classifying strings without clues. However, CDM depends on the compression program, and its theoretical background is unclear. In this paper, we propose to replace CDM with the computation of information quantity. Since CDM only uses compressed size, our approach uses the value of information quantity of maximum probability partitioning of string instead of file size. We find this approach is more effective. Then, CDM and the proposed method were applied to publicly available time series data. In addition to the careful implementation of computation using suffix arrays, we also find this approach more efficient.

リンク情報
DOI
https://doi.org/10.1527/tjsai.38-1_A-M71 本文へのリンクあり
ID情報
  • DOI : 10.1527/tjsai.38-1_A-M71

エクスポート
BibTeX RIS