論文

査読有り
2017年

Exploring identical users on GitHub and stack overflow

Proceedings of the International Conference on Software Engineering and Knowledge Engineering, SEKE
  • Takahiro Komamizu
  • ,
  • Yasuhiro Hayase
  • ,
  • Toshiyuki Amagasa
  • ,
  • Hiroyuki Kitagawa

開始ページ
584
終了ページ
589
記述言語
英語
掲載種別
研究論文(国際会議プロシーディングス)
DOI
10.18293/SEKE2017-109
出版者・発行元
Knowledge Systems Institute Graduate School

Analyzing behaviours of developers in different platforms (in particular, GitHub and Stack Overflow in this paper) can reveal interesting facts related to development activities. There are only few datasets for analysing crossplatform user behaviours, especially across GitHub and Stack Overflow. Users on GitHub and Stack Overflow are identifiable by equivalences of email addresses. In order to increase the number of identifiable users on these datasets, this paper retrieves potentially identifiable users between GitHub and Stack Overflow not relying only on email addresses. This paper employs a classification-based link prediction, which design the user identification problem as a link prediction problem on the bipartite graph consisting of users of GitHub and those of Stack Overflow. With the identification method, this paper generates a probabilistic dataset containing pairs of users with probabilities (or confidences). This paper, as well, publishes the identification tool in order to enable further data generation on appearing datasets of GitHub, Stack Overflow and others. The generated dataset and tool are highly helpful to accelerate researches on mining software repositories.

リンク情報
DOI
https://doi.org/10.18293/SEKE2017-109
DBLP
https://dblp.uni-trier.de/rec/conf/seke/KomamizuHAK17
URL
http://dblp.uni-trier.de/db/conf/seke/seke2017.html#conf/seke/KomamizuHAK17
ID情報
  • DOI : 10.18293/SEKE2017-109
  • ISSN : 2325-9086
  • ISSN : 2325-9000
  • DBLP ID : conf/seke/KomamizuHAK17
  • SCOPUS ID : 85029525601

エクスポート
BibTeX RIS