Tell Me Who Are You Talking to and I Will Tell You What Issues Need Your Skills

Fabio Santos, Jacob Penney, Joao Felipe Pimentel, Igor Wiese, Igor Steinmacher, Marco A. Gerosa

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Scopus citations

Abstract

Selecting an appropriate task is challenging for newcomers to Open Source Software (OSS) projects. To facilitate task selection, researchers and OSS projects have leveraged machine learning techniques, historical information, and textual analysis to label tasks (a.k.a. issues) with information such as the issue type and domain. These approaches are still far from mainstream adoption, possibly because of a lack of good predictors. Inspired by previous research, we advocate that label prediction might benefit from leveraging metrics derived from communication data and social network analysis (SNA) for issues in which social interaction occurs. Thus, we study how these "social metrics"can improve the automatic labeling of open issues with API domains - categories of APIs used in the source code that solves the issue - which the literature shows that newcomers to the project consider relevant for task selection. We mined data from OSS projects' repositories and organized it in periods to reflect the seasonality of the contributors' project participation. We replicated metrics from previous work and added social metrics to the corpus to predict API-domain labels. Social metrics improved the performance of the classifiers compared to using only the issue description text in terms of precision, recall, and F-measure. Precision (0.922) increased by 15.82% and F-measure (0.942) by 15.89% for a project with high social activity. These results indicate that social metrics can help capture the patterns of social interactions in a software project and improve the labeling of issues in an issue tracker.

Original languageEnglish (US)
Title of host publicationProceedings - 2023 IEEE/ACM 20th International Conference on Mining Software Repositories, MSR 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages611-623
Number of pages13
ISBN (Electronic)9798350311846
DOIs
StatePublished - 2023
Event20th IEEE/ACM International Conference on Mining Software Repositories, MSR 2023 - Melbourne, Australia
Duration: May 15 2023May 16 2023

Publication series

NameProceedings - 2023 IEEE/ACM 20th International Conference on Mining Software Repositories, MSR 2023

Conference

Conference20th IEEE/ACM International Conference on Mining Software Repositories, MSR 2023
Country/TerritoryAustralia
CityMelbourne
Period5/15/235/16/23

Keywords

  • Human Factors
  • Labels
  • Machine Learning
  • Mining Software Repositories
  • Open Source Software
  • Skills
  • Social Network Analysis
  • Tags

ASJC Scopus subject areas

  • Software
  • Safety, Risk, Reliability and Quality

Fingerprint

Dive into the research topics of 'Tell Me Who Are You Talking to and I Will Tell You What Issues Need Your Skills'. Together they form a unique fingerprint.

Cite this