Abstract
GitHub repository recommendation is a research hotspot in the field of open-source software. The current problems with the repository recommendation system are the insufficient utilization of open-source community information and the fact that the scoring metrics used to calculate the matching degree between developers and repositories are developed manually and rely too much on human experience, leading to poor recommendation results. To address these problems, we design a questionnaire to investigate which repository information developers focus on and propose a graph convolutional network-based repository recommendation system (GCNRec). First, to solve insufficient information utilization in open-source communities, we construct a Developer-Repository network using four types of behavioral data that best reflect developers’ programming preferences and extract features of developers and repositories from the repository content that developers focus on. Then, we design a repository recommendation model based on a multi-layer graph convolutional network to avoid the manual formulation of scoring metrics. This model takes the Developer-Repository network, developer features and repository features as inputs, and recommends the top-k repositories that developers are most likely to be interested in by learning their preferences. We have verified the proposed GCNRec on the dataset, and by comparing it with other open-source repository recommendation methods, GCNRec achieves higher precision and hit rate.
Original language | English |
---|---|
Pages (from-to) | 175-196 |
Number of pages | 22 |
Journal | Computer Modeling in Engineering & Sciences |
Volume | 137 |
Issue number | 1 |
Early online date | 23 Apr 2023 |
DOIs | |
Publication status | Published - 2023 |
Keywords
- GitHub
- graph convolutional network
- open-source software
- Repository recommendation
ASJC Scopus subject areas
- Software
- Modelling and Simulation
- Computer Science Applications