{"id":1812,"date":"2021-01-08T16:42:45","date_gmt":"2021-01-08T07:42:45","guid":{"rendered":"http:\/\/floss-lab.org\/?p=1812"},"modified":"2021-02-18T00:19:29","modified_gmt":"2021-02-17T15:19:29","slug":"onboarding-analysis-dataset","status":"publish","type":"post","link":"https:\/\/floss-lab.org\/?p=1812","title":{"rendered":"SANER2021 ERA Dataset"},"content":{"rendered":"\n<h2 class=\"wp-block-heading\">About<\/h2>\n\n\n\n<p>On this page, we publish the dataset used in the our paper \u201cOnboarding to Open Source Projects with Good First Issues: A Preliminary Analysis (Hyuga Horiguchi, Itsuki Omori and Masao Ohira)\u201d has been accepted for inclusion in the Early Research Achievements (ERA) track of the 28th IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER \u201921). <\/p>\n\n\n\n<h2 class=\"wp-block-heading\">File list<\/h2>\n\n\n\n<ol class=\"wp-block-list\"><li><a href=\"https:\/\/floss-lab.org\/public_dataset\/saner2021\/prs_nums_before_resolving_issue.csv\">prs_nums_before_resolving_issue.csv<\/a><\/li><li><a href=\"https:\/\/floss-lab.org\/public_dataset\/saner2021\/resolved_issues_percenrage.csv\">resolved_issues_percenrage.csv<\/a><\/li><li><a href=\"https:\/\/floss-lab.org\/public_dataset\/saner2021\/prs_nums_after_resolving_issues.csv\">prs_nums_after_resolving_issues.csv<\/a><\/li><\/ol>\n\n\n\n<h2 class=\"wp-block-heading\">Description<\/h2>\n\n\n\n<p>The first file was used for the analysis of RQ1.<br>The violin plot in Fig. 1 shows the distribution of the <code>prs_num<\/code> in the 4th column of the file for each <code>issue_type<\/code> in the 3rd column.<br>The 1st column, <code>dev_id<\/code> is the ID to identify the developer. It is used to anonymize the account information of GitHub.<br>The 2nd column, <code>issue_url<\/code> is the URL of the issue resolved by the developer.<br>The 3rd column, <code>issue_type<\/code> shows whether the issue is a Regular Issue or a Good First Issue.<br>The 4th column, <code>prs_num<\/code> is the number of PRs that the developer with the <code>dev_id<\/code> has posted on GitHub before resolving the issue with the <code>issue_url<\/code>.<\/p>\n\n\n\n<p>The second file was used for the analysis of RQ2.<br>Table II shows the 1st, 4th, and 7th columns of the file as shown below.<br>The 1st column, <code>repo_url<\/code> is the URL of the repository.<br>The 2nd column, <code>issues_num<\/code> is the number of Regular Issues that the repository has.<br>The 3rd column, <code>resolved_issues_num<\/code> is the number of resolved Regular Issues.<br>The 4th column, <code>resolved_issues_percentage<\/code> is the value of the <code>resolved_issues_num<\/code> (3rd column) divided by the <code>issues_num<\/code> (2nd column).<br>The 5th column, <code>good_first_issues_num<\/code> is the number of Good First Issues that the repository has.<br>The 6th column, <code>resolved_good_first_issues_num<\/code> is the number of resolved Good First Issues.<br>The 7th column, <code>resolved_good_first_issues_percentage<\/code> is the value of the <code>resolved_good_first_issues_num<\/code> (6th column) divided by the <code>good_first_issues_num<\/code> (5th column).<br>The 8th column, <code>resolved_ratio<\/code> is the ratio of the <code>resolved_good_first_issues_percentage<\/code> (7th column) divided by the <code>resolved_issues_percentage<\/code> (4th column).<\/p>\n\n\n\n<p>The third file was used for the analysis of RQ3.<br>Table III shows the percentage of developers for each repository whose the <code>prs_num<\/code>(4th column) is 1 or higher among the Good First Issue of the <code>issue_type<\/code>(3rd column).<br>The 1st column, <code>dev_id<\/code> is the ID to identify the developer.<br>The 2nd column, <code>issue_url<\/code> is the URL of the issue resolved by the developer.<br>The 3rd column, <code>issue_type<\/code> shown whether the issue is a Regular Issue or a Good First Issue.<br>The 4th column, <code>prs_num<\/code> is the number of PRs that the developer with the <code>dev_id<\/code> has posted to the same repository as the <code>issue_url<\/code> after resolving the issue with the <code>issue_url<\/code>.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Contact<\/h2>\n\n\n\n<p>Hyuga Horiguchi (hhyuga201515<span class=\"has-inline-color has-white-color\">xx<\/span>@<span class=\"has-inline-color has-white-color\">xx<\/span>gmail.com)<br>Masao Ohira (masao<span class=\"has-inline-color has-white-color\">xx<\/span>@<span class=\"has-inline-color has-white-color\">xx<\/span>wakayama-u.ac.jp)<\/p>\n","protected":false},"excerpt":{"rendered":"<p>About On this page, we publish the datas &hellip; <a href=\"https:\/\/floss-lab.org\/?p=1812\">\u7d9a\u304d\u3092\u8aad\u3080 <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-1812","post","type-post","status-publish","format-standard","hentry","category-1"],"_links":{"self":[{"href":"https:\/\/floss-lab.org\/index.php?rest_route=\/wp\/v2\/posts\/1812","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/floss-lab.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/floss-lab.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/floss-lab.org\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/floss-lab.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=1812"}],"version-history":[{"count":16,"href":"https:\/\/floss-lab.org\/index.php?rest_route=\/wp\/v2\/posts\/1812\/revisions"}],"predecessor-version":[{"id":1831,"href":"https:\/\/floss-lab.org\/index.php?rest_route=\/wp\/v2\/posts\/1812\/revisions\/1831"}],"wp:attachment":[{"href":"https:\/\/floss-lab.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=1812"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/floss-lab.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=1812"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/floss-lab.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=1812"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}