Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tiers of Authorship-ness #7

Open
frobnitzem opened this issue Aug 3, 2020 · 1 comment
Open

Tiers of Authorship-ness #7

frobnitzem opened this issue Aug 3, 2020 · 1 comment
Assignees

Comments

@frobnitzem
Copy link
Member

Classify Authors Into Tiers:

  • who's funded and who's not
  • who's only made commits
  • who's both committed and interacted through an issue / discussion / review / other Q&A

For "Funded" - How to Corroborate Contribution:

  • We'll check against ECP rosters and poster author lists
  • some projects have lots of connections / applications outside of ECP
  • some projects exist because they are solely funded by ECP
@elaineraybourn
Copy link
Member

elaineraybourn commented Aug 4, 2020

For "author" we use "contributor" in our research study document. In developing our contributor or "author" classification scheme for data analysis we need to be clear of the scope of the analysis, and how we are defining terms. I propose the following definitions to guide data analysis:
Phase 1 (Tier 1 -- lowest level of analysis)
Repo is defined as a GitHub repository. For the purposes of this Phase, we are interested primarily in repos that are associated with ECP projects.
Commit is defined as a save (of the current state, or snapshot) of the repository.
Contributor is defined as a unique user ID with 1 or more commits to 1 or more repos attributed to an ECP project.
Contributor ranking is defined as the number of commits. The greater the number of commits, the greater the ranking of the contributor.
Contribution is defined as a commit generated by a human, and potentially, a contribution by a bot created by a human.
Cross-repo contribution is defined as one or more commits by a unique contributor to two or more different repos
Cross-project contribution is defined as one or more commits to one or more ECP project repos. A project is defined as a formal collection of repos (e.g. ADTM, ALPINE). Only one commit to any ECP project repo is necessary to be considered a contribution to the ECP project, if multiple repos exist in the project.
Contributor network ranking is defined as the number of commits in repos that are attributed to a number of different ECP project repos. The greater the number of commits and greater the number of ECP project repos the higher the ranking of the individual contributor.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants