Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Define the submission checklist used by the review committee #77

Open
deanwampler opened this issue Jan 8, 2025 · 0 comments
Open

Define the submission checklist used by the review committee #77

deanwampler opened this issue Jan 8, 2025 · 0 comments
Labels
contribution process Steps for contributing datasets and validating contributions. dataset requirements All aspects of the specification for acceptable datasets.
Milestone

Comments

@deanwampler
Copy link
Contributor

What checklist items should reviewers use?

Some possibilities:

  • Data Dictionary/Schema: Is the format of the data documented correctly, including data types, uniqueness, any keyed relationships, etc.
  • Data Quality Assessment: Score the data set quality using an established and accepted metric or process.
  • Compliance and Regulatory Adherence: Verify the data does not contain any PII, PCI, etc.
  • Licensing: Are the allowed uses of the data documented clearly and is the license compatible with OTDI goals?
  • Data Lineage: Where did the data originate from? Is the data a transform of an existing public data set? Is the code that generated the data set available for review? Who generated the data set?
  • Data Consumption: Is there a reference implementation for consuming the data?
    Issues: Identify and document clearly any known issues with the data, including any expected improvements to the data.

We should automate as many of these as we can, over time.

@deanwampler deanwampler added contribution process Steps for contributing datasets and validating contributions. dataset requirements All aspects of the specification for acceptable datasets. labels Jan 8, 2025
@deanwampler deanwampler moved this to Todo in FA5: OTDI Tasks Jan 8, 2025
@deanwampler deanwampler added this to the 2025-01-31 milestone Jan 8, 2025
@deanwampler deanwampler changed the title Define the submission checklist used by review committee Define the submission checklist used by the review committee Feb 12, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
contribution process Steps for contributing datasets and validating contributions. dataset requirements All aspects of the specification for acceptable datasets.
Projects
Status: Todo
Development

No branches or pull requests

1 participant