-
Notifications
You must be signed in to change notification settings - Fork 3
Issues: The-AI-Alliance/open-trusted-data-initiative
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Decide if and how DPK will be used
data pipelines
Defining and implementing data processing pipelines
#109
opened Mar 12, 2025 by
deanwampler
Implement data processing of Dataset cards across datasets using DPK
#106
opened Feb 17, 2025 by
blublinsky
Leverage DataPerf?
data pipelines
Defining and implementing data processing pipelines
#99
opened Feb 5, 2025 by
deanwampler
Add a section to the website catalog page that reports on the HF summary statistics
data pipelines
Defining and implementing data processing pipelines
dataset catalog
All aspects of managing the catalog and its use
"Semi-automate" periodic gathering of HF dataset statistics
data pipelines
Defining and implementing data processing pipelines
Create a schedule of AIA blog posts for OTDI
evangelism
Anything related to public exposure
#95
opened Jan 31, 2025 by
deanwampler
Explore ideas for crowd-sourcing tools and processes for building datasets
contribution process
Steps for contributing datasets and validating contributions.
#90
opened Jan 23, 2025 by
deanwampler
Replace the form submission process that sends an email with an actual web service invocation.
#89
opened Jan 21, 2025 by
deanwampler
Explore possible connection to IETF initiative for "AI prefs"
#81
opened Jan 10, 2025 by
deanwampler
Define the takedown process
dataset catalog
All aspects of managing the catalog and its use
dataset requirements
All aspects of the specification for acceptable datasets.
#79
opened Jan 8, 2025 by
deanwampler
Define the submission checklist used by the review committee
contribution process
Steps for contributing datasets and validating contributions.
dataset requirements
All aspects of the specification for acceptable datasets.
Evaluate LinkedIn Data Hub as a catalog system
dataset catalog
All aspects of managing the catalog and its use
dataset requirements
All aspects of the specification for acceptable datasets.
help wanted
Extra attention is needed
Investigate using Databricks-sponsored Unity Catalog for metadata management
data pipelines
Defining and implementing data processing pipelines
dataset catalog
All aspects of managing the catalog and its use
#71
opened Dec 12, 2024 by
deanwampler
There are needs to support hidden/restricted data. Investigate what we might do
dataset catalog
All aspects of managing the catalog and its use
dataset requirements
All aspects of the specification for acceptable datasets.
#70
opened Dec 12, 2024 by
deanwampler
Create requirements doc for the catalog viewer
dataset catalog
All aspects of managing the catalog and its use
Form a committee to review submissions
administration
Misc. admin. tasks, like organizing the work, recruiting participants, etc.
data pipelines
Defining and implementing data processing pipelines
dataset catalog
All aspects of managing the catalog and its use
dataset requirements
All aspects of the specification for acceptable datasets.
Define the tasks and epics for processing
contribution process
Steps for contributing datasets and validating contributions.
data pipelines
Defining and implementing data processing pipelines
Recruit maintainers for the pipeline processing
data pipelines
Defining and implementing data processing pipelines
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.