Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clarify "canonical" labels to support #108

Open
deanwampler opened this issue Mar 4, 2025 · 0 comments
Open

Clarify "canonical" labels to support #108

deanwampler opened this issue Mar 4, 2025 · 0 comments

Comments

@deanwampler
Copy link
Contributor

deanwampler commented Mar 4, 2025

I added two links in the references to general categories used in advertising and NLP. They are hierarchical with the lowest layers probably too fine-grained, but the higher layers could be good categories to provide canned labels for users searching for domain-specific datasets.

Other important labels include: synthetic, training, tuning, materials, healthcare, time series, science, math, tech, audio, video, multimedia, language, particular languages, etc.

@deanwampler deanwampler converted this from a draft issue Mar 4, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Todo
Development

No branches or pull requests

1 participant