Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Simple Preprocessor #13

Open
amc-corey-cox opened this issue Jan 24, 2025 · 0 comments
Open

Simple Preprocessor #13

amc-corey-cox opened this issue Jan 24, 2025 · 0 comments

Comments

@amc-corey-cox
Copy link
Collaborator

We need to lay a framework for a simple per-processor. The purpose of this tool is simply to remediate know data quality issues, such as using nonsense values for missing data or enum columns with different enum batters (case, abbreviations, etc.)

This tool should be as simple and straightforward as possible. It should have a matrix of column X value = value that we can use to convert every value in a column to a different value. Eventually, this tool should be run by the data submitter before submission occurs but for now we'll use it upstream.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant