Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Minor changes to csvs #9

Open
4 tasks
dlebauer opened this issue Jun 20, 2022 · 0 comments
Open
4 tasks

Minor changes to csvs #9

dlebauer opened this issue Jun 20, 2022 · 0 comments

Comments

@dlebauer
Copy link
Contributor

dlebauer commented Jun 20, 2022

Task to do

I think we should use names rather than integers for the keys, as you have done with germplasmName;

It is possible / likely that people will prefer these CSV files over the JSON files ... as you have already done the hard work making these tables ready for analysis!

  • add studyName and observationName fields; add these to tables that currently have studyDbId and observationDbId.
  • events_table doesn't need studyDbId (or study name) since it can be joined via the observations table; its redundancy adds a circular reference and may cause confusion (e.g. people wondering why it is there, or thinking they should join events and studies together on studyDbId).
  • please add treatments$factor as treatment or treatmentName to observations because it is required to interpret the data
  • update this table relationships figure on google docs and replace one currently in repository

Reason

Consistency, clarity, providing useful information

Result

Hopefully this only updates the tables and doesn't change the downstream steps (e.g. ask vmapper to ignore these extra fields)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant