You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
These should be very simple ~10 column x 10 row datasets that we can simply use to make sure we can run the tools. We might continue to use them later as test cases or something else but that is a later concern. Right now, we just want something right away that we can use for testing.
The text was updated successfully, but these errors were encountered:
Okay, a few things in creating a quick 'toy data' set to get something going to start work on.
Toy data needs to be ~100 rows, some of our tools need that many to use their more valuable inference (i.e. Schema-Automator enums).
I'd like to have at least the basic things we need for the BDC model to build it's required objects (study/participant required info)
Ideally, we'll be growing this limited toy data set in two directions
First, a better toy data set with coverage of all the main features of our tool-chain but not much else.
Also, a true synthetic data set that reflects the kind of data that we actually expect to see - this would then be our synthetic data set, no longer a toy.
Our most reasonable starting point for anything at all is to use the BDC synthetic data set made available to us on BDC and prune it down/add what we need.
amc-corey-cox
changed the title
Create Toy Datasets for each step of the ingest
Create Toy Datasets
Feb 21, 2025
These should be very simple ~10 column x 10 row datasets that we can simply use to make sure we can run the tools. We might continue to use them later as test cases or something else but that is a later concern. Right now, we just want something right away that we can use for testing.
The text was updated successfully, but these errors were encountered: