-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
New Task: For raw ATN **trajectory** data sets, finalize and implement automated NCEI data archive pathway #41
Comments
What does "and archive administrative tools" mean in the title? |
@iamchrisser @mmckinzie where are we with this automation? Are the file structures settled? I'd like to release v1.0 of the template, update the website, and stick a fork in the automation task so we can move to mobilizing these data to obis and move on to the profile stream. |
Getting close. Have a manual process built for now but working on automating. Ran into some issues with seaname mappings. |
Still trying to resolve issue with :seanames and update :acknowledgement attribute. Files are still being manually generated. Automation process still needs to be developed and tested. |
pipeline is being documented in ioos/ioos-atn-data#80 |
Pipeline is documented at https://github.com/ioos/ioos-atn-data/blob/main/README-ArchiveSOP.md |
I have ported the documentation over to the website https://ioos.github.io/ioos-atn-data/atn-archive.html |
Still waiting on the next batch of test files from Axiom to confirm :seanames and :acknowledgement issue has been appropriately resolved. |
@lmshowalter3 any updates on a new batch of test files? |
The next batch of test files are now available. :seanames and :acknowledgement appear to be updated correctly. Will send to NCEI for review. I also notice a new minor issue in the :citation. It should follow the following format: [creator_last_name, creator_first_name]; [contributor_last name(s), contributor_first name(s)]. [year]. [animal_common_name] ([animal_scientific_name]) location data from a [instrument] (ptt id [ptt_id]) deployed in the [sea_name] from [deployment_state_datetime] to [deployment_end_datetime], deployment id [deployment_id]. [Dataset]. [publisher_name]. Presently files list :publisher_institution instead of :publisher_name. |
NCEI just tested the new files. Expect feedback from them soon. |
NCEI tested/reviewed files. Typo still present in select :seanames. This issue has been subsequently resolved by Axiom and they will generate a new batch of test files to verify. It was also confirmed that Axiom derived :seanames now match NCEI derived list. Both are now using all positions to determine sea names regardless of quality. This approach may need to be revisited in the future as we may want to consider at least removing the 'Z' locations prior to generating list of sea names to prevent erroneous seas from being included in dataset titles, abstracts and citations. NCEI also recommended that we add :place as a global attribute to template to capture more localized or specific locations or bodies of water not captured in the seaname list. These would become searchable keywords at NCEI. Relatively easy to add :place to the template but it would require additional development work by Axiom to be able to harvest appropriate metadata from the ADR (deployment_location field) and RW (Arbitrary Keywords, type listed as 'place'). More generalized locations listed in project titles (e.g., Mid-Atlantic, US West Coast, Mexico) would get added to the RW arbitrary keywords by ATN DC and the more specific deployment by deployment locations (e.g., Virgina Beach, VA; Guadalupe Island; LA-Long Beach Harbor) provided by PI would be added to ADR metadata. We'll need to decide if this recommendation is a priority and let Axiom know how we want to proceed. Simply add :place to template but leave blank for now or fully develop so attribute can be properly populated. |
As the So, add it to the list of things to be addressed in the next go around and celebrate victory. @mmckinzie please create an issue at https://github.com/ioos/ioos-atn-data/issues for the |
issue ticket has been created regarding :place attribute for a v2 enhancement, no more issues found or flagged within submitted v1 test files by NCEI. Recent test files will be pushed to the production folder. Ready to move on to fully automating and operationalizing pipeline to NCEI. Future generated NCEI files can go straight into the prod folder, no need to stage in test folder. |
Notes from ATN DAC Data Coordinator call:
|
@conniekot thanks for the update. I don't think you intended to link to issue #5, instead reference to some request ATN should continue progress on this while NCEI is on hold. NCEI would prefer to pick up a batch of packages anyways to fully test for all the variability in the files. It looks like there is a good list of items to continue work on, so let's keep pushing forward. |
@mmckinzie test files are now in prod for NCEI to pick up whenever they are back up and running https://ncei.axiomdatascience.com/atn/prod/ (let me know if that looks good) |
@iwensu0313 Great, thanks for letting me know. I will send an email to NCEI notifying them that new files are available in the production folder. |
Next steps towards full automation of the NCEI pipeline by Axiom to ensure its fully operational will be to resolve issue or devise a more appropriate approach for populating the creator/contributor global attributes within the satellite trajectory NetCDF files over harvesting these metadata attributes from the RW project level-metadata → contacts → citation folks. The current approach conflicts with the DACs data management/submmsion guidelines allowing data contributors to have distinct 'project', 'deployment' and 'dataset' level metadata as well as has implications for metadata presently contained in the portal project pages. |
Three more packages have been archived! 🎉
For reference, here are the others that have been archived:
I believe the search below should always return the appropriate records from NCEI: Note that NOAA OneStop indexing from NCEI records might not be very frequent. So, records might be at NCEI but not appearing in OneStop just yet. @laurabrenskelle might know more about that. @mmckinzie In response to your comment above. Can you outline what the current approach is vs what you are proposing needs to be change? I don't quite follow the details above. What is the conflict? What is needed to be fixed? Can you summarize what the current "manual" process is for generating these packages? edit: edited to add a third package archived. |
@MathewBiddle WAFs are set up to be indexed on I believe a daily, weekly, monthly, or on demand frequency at NCEI. It looks like the IOOS WAF is currently set to process weekly. It does say it is currently processing, but it also says the last time this WAF was processed was in August. I will reach out to the appropriate NCEI team to ask why this might be. Also, if weekly is not enough, we can change the indexing frequency, especially if we anticipate more updates on a regular basis soon. |
Can we say that ATN is now "begin operationally submitting non-embargoed deployments to NCEI"? I think so. |
Draft IOOS News Story can be found at https://docs.google.com/document/d/1te-6nj5VP1G6b5yPtmuKjy9plmukKb09_MzY5fonL88/edit#heading=h.gglfy4jadzfe |
Thanks for writing up that news story. I agree that we can say that we're operational. Nice work, everyone! |
I also agree pathway is operational although not yet fully automated. @MathewBiddle thanks for drafting the news story, I have not yet had a chance to review it or provide edits/comments, but I will get to it tomorrow 10/29. |
@MathewBiddle Overall I think the new story looks good. I made a few minor suggestions and edits as well as added a few comments. |
Who is requesting this?
@ioos/marine-life, and the ATN community
What is being requested?
What is the requested deadline and why?
No response
What is the current status quo (i.e., what happens if this does not get done)?
ATN tajectory data are not archived at NCEI and PARR requirements are not met.
What indicates this is done (i.e., how do we know this is complete)?
Provide a description or any other important information.
xref:
The text was updated successfully, but these errors were encountered: