Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Project Proposal]: DwC archive spreadsheet-style editor #31

Open
7yl4r opened this issue Jan 16, 2024 · 5 comments
Open

[Project Proposal]: DwC archive spreadsheet-style editor #31

7yl4r opened this issue Jan 16, 2024 · 5 comments
Assignees
Labels
2024 Topic to be executed during 2024 event code sprint topic Proposed topic for a code sprint activity metadata pairable project

Comments

@7yl4r
Copy link
Contributor

7yl4r commented Jan 16, 2024

Project Description

A familiar spreadsheet view like excel or gsheet but with column headers specified web3-style for Darwin Core. Setting a column heading would have a different UI than editing a cell normally; whatever the user enters is checked against the DwC (and other?) metadata ontologies. In the csv file, the header will be set to an RDF URI, allowing the application to look up a machine-readable definition of the column, apply special formatting, automated data checking, and enable cross-compatibility with other semantic web applications. AI-assisted fuzzy-matching of user input to "valid" column names could also be explored here.

Use cases:

  • As a student I want guidance on how to name column headers so that my data collection is harvestable.
  • As a data collector I want to record my taxa-occurrence data in something other than excel so that *the data is automatically DwC-archive compatible.
  • As a data manager I want to do manual inspections of DwC archive data so that I can check the validity.

Expected Outcomes

The code sprint should focus on lo-fi prototyping and identifying possible technologies to build upon.

Skills required

spreadsheet, Darwin Core, web3

Expertise

Intermediate

Topic Lead(s)

????

Relevant links

possible starting points

@7yl4r 7yl4r added the code sprint topic Proposed topic for a code sprint activity label Jan 16, 2024
@MathewBiddle MathewBiddle added the 2024 Topic to be executed during 2024 event label Jan 19, 2024
@iwensu0313
Copy link

iwensu0313 commented Apr 4, 2024

@7yl4r curious about this topic! We have a data management and coordination project at Axiom over the next year that involves working with PIs to help them meet Darwin Core requirements for archival. We have some folks at Axiom with familiarity/expertise in DwC (but not coding). Would that be helpful?

@7yl4r
Copy link
Contributor Author

7yl4r commented Apr 5, 2024

In my opinion the limiting factor here is programmer-hours. If we can find a technical collaborator then we will definitely include yall in the discussions.

@7yl4r
Copy link
Contributor Author

7yl4r commented Apr 10, 2024

Thanks to Stace Beaulieu for finding & sharing this gsheet add-on (pdf), which demonstrates the concept well.

Unfortunately, I can't find the source code or the app and the linked website (https://dwcaassistant.com/) is down.

@MathewBiddle MathewBiddle assigned 7yl4r and unassigned mwengren and MathewBiddle May 3, 2024
@MathewBiddle
Copy link
Contributor

Thank you for taking the time to propose this topic! From the Code Sprint topic survey, this has garnered a lot of interest.

Following the contributing guidelines on selecting a code sprint topic I have assigned this topic to @7yl4r. Unless indicated otherwise, the assignee will be responsible for identifying a plan for the code sprint topic, establishing a team, and taking the lead on executing said plan. The first action for the lead is to:

@vijaybarve
Copy link

vijaybarve commented May 3, 2024

@7yl4r Darwin Core Archive Assistant Add-on is here https://workspace.google.com/marketplace/app/darwin_core_archive_assistant/567341081140 and the source is here https://github.com/zedomel/dwca-gsheet-assistant

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2024 Topic to be executed during 2024 event code sprint topic Proposed topic for a code sprint activity metadata pairable project
Projects
None yet
Development

No branches or pull requests

6 participants