Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pensoft pub DwC-A missing Biodiversity Data Journal. A new crane fly species of the genus Libnotes Westwood, 1876 (Diptera, Limoniidae) from Jilin, China. Checklist dataset https://doi.org/10.3897/bdj.10.e87316 #321

Closed
jhpoelen opened this issue Mar 17, 2025 · 5 comments

Comments

@jhpoelen
Copy link
Member

as seen on 2025-03-02T23:52:12.688Z in GBIF / iDigBio / BioCASe corpus - https://linker.bio/hash://sha256/f7f5b8f598bf83bd9de258614799049f1e83fb414ef3d06349f6810b0b3a2d27 (see below), indicates that the "source" archive associated with GBIF dataset registration https://gbif.org/dataset/84011592-688a-4583-a311-699a4e86c393, pointing to Biodiversity Data Journal. A new crane fly species of the genus Libnotes Westwood, 1876 (Diptera, Limoniidae) from Jilin, China. Checklist dataset https://doi.org/10.3897/bdj.10.e87316 accessed via GBIF.org on 2025-03-01.

was not available.

{
  "key": "84011592-688a-4583-a311-699a4e86c393",
  "installationKey": "d5b61ace-f25c-43bd-9dd0-03486850f90b",
  "publishingOrganizationKey": "750a8724-fa66-4c27-b645-bd58ac5ee010",
  "networkKeys": [],
  "doi": "10.3897/bdj.10.e87316",
  "external": false,
  "numConstituents": 0,
  "type": "CHECKLIST",
  "title": "A new crane fly species of the genus Libnotes Westwood, 1876 (Diptera, Limoniidae) from Jilin, China",
  "language": "eng",
  "citation": {
    "text": "Biodiversity Data Journal. A new crane fly species of the genus Libnotes Westwood, 1876 (Diptera, Limoniidae) from Jilin, China. Checklist dataset https://doi.org/10.3897/bdj.10.e87316 accessed via GBIF.org on 2025-03-01.",
    "citationProvidedBySource": false
  },
  "contactsCitation": [],
  "lockedForAutoUpdate": false,
  "createdBy": "pensoft",
  "modifiedBy": "pensoft",
  "created": "2022-10-26T06:30:04.513+00:00",
  "modified": "2022-10-26T06:30:04.513+00:00",
  "contacts": [],
  "endpoints": [
    {
      "key": 786282,
      "type": "DWC_ARCHIVE",
      "url": "https://bdj.pensoft.net/lib/ajax_srv/archive_download.php?archive_type=2&document_id=87316",
      "createdBy": "pensoft",
      "modifiedBy": "pensoft",
      "created": "2022-10-26T06:30:04.876+00:00",
      "modified": "2022-10-26T06:30:04.876+00:00",
      "machineTags": []
    }
  ],
  "machineTags": [
    {
      "key": 24021920,
      "namespace": "crawler.gbif.org",
      "name": "crawl_attempt",
      "value": "119",
      "createdBy": "crawler.gbif.org",
      "created": "2025-03-01T11:21:55.286+00:00"
    }
  ],
  "tags": [],
  "identifiers": [
    {
      "key": 393210,
      "type": "CLB_DATASET_KEY",
      "identifier": "172421",
      "createdBy": "clb-bot",
      "created": "2025-02-21T09:13:27.245+00:00",
      "primary": false
    }
  ],
  "comments": [],
  "bibliographicCitations": [],
  "curatorialUnits": [],
  "taxonomicCoverages": [],
  "geographicCoverages": [],
  "temporalCoverages": [],
  "keywordCollections": [],
  "countryCoverage": [],
  "collections": [],
  "dataDescriptions": [],
  "license": "http://creativecommons.org/licenses/by/4.0/legalcode"
}
<hash://sha256/45a0d289e5d25a1a562c9f1d3c859dcdd4cc6b19c95960c79b50226107882162> <http://www.w3.org/ns/prov#hadMember> <84011592-688a-4583-a311-699a4e86c393> <urn:uuid:e2b6fc7e-513b-4757-afba-191409722889> .
<84011592-688a-4583-a311-699a4e86c393> <http://www.w3.org/1999/02/22-rdf-syntax-ns#seeAlso> <https://doi.org/10.3897/bdj.10.e87316> <urn:uuid:e2b6fc7e-513b-4757-afba-191409722889> .
<84011592-688a-4583-a311-699a4e86c393> <http://www.w3.org/ns/prov#hadMember> <https://bdj.pensoft.net/lib/ajax_srv/archive_download.php?archive_type=2&document_id=87316> <urn:uuid:e2b6fc7e-513b-4757-afba-191409722889> .
<https://bdj.pensoft.net/lib/ajax_srv/archive_download.php?archive_type=2&document_id=87316> <http://purl.org/dc/elements/1.1/format> "application/dwca" <urn:uuid:e2b6fc7e-513b-4757-afba-191409722889> .
<https://deeplinker.bio/.well-known/genid/aed1b295-bffe-3be2-936e-0c0d6fc99953> <http://www.w3.org/ns/prov#wasGeneratedBy> <urn:uuid:aba22552-3571-419d-b855-2c0453c2a89b> <urn:uuid:aba22552-3571-419d-b855-2c0453c2a89b> .
<https://deeplinker.bio/.well-known/genid/aed1b295-bffe-3be2-936e-0c0d6fc99953> <http://www.w3.org/ns/prov#qualifiedGeneration> <urn:uuid:aba22552-3571-419d-b855-2c0453c2a89b> <urn:uuid:aba22552-3571-419d-b855-2c0453c2a89b> .
<urn:uuid:aba22552-3571-419d-b855-2c0453c2a89b> <http://www.w3.org/ns/prov#generatedAtTime> "2025-03-02T23:52:12.688Z"^^<http://www.w3.org/2001/XMLSchema#dateTime> <urn:uuid:aba22552-3571-419d-b855-2c0453c2a89b> .
<urn:uuid:aba22552-3571-419d-b855-2c0453c2a89b> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://www.w3.org/ns/prov#Generation> <urn:uuid:aba22552-3571-419d-b855-2c0453c2a89b> .
<urn:uuid:aba22552-3571-419d-b855-2c0453c2a89b> <http://www.w3.org/ns/prov#wasInformedBy> <urn:uuid:e2b6fc7e-513b-4757-afba-191409722889> <urn:uuid:aba22552-3571-419d-b855-2c0453c2a89b> .
<urn:uuid:aba22552-3571-419d-b855-2c0453c2a89b> <http://www.w3.org/ns/prov#used> <https://bdj.pensoft.net/lib/ajax_srv/archive_download.php?archive_type=2&document_id=87316> <urn:uuid:aba22552-3571-419d-b855-2c0453c2a89b> .
<https://bdj.pensoft.net/lib/ajax_srv/archive_download.php?archive_type=2&document_id=87316> <http://purl.org/pav/hasVersion> <https://deeplinker.bio/.well-known/genid/aed1b295-bffe-3be2-936e-0c0d6fc99953> <urn:uuid:aba22552-3571-419d-b855-2c0453c2a89b> .
@jhpoelen
Copy link
Member Author

@teodorgeorgiev
Copy link

Thanks! I have fixed that

@jhpoelen
Copy link
Member Author

Thanks for your prompt reply, and I can confirm I can now access the DwC-A. Just curious - what was the root cause? Also, how do you guys check the integrity of the dwc associated with your publications? Perhaps the method I developed could help . . .

Another pub appears to have a similar issue:

Fortini L, Ruzzier E, Mei M, Di Giulio A (2024) The wild bees (Hymenoptera, Apoidea, Anthophila) of the urban nature reserves of Rome (Italy, Latium): a preliminary survey. Biodiversity Data Journal 12: e139087. https://doi.org/10.3897/BDJ.12.e139087

as found in https://linker.bio/line:hash://sha256/f7f5b8f598bf83bd9de258614799049f1e83fb414ef3d06349f6810b0b3a2d27!/L2462948-L2462957

<230fe79e-4ae4-4748-aeb4-d76339ca164a> <http://www.w3.org/1999/02/22-rdf-syntax-ns#seeAlso> <https://doi.org/10.3897/bdj.12.e139087> <urn:uuid:002a604f-2cd5-4c82-beca-a739b0fa4abe> .
<230fe79e-4ae4-4748-aeb4-d76339ca164a> <http://www.w3.org/ns/prov#hadMember> <https://bdj.pensoft.net/lib/ajax_srv/archive_download.php?archive_type=2&document_id=139087> <urn:uuid:002a604f-2cd5-4c82-beca-a739b0fa4abe> .
<https://bdj.pensoft.net/lib/ajax_srv/archive_download.php?archive_type=2&document_id=139087> <http://purl.org/dc/elements/1.1/format> "application/dwca" <urn:uuid:002a604f-2cd5-4c82-beca-a739b0fa4abe> .
<https://deeplinker.bio/.well-known/genid/f19c4986-8898-37ee-ab0b-45dc98c80b52> <http://www.w3.org/ns/prov#wasGeneratedBy> <urn:uuid:e6b5ce11-5eae-4a0e-a689-9ce32395b601> <urn:uuid:e6b5ce11-5eae-4a0e-a689-9ce32395b601> .
<https://deeplinker.bio/.well-known/genid/f19c4986-8898-37ee-ab0b-45dc98c80b52> <http://www.w3.org/ns/prov#qualifiedGeneration> <urn:uuid:e6b5ce11-5eae-4a0e-a689-9ce32395b601> <urn:uuid:e6b5ce11-5eae-4a0e-a689-9ce32395b601> .
<urn:uuid:e6b5ce11-5eae-4a0e-a689-9ce32395b601> <http://www.w3.org/ns/prov#generatedAtTime> "2025-03-02T22:59:47.845Z"^^<http://www.w3.org/2001/XMLSchema#dateTime> <urn:uuid:e6b5ce11-5eae-4a0e-a689-9ce32395b601> .
<urn:uuid:e6b5ce11-5eae-4a0e-a689-9ce32395b601> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://www.w3.org/ns/prov#Generation> <urn:uuid:e6b5ce11-5eae-4a0e-a689-9ce32395b601> .
<urn:uuid:e6b5ce11-5eae-4a0e-a689-9ce32395b601> <http://www.w3.org/ns/prov#wasInformedBy> <urn:uuid:002a604f-2cd5-4c82-beca-a739b0fa4abe> <urn:uuid:e6b5ce11-5eae-4a0e-a689-9ce32395b601> .
<urn:uuid:e6b5ce11-5eae-4a0e-a689-9ce32395b601> <http://www.w3.org/ns/prov#used> <https://bdj.pensoft.net/lib/ajax_srv/archive_download.php?archive_type=2&document_id=139087> <urn:uuid:e6b5ce11-5eae-4a0e-a689-9ce32395b601> .
<https://bdj.pensoft.net/lib/ajax_srv/archive_download.php?archive_type=2&document_id=139087> <http://purl.org/pav/hasVersion> <https://deeplinker.bio/.well-known/genid/f19c4986-8898-37ee-ab0b-45dc98c80b52> <urn:uuid:e6b5ce11-5eae-4a0e-a689-9ce32395b601> .

@teodorgeorgiev
Copy link

Hi @jhpoelen, following the GBIF recommendations, we have to regenerate all DwC-A. During this process, some failed. We are aware of that and will fix all of them.

@jhpoelen
Copy link
Member Author

@teodorgeorgiev thanks for your prompt reply.

After producing the results shown below, I can confirm that https://bdj.pensoft.net/lib/ajax_srv/archive_download.php?archive_type=2&document_id=139087 now produces content, where previously it did not.

preston track "https://bdj.pensoft.net/lib/ajax_srv/archive_download.php?archive_type=2&document_id=139087"\
 | preston dwc-stream\
 | head -1\
 | jq .
{
  "http://www.w3.org/ns/prov#wasDerivedFrom": "line:zip:hash://sha256/cbe83142ef6220965572222f9b56fe6add435ced274dcfca967350ac3665c65f!/taxa.csv!/L2",
  "http://www.w3.org/1999/02/22-rdf-syntax-ns#type": "http://rs.tdwg.org/dwc/terms/Taxon",
  "http://rs.tdwg.org/dwc/text/id": "139087-sp1",
  "http://rs.tdwg.org/dwc/terms/scientificName": "Andrena aeneiventris Morawitz, 1872",
  "http://rs.tdwg.org/dwc/terms/genus": "Andrena",
  "http://rs.tdwg.org/dwc/terms/family": "Andrenidae",
  "http://rs.tdwg.org/dwc/terms/scientificNameID": null,
  "http://eol.org/schema/reference/referenceID": null,
  "http://rs.tdwg.org/dwc/terms/order": "Hymenoptera",
  "http://rs.tdwg.org/ac/terms/furtherInformationURL": "https://doi.org/10.3897/BDJ.12.e139087",
  "http://rs.tdwg.org/dwc/terms/subgenus": null,
  "http://rs.tdwg.org/dwc/terms/taxonomicStatus": null,
  "http://rs.tdwg.org/dwc/terms/specificEpithet": "aeneiventris",
  "http://purl.org/dc/terms/references": "https://doi.org/10.3897/BDJ.12.e139087",
  "http://rs.tdwg.org/dwc/terms/namePublishedIn": null,
  "http://rs.tdwg.org/dwc/terms/infraspecificEpithet": null,
  "http://rs.tdwg.org/dwc/terms/parentNameUsageID": null,
  "http://rs.tdwg.org/dwc/terms/phylum": "Arthropoda",
  "http://rs.tdwg.org/dwc/terms/class": "Insecta",
  "http://rs.tdwg.org/dwc/terms/taxonRank": "species",
  "http://rs.tdwg.org/dwc/terms/taxonRemarks": null,
  "http://rs.tdwg.org/dwc/terms/nomenclaturalStatus": null,
  "http://rs.tdwg.org/dwc/terms/taxonID": "139087-sp1",
  "http://rs.tdwg.org/dwc/terms/kingdom": "Animalia"
}

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants