Description
From Daniel Letson at Turnitin
https://api.crossref.org/works?filter=full-text.application:similarity-checking&facet=category-name:*
The result that comes back include the facets in a list, with the counts of works in each facet, as well as a 'total-results' count, and a paginated list of the works themselves, like this:
"facets": {
"category-name": {
"value-count": 326,
"values": {
"General Medicine": 613792,
"Linguistics and Language": 330949,
"Language and Linguistics": 326747,
...
}
},
"total-results": 49270539,
The issue is, when we export this data to a spreadsheet and total the counts within each facet, the number ends up being nowhere near the 'total-results' count (in a recently-run query, the facets summed to 5,927,779, while the total results were 49,270,539. Are most works just not categorized? Or is there a better way we should be counting works by subject matter?