Skip to content

Issue with counts in subject category facet #407

Open
@jenniferlin15

Description

@jenniferlin15

From Daniel Letson at Turnitin

https://api.crossref.org/works?filter=full-text.application:similarity-checking&facet=category-name:*

The result that comes back include the facets in a list, with the counts of works in each facet, as well as a 'total-results' count, and a paginated list of the works themselves, like this:

"facets": {
"category-name": {
"value-count": 326,
"values": {
"General Medicine": 613792,
"Linguistics and Language": 330949,
"Language and Linguistics": 326747,
...
}
},
"total-results": 49270539,

The issue is, when we export this data to a spreadsheet and total the counts within each facet, the number ends up being nowhere near the 'total-results' count (in a recently-run query, the facets summed to 5,927,779, while the total results were 49,270,539. Are most works just not categorized? Or is there a better way we should be counting works by subject matter?

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions