Issue with counts in subject category facet

From Daniel Letson at Turnitin

https://api.crossref.org/works?filter=full-text.application:similarity-checking&facet=category-name:*

The result that comes back include the facets in a list, with the counts of works in each facet, as well as a 'total-results' count, and a paginated list of the works themselves, like this:

"facets": {
            "category-name": {
                "value-count": 326,
                "values": {
                    "General Medicine": 613792,
                    "Linguistics and Language": 330949,
                    "Language and Linguistics": 326747,
                    ...
                }
           },
"total-results": 49270539,

The issue is, when we export this data to a spreadsheet and total the counts within each facet, the number ends up being nowhere near the 'total-results' count (in a recently-run query, the facets summed to 5,927,779, while the total results were 49,270,539. Are most works just not categorized? Or is there a better way we should be counting works by subject matter?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Issue with counts in subject category facet #407

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue with counts in subject category facet #407

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions