Skip to content

Named entity recognition #113

@kenalba

Description

@kenalba

A useful feature for performing analysis that is tied less to gendered pronouns is to use proper names. This would allow a user to see adjectives used to describe a particular character, or a particular family of characters.

A naïve solution to this might just search for words whose first letter is capitalized and that don't show up in a dictionary, though I suspect we'll need a more robust algorithm to make this usable. We might also be able to use our POS tagger to get us part of the way there. There are open source approaches to the problem; it seems like spaCy might be able to do what we want, here.

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions