Enhancement: support for pretrained word embeddings

Implement a new `Extractor` subtype, called `WordEmbeddingExtractor`, for extracting NLP words using their embeddings (using [`Embeddings.jl`](https://github.com/JuliaText/Embeddings.jl) and [`WordTokenizers.jl`](https://github.com/JuliaText/WordTokenizers.jl)?)

Rough sketch of possible implementation can be found [here](https://github.com/CTUAvastLab/JsonGrinder.jl/issues/114), but this is for the old version of `JsonGrinder`.

A good starting point is [`NGramExtractor` implementation](https://github.com/CTUAvastLab/JsonGrinder.jl/blob/master/src/extractors/ngram.jl), the design should be very similar.

We might also want to update `suggestextractor` with a new kwarg governing when `String`s are extracted as ngrams and when they are tokenized

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enhancement: support for pretrained word embeddings #136

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Enhancement: support for pretrained word embeddings #136

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions