Urls without www are not handled by extractors where domains have www in url

Hi everyone, recently we discovered an issue in our system because some urls were parsed without www and thefeore an extractor for that source wasn't used. In this case we either need to submit all custom extractors without www or allow searching for www extractors in addition to base hostname extractors.

## Expected Behavior

Commands

`postlight-parser https://www.newyorker.com/culture/annals-of-inquiry/the-case-for-free-range-lab-mice`

and

`postlight-parser https://newyorker.com/culture/annals-of-inquiry/the-case-for-free-range-lab-mice`

to produce the same result.

## Current Behavior



In case of 

`postlight-parser https://newyorker.com/culture/annals-of-inquiry/the-case-for-free-range-lab-mice`

the custom extractor is not used and body has only 1949 words instead of 3950.

## Steps to Reproduce

`postlight-parser https://newyorker.com/culture/annals-of-inquiry/the-case-for-free-range-lab-mice` and see `content` and `word_count` fields

## Detailed Description

Because of not using custom extractors parser returns an incomplete body.

## Possible Solution

Either rename all folders without www. and set domains without www. or allow getExtractor to also check extractors with www. + hostname and www + base host name

I'm not sure which option is better for the parser (I'd rather go with the first one, though it might be error-prone, the second one is less error-prone).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Urls without www are not handled by extractors where domains have www in url #744

Expected Behavior

Current Behavior

Steps to Reproduce

Detailed Description

Possible Solution

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Urls without www are not handled by extractors where domains have www in url #744

Description

Expected Behavior

Current Behavior

Steps to Reproduce

Detailed Description

Possible Solution

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions