-
Notifications
You must be signed in to change notification settings - Fork 116
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Very slow extraction for specific string #193
Comments
@Schwankenson I didn't check the string yet but what might help is restricting the supported dialects by passing a custom |
@lopuhin Great, thank you! Limiting it to json-ld and microdata shortens time to below one second! |
Glad it helped, and thanks for checking 👍 |
For one html string, I waited 10 hours. Finally found out that the problem is just in 'microformat'. After skipping that format, it takes just 1 second. |
Super helpful thank you - this was the case for me too. |
In my case it was |
I have one site with HTML strings, where I have really slow extraction times (~60 seconds). I just call
extruct.extract
with this string:https://pastebin.com/QJbUdaA6
Other strings work in times like 1-5 seconds. Does somebody have an idea what`s wrong with this string? Is there something I can do?
Thank you all for working on this great python package!
The text was updated successfully, but these errors were encountered: