Version 0.0.20 has been released #243
JorjMcKie
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
This fixes some more bugs.
Here is an overview of other changes:
ignore_images
andignore_graphics
can be used to omit these objects when determining the page layout and will be omitted in the output. Can be helpful with overly crowded pages.table_strategy
can now beNone
to suppress execution of PyMuPDF's table finder. This can significantly increase conversion speed.max_levels
(an integer between 1 and 6) limits the number of detected header levels. Here is a code snippet:The output produced above will only have headers up to level 3 ("###"). All text with a smaller font size than that of level 3 will be treated as body text.
The complete changes are here.
Beta Was this translation helpful? Give feedback.
All reactions