You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
There is a regression in 84a4e1a: When passing multiple pages for an image-only input fileGrp, e.g. -g phys_0001,phys_0007 -I OCR-D-IMG, now the logic that tries to prevent mixing derived images with original images is falsely triggered:
There is a regression in 84a4e1a: When passing multiple pages for an image-only input fileGrp, e.g.
-g phys_0001,phys_0007 -I OCR-D-IMG
, now the logic that tries to prevent mixing derived images with original images is falsely triggered:core/ocrd/ocrd/processor/base.py
Lines 118 to 125 in edf31fa
The problem is that
self.page_id
here is actually a list (formatted in comma-join notation).So the correct way of ensuring that no single page gets multiple image file results is by
find_all_files
to aggregate them like this (which is probably valid in other contexts, though)ret
and checking whether any of itspageId
s repeat:The text was updated successfully, but these errors were encountered: