-
Notifications
You must be signed in to change notification settings - Fork 16
Inconsistent OCR #350
Copy link
Copy link
Open
Labels
bugSomething isn't workingSomething isn't working
Description
Example:
- Searching document text for
meme. http://127.0.0.1:8000/archives/doc/3_19_pmm_memo_re_709_1960_04_29_1_19 is first result. - Looking at PDF preview online, there is no
memein text, onlymemo. Highlighting the sentenceStatus of programming memo and revision of machine shut-down date to late July.and copy pasting elsewhere gives correct text. - Check OCR text in
data/processed_pdfsfolder. It saysStatus of programming meme, probably due to OCR error.
Seems like PDF preview and search have different opinions on the OCR?
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't working