-
Notifications
You must be signed in to change notification settings - Fork 16
V3 api #128
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
V3 api #128
Conversation
TBH It would really prefer to have this in a separate PR... |
I'll look into it and update this PR accordingly. |
|
Thanks a lot :) I'll dissect this PR first and then think about that change! |
Downgrading to uniseg 0.9.0 fixed the issues with master, and 0.9.1 breaks ... I've opened #130. Do you remember what the problem with rapidfuzz 3 was? I don't see it here with RapidFuzz 3.13.0. |
Breakage with the newest uniseg API was fixed in master. Can't see any issue with rapidfuzz, so removing that pin, too.
I've fixed the uniseg issues in #132 and rebased this PR accordingly. Can't see any issue with RapidFuzz, so I removed that pin. |
TBH I'm not terribly happy with this PR. Besides updating to OCR-D's V3 API, it a. does all kind of things with the Dockerfile and the Makefile. Mind you, "(y)our standards" do not necessarily apply to non-OCR-D projects) b. as a drive-by shooting, changes the definition of CER and just redefines These changes may very well be great and awesome, but are definitely 1. not part of an update to V3 API and 2. definitely worth some separate discussion. I'll cut out part b and review it in #129. I'll review (and fix) part a and may merge it with this PR. |
Co-authored-by: Robert Sachunsky <[email protected]>
Co-authored-by: Robert Sachunsky <[email protected]>
Co-authored-by: Robert Sachunsky <[email protected]>
I've rebased to remove the changes in #129, that needs more discussion. |
With #129 separate, this is ready to merge. Not a big fan of a. the indirection of Makefile (I mostly prefer a direct I have not tested the Docker changes (also unrelated to V3 API AFAICT) at all. |
If our Dockerfile conventions are too much of a stretch, I recommend we split off The Makefile is just for convenience, and things (like testdata dependencies) that are easier to maintain than in pyproject.toml alone. It's up to you to use other solutions. |
Ports the OCR-D processor to the v3 API and updates Dockerfile and Makefile with our current best practices.
I had to pin
uniseg
andrapidfuzz
to older versions because of breaking changes inrapidfuzz>=3
anduniseg>=0.9
. Is there a development branch that updates those APIs I could merge/cherry-pick?Also includes https://github.com/bertsky/dinglehopper/tree/normalized-cer which uses
normalized_distance
and gets rid of the potentialInfinity
results AFAICT.