-
Notifications
You must be signed in to change notification settings - Fork 363
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Clarify language support quality status #83
Comments
See https://tesseract-ocr.github.io/tessdoc/Data-Files-in-different-versions.html. All work on Tesseract is currently done by volunteers, so you are invited to find the answers to your questions and document them. |
@stweil : Can you linkify the "100 languages" sentence in the README.md to point to that page? |
@eyalroz I went ahead and propsed the change in the tesseract repo: tesseract-ocr/tesseract#4027 I also think it would be very helpful. Even though the list itself has no information on languages in v5 yet. |
There was no update for v5. All the v4 data files should work with Tesseract 5.x. |
That's at least not obvious from the table. The information can be found in other parts of the docs, true. Users can easily miss it though. |
The README.md says tesseract "supports over 100 languages out of the box". But - which languages? And what quality is the support for different languages known to be, out of the box?
It would be helpful if a separate file (or wiki page) would detail, to the extent possible, this information.
The text was updated successfully, but these errors were encountered: