Skip to content

Releases: tesseract-ocr/tesseract

5.2.0

06 Jul 20:22
5.2.0
Compare
Choose a tag to compare

This is a new minor version of Tesseract 5.

  • Improvements and fixes for continuous integration,
    autoconf and cmake builds.
  • Set /Os for some 32 bit MS compilers (fixes #3769).
  • Improve comments and other documentation.
  • Add initial support for Intel AVX512F.
  • Fix for very large PDF files on 32 bit hosts (fixes #3805).
  • Fix NEON detection on FreeBSD.
  • Fix regression with UZN files (fixes #3837).
  • Fix calling delete[] for memory allocated by malloc in C API.
  • Add an API function to init tesseract with traineddata from memory
    (fixes #3691).
  • Replace direct access to Leptonica internal data structures by
    function calls and support latest releases of Leptonica.
  • Replace std::regex by std::string functions (fixes issue #3830).
  • Use compiled-in TESSDATA_PREFIX also on Windows (fixes #3767).
  • Add new parameter 'invert_threshold', change the default threshold
    from 0.5 to 0.7 and mark parameter 'tessedit_do_invert' as deprecated.

See also list of all changes.

5.1.0

01 Mar 16:33
5.1.0
Compare
Choose a tag to compare

This is a new minor version of Tesseract 5.

  • Handle image and line regions in output formats ALTO, hOCR and text.
  • New parameter curl_timeout for curl_easy_setop.
  • Build fixes and improvements.
  • Catch nullptr in PageIterator::Orientation to improve robustness.
  • Remove unused code.

See also list of all changes.

5.0.1

07 Jan 12:48
5.0.1
Compare
Choose a tag to compare

This is a bug fix release of Tesseract 5.0.

  • Add SPDX-License-Identifier to public include files.
  • Support redirections when running OCR on a URL.
  • Lots of fixes and improvements for cmake builds.
    Distributions should use the autoconf build.
  • Fix broken msys2 build with gcc 11.
  • Fix parameter certainty_scale (was duplicated).
  • Fix some compiler warnings and clean code.
  • Correctly detect amd64 and i386 on FreeBSD.
  • Add libarchive and libcurl in continuous integration actions.
  • Update submodule googletest to release v1.11.0.

See also list of all changes.

5.0.0

30 Nov 17:53
5.0.0
Compare
Choose a tag to compare

This is the final stable release of Tesseract 5.0.0.

  • Limit BCER to interval [0,1]
  • Improved build process
  • Cleaned code

See also list of all changes.

5.0.0-rc3

22 Nov 21:09
5.0.0-rc3
Compare
Choose a tag to compare

This is the third release candidate of Tesseract 5.0.0.

  • Improve training messages
  • Add RowAttributes getter to PageIterator

See also list of all changes.

4.1.3

15 Nov 17:36
4.1.3
Compare
Choose a tag to compare

This is a new stable release of Tesseract 4.1.

  • Fix broken autoconf build (issue #3642)

See also list of all changes.

5.0.0-rc2

14 Nov 19:10
5.0.0-rc2
Compare
Choose a tag to compare

This is the second release candidate of Tesseract 5.0.0.

  • Fix regression for OCR with more than one model file
  • Bug fixes
  • Optimizations

See also list of all changes.

4.1.2

14 Nov 19:42
4.1.2
Compare
Choose a tag to compare

This is a new stable release of Tesseract 4.1.

Note: The autoconf build is broken (see issue #3642), so please use 4.1.3.

  • Allow line images with larger width for training
  • Bug fixes
  • Build updates and fixes

See also list of all changes.

5.0.0-rc1

29 Oct 20:43
5.0.0-rc1
Compare
Choose a tag to compare

This is the first release candidate of Tesseract 5.0.0.

  • Enable fast float32 LSTM by default
  • Switch to NFC normalisation everywhere
  • Remove banner message
  • Disable music staff detection and removal
  • Add new command line option --loglevel
  • Bug fixes

See also list of all changes.

5.0.0-beta-20210916

16 Sep 04:48
5.0.0-beta-20210916
Compare
Choose a tag to compare
5.0.0-beta-20210916 Pre-release
Pre-release

This is a new pre-release of Tesseract 5.0.0.

  • Bug fixes
  • Extend URI support for Tesseract with libcurl
  • Rename processed TIFF output file and add page number if needed

See also list of all changes.