document-understanding
2024.10
true
  • Release notes
    • 2024.10
      • 2024.10.0
UiPath logo, featuring letters U and I in white
Document Understanding Release Notes
Automation CloudAutomation Cloud Public SectorAutomation SuiteStandalone
Last updated Nov 11, 2024

2024.10.0

Release date: 11 November 2024

Document Understanding™ 2024.10 LTS Release

What's new

UiPath Extended Languages OCR

We are excited to announce that our latest OCR engine, UiPath Extended Languages OCR, is now in general availability. The new OCR is capable of digitizing documents in over 200 languages, bringing a significant improvement over its predecessor, especially in regards to Chinese, Japanese, and Korean. Additionally, it can process documents in Thai, Vietnamese, all major languages from India, and languages using the Cyrilic alphabet, and Greek.

Data Extraction ML packages

The following new ML packages are available:

Improvements

Data Extraction ML packages

We've made significant improvements to our document digitization process. Now, when using the UiPath Extended Languages OCR, the output will be regular word boxes instead of individual characters.

UiPath Document Understanding OCR

  • This release brings accuracy and performance improvements for handwriting recognition.
  • The recognition and detection for Magnetic Ink Character Recognition (MIRC) is improved, bringing enhanced accuracy especially for checks.
  • Previously, numbers were not recognized in some instances when a space was used as separator. Numbers are now recognized when space is used as separator.
  • The confidence score for the UiPath Document Understanding OCR is improved, particularly when used on lower quality images. In workflows where confidence score is used to decide if documents need human validation in Action Center, this may result in an increased number of documents undergoing validation.

Bug fixes

UiPath Document Understanding OCR

We've fixed an issue where annotation boxes were returned horizontally, even though some documents were slightly skewed, causing misalignment in the annotation.

Data Extraction ML packages

We've fixed an issue related to Japanese text while using the Extended Languages OCR. This issue was resulting in several extra spaces appearing in certain situations due to individual character boxes.

Was this page helpful?

Get The Help You Need
Learning RPA - Automation Courses
UiPath Community Forum
Uipath Logo White
Trust and Security
© 2005-2024 UiPath. All rights reserved.