document-understanding
2023.10
false
UiPath logo, featuring letters U and I in white
Document Understanding User Guide
Automation CloudAutomation Cloud Public SectorAutomation SuiteStandalone
Last updated Nov 11, 2024

Language support

The supported languages for different Document UnderstandingTM components can be found in the table below.

Components

Supported Languages

The left-to-right languages supported by the OCR engine of choice:

  • For supported languages by UiPath® Document OCR, click here.
  • For supported languages by Omnipage OCR, click here.
  • For supported languages by other 3rd party vendors (Google, Microsoft), check the vendor's website for the most up-to-date information.
The right-to-left languages supported by the OCR engine of choice:
  • For supported languages by Omnipage OCR, click here.
  • For supported languages by other 3rd party vendors (like Google), check the vendor's website for the latest information.

Same as above.

For the supported languages, retraining may be required to get the expected accuracy if the documents are considerably different from the original model training dataset.

For the languages not supported in this list, you can experiment with the approach of creating a custom model to extract any left-to-right language, assuming the OCR engine supports it as well.

Automatic reformatting of dates in a standard yyyy-mm-dd format for Asian languages is currently supported only for Japanese. For documents in other Asian languages, you can extract the dates as String content type and format it in the RPA workflow.

Was this page helpful?

Get The Help You Need
Learning RPA - Automation Courses
UiPath Community Forum
Uipath Logo White
Trust and Security
© 2005-2024 UiPath. All rights reserved.