- Overview
- Getting started
- Building models
- Consuming models
- Model Details
- 1040 - ML package
- 1040 Schedule C - ML package
- 1040 Schedule D - ML package
- 1040 Schedule E - ML package
- 1040x - ML package
- 3949a - ML package
- 4506T - ML package
- 709 - ML package
- 941x - ML package
- 9465 - ML package
- ACORD125 - ML package
- ACORD126 - ML package
- ACORD131 - ML package
- ACORD140 - ML package
- ACORD25 - ML package
- Bank Statements - ML package
- Bills Of Lading - ML package
- Certificate of Incorporation - ML package
- Certificate of Origin - ML package
- Checks - ML package
- Children Product Certificate - ML package
- CMS 1500 - ML package
- EU Declaration of Conformity - ML package
- Financial Statements - ML package
- FM1003 - ML package
- I9 - ML package
- ID Cards - ML package
- Invoices - ML package
- Invoices Australia - ML package
- Invoices China - ML package
- Invoices Hebrew - ML package
- Invoices India - ML package
- Invoices Japan - ML package
- Invoices Shipping - ML package
- Packing Lists - ML package
- Payslips - ML package
- Passports - ML package
- Purchase Orders - ML package
- Receipts - ML Package
- Remittance Advices - ML package
- UB04 - ML package
- Utility Bills - ML package
- Vehicle Titles - ML package
- W2 - ML package
- W9 - ML package
- Public endpoints
- Supported languages
- Insights dashboards
- Data and security
- Licensing and Charging Logic
- How to
Key concepts
Familiarize yourself with the core concepts around UiPath® Document UnderstandingTM.
Active learning is our modern approach to creating models for Document UnderstandingTM.
Active learning provides an interactive experience where the learning algorithm can query the user to label data with the desired outputs. This process helps to reduce the time and data required to train a machine-learning model by up to 80%. AI is used to guide the process, which includes automatic annotation, typically the most time-consuming task. The model also provides expert recommendations to enhance accuracy using the most informative datasets.
Using active learning, you can also monitor your automations through analytical capabilities.
A document type refers to the classification or categorization of a document based on its content, format, purpose, or other distinguishing factors. Some examples can include invoices, receipts, contracts, reports, medical records, legal documents, and others.
- Structured: documents designed to collect information in a specific format. For example, surveys, tax forms, passports, or licenses are all structured documents.
- Semi-structured: documents that do not follow a strict format and are not bound to specified data fields. Semi-structured documents include invoices, receipts, uility bills, bank statements, and others.
- Unstructured: documents that do not follow a specific or organized model. For example, contracts, leases, or news articles are all unstructured documents.
To learn more about document types, check the Document types section.
ML models are like virtual assistants that have been trained to learn from data and make predictions or decisions. These models are essentially algorithms that learn to recognize patterns based on historical data. The more data they are exposed to, the better they can improve their predictions or decisions over time.
You can find several out of the box ML models in Document UnderstandingTM. These models help you classify and extract any commonly occurring data points from semi-structured or unstructured documents, with no setup required.
Check the Out-of-the-box models page for the full list of pre-trained models and their fields.
ML models can be trained on a majority of languages, as long as the OCR recognizes the document and text with high confidence.
Optical character recognition (OCR) is a special technology used to convert different types of documents, such as scanned paper documents, PDF files, or images taken by a digital camera, into editable and searchable data.
The accuracy of an OCR engine most oftenly depends on the quality of the original document. Clear, well-formatted text in a readable font typically produces the best output.
For more information on the languages supported by the OCR engines options provided by UiPath®, check the OCR Supported Languages page.