activities
latest
false
- Overview
- Document Processing Contracts
- Release notes
- About the Document Processing Contracts
- Box Class
- IPersistedActivity interface
- PrettyBoxConverter Class
- IClassifierActivity Interface
- IClassifierCapabilitiesProvider Interface
- ClassifierDocumentType Class
- ClassifierResult Class
- ClassifierCodeActivity Class
- ClassifierNativeActivity Class
- ClassifierAsyncCodeActivity Class
- ClassifierDocumentTypeCapability Class
- ExtractorAsyncCodeActivity Class
- ExtractorCodeActivity Class
- ExtractorDocumentType Class
- ExtractorDocumentTypeCapabilities Class
- ExtractorFieldCapability Class
- ExtractorNativeActivity Class
- ExtractorResult Class
- ICapabilitiesProvider Interface
- IExtractorActivity Interface
- ExtractorPayload Class
- DocumentActionPriority Enum
- DocumentActionData Class
- DocumentActionStatus Enum
- DocumentActionType Enum
- DocumentClassificationActionData Class
- DocumentValidationActionData Class
- UserData Class
- Document Class
- DocumentSplittingResult Class
- DomExtensions Class
- Page Class
- PageSection Class
- Polygon Class
- PolygonConverter Class
- Metadata Class
- WordGroup Class
- Word Class
- ProcessingSource Enum
- ResultsTableCell Class
- ResultsTableValue Class
- ResultsTableColumnInfo Class
- ResultsTable Class
- Rotation Enum
- SectionType Enum
- WordGroupType Enum
- IDocumentTextProjection Interface
- ClassificationResult Class
- ExtractionResult Class
- ResultsDocument Class
- ResultsDocumentBounds Class
- ResultsDataPoint Class
- ResultsValue Class
- ResultsContentReference Class
- ResultsValueTokens Class
- ResultsDerivedField Class
- ResultsDataSource Enum
- ResultConstants Class
- SimpleFieldValue Class
- TableFieldValue Class
- DocumentGroup Class
- DocumentTaxonomy Class
- DocumentType Class
- Field Class
- FieldType Enum
- LanguageInfo Class
- MetadataEntry Class
- TextType Enum
- TypeField Class
- ITrackingActivity Interface
- ITrainableActivity Interface
- ITrainableClassifierActivity Interface
- ITrainableExtractorActivity Interface
- TrainableClassifierAsyncCodeActivity Class
- TrainableClassifierCodeActivity Class
- TrainableClassifierNativeActivity Class
- TrainableExtractorAsyncCodeActivity Class
- TrainableExtractorCodeActivity Class
- TrainableExtractorNativeActivity Class
- Document Understanding Digitizer
- Document Understanding ML
- Document Understanding OCR Local Server
- Document Understanding
- Release notes
- About the Document Understanding activity package
- Project compatibility
- Set PDF Password
- Merge PDFs
- Get PDF Page Count
- Extract PDF Text
- Extract PDF Images
- Extract PDF Page Range
- Extract Document Data
- Create Validation Task and Wait
- Wait for Validation Task and Resume
- Create Validation Task
- Classify Document
- Create Classification Validation Task
- Create Classification Validation Task and Wait
- Wait for Classification Validation Task and Resume
- Intelligent OCR
- Release notes
- About the IntelligentOCR activity package
- Project compatibility
- Configuring Authentication
- Load Taxonomy
- Digitize Document
- Classify Document Scope
- Keyword Based Classifier
- Document Understanding Project Classifier
- Intelligent Keyword Classifier
- Create Document Classification Action
- Wait For Document Classification Action And Resume
- Train Classifiers Scope
- Keyword Based Classifier Trainer
- Intelligent Keyword Classifier Trainer
- Data Extraction Scope
- Document Understanding Project Extractor
- RegEx Based Extractor
- Form Extractor
- Intelligent Form Extractor
- Present Validation Station
- Create Document Validation Action
- Wait For Document Validation Action And Resume
- Train Extractors Scope
- Export Extraction Results
- ML Services
- OCR
- OCR Contracts
- Release notes
- About the OCR Contracts
- Project compatibility
- IOCRActivity Interface
- OCRAsyncCodeActivity Class
- OCRCodeActivity Class
- OCRNativeActivity Class
- Character Class
- OCRResult Class
- Word Class
- FontStyles Enum
- OCRRotation Enum
- OCRCapabilities Class
- OCRScrapeBase Class
- OCRScrapeFactory Class
- ScrapeControlBase Class
- ScrapeEngineUsages Enum
- ScrapeEngineBase
- ScrapeEngineFactory Class
- ScrapeEngineProvider Class
- OmniPage
- PDF
- [Unlisted] Abbyy
- [Unlisted] Abbyy Embedded
Activities
Document Understanding Activities
Last updated Oct 29, 2024
Activities
The following table shows the available activities in the UiPath.IntelligentOCR.Activities package.
Activity | Description |
---|---|
Load Taxonomy | Turns the taxonomy.json file created with the help
of the Taxonomy Manager into a variable that can be further used with
other activities.
|
Digitize Document | Digitizes a document, extracting its Document Object Model (DOM) and text and storing them in their corresponding variable types. |
Classify Document Scope | Provides a scope for classifier activities, providing all of the necessary files needed to perform document classification. |
Keyword Based Classifier | Uses a keyword-based document classification algorithm designed for files with repeatable titles, and classifies a document based on a given training set from a specified file when used within the Classify Document Scope activity. |
Document Understanding Project Classifier | Classifies documents from a specific Document Understanding modern project, and version, through the classifier trained within that version. |
Intelligent Keyword Classifier | Enables you to classify and split document packages into individual document types. |
Present Classification Station | Opens the Classification Station, which enables users to manually separate the files into document types and also to verify and correct, if needed, the automatic classification and separation of files based on the document type. |
Create Document Classification Action | Creates an action of type Document Classification in Action Center. |
Wait For Document Classification Action And Resume | The activity can be used to suspend the execution of the current workflow until a specified document classification action is completed. |
Train Classifiers Scope | Provides a scope for any classifier activities you want to train. |
Keyword Based Classifier Trainer | Incorporates a keyword-based document classification algorithm, targeting classification for files that have repeatable titles. |
Intelligent Keyword Classifier Trainer | Trains the Learning Data added as variables. The used keywords can be defined by the user or vector learned. |
Data Extraction Scope | Provides a scope for extractor activities, enabling you to configure them according to the document types defined in your taxonomy. |
Document Understanding Project Extractor | Extracts data from documents using a certain modern project and version. |
RegEx Based Extractor | Enables you to create and use a custom Regular Based Expression to extract information from a document. |
Form Extractor | The Form Extractor is best suited for extracting, matching, and reporting specific information by analyzing the word's position inside the document, or detecting a signature. |
Intelligent Form Extractor | Analyses, extracts, matches, and reports the required information from a document by taking into consideration the words' position, including handwritten and signed fields. |
Present Validation Station | Opens the Validation Station, which enables users to review and correct document classification and automatic data extraction results. |
Create Document Validation Action | Creates an action of type Document Validation in
Action Center.
|
Wait For Document Validation Action And Resume | The activity can be used to suspend the execution of the current workflow until a specified document validation action is completed. |
Train Extractors Scope | Provides a scope for any extractor activities you want to train. |
Export Extraction Results | Gives you easy access to extraction results by exporting results from
an ExtractionResult variable to a
DataSet variable which can be further processed.
|