- Getting Started
- Framework Components
- Document Understanding in AI Center
- Pipelines
- ML Packages
- Data Manager
- OCR Services
- Licensing
- References
Create & Configure Fields
Fields cannot be deleted or renamed, so please think carefully before adding new fields. If, however, there are fields that you later decide you do not want to use for training an ML model, you can always hide them using the Hidden checkbox in the Edit Field window.
Click here for details about fields, their meaning, and when to use them.
A line item Description or Unit Price on an invoice document would be examples of Column fields.
- Click + in the table section at the top of the page to add a new Column field. The Create Column Field window is displayed.
- In the Enter Unique Field Name field, fill in a unique name for the field. The field does not accept uppercase letters.
- Click Create. The Edit Field window is displayed.
- From the Content Type drop-down, select the content type.
- From the Scoring drop-down, select the measure used to determine accuracy when running evaluations of model predictions.
- Click the Hotkey field and press a key on your keyboard to automatically populate it.
- Fill in the hex code of the desired field color in the Color field.
- Select the Multi line checkbox if the field to be checked against might span across multiple text lines, such as addresses or descriptions. If this option is not selected, only the first line is returned.
- Select the Split items checkbox if you want this field to be used as a delimiter between line items or rows in a table. Any line on which this field appears is considered to be a new line item or row in the table. Most commonly, this is used on Line Amount fields on Invoice line items.
- Select the Hidden checkbox if you do not want this field to be part of exported datasets.
- Click Save to save your settings.
These are fields which appear only once on a given document. A line item Invoice Number or Total Amount on an invoice document would be examples of Column fields.
- Click + on the right pane in the Regular Fields section. The Create Regular Field window is displayed.
- Fill in a unique name for the field in the Enter Unique Field Name field. The field does not accept uppercase letters.
- Click Create. The Edit Field window is displayed.
- Select the content type from the Content Type drop-down.
- Select the post-processing mechanism in case the model predicts more than one instance of a field on a given page from the Post processing drop-down.
- Click the Hotkey field and press a key on your keyboard to automatically populate it.
- In the Color field, fill in the hex code of the desired field color o
- From the Multi page drop-down, select the data retrieval strategy. This is used in case that fields appear on a few different pages of a multi-page document. This option defines how the model decides which one to return.
- From the Scoring drop-down, select the measure used to determine accuracy when running evaluations of model predictions.
- Select the Multi line checkbox if the field to be checked against might span across multiple text lines, such as addresses or descriptions. If this option is not selected, only the first line is returned.
- Select the Hidden checkbox if you do not want this field to be part of exported datasets.
- Click Save to save your settings.
Data points which refer to a document as a whole. For instance, the Expense Type of a receipt (Food, Hotel, Airline, Transportation) or the Currency of an invoice (USD, EUR, JPY) would be examples of Classification fields.
- Click + on the right pane in the Classification Fields section. The Create Classification Field window is displayed.
- Fill in a unique name for the field in the Enter Unique Field Name field. The field does not accept uppercase letters.
- Click Create. The Edit Field window is displayed.
- In the text area, fill in the list of classes and type the names as a comma separated list.
- Click Save to save your settings.
Important: Contrary to Regular and Column fields, Classification fields are not Re-trained. For example for Currency field, if you retrain the Invoices model on a dataset containing only USD and INR invoices, then the resulting model will only be able to recognize those two currencies.
Displayed at the top of the page in Data Manager. Enables you to perform multiple operations: navigate in between documents, delete a document, filter documents, run AI model predictions, import and export documents.
Field |
Description |
---|---|
→ |
Navigate in between documents that match the active filter. In between the two arrows a counter is displayed. It illustrates the number of the current document out of the total number of documents that match the active filter. |
Delete / Recover |
Delete or recover a document. |
Filter Drop-Down |
Filter documents. This filter applies to exported data as well. The following options are available:
|
Predict |
Run AI model predictions and display the results. |
Import |
Import a new document to be labeled. |
Export |
Export labeled data. The active filter applies to the exported data. |
[DocumentName] |
The name of the currently active document. |
[UserName] |
The username of the currently active user. |
Log Out |
Log out of Data Manager. Logging out also clears the cookies. |
Help |
Displays the Data Manager help menu. |
Enables you to configure the name of the field to be added.
Field |
Description |
---|---|
Enter Unique Field Name |
The name of the field. Can only contain lowercase letters, numbers, underscore “_” and dash “-“. |
Enables you to configure regular and column field.
Field |
Description |
---|---|
Content Type |
The content type of a field. The following options are available:
|
Post Processing |
Only displayed for regular fields. The post-processing mechanism. The following options are available:
|
Hotkey |
The shortcut key for the field. |
Color |
The color for the field. |
Multi Page |
The data return strategy in case a field appears on multiple pages in a document. The following options are available:
|
Scoring |
Can only be configured for content of type string. All other content types use an Exact Match scoring strategy. The measure used to determine accuracy when running evaluations of model predictions.
|
Multi Line |
Select this checkbox for fields which may span across multiple lines, such as addresses or descriptions. Otherwise, only the first line is returned. |
Split Items |
Only displayed for column fields. Select this checkbox if you want this field to be used as a delimiter between line items or rows in a table. Any line on which this field appears is considered to be a new line item or row in the table. Most commonly this is used on Line Amount fields on Invoice line items. |
Hidden |
Select this checkbox if you do not want this field to be part of exported datasets. |
The Labeling Controls section displays the controls to be used when handling data.
The Document Shortcuts section displays the shortcuts used to perform various operations such as navigation and UI scaling.
The Configuration section displays details about the instance configuration as performed during installation.
The Error Reporting section enables you to view recently generated logs.