When setting up a file format configuration for XLIFF files, there are many options to choose from to ensure extraction is successful. This page will explain the each section of configuration options for XLIFF files.
The following file extensions are supported when setting up file format configurations for XLIFF: .doc, .docx, .dot, .dotx, .docm, .dotm.
Please click on a section to see specific information regarding a configuration option.
Table of Contents |
---|
General Tab
Configuration Option | Description |
---|---|
Content Section | Extraction rules for document properties, headers, footers, calculated fields text, table of contents, and user comments. |
Whitespaces and Symbols | Elect to not show leading and trailing whitespaces, convert sequences of multiple whitespaces into markup, do not show leading or trailing characters that are not letters or digits, convert words containing no letters or digits into markup. |
Text Segmentation | Enable SRX rules for text segmentation and elect to always split text at line breaks. |