The following file extensions are supported when setting up file format configurations for RTF Files: .rtf.
For more information, please see the following sections:
Table of Contents |
---|
To learn more about working with file format configurations, please see the following pages:
- Viewing File Format Configurations
- Modifying Format Configurations
- Creating Format Configurations
- Testing File Format Configurations
Default RTF File Format Configuration
Every file format has a default configuration to ensure that a file can be translated; however, it does not handle every complex property that could be thrown your way when translating a source file. The default configuration for RTF files does the following:
- Extracts Document Properties, Headers, and Footers.
- Extracts Table of Contents and User Comments.
- Does Not Show Leading/Trailing Whitespaces.
- Converts Sequences of Multiple Whitespaces into Markup.
- Does Not Show Leading/Trailing Characters that are Neither Letters nor Digits.
- Converts Words Neither Containg Letters nor Digits into Markup.
- Uses SRX Rules for Text Segmentation.
- Always Splits Text at Line Breaks.
Custom RTF File Format Configurations
If you are performing an RTF file translation, a custom file format configuration might be necessary to achieve the right results in your target file. Outside of the default configuration selections, Wordbee Translator offers many additional choices for configuring:
- Not Extract Document Properties, User Comments, etc.
- Not Convert Whitespace Sequences or Symbols into Markup.
- Show Whitespaces and Symbols.
- Not Translate Certain Styles, Colors, Words, or Segments.
- Reduce Markup.
- Handle Embedded Files (Microsoft Word, Excel, etc.).
RTF File Format Options
When setting up a file format configuration for RTF files, there are many options to choose from to ensure extraction is successful. The options are described in general below based on individual sections.
- Encoding - Auto detect the character coding of a source file or configure a specific type of character encoding.
- Content - This section contains options for handling extraction of symbols and showing or hiding leading/trailing whitespaces to the translator.
- Exclude Content - This option may be used to enter regular expressions for exlcuding content from translation within the source file.
- Text Segmentation - Enable/Disable SRX rules for text segmentation and choose to spilt or not split text at line breaks.