Every language uses a default setting for Segmentation Rules during translation to ensure proper handling of text segments. Unless otherwise specified, these rules will be used when the Enable SRX Rules for Text Segmentation option has been enabled within a File Format Configuration for the appropriate language.
What are Segmentation Rules?
Segmentation rules are used by the system to determine how extracted text will be split into segments and paragraphs. A default set of segmentation rules is defined and enabled in the system for each available language.
When text is extracted, the system will perform the extraction based on these rules if "Enable SRX Rules for Text Segmentation" has been enabled for the File Format Configuration used for the translation.
SRX Rule Settings
Within the Wordbee Administration settings, you have the option to enable/disable a set of SRX rules, download them for viewing, and to make changes when necessary.
The following pages will help you understand where SRX Rule (i.e. segmentation rule) configurations can be found within Wordbee Translator and how to view, modify, and create custom configurations: