Document Formats

When you import a file type (e.g. Word, Excel, XML) into Wordbee, the text is processed by a specific set of extraction rules that you can view and configure in Settings > Customization > Translation Settings > Document Formats.

Click on Configure to the right of Document Formats to search for a specific file and its extension. See GIF.

 

When you open the configuration of a document format, you will see the Default Configuration. Click on Select to view, edit, or clone the configuration.

Each file has specific configuration options to ensure proper extraction of the required information from the source file for translation. These options are grouped by tabs and sections to make the process of creating or modifying a file format configuration easier and will vary based on the file format you are using for the configuration. 

New

We have improved the filters for all document formats. When you open the configuration of one file type, you will see an additional option: File conditions. Here you can specify conditions for when a certain configuration shall be used. See article:

How to process files with auto-select filters


Default Configuration

Wordbee Translator offers a Default Configuration for every available file format and extension. These default configurations do not work for every situation and are designed to ensure that online translations are successfully completed.

For example, the default configuration for an XLIFF file does the following: 

  • Extracts existing translations

  • Does not show leading and trailing whitespaces

  • Does not show preceding and trailing markup

  • Splits segments at XLIFF segmentation boundaries

  • Enables SRX Rules for text segmentation

This configuration is not pre-configured to handle extraction for XLIFF files that are HTML or that contain HTML content. If you need to use Wordbee Translator to extract an HTML based XLIFF file, then a different configuration must be used that has the "Content is HTML" option checked as part of the XLIFF file format configuration.

The same applies for accomplishing specific extraction or exclusion objectives with Microsoft Word, Microsoft Excel, Code Files, and other types of formats for translation. 


Custom Configurations

File format configurations may be used to accomplish many tasks such as omitting red text from the translation of a Microsoft Word File or translating an XLIFF HTML file.

Certain needs are simply not covered by the default configurations provided by Wordbee Translator. In these instances, it makes more sense to create a custom file format configuration, as using the default will result in either an error or unwanted results in the completed translation.

With Wordbee Translator, you can do the following and more with custom file format configurations: 

  • Mark an XLIFF file as HTML.

  • Not extract File Headers/Footers in a Word file. 

  • Omit certain colors of text from a Microsoft Word File, Excel file, etc.

  • Not translate specific segments within a Word, Excel, or another file type.

  • Define specific columns or rows to translate in an Excel file.

  • Configure the extraction of embedded files.

  • Change the default character encoding of a code file.

  • Exclude quote strings or additional content from code files.


How to view, modify and test a configuration

See the sections listed below to learn how to create, view, modify and test the settings of document formats.


Supported Formats, Versions, and Extensions

Wordbee Translator allows you to create configurations for the following formats, versions, and file extensions. 

Copyright Wordbee - Buzzin' Outside the Box since 2008