Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

When setting up a file format configuration for XLIFF files, there are many options to choose from to ensure extraction is successful. This page will explain configuration options for XLIFF files in Wordbee Translator.

...

To view and edit the XLIFF filter options, go to Translation Settings > Document Formats and select XLIFF files. The XLIFF files - Configuration window opens. Here you can configure the following:

Table of Contents

Image Added

General

...

settings

The General Tabtab contains options for extracting content, defining the file as HTML, handling whitespaces and symbols, excluding content, and text segmentation. The options are described in below based on individual sections.: 

  • Content SectionExtract XLIFF existing translations (if any) and set segment status to 'Translated' in the translated XLIFF file.
  • Comments - Extract XLIFF notes on segment level to Wordbee comments and write new comments added during translation work in the translated file.
  • HTML Content SectionInform the system that the content is HTML, set up a configuration for HTML extraction, and split text at HTML break tags.
  • Whitespaces and Symbols SectionDo not show leading and trailing whitespaces, do not show preceeding and trailing markup, do not translate texts containing neither letters or digits, and always preserve whitespacees by default.
  • Text Segmentation SectionSplit segments at XLIFF segmentation boundaries, enable SRX rules for text segmentation, and select to always "Always split text at line breaks".


...

We have two scenarios when enabling this When enabling the option "Always split text at line breaks" in the XLIFF configuration, consider the following scenarios:


Image Modified






Scenario 1.

...

If there is no

...

HTML content in the XLIFF file:

There will be no line breaks

...

when enabling the Content is HTML

...

  option because the HTML parser removes white spaces from the segments which are considered line brakes in XLIFF.


Scenario 2.

...

If there is HTML content in the XLIFF file:

There will be line breaks

...

when enabling the Content is HTML

...

 option provided that the HTML content

...

contains HTML breaking tags (eg: <p>, <div>....etc). 

...

The HTML parser

...

will remove the white spaces from the segments unless the HTML breaking tags (eg: <p>, <div>) are included in both the HTML content

...

of the XLIFF file

...

and the HTML configuration

...

attached to the XLIFF configuration

...

. See XLIFF 2 Information.

...

Do not translate

...

settings

Exclude Content Sections - Configure content to be translated or not translated when the system looks for texts or regular expression patterns.

  • Segments
  • Words or terms
  • Attributes and comments

SDL XLIFF

...

settings

The SDL XLIFF Tabtab may be used to load advanced properties when XLIFF files have been produced by other CAT tools. 

  • Extract Origin of TranslationsThe SDL 'origin' attribute specifies the origin of the translation: 'tm' for translation memory, 'mt' for machine translation, etc. The SDL 'percent' attribute tells whether a pretranslation is exact, fuzzy or perfect. These fields will be mapped to the respective fields in Wordbee Translator. The Wordbee word count will then take into consideration these values.

QA

...

settings

Pass over restrictions on the size of the segments to highlight issues when performing quality assurance checks.

Web preview

...

settings

Wordbee supports web preview with XSLT stylesheets allowing the conversion of XML files into HTML for easier and customizable previewing in the web browser. The stylesheet must convert the XML file to HTML (find more details in this help page XSLT). You can have a library with all your stylesheets available in the platform by uploading them within a specific folder in the Documents tab of the Master Company (navigate to "My Company " > Documents).

Find more help on how to set this up within the parser page itself:

...

To learn more about working with file format configurations, see the following pages: 

...