Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 11 Next »

When setting up a file format configuration for Web Pages, there are many options to choose from to ensure extraction is successful. This page will explain each section of the configuration options for Web Pages.

The following file extensions are supported when setting up file format configurations for Web Pages: .xlf, .sdlxliff, .xliff, .xlif. 

These sections have been provided to help you become familiar with available Web Page configuration options in Wordbee Translator: 

To learn more about accessing and working with file format configurations, please see the following pages: 

 

General Tab

The General Tab contains options for configuring the type of encoding, HTML code, HTML attributes, content exclusion, and text segmentation. The options are described in general below based on individual sections. 

Configuration OptionsDescription
EncodingThe default encoding selection for web pages is UTF-8; however, the encoding option may be used to select a different type of encoding such as Windows, Macintosh, ASCII, etc. An additional option is provided for converting characters that are not compatible with the target encoding into entity references.
HTML Code

These options inform Wordbee Translator how the HTML code itself will be handled during translation. Within this configuraiton section, you will be able to configure:

  • To show or not show whitespaces at beginning and end to the translator.

  • Compress sequences of whitespaces into a single whitespace.

  • Replace '& nbsp;' by blanks.

  • Always show HTML tags preceeding and trailing text.

  • Change how entity references are displayed to the translator: keep original HTML code, change all to characters (default), or convert numeric entities only.

 

HTML Attributes

The options inform Wordbee Translator how to handle specific HTML attributes during translation and include:

  • Choosing to show whitespaces at beginning and end to the translator.

  • Choosing to compress sequences of whitespaces into a single whitespace.

  • Changing how entity references are displayed to the translator: keep original HTML code, convert all to characters (default), or convert numeric entities only.


Exclude Content

This option may be used to configure specific content to be excluded from the extracted text for translation. Within this configuration section you can enter text segments or regular expressions. If a text/pattern matches, then it is possible to mark the segment as not translatable, as translatable or as potentially not translatable. The latter two will be shown to the translator. The system checks one pattern after the other until one matches. Text that matches none of the patterns is considered translatable.


Text Segmentation

This configuration section may be used to enable options for text segmentation during tranlsation. Here you can:

  • Enable/Disable SRX Rules for Text Segmentation
  • Elect to Always Split Text at Line Breaks

 

 

Server and Client Side Code Tab

Configuration OptionsDescription
Extract Quoted Strings 
Exclude Quoted Strings 
Include or Exclude Additional Content 

 

HTML Tags and Attributes Tab

Configuration OptionsDesriptions
Translatable Attributes 
Non-Breaking Tags 
Whitespace Preserving Tags 

 

CMS Specific Settings Tab

Configuration OptionsDescription
  

 

For more information about working with file format configurations and creating custom configurations, please see these pages: 

 

  • No labels