Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Question

Sometimes my files contain many tags, and the result of the translation generated by my machine translation provider is not great. What can I do to improve the results?

Answer

Markup handling and placement is a very complex topic for which results are always best when dealing with small volumes and generic HTML-based markup.

...

  • Each segment is a machine-readable string that any engine can process. These strings contain the text in the language processed and markup. The markup defined by extraction rules can be of different types (custom, HTML-compatible), which makes the string unique.

  • Because of the nature in which markup can be defined in the text extraction rules, the system needs to prepare the initially extracted string further to make it compatible with machine-related processes that can happen outside Wordbee Translator. The way the text and the markup are generated will have an impact on any machine-related processes.

Processing of segments (before and after machine translation)

When segments are the text parsed using a file format configuration is prepared for machine translation, the following processing is applied to the segments:

  1. The text in the source language of the segment and its markup are further prepared to maximize the chances of getting the integrity of the content translated by the MT provider. The markup in the string is further converted into generic HTML markup to make it machine compatible.

  2. The new converted string is sent to the MT provider chosen, as per MT profile configuration.

  3. Once the MT provider generates the MT output, the Wordbee Translator verifies if the markup obtained in the output is valid as per the initial MT request. It checks if the translation generated by the MT provider has done the following:

    1. returned all markup

    2. the markup was correctly placed
      Wordbee Translator has several mechanisms that allow you to "roughly" fix any major markup issues. These aim at doing accurate translations and preventing problems when reconstructing the file with all translations, such as making the file readable in the first place.

  4. Finally, once the machine translation output is available and validated, the system needs to convert the HTML-based markup back to the style initially parsed in Wordbee Translator.

...