Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 2 Next »

Activities to clean-up the resources

Maintaining linguistic resources is something users can do online and offline. Depending on the volume of work to be done and of the output expected, users performing these actions will decide to work directly in the online editor or export the resource and run some extensive clean-up with dedicated tools.

Cleanup activities via the CAT Editor

Phase 1: filter relevant segments in the Editor

 Use the search bar with one or several languages

The power search bar on the top of the page will allow you to filter on specific criteria you may need to browse contents through pairs of segments.

  • In the language menu on top, you could be filtering up to 2 languages (making it possible to apply filters on <single> languages or <language pairs>)
  • Additionally, you could be filtering segments based on the values for each language.

This means, you could search for segments which contain specific texts for each language (where a text is translated in a specific way, or where the text is present in both, source and target segments).


Filter options

It is possible to combine several categories to apply a custom filter. Once you enter the values you expect to apply, the categories are enabled and their field name is highlighted. 

The relevant filter categories and their purpose are listed below:

Text contains: filter on segment text 

The text contains filter is equipped with this search operator (AND/OR), among other search possibilities such as phrase match, case sensitive search or even Regex/wildcard search.

For example, in a en-GB to ca-CA job, you could be applying a filter where the source segment has to have a specific translation (filter on segments where "university" is translated as "college") or where a text is in both languages identical (text is the same).

Bookmarks: filter on pinned segments 

The bookmarks filter is equipped with this search operator (AND/OR), which allows filtering on specific bookmark value(s) for both or each language(s)


Phase 2: apply batch action

Once you have a filter with specific criteria applied, you may want to update the segments with some specific property or value. The fastest way to update a group of segments is to use the batch actions menu.

Actions that may be relevant for resource maintenance activities are:

Language actions menu

  • Clean all segment information: removes all contents for the given target language (this includes translations, bookmark, status, labels and any other related properties)
  • Set custom QA message: if you want to leave a custom "error" message which could help you do further research on what needs to be done for that segments.
  • Language specific label: if you use labels for maintenance purposes. See labels (category: resources).
  • Find repetitions: This action lets you find multiple occurrences of identical source texts and translations. Use cases are: Find source text repetitions. Find redundant segments that have both identical source text and translation. Find identical source texts that have very similar translations.
  • No labels