Export to TBX
Use this method to export segments to a TBX file (as a ZIP archive). You can export projects, jobs, resources and even multiple resources and apply a wide range of filters.
The zip archive contains the .tbx file as well as the .xcs schema file:
URL
(POST) /api/resources/segments/io/tbx3
(POST) /api/resources/segments/io/tbx
Choice of tbx and tbx3:
Suffix | Version details |
---|---|
tbx | TBX version 1 (2008). https://www.gala-global.org/sites/default/files/uploads/pdfs/tbx_oscar_0.pdf |
tbx3 | TBX version 3 (2014 to 2019) http://www.terminorgs.net/downloads/TBX_Basic_Version_3.1.pdf |
PARAMETERS
The message body contains a JSON object.
Generic export properties: scope The scope object tells the system what data to export. Examples: Export a project memory, translation memory or term base by specifying its resource ID: Export a project memory by specifying its project ID: See all available scopes here. Mandatory object The list of all locales that you want to export. An optional filter to apply to the exported data. Sample use cases are: The query properties and features are described here. Example of exporting texts in neutral (0) or green (1) status thus excluding red status (2): IMPORTANT: The query has 2 functions: On the one hand it filters the segments and on the other hand it removes any translations that do not match the filter. In the example above, the query first identifies segments where at least one of the 3 languages is not in red status. In a second step, the system removes any languages that are in red status. Optional, object The source locale. It is highly recommended to set this value if you know it. Otherwise, the system chooses the first entry in locales. Optional, string Optional number of languages a segment must have at a minimum to be exported. By default this is 1. Optional, default is false. If true then translations are removed from the export if the text is identical to the source text. Optional. List of numeric comments categories. If set then only comments of these categories are included in the export. Categories are sequentially numbered and start at 0. Optional, int[] Optional, default is null. When you set markupRemove to true then any markup will be replaced by this character. Note: You must specify a single character such as: "markupReplaceChar": " " Optional dictionary to transform or modify language codes prior to saving to file. The following example, transforms fully qualified locales from the resource into simpler ones: Locales not in the dictionary are left unchanged. If you map multiple original locales to the same output locale, the system will retain only one of the "clashing" languages.{ "type": "DocumentSet", "dsid": 157 }
{ "type": "Project", "projectid": 7726 }
locales "locales": [ "de-DE", "de-AT", "fr-FR" ]
Mandatory, string[] query "query": {
"languages": [
{ "loc": ["en", "de", "fr"], "status": { 0, 1 } }
]
}
src minimumTexts Optional, int skipTargetIsSource Optional, bool excludeCommentTypes markupRemove Optional, default is false. If true then markup is removed from texts. See also property below. Optional, bool markupReplaceChar Optional, char localesMappings "localesMappings": {
"de-DE": "de",
"fr-FR": "fr"
}
Optional, object
TBX specific options
extractWordbeeProperties | Optional, default is false. If true then the system includes Wordbee specific properties in the TMX file. These properties are saved as <descripGrp> nodes: <descripGrp> <descrip type="Status">1</descrip> </descripGrp>
Text level properties in "tuv" node:
| Optional, bool |
extractComments | Optional, default is false. Set to true if you want to include comments attached to the texts. These are saved as notes inside <tig>: <tig> <termid="6386438-en-GB">kkkk</term> <auxInfo> <note id="529">This is a comment [23/11/2018 08:30:17,187,0]</note> </auxInfo>
| Optional, bool |
TBX version 3 specific options
extractCreationData | Optional, default is 0. Available options:
If not "0" then the system includes "creation" data from "Wordbee fields" if set to "1", or from "Originally imported data" if set to "2". These properties are saved as <transacGrp> nodes: <transacGrp> <transac type="transactionType">creation</transac> <transacNote type="responsibility" target="511">Rageh Aly</transacNote> <date>2021-03-08</date> </transacGrp> | Optional, int |
extractModificationData | Optional, default is 0. Available options:
If not "0" then the system includes "creation" data from "Wordbee fields" if set to "1", or from "Originally imported data" if set to "2". These properties are saved as <transacGrp> nodes: <transacGrp> <transac type="transactionType">modification</transac> <transacNote type="responsibility" target="511">Rageh Aly</transacNote> <date>2021-03-08</date> </transacGrp> | Optional, int |
RESULTS
This API method is an "asynchronous" operation. It either immediately returns the result or instead an operation ID which you need to periodically poll until the operation has finished.
Upon the first call you typically get an "operationid". Use this to further poll the operation status. See full explanation here: Asynchronous operation
When the operation is finished you get the reference to the file:
{ "trm": { "status": "Finished" ... }, "custom": { "fileref": "52e77705f6904233b0c33b2e5d7c4a80", "filename": "export.zip", "segments": 10 } }
Once the operation is finished, download the file with fileref and this API method:
(GET) /api/media/get/{fileref}
EXAMPLES - TBX version 1
Submit request to export term base 112334:
(POST) /api/resources/segments/io/tbx BODY: { "scope": { "type": "DocumentSet", "dsid": 112334 }, "locales": [ "en", "fr" ] }
The resulting TBX file:
<?xml version="1.0" encoding="utf-8"?> <martif type="TBX" xml:lang="en-GB"> <martifHeader> <fileDesc> <titleStmt> <title>My term base</title> </titleStmt> <sourceDesc> <title>Project memory</title> </sourceDesc> </fileDesc> </martifHeader> <text> <body> <termEntry id="6386440"> <descripGrp> <descrip type="Context">p</descrip> </descripGrp> <langSet xml:lang="en-GB"> <tig> <term id="6386440-en-GB">Hello world</term> <descripGrp> <descrip type="Status">0</descrip> </descripGrp> <descripGrp> <descrip type="Bookmark">0</descrip> </descripGrp> <descripGrp> <descrip type="Locked">No</descrip> </descripGrp> <descripGrp> <descrip type="Editor">0</descrip> </descripGrp> </tig> </langSet> <langSet xml:lang="de-DE"> <tig> <term id="6386440-de-DE">Hallo Welt</term> <descripGrp> <descrip type="Status">1</descrip> </descripGrp> <descripGrp> <descrip type="Bookmark">0</descrip> </descripGrp> <descripGrp> <descrip type="Locked">No</descrip> </descripGrp> <descripGrp> <descrip type="Editor">1</descrip> </descripGrp> </tig> </langSet> </termEntry> ...
The resulting schema file:
<?xml version="1.0" encoding="utf-8"?> <!DOCTYPE TBXXCS SYSTEM "tbxxcsdtd.dtd"> <TBXXCS name="master" version="0.4" lang="en" xml:lang="en"> <header> <title>TBX XCS file</title> </header> <languages> <langInfo> <langCode>en-GB</langCode> <langName>English (United Kingdom) (en-GB)</langName> </langInfo> <langInfo> <langCode>de-DE</langCode> <langName>German (Germany) (de-DE)</langName> </langInfo> </languages> <datCatSet> <descripSpec name="Context" datcatId="x-wb-context"> <contents datatype="plainText" /> <levels>plainText</levels> </descripSpec> <descripSpec name="Status" datcatId="x-wb-cfs-5"> <contents datatype="plainText" /> <levels>plainText</levels> </descripSpec> <descripSpec name="Category" datcatId="x-wb-cfs-11"> <contents datatype="plainText" /> <levels>plainText</levels> </descripSpec> <descripSpec name="NSR unit" datcatId="x-wb-cfs-12"> <contents datatype="plainText" /> <levels>plainText</levels> </descripSpec> ...
Additional information
TBX specifications: https://www.gala-global.org/sites/default/files/uploads/pdfs/tbx_oscar_0.pdf
Copyright Wordbee - Buzzin' Outside the Box since 2008