Introduction To CAT Tools
Introduction To CAT Tools
Introduction To CAT Tools
I) Project management
In project with multiple files, CAT tools can analyze the content and generate reports that show
Full and fuzzy match analysis
Intra-and cross file repetitions
Translation memory leverage
List of files with individual word counts
SDL Trados 2017 Freelance (and many other similar CAT tools) can handle
o Multiple file formats: all Microsoft Office line, InDesign, HTML and more.
They are all converted to XLIFF
1
XLIFF means “XML Localization Interchange File Format”
After translation, XLIFFs can be exported back to the native file format.
This is very convenient. For example, in InDesign documents, you don’t need to extract the copy to translate
it, for the designer to later on place it back manually in the publication. You can translate InDesign
documents directly, and when you’re done you can export them back to InDesign, and voila! The source
language text is replaced by the translation. This saves work on both ends: the translator does not need to
extract text from a PDF, and the designer does not need to place back the translated text back manually
(which often times opens the door for errors to creep in).
o PDF conversion (“Pretty Darn Frustrating” no longer): SDL Trados 2017 can convert PDFs into
fully editable Microsoft Word documents automatically in the process of creating a project. Prior
to this, separate programs—which were pretty costly by the way—were required to accomplish
this (for example, OmniPage, ABBYY FineReader, etc.)
o SDL Trados 2017 has a built-in OCR function. It is no longer necessary to have a separate
program OCR program. In the process of creating a project, text embedded as an image is
automatically “read” and converted into fully editable text.
2
The translator can set criteria for the terminology extraction, for example, terms must have a minimum number
of characters, or appear a certain number of times. The extracted terminology list can be added to the project as
a separate file, translated, and converted into a term base. This can be useful for working out the terminology
before actually translating, or for standardizing terminology when working with a team of translators.
You can see a demo of how this works in the YouTube video posted in this webpage.
http://www.youalign.com/
o Has paid and unpaid versions. Pros and cons: Unpaid version raises confidentiality issues
because you have to upload files to their server. Paid version can be local in your computer.
3
Machine translation: Una historia del pueblo de los Estados Unidos: Desde 1492 hasta el presente (even though
this is perfectly correct, to most Spanish readers this would look like another history book)
§
Human translation (published title of the book in Spanish): La otra historia de los Estados Unidos: Desde 1492
hasta el presente. (Bold and underline are mine, the back translation would be “The Other History of the United
States of America”) The human translation succeeds in conveying the idea that this is a different type of history
book. But the translator obviously took some “poetic license.”
This is where humans can outperform machines! Machines tend to err on the side of being too literal. Good
translators, instead, understand that their job is not just to transpose words or phrases from one language to
another, but to distill the idea the author meant to communicate, and to re-express it in another language so that
it is understood. This often means rewriting and departing from the actual form of the message. Some
translation scholars have called this interlingual intercultural mediation (rather than just translation).
o Concept of FAUT (Fully Automated Useful Translation) and FAUHQT (Fully Automated
Useful High-Quality Translation, the “Holy Grail”). Ray Kurzweil predicted that FAUHQT will
be available in 2029 (9 years from now!)
o Is output useful? Depends on the POV (point of view). Useful for what? For the translator, the
usefulness could be a decent first draft that can be edited and improved faster than translating it
from scratch by typing or dictating. The usefulness could also be that it offers an easy and fast
way to consider other translation options of words, phrases or sentences, or even to research
terminology.
o Quality indicator of MT: Can output be understood without looking at the source text?
o A quick history
Rules-based MT
Statistical Phrase Based MT
Neural Machine Translation
Hybrid
GT4T
IntelliWebSearch: incredibly useful tool to query anything online or selectively fetch GT or other MT
http://www.intelliwebsearch.com/version-5/
4
Pressing a convenient customizable shortcut key:
1. copies selected text from your translation environment by simulating the Windows copy
function;
2. strips the text of paragraph marks, line breaks, tabs, double spaces, punctuation marks and
various other superfluous characters (customizable);
3. opens your default browser (Microsoft Edge, MS Internet Explorer, Firefox, Google Chrome,
Opera, etc.) and sends the copied text to one of a virtually unlimited number of user preset search
engines, on-line dictionaries or on-line encyclopaedias together with other customizable
instructions which limit the search according to the advanced search options available for the
chosen web resource. Alternatively it can send the same text and instructions to local dictionaries
on CD-ROMs or installed on your hard disk.
Another convenient shortcut key (known as the Return Key) copies any text you select in your
browser or local dictionary and returns you to your translation environment, where you can
choose to paste in the text using your translation environment tool’s own paste function
VIII) Translation Memory (for leveraging our previous work and “collecting royalties” on it).
https://www.youtube.com/watch?v=MmsBe7BitG4
Now, in SDL Trados, you can leverage not just full sentences but also words and phrases via SDL upLIFT
technology, which introduces matching based on fragments.
And upLIFT Fuzzy Repair intelligently uses your own trusted resources to repair fuzzy matches, so you
save time and get the best match possible.
Fuzzy Match Repair can draw from a number of translation sources to perform fuzzy match repair:
Machine Translation
Termbases
Legacy and new translation memories.
SDL Trados AutoSuggest: Predictive typing based on your own way of writing, draws from your own
Translation Memory
IX) Speech Recognition (for entering text by speaking)