WHAT IS COMPUTER AIDED TRANSLATION

by Juraj Sipos.

Share
|
Homepage | Submit your article | Contact | TOS
More articles on software  

You are here: Categories » Computers and technology » Software

Computer terminology uses the term CAT for Computer Aided Translation. It is a software tool that saves pairs of original and translated text into its Translation Memory (TM). When you translate, the software will suggest to you all texts of your past translations from its TM.

CAT software/tools are not machine translation (automatic translation of a text). They rather represent an independent field of tools that help translators expedite their work.

CAT software continuously compares the text stored in TM with the document's text and gives suggestions on the basis of definable criteria - it looks for similar (fuzzy) or accurate (exact) matches in TM. The translator can define the fuzziness threshold.

Today, there are robust commercial CAT solutions on the market such as Trados, minimalist Wordfast or Metatexis - unfortunately, they are all commercial. Trados is a standalone tool; Wordfast or Metatexis cannot be used alone - they require MS Word, because they work as a macro in it. If you use Linux, Wordfast and Metatexis work perfectly in MS Word in Wine (environment used for running Windows applications in Unix) or CrossOver Office (commercial Wine).

Anaphraseus

A widespread historical employability of MS Word is the reason why most translators use it. However, there is a very good CAT solution for OpenOffice.org too - Anaphraseus, which is an OOo extension compatible with Wordfast.

CAT terminology is everywhere the same, so a good way before you start working with CAT software is to read some help documents. You may also download a manual for Wordfast or any other CAT tool.

Anaphraseus: CAT With OpenOffice.org

Anaphraseus as an extension for OpenOffice.org installs directly from the OOo's menu: Tools | Extension Manager, where you just click on the Add button. In OpenOffice.org 3.0 you click on the blue text "Get more extensions here...", which is at the bottom of the Extension Manager window. After you click on the blue text, your default browser will open up with the OOo extensions URL address where you must search for Anaphraseus - just type it as a keyword into the search textbox and then download it.

You will get a file with an OXT extension (some older versions used a Zip format); then click on the Add button in the OOo Extension Manager window and find the OXT file on your PC. When done, read the license and scroll down to accept it. After you click on the Accept button Anaphraseus will permanently move to your OOo Extension Manager.

Anaphraseus is not platform dependent but OOo dependent. It works in OpenOffice, no matter if you work in Windows, Linux, FreeBSD, OS/2, or even OpenBSD. This nice CAT extension is compatible with OpenOffice.org 2.1 and higher; StarOffice 8, Update 5 or higher, and it will give you the following possibilities:

* Term Recognition

* Fuzzy Search

* Unicode UTF-16 TMX Export/Import

* Plain text and Unicode UTF-16 TM

* User Glossary

* Russian localization available at Sourceforge.net

After the installation you need to restart OpenOffice.org for the changes to take effect. You will then see a new panel with the Anaphraseus icons appearing in the OOo environment.

To make your first translation, create (or import) your 1) Translation Memory; 2) open a document with your desired source language; and 3) start translating it by clicking on the Alt+Down button on the Anaphraseus icon panel.

Translation Memory

The first step is to create (or possibly to import) your TM. To do this, click on the Anaphraseus Setup icon, select New, and then enter pertinent TMX codes, which you will need for cases you decide to export your TM later (see Few CAT terms at the bottom). You will need a separate TM for every language combination - for example, if you translate from Czech to Hungarian, this combination - that is, this TM is not good for translation from Hungarian to Czech.

The software allows you to work with many Translation Memories, which you can use for specific languages or jobs; for example, bible-czech2eng.txt (from Czech to English) or bible-eng2czech.txt (from English to Czech) will be your TM's for biblical translation projects (both Wordfast and Anaphraseus use TXT format in their TM's).

Some CAT Terms

TMX

In Computer Aided Translation you use the Translation Memory eXchange (TMX) format (XML) because translators often need to migrate (export/import) their TM's to a variety of CAT tools they use. It is a translator's right to choose any CAT software and in case a group of translators works on a project, they can thus share their TM's. Many CAT tools use their own (proprietary) Translation Memory formats and TMX helps translators and translating agencies share their TM's easily. For example, you export your TM from a proprietary CAT application's format (Trados, etc.) to the TMX format and then you import this TMX format to Anaphraseus (or to any other CAT tool). TMX is a type of database with various codes that identify languages (CS-01 for the Czech language, EN-US for US English, etc.).

Unicode

If Anaphraseus asks you whether you want to use Unicode, you need to know that CAT software may have problems to display words with diacritical marks such as those used in East-European languages. By the term "displaying" I mean that once the source and target sentences get to TM, Anaphraseus will compare the source sentence in the document with the one in its TM and will show you the target sentence if it meets certain criteria. With the Unicode font it will display correctly all the fonts. If you do not work with Western-type languages, it is always a good idea to use Unicode.

Cleaning Up

The term "clean up the document" in CAT terminology means that you remove the original (source) text from the document, which keeps staying there for editing purposes. Both source and target segments are delimited with color markers such as {0> and you may not delete from the document (of course, you can, but only by "cleaning up the document"). Authors of CAT tools know that translators need to compare the original text with the translated one even after the translation is finished. In addition to the fact that Anaphraseus (and many other CAT tools) saves pairs of sentences in its TM you will also have these pairs embedded in the document until you clean it.

If the document is not yet cleaned, you may always click on the Arrow Down button on the Anaphraseus toolbar, compare the source (original) text with the translated one, and continue editing it. When you are finished, choose CLEAN UP. The software will ask you if you want to update your Translation Memory. All color markers and source sentences will disappear from the document and you will only see your final work (the text you translated).

Conclusion

Anaphraseus does not have all the functions of commercial applications (such as Pandora's box, etc.), but not all translators need comprehensive solutions every hour and every day. OpenOffice.org has thus become not only a complex and very useful tool for translators, but also a star on your way to freedom.

The author's website at www.freebsd.nfo.sk

Leave a comment or ask a question
Total comments: 0

Software Disclaimer

  • The e-articles directory is not responsible for any and all copyright infringements by writers and authors. If you suspect the information contained by this page for any copyright infringements, please contact us to investigate the issue
Detecting SoftICE by Calling INT 3h - This is one of the most well known anti-debugging tricks, and it uses a back door in SoftICE itself. It works in all versions of Windows, and it is based on calling INT 3h with registers containing (more...)
Detecting SoftICE by Calling INT 68h - Here's a way to detect the presence of SoftICE in memory by calling INT contain the value 43h before calling INT be in the AX register. 68h. The AH register must 68h. If SoftICE is active in memor (more...)
How Can You Increase Your computer performance - Basic computer knowledge or/ and appropriate technical assistance can help you increase computer performance. Computers have become an expected supporter in this modern world. O (more...)
Detecting SoftICE by Searching Memory - This detection searches the memory in the V86 mode for the WINICE.BR string. Because this method is infrequently used, it's worth considering, though it can only be used in Windows 9x. Thi (more...)
Fight for the Future, Digital Future: Google VS Apple - We can be proud as we watch one of the greatest virtual wars unleashing at the digital market. If 15 years ago it was Apple Vs Windows confrontation, today it has slightly changed its main parties (more...)
The Beginner's Guide to iPad Video Conversion on Mac - So, you've just taken in a shiny new iPad and impressed by its beautiful display. There are several possible sources of content that you will want to convert for viewi (more...)
Touch and View: iPad application - iPad application development has become an extremely popular topic at numerous conferences and workshops since the product introduction keynote. Mobile software companies went boldly into the une (more...)
Deciding on TIFF vs. JPEG Output for Scanned Images - It can be difficult to understand which file type is best for saving your scanned images. Here's a brief breakdown of the two most common options. Preserving beloved memories is on (more...)
Photoshop Clipping Path and Masking Techniques :: Wonderful Technique to Knock Out Image Background - Graphic design is being the promotional key in every spare of business and individual life. Business organizations seek graphic tools to have publicity by dint of bill board, catalogs, magazines, w (more...)
Tips on Getting MP3 from CD as iPhone Ringtone on Mac - Many people must have favorite CDs filled with their bookcase or CD case somewhere at home due to the songs they loved while they still need to pay $0.99 each to get the same songs as their iPhone (more...)

 
free content
    Copyright © 2006 - 2012 e-articles.info.
The texts, articles and tutorials in the directory are property of their respective owners and authors.