Archive for the ‘term extraction’ Category

Terminology extraction

11/05/2009

Both translators and translation agencies need to invest time in terminology management one way or another. Translators will usually make use of ad hoc terminology research and sometimes also of systematic terminology management in order to specialize in certain subject fields. Translation agencies, on the other hand, may carry out various projects. These project may involve different translators working side by side on one big document or on different texts originating from the same client. Also in this case, it is necessary to use and manage terminology consistently.

Term Extractor

Good terminology management requires efficient and correct terminology extraction (or term extraction) techniques. Term extraction can not only be done after having finished the translation (using bilingual term extraction tools) but also before starting to translate a text. For example, a company terminologist, an employee at a translation agency or a translator can make a term list based on the text and its subject before the actual translation starts. This is useful in order to avoid spending precious time on searching for terms and their equivalents and to avoid terminological inconsistency.

For preparing a translation project and previously providing a term list, one can do a monolingual term extraction using various tools. Unfortunately, automated term extraction both mono- and bilingual rarely yields to satisfying results. The existing term extractors (you can find a short list here or here) are either too expensive or useless… or both!! Besides, most translation agencies and companies don’t have the time and the resources to take care of the tasks of term extraction and terminology management, shifting the responsibility over to the translators (who even have less time and resources).

Luckily, there are some cheap or even free tools which can help translators or companies in analyzing and processing texts and making term lists for major projects semi-automatically. Three of such tools are: Apsic Xbenc, WebCorp and AntConc, but more on this, next time. Have to pack my stuff now,  flying to Canada tomorrow…