Machine translation

Machine translation software are of two kinds :
  • Translation memory : is based on an existing resource of bilingual translation data. It stores recurring sentences or phrases and the translations that were made. Later these translations can just be used again. These are also called CAT-programs (Computer Aided Translation)
  • Automatic translation : the computer analyses the text and translates it based on software rules
Before a file can be translated automatically it has to be pre-processed. It has to be checked on spellingserrors, the type of document has to be analysed, names of persons could be specified.
The correct spelling of words is important since incorrect spelling can have substantial negative impact an the machine translations output quality. Misspelled words cannot be matched to entires in the dictionaries. As a result, these words go untranslated. A more isidious effect results from the fact that unknown words are assing default part of speech. If the default part of speech is incorrect, other decisions concerning the syntactic structure of the sentence may be incorrect as well.
In Western European languages blank spaces are the primary indicator of a word boundary, in Chinese (and other Asian languages) blank spaces are not used to mark word boundries and word segmentation is a more complex task that uses morphological and syntactic information.

Technical text are more appropiate for machine translation since they're less ambigui.

Links
Language processing
Localization Directory, Resources and Information
MT Summit IX
MT Summit VII
White papers by Systran
Chinesecomputing.com - Machine translation
Systranbox (also known as bablefish).

Organisations
EAMT (European Association for Machine Translation)
AMTA (Association for Machine Translation in the Americas
AAMT (Asia-Pacific Association for Machine Translation


   
Search >


Local links are in blue, links to other websites are in red, commands are in green.

You need unicode fonts, a 4+ browser and acrobat reader to fully explore and enjoy this webpage. (if necessary you can download asian fontpacks for acrobat reader)

Currently translating my thesis to English : more info

© Seba - contact at seba at ulyssis dot org
users online: 5