Machine Translation

Overview

Machine translation has evolved through several major paradigms, from early rule-based systems to the highly fluent neural models used today.

Evolution

Rule-Based (RBMT): Used complex linguistic rules and bilingual dictionaries.
Statistical (SMT): Used probability models based on large bilingual corpora (e.g., Google Translate's original version).
Neural (NMT): Uses deep learning (RNNs or Transformers) to translate entire sentences at once, capturing better context and fluency.

Key Challenges

Ambiguity: Words with multiple meanings depending on context.
Idioms and Slang: Phrases that cannot be translated literally.
Low-Resource Languages: Languages with limited available training data.

Evaluation

BLEU (Bilingual Evaluation Understudy) is the most common metric for automatically evaluating the quality of machine-translated text against human references.