You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

YehaTranslate

YehaTranslate is a fine-tuned translation model for Ethiopian languages ↔ English, developed by Hasab AI. It is based on google/translategemma-4b-it and fine-tuned on paired Amharic–English data.

The primary training focus is Amharic (am) ↔ English (en). The training data also contains samples in Tigrinya (ti) and Oromo (om), so translation to/from English is supported for these languages — though coverage is not as extensive as Amharic. We plan to expand Tigrinya and Oromo training data in future releases.

Model Details

Field Value
Base model google/translategemma-4b-it
Architecture Gemma3ForConditionalGeneration (4B)
Languages Amharic (am), Tigrinya (ti), Oromo (om), English (en)
Framework PyTorch + HuggingFace Transformers, distributed with torchrun

Benchmark Results

Evaluated on 960 held-out sentence pairs per direction. Primary metric: chrF2 (character-level F-score, β=2), which is more reliable than BLEU for morphologically rich Amharic.

am → en

Model BLEU chrF2 Len Ratio
Baseline Gemma (no fine-tuning) 7.00 27.69 1.22
YehaTranslate (ours) 37.58 64.60 1.07
Gemini 2.5 Flash 36.17 57.78 0.69
GPT-5.4 26.37 51.36 0.67

en → am

Model BLEU chrF2 Len Ratio
Baseline Gemma (no fine-tuning) 1.00 7.69 1.79
YehaTranslate (ours) 8.12 25.93 1.13
Gemini 2.5 Flash 4.43 23.05 0.43
GPT-5.4 0.15 9.51 0.18

Translation quality comparison

Usage

import torch
from transformers import AutoProcessor, AutoModelForImageTextToText

model_id = "hasab-ai/YehaTranslate"

processor = AutoProcessor.from_pretrained(model_id, use_fast=True)
model = AutoModelForImageTextToText.from_pretrained(model_id, torch_dtype=torch.bfloat16)
model.generation_config.pad_token_id = processor.tokenizer.eos_token_id
model.eval()

DIRECTIONS = {
    "Amharic → English":  ("am", "en"),
    "English → Amharic":  ("en", "am"),
    "Oromo → English":    ("om", "en"),
    "English → Oromo":    ("en", "om"),
    "Tigrinya → English": ("ti", "en"),
    "English → Tigrinya": ("en", "ti"),
}

def translate(text: str, src_lang: str, tgt_lang: str, max_new_tokens: int = 512) -> str:
    messages = [{"role": "user", "content": [
        {"type": "text", "source_lang_code": src_lang,
         "target_lang_code": tgt_lang, "text": text.strip()}
    ]}]
    inputs = processor.apply_chat_template(
        messages,
        tokenize=True,
        add_generation_prompt=True,
        return_tensors="pt",
        return_dict=True,
    )
    input_len = inputs["input_ids"].shape[1]
    with torch.inference_mode():
        out = model.generate(**inputs, do_sample=False, max_new_tokens=max_new_tokens)
    return processor.decode(out[0][input_len:], skip_special_tokens=True).strip()

# Amharic → English
print(translate("የኢትዮጵያ ሕዝቦች ታሪካዊ ትብብር እና አንድነት ዛሬም ቀጥሏል።", "am", "en"))

# English → Amharic
print(translate("The coffee ceremony is an important part of Ethiopian culture.", "en", "am"))

# Oromo → English
print(translate("Biyyoonni Itoophiyaa beekamtii addunyaa qabdi.", "om", "en"))

Training Details

Training used torchrun for distributed data-parallel training, with the HuggingFace Trainer API and bfloat16 activations.

The model was trained on paired Amharic–English sentence pairs covering news, general web, and social media domains.

Limitations

  • Best performance on general-domain text; domain-specific content (legal, medical) may degrade.
  • en→am generation is harder than am→en decoding, consistent with the asymmetry seen across all models.
  • Amharic is morphologically rich; single-reference automatic metrics (BLEU, chrF2) underestimate translation quality when paraphrase variation is high.

Citation

@misc{oriontranslate2026,
  title        = {YehaTranslate: Fine-tuned Amharic-English Translation},
  author       = {Hasab AI},
  year         = {2026},
  publisher    = {Hugging Face},
  howpublished = {\url{https://huggingface.co/hasab-ai/YehaTranslate}},
}
Downloads last month
-
Safetensors
Model size
4B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for hasab-ai/YehaTranslate

Finetuned
(26)
this model