Text Generation
Safetensors
Mongolian
gemma3
conversational

Gemma 3 PT 4B Continually Pretrained on Mongolian (Traditional Mongolian Script)

This model is a continual pretraining (CPT) checkpoint built by further pretraining Gemma 3 PT 4B on the Mongolian (Traditional Mongolian Script) portion of the MC^2 Corpus.

The model is intended to improve Mongolian (Traditional Mongolian Script) language modeling and to support research on low-resource language adaptation.

Training details and methodology are described in: "Efficient Low-Resource Language Adaptation via Multi-Source Dynamic Logit Fusion" (ACL 2026).

Training Data

  • Corpus: Mongolian (Traditional Mongolian Script) subset of MC^2 Corpus
  • Language: Mongolian (mn, Traditional Mongolian Script)
  • Training paradigm: Continual pretraining (CPT) starting from Gemma 3 PT 4B

Intended Use

This checkpoint is released primarily for research purposes. Researchers are welcome to use this CPT checkpoint as a base model for future work, particularly in model merging and logit fusion.

Citation

If you use this model, please cite:

@article{zhang2026efficient,
  title={Efficient Low-Resource Language Adaptation via Multi-Source Dynamic Logit Fusion},
  author={Zhang, Chen and Lin, Jiuheng and Liao, Zhiyuan and Feng, Yansong},
  journal={arXiv preprint arXiv:2604.18106},
  year={2026}
}
Downloads last month
9
Safetensors
Model size
4B params
Tensor type
BF16
·
Inference Providers NEW
Input a message to start chatting with pkupie/gemma-3-4b-mn-cpt.

Model tree for pkupie/gemma-3-4b-mn-cpt

Finetuned
(293)
this model

Dataset used to train pkupie/gemma-3-4b-mn-cpt

Collection including pkupie/gemma-3-4b-mn-cpt

Paper for pkupie/gemma-3-4b-mn-cpt