Siluni/sinhala-vqa-dataset
Viewer • Updated • 37.3k • 66
How to use Siluni/gemma3-4b-cpt with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("google/gemma-3-4b-it")
model = PeftModel.from_pretrained(base_model, "Siluni/gemma3-4b-cpt")CPT-only (Continued Pre-Training) adapter for Gemma-3-4B-IT on the MADLAD-400 Sinhala corpus.
This adapter does not perform VQA on its own. It is intended to be used as the first stage
of the sequential CPT → VQA pipeline together with Siluni/gemma3-4b-cpt-vqa-33k.
This adapter must be loaded together with the VQA adapter and combined before inference.
See Siluni/gemma3-4b-cpt-vqa-33k for the full loading instructions.
@misc{keerthiratne2025sinhalavqa,
title = {Benchmarking and Adapting Compact Multimodal Models for Sinhala Visual Question Answering},
author = {Keerthiratne, Siluni and Weerasinghe, Ruvan and Sumanathilaka, Deshan},
year = {2025},
institution = {Informatics Institute of Technology / Robert Gordon University},
}