YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

Unison-Judge is a fine-tuned Qwen3-VL-8B vision-language model that serves as the local automatic judge for the Unison benchmark. It scores UMMs' outputs across all four unified tasks (IC, UGG, GGU and ME) without requiring a hosted API.

Judge Consistency Data

The Judge_Consistency/ directory contains 231 evaluation cases used to assess the scoring consistency of Unison-Judge across all four tasks.

Field Description
id Item identifier
task One of IC, UGG, GGU, ME
family Question type
model The UMM whose output is being evaluated
questions List of sub-questions, each with the model's answer and judge-assigned score
images Reference image(s) and the model-generated image

Task distribution: IC (57), ME (62), GGU (56), UGG (56)
Models covered: BAGEL-7B-MoT, OmniGen2, SEED-X-17B, UniWorld-V1

Downloads last month
-
Safetensors
Model size
770k params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support