qwen3-vl-4b-instruct-sft-seed3407-20251231-0201

簡介

  • 任務: qwen3-vl-sft
  • Base model: unsloth/Qwen3-VL-4B-Instruct
  • 作者: 蓋瑞王
  • 團隊: UNIFORM
  • 訓練設定來源: /workspace/project/unsloth/configs/qwen3-vl-4b-vision-sft.yaml
  • 輸出目錄: outputs/qwen3-vl-4b-instruct-sft-seed3407-20251231-0201

訓練設定(節錄)

seed: 3407
model:
  name: unsloth/Qwen3-VL-4B-Instruct
  max_seq_length: 2048
  load_in_4bit: true
  fast_inference: false
  use_gradient_checkpointing: unsloth
peft:
  lora_rank: 16
  lora_alpha: 16
  lora_dropout: 0
  finetune_vision_layers: true
  finetune_language_layers: true
  finetune_attention_modules: true
  finetune_mlp_modules: true
  use_rslora: false
data:
  dataset_name: unsloth/LaTeX_OCR
  split: train
  instruction: Write the LaTeX representation for this image.
  image_field: image
  text_field: text
sft:
  per_device_train_batch_size: 1
  gradient_accumulation_steps: 4
  max_steps: 1000
  learning_rate: 0.0002
  warmup_steps: 5
  optim: adamw_8bit
  weight_decay: 0.001
  lr_scheduler_type: linear
eval:
  enabled: true
  dataset_name: unsloth/LaTeX_OCR
  split: test
  max_samples: 200
  max_new_tokens: 128
  temperature: 0.0
  top_p: 1.0
  do_sample: false

評測摘要

  • 評測摘要來源: outputs/qwen3-vl-4b-instruct-sft-seed3407-20251231-0201/best_checkpoints.json
  • 最佳 checkpoint: checkpoint-200
  • num_samples: 200
  • exact_match: 0.35
  • edit_distance: 15.835
  • eval.split: test
  • eval.max_samples: 200

評測摘要(表格)

eval_results.json

source num_samples exact_match edit_distance
eval_results.json 200 0.3300 15.8550

best_checkpoints.json

rank checkpoint exact_match edit_distance note
1 checkpoint-200 0.3500 15.8350 -
2 checkpoint-100 0.3450 16.7500 -
3 checkpoint-400 0.3300 15.8550 -

備註

  • 請補上授權、用途說明與限制等資訊。
Downloads last month
12
Safetensors
Model size
4B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for gary109/qwen3-vl-4b-instruct-sft-seed3407-20251231-0201-merged_16bit

Finetuned
(27)
this model

Dataset used to train gary109/qwen3-vl-4b-instruct-sft-seed3407-20251231-0201-merged_16bit