Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
AudioVisual-Caption
/
ASID-Captioner-3B
like
35
Follow
ASID-Caption
32
Image-Text-to-Text
Transformers
Safetensors
English
qwen2_5_omni
video-captioning
audiovisual
qwen2.5-omni
instruction-tuning
attribute-structured
quality-verified
conversational
arxiv:
2602.13013
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
ASID-Captioner-3B
Commit History
Update README.md
0fca852
verified
lyhisme
commited on
7 days ago
Update README.md
9b6ed46
verified
lyhisme
commited on
20 days ago
Update README.md
59a00fb
verified
lyhisme
commited on
22 days ago
Update README.md
c0c0dcc
verified
lyhisme
commited on
22 days ago
Update README.md
4b41a1b
verified
lyhisme
commited on
about 1 month ago
Update README.md
acdea8c
verified
lyhisme
commited on
Feb 14
Update README.md
a5824a3
verified
lyhisme
commited on
Feb 11
Update README.md
0b1dd39
verified
lyhisme
commited on
Feb 11
Upload folder using huggingface_hub
393feb7
verified
lyhisme
commited on
Feb 9
initial commit
6abb918
verified
lyhisme
commited on
Feb 9