arxiv:2604.12944
IanHuang
Hukcc1
ยท
AI & ML interests
Multimodal LLMs | Efficiency | Reliability | Hallucination Detection & Mitigation | Video Understanding | Layout Understanding
Recent Activity
authored a paper about 1 month ago
D-CoDe: Scaling Image-Pretrained VLMs to Video via Dynamic Compression
and Question Decomposition authored a paper about 1 month ago
SHIELD: Suppressing Hallucinations In LVLM Encoders via Bias and Vulnerability Defense authored a paper about 1 month ago
Distorted or Fabricated? A Survey on Hallucination in Video LLMs