Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
1
6
24
Ofer Hasson
hassonofer
Follow
mohammadkamil's profile picture
kobikis's profile picture
2 followers
ยท
18 following
hassonofer
AI & ML interests
Computer Vision
Recent Activity
upvoted
a
collection
about 5 hours ago
Perception Encoder
reacted
to
Anran-MLLM
's
post
with ๐
about 5 hours ago
๐ Introducing PerceptionDLM โ the first multimodal diffusion LLM for parallel region perception! Most MLLMs are autoregressive, so captioning N regions costs N sequential passes. PerceptionDLM instead describes ALL masked regions in a single denoising process. ๐งฉ โจ Highlights โข โก Up to 3.4ร faster on dense multi-region captioning, with stable per-image latency โข ๐ PerceptionDLM-Base beats LLaDA-V on 15/16 multimodal benchmarks (new SOTA among open diffusion VLMs) โข ๐ New benchmark: ParaDLC-Bench โ jointly evaluates caption quality AND inference efficiency โข ๐ Code, models & benchmark all open-sourced ๐ค Models https://huggingface.co/MSALab/PerceptionDLM-Base https://huggingface.co/MSALab/PerceptionDLM ๐ Benchmark https://huggingface.co/datasets/MSALab/ParaDLC-Bench ๐ Paper: https://huggingface.co/papers/2606.19534 ๐ป Code: https://github.com/MSALab-PKU/PerceptionDLM Diffusion LLMs aren't just for text โ they unlock efficient, parallel visual perception. ๐๏ธโจ #multimodal #diffusion #VLM #perception
liked
a model
about 18 hours ago
MiniMaxAI/MiniMax-M3
View all activity
Organizations
models
4
Sort:ย Recently updated
hassonofer/vit_reg1_s14_ls_dino-v2-dist-bio
Image Feature Extraction
โข
Updated
May 18
โข
4
โข
1
hassonofer/vit_so150m_patch14_reg4_biodino_336
Image Feature Extraction
โข
Updated
May 10
โข
2
hassonofer/vit_so150m_patch14_reg4_biodino_252
Image Feature Extraction
โข
Updated
May 10
โข
2
hassonofer/vit_so150m_patch14_reg4_biodino_224
Image Feature Extraction
โข
Updated
May 10
โข
3
datasets
0
None public yet