Phan Hoang

phanhoang

3 37 145

AI & ML interests

None yet

Recent Activity

upvoted an article about 8 hours ago

Supercharge your OCR Pipelines with Open Models

liked a model 10 days ago

ATH-MaaS/OvisOCR2

liked a model 11 days ago

baidu/Unlimited-OCR

View all activity

Organizations

None yet

upvoted an article about 8 hours ago

Article

Supercharge your OCR Pipelines with Open Models

merve, ariG23498, davanstrien, hynky, andito, reach-vb, pcuenq

•

Oct 21, 2025

• 319

liked a model 10 days ago

ATH-MaaS/OvisOCR2

Image-Text-to-Text • 0.9B • Updated 16 days ago • 66.2k • 356

liked a model 11 days ago

baidu/Unlimited-OCR

Image-Text-to-Text • 3B • Updated 2 days ago • 2.51M • 3.64k

liked 2 datasets 13 days ago

trannhiem/TranNhiem-Vietnamese-ImageText-Reasoning

Viewer • Updated 8 days ago • 545k • 1.12k • 7

trannhiem/TranNhiem-Vietnamese-DocumentImage-Reasoning

Viewer • Updated 8 days ago • 64.8k • 504 • 5

liked a dataset about 2 months ago

5CD-AI/Viet-Handwriting-OCR-v2

Viewer • Updated 17 days ago • 60.2k • 486 • 74

liked a Space over 1 year ago

VLM R1 Referral Expression

💬

Mark regions in images based on text descriptions

liked 2 models over 1 year ago

ByteDance/Sa2VA-1B

Image-Text-to-Text • 1B • Updated Sep 8, 2025 • 563 • 30

OpenGVLab/InternVL2_5-4B-MPO

Image-Text-to-Text • 4B • Updated Sep 11, 2025 • 2.24k • 18

upvoted a paper over 1 year ago

Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

Paper • 2412.04424 • Published Dec 5, 2024 • 63

liked a model over 1 year ago

alibaba-damo/mgp-str-base

Image-to-Text • 0.1B • Updated Dec 11, 2023 • 99k • 65

liked 2 datasets over 1 year ago

wikimedia/wikipedia

Viewer • Updated Jan 9, 2024 • 61.6M • 240k • 1.34k

5CD-AI/Viet-Table-Markdown

Viewer • Updated Nov 17, 2024 • 64.9k • 754 • 17

upvoted an article almost 2 years ago

Article

ColFlor: Towards BERT-Size Vision-Language Document Retrieval Models

ahmed-masry

•

Oct 18, 2024

• 22

liked a model almost 2 years ago

InternScience/StructTable-InternVL2-1B

Image-to-Text • 0.9B • Updated Dec 6, 2025 • 214 • 49

upvoted an article almost 2 years ago

Article

Visually Multilingual: Introducing mcdse-2b

marco

•

Oct 27, 2024

• 41

liked 4 models almost 2 years ago

Phan Hoang

AI & ML interests

Recent Activity

Organizations

phanhoang's activity

Supercharge your OCR Pipelines with Open Models

VLM R1 Referral Expression

ColFlor: Towards BERT-Size Vision-Language Document Retrieval Models

Visually Multilingual: Introducing mcdse-2b