Aritra Dutta's picture

Open to Work

Aritra Dutta

dutta18

·

https://vpnleaderboard.com/

AI & ML interests

None yet

Recent Activity

upvoted a collection about 2 months ago

updated a dataset about 2 months ago

dutta18/esnlive

published a dataset about 2 months ago

dutta18/esnlive

View all activity

Organizations

upvoted a collection about 2 months ago

LLaVa-NeXT

LLaVa-NeXT (also known as LLaVa-1.6) improves upon the 1.5 series by incorporating higher image resolutions and more reasoning/OCR datasets. • 8 items • Updated Jul 19, 2024 • 34

updated a dataset about 2 months ago

dutta18/esnlive

Viewer • Updated Apr 15 • 129k • 1.47k

published a dataset about 2 months ago

dutta18/esnlive

Viewer • Updated Apr 15 • 129k • 1.47k

upvoted an article about 2 months ago

Article

Multimodal Embedding & Reranker Models with Sentence Transformers

tomaarsen

•

Apr 9

• 61

New activity in lmms-lab/DocVQA about 2 months ago

DataFilesNotFoundError: No (supported) data files found in lmms-lab/DocVQA

#5 opened about 2 months ago by

liked a model about 2 months ago

nanonets/Nanonets-OCR-s

Image-Text-to-Text • 4B • Updated Jun 20, 2025 • 183k • 1.59k

updated a dataset 2 months ago

dutta18/A-OKVQA-17K

Viewer • Updated Apr 2 • 18.2k • 260

published a dataset 2 months ago

dutta18/A-OKVQA-17K

Viewer • Updated Apr 2 • 18.2k • 260

updated a dataset 2 months ago

dutta18/Physical-Reasoning-VQA-45K

Viewer • Updated Apr 2 • 64.9k • 410

published a dataset 2 months ago

dutta18/Physical-Reasoning-VQA-45K

Viewer • Updated Apr 2 • 64.9k • 410

updated a dataset 2 months ago

dutta18/Quantity-Reasoning-VQA-23K

Viewer • Updated Apr 2 • 23.7k • 109

published a dataset 2 months ago

dutta18/Quantity-Reasoning-VQA-23K

Viewer • Updated Apr 2 • 23.7k • 109

upvoted a collection 2 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 10 items • Updated Mar 2 • 563

New activity in google/gemma-3-4b-it 3 months ago

Finetuning Code Link In Native PyTorch

#87 opened 3 months ago by

liked a model 3 months ago

meta-llama/Llama-3.2-11B-Vision-Instruct

Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 217k • 1.6k

New activity in mistralai/Ministral-3-3B-Instruct-2512 3 months ago

How to use local image in the chat template?

#15 opened 3 months ago by

updated a dataset 4 months ago

dutta18/multidomain-VQA-with-cot-trace-9K

Viewer • Updated Feb 6 • 10.8k • 49

published a dataset 4 months ago

dutta18/multidomain-VQA-with-cot-trace-9K

Viewer • Updated Feb 6 • 10.8k • 49

upvoted an article 4 months ago

Article

Making automatic speech recognition work on large files with Wav2Vec2 in 🤗 Transformers

Narsil

•

Feb 1, 2022

• 16

New activity in lmms-lab/LongVA-7B 4 months ago

TypeError: unsupported operand type(s) for //: 'int' and 'NoneType' while calling the processor

#1 opened 4 months ago by