NemoStation
/

Marlin-2B

Video-Text-to-Text

text-generation

video-captioning

temporal-grounding

Model card Files Files and versions

Resources

View closed (3)

use marlin on a remote GPU machine ? third party providers ?

#9 opened 5 days ago by

Inference speed

#8 opened 7 days ago by

This model work by feeding multi sampling frame from video or raw video file?

#7 opened 7 days ago by

Collaboration Opportunity

#6 opened 8 days ago by

Phase-Technologies

Train on own videos / labels?

#5 opened 9 days ago by

Can you use this model with image and text-only inputs apart from video?

#4 opened 12 days ago by

Question about the evaluation metrics for captioning benchmarks

#3 opened 14 days ago by