aud facebook/wav2vec2-lv-60-espeak-cv-ft Automatic Speech Recognition • Updated Oct 31, 2023 • 1.24M • 69 sesame/csm-1b Text-to-Speech • 2B • Updated Dec 1, 2025 • 309k • 2.39k
facebook/wav2vec2-lv-60-espeak-cv-ft Automatic Speech Recognition • Updated Oct 31, 2023 • 1.24M • 69
papers TimeMarker: A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability Paper • 2411.18211 • Published Nov 27, 2024
TimeMarker: A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability Paper • 2411.18211 • Published Nov 27, 2024
aud facebook/wav2vec2-lv-60-espeak-cv-ft Automatic Speech Recognition • Updated Oct 31, 2023 • 1.24M • 69 sesame/csm-1b Text-to-Speech • 2B • Updated Dec 1, 2025 • 309k • 2.39k
facebook/wav2vec2-lv-60-espeak-cv-ft Automatic Speech Recognition • Updated Oct 31, 2023 • 1.24M • 69
papers TimeMarker: A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability Paper • 2411.18211 • Published Nov 27, 2024
TimeMarker: A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability Paper • 2411.18211 • Published Nov 27, 2024