Speech & Audio Processing SPEAR: A Unified SSL Framework for Learning Speech and Audio Representations Paper • 2510.25955 • Published Oct 29, 2025 marcoyang/spear-xlarge-speech-audio 0.6B • Updated Feb 3 • 37.2k • 4
SPEAR: A Unified SSL Framework for Learning Speech and Audio Representations Paper • 2510.25955 • Published Oct 29, 2025
video-SALMONN 2 video-SALMONN 2 is a powerful audio-visual large language model (LLM) that generates high-quality audio-visual video captions. tsinghua-ee/video-SALMONN-2_plus_72B Updated Sep 28, 2025 • 22 • 2 tsinghua-ee/video_SALMONN2plus_72B_audioAlign Updated Jan 28 • 63 tsinghua-ee/video-SALMONN2_plus_7B_full 9B • Updated Feb 23 • 114 tsinghua-ee/video-SALMONN-2_plus_7B Updated Sep 28, 2025 • 873 • 6
General Time Series SciTS: Scientific Time Series Understanding and Generation with LLMs Paper • 2510.03255 • Published Sep 26, 2025 OpenTSLab/SciTS Preview • Updated 9 days ago • 37.9k • 1
SciTS: Scientific Time Series Understanding and Generation with LLMs Paper • 2510.03255 • Published Sep 26, 2025
Brain Signals BrainOmni: A Brain Foundation Model for Unified EEG and MEG Signals Paper • 2505.18185 • Published May 18, 2025 • 1 OpenTSLab/BrainOmni Updated Oct 15, 2025 • 2
BrainOmni: A Brain Foundation Model for Unified EEG and MEG Signals Paper • 2505.18185 • Published May 18, 2025 • 1
Speech & Audio Processing SPEAR: A Unified SSL Framework for Learning Speech and Audio Representations Paper • 2510.25955 • Published Oct 29, 2025 marcoyang/spear-xlarge-speech-audio 0.6B • Updated Feb 3 • 37.2k • 4
SPEAR: A Unified SSL Framework for Learning Speech and Audio Representations Paper • 2510.25955 • Published Oct 29, 2025
General Time Series SciTS: Scientific Time Series Understanding and Generation with LLMs Paper • 2510.03255 • Published Sep 26, 2025 OpenTSLab/SciTS Preview • Updated 9 days ago • 37.9k • 1
SciTS: Scientific Time Series Understanding and Generation with LLMs Paper • 2510.03255 • Published Sep 26, 2025
video-SALMONN 2 video-SALMONN 2 is a powerful audio-visual large language model (LLM) that generates high-quality audio-visual video captions. tsinghua-ee/video-SALMONN-2_plus_72B Updated Sep 28, 2025 • 22 • 2 tsinghua-ee/video_SALMONN2plus_72B_audioAlign Updated Jan 28 • 63 tsinghua-ee/video-SALMONN2_plus_7B_full 9B • Updated Feb 23 • 114 tsinghua-ee/video-SALMONN-2_plus_7B Updated Sep 28, 2025 • 873 • 6
Brain Signals BrainOmni: A Brain Foundation Model for Unified EEG and MEG Signals Paper • 2505.18185 • Published May 18, 2025 • 1 OpenTSLab/BrainOmni Updated Oct 15, 2025 • 2
BrainOmni: A Brain Foundation Model for Unified EEG and MEG Signals Paper • 2505.18185 • Published May 18, 2025 • 1