OpenMOSS

Team

university

http://openmoss.sii.edu.cn/

OpenMOSS

Activity Feed Request to join this org

AI & ML interests

LLM

Recent Activity

lkdhy new activity 3 days ago

OpenMOSS-Team/SciJudge-30B:Update SciJudge-30B model card and link 2605 release

lkdhy new activity 3 days ago

OpenMOSS-Team/SciJudgeBench:Update SciJudgeBench README links, tags, and citation

lkdhy new activity 3 days ago

OpenMOSS-Team/SciJudge-30B-2605:Upload SciJudge-30B-2605 weights and model card

View all activity

Papers

In-Context World Modeling for Robotic Control

World Action Models: The Next Frontier in Embodied AI

View all Papers

OpenMOSS-Team 's collections 21

MOSS-Audio

An open-source audio understanding model supporting speech recognition, environmental sound analysis, music understanding, time-aware QA, and complex

Running

Agents

27

MOSS Audio 8B Thinking

🐢

27

Generate answers to audio or video prompts
OpenMOSS-Team/MOSS-Audio-4B-Instruct

Audio-Text-to-Text • 5B • Updated Apr 14 • 66.4k • 73
OpenMOSS-Team/MOSS-Audio-4B-Thinking

Audio-Text-to-Text • 5B • Updated Apr 14 • 15.4k • 33
OpenMOSS-Team/MOSS-Audio-8B-Instruct

Audio-Text-to-Text • 9B • Updated 24 days ago • 16.1k • 47

MOSS-VL

OpenMOSS-Team/MOSS-VL-Instruct-0408

Video-Text-to-Text • 11B • Updated Apr 22 • 373 • 97
OpenMOSS-Team/MOSS-VL-Base-0408

Video-Text-to-Text • 11B • Updated Apr 23 • 755 • 61

AI Can Learn Scientific Taste

AI Can Learn Scientific Taste

Paper • 2603.14473 • Published Mar 15 • 431
OpenMOSS-Team/SciJudgeBench

Preview • Updated 3 days ago • 254 • 10
OpenMOSS-Team/SciJudge-4B

Text Generation • 4B • Updated 3 days ago • 285 • • 6
OpenMOSS-Team/SciJudge-30B

Text Generation • 31B • Updated 3 days ago • 683 • 12

MOVA

OpenMOSS-Team/MOVA-360p

Image-to-Video • Updated Feb 15 • 94k • 215
OpenMOSS-Team/MOVA-720p

Any-to-Any • Updated Feb 11 • 140 • 129
MOVA: Towards Scalable and Synchronized Video-Audio Generation

Paper • 2602.08794 • Published Feb 9 • 159

MOSS Transcribe Diarize

A unified multimodal large language model for end-to-end speaker-attributed, time-stamped transcription.

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization

Paper • 2601.01554 • Published Jan 4 • 62
Running

Agents

Featured

63

MOSS Transcribe Diarize

🏢

63

Transcribe audio/video with speaker diarization

ABC-Bench

Evaluating Agentic Backend Coding Capabilities in Real-World Development Scenarios

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

Paper • 2601.11077 • Published Jan 16 • 67
OpenMOSS-Team/ABC-Bench

Viewer • Updated Jan 20 • 224 • 157 • 4
OpenMOSS-Team/Qwen3-32B-ABC

Text Generation • 33B • Updated Jan 20 • 6 • 3
OpenMOSS-Team/Qwen3-8B-ABC

Text Generation • 8B • Updated Jan 20 • 5 • 3

Game-RL

[ICLR 2026] Game-RL: Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoning

OpenMOSS-Team/GameQA-140K

Updated Mar 19 • 211 • 18
OpenMOSS-Team/GameQA-5K

Preview • Updated Jun 22, 2025 • 91 • 2
OpenMOSS-Team/Game-RL-Qwen2.5-VL-7B

Image-Text-to-Text • 8B • Updated Jul 27, 2025 • 10 • 3
OpenMOSS-Team/Game-RL-InternVL3-8B

8B • Updated Jun 17, 2025 • 11 • 2

DiRL

An Efficient Training Framework for Diffusion Language Models

OpenMOSS-Team/DiRL-8B-Instruct

Text Generation • 8B • Updated Jan 20 • 15 • 13

MOSS Embodied Planner

World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning

Paper • 2503.10480 • Published Mar 13, 2025 • 57
Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning

Paper • 2506.23127 • Published Jun 29, 2025 • 2
World-aware Planning Narratives Enhance Large Vision-Language Model Planner

Paper • 2506.21230 • Published Jun 26, 2025 • 1
OpenMOSS-Team/Embodied_R1-ScienceWorld

8B • Updated Jun 30, 2025 • 5 • 1

MHA2MLA-refactor

The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"

OpenMOSS-Team/SmolLM-135M-MLA-d_kv_8-refactor

Text Generation • 0.1B • Updated Jun 23, 2025 • 10 • 1
OpenMOSS-Team/SmolLM-135M-MLA-d_kv_32-refactor

Text Generation • 0.1B • Updated Jun 17, 2025 • 9 • 1
OpenMOSS-Team/SmolLM-135M-MLA-d_kv_16-refactor

Text Generation • 0.1B • Updated Jun 17, 2025 • 7 • 1
OpenMOSS-Team/SmolLM-360M-MLA-d_kv_8-refactor

Text Generation • 0.3B • Updated Jun 17, 2025 • 8 • 1

MOSS

OpenMOSS-Team/moss-moon-003-sft-plugin

Text Generation • Updated May 27 • 498 • 71
OpenMOSS-Team/moss-moon-003-sft

Text Generation • Updated May 27 • 700 • 129
OpenMOSS-Team/moss-moon-003-base

Text Generation • Updated May 27 • 680 • 132
OpenMOSS-Team/moss-moon-003-sft-int4

Text Generation • Updated May 27 • 38 • 41

MOSS-Video-Preview

OpenMOSS-Team/moss-video-preview-base

Video-Text-to-Text • 11B • Updated 26 days ago • 19 • 13
OpenMOSS-Team/moss-video-preview-sft

Video-Text-to-Text • 11B • Updated 26 days ago • 35 • 15
OpenMOSS-Team/moss-video-preview-realtime-sft

Video-Text-to-Text • 11B • Updated 25 days ago • 59 • 23
OpenMOSS-Team/Realtime-QA-100K

Viewer • Updated 25 days ago • 100k • 497 • 5

MOSS-TTS

OpenMOSS-Team/MOSS-TTS

Text-to-Speech • 8B • Updated Mar 20 • 982k • 405
OpenMOSS-Team/MOSS-TTS-Local-Transformer

Text-to-Speech • 3B • Updated Mar 20 • 12.7k • 29
OpenMOSS-Team/MOSS-TTS-Realtime

Text-to-Speech • 2B • Updated Mar 20 • 27.5k • 90
OpenMOSS-Team/MOSS-TTS-Nano-100M

Text-to-Speech • Updated Apr 13 • 71.7k • 226

Llama Scope 2

Opensource Lorsas and Transcoders

OpenMOSS-Team/Llama-Scope-2

Updated Feb 10 • 1
OpenMOSS-Team/Llama-Scope-2-Qwen3-1.7B

Updated Apr 2 • 3

MOSS-TTSD

OpenMOSS-Team/MOSS-TTSD-v1.0

Text-to-Speech • 8B • Updated Feb 14 • 5.85k • 58
OpenMOSS-Team/MOSS-TTSD-v0.7

Text-to-Speech • 2B • Updated Nov 11, 2025 • 199 • 18
OpenMOSS-Team/MOSS-TTSD-v0.5

Text-to-Speech • 2B • Updated Sep 2, 2025 • 904 • 54
OpenMOSS-Team/MOSS-TTSD-v0

Text-to-Speech • 2B • Updated Jun 20, 2025 • 20 • 28

MOSS-Speech

True Speech-to-Speech Langugage Model

OpenMOSS-Team/MOSS-Speech

9B • Updated Sep 30, 2025 • 50 • 22
OpenMOSS-Team/MOSS-Speech-Codec

0.9B • Updated Oct 1, 2025 • 16 • 7
Running on Zero

Agents

20

MOSS-Speech Demo

🚀

20

True Speech-to-Speech Language Model
MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance

Paper • 2510.00499 • Published Oct 1, 2025 • 23

FutureOmni

First Omni-modal Future Forecasting Benchmark

OpenMOSS-Team/FutureOmni

Viewer • Updated Jan 22 • 1.03k • 291 • 6
FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs

Paper • 2601.13836 • Published Jan 20 • 37

FRoM-W1

https://github.com/OpenMOSS/FRoM-W1

OpenMOSS-Team/FRoM-W1

Updated Feb 4 • 11
OpenMOSS-Team/FRoM-W1-Datasets

Viewer • Updated Jan 29 • 166k • 174 • 7
FRoM-W1: Towards General Humanoid Whole-Body Control with Language Instructions

Paper • 2601.12799 • Published Jan 19 • 4

RoboOmni

Proactive Robot Manipulation in Omni-modal Context

OpenMOSS-Team/RoboOmni

Robotics • 5B • Updated Oct 30, 2025 • 12 • 16
OpenMOSS-Team/OmniAction

Updated Mar 27 • 38.1k • 283
OpenMOSS-Team/OmniAction-LIBERO

Updated Mar 27 • 5.3k • 70
OpenMOSS-Team/RoboOmni-LIBERO-Spatial

Robotics • 5B • Updated Oct 31, 2025 • 26 • 3

Low Rank Sparse Attention

Open source weights of Lorsa modules introduced in "Towards Understanding the Nature of Attention with Low-Rank Sparse Decomposition".

OpenMOSS-Team/Lorsa

Updated Apr 28, 2025 • 3
OpenMOSS-Team/Lorsa-Pythia-160M

Updated May 8, 2025 • 2
OpenMOSS-Team/Lorsa-Llama-3.1-8B

Updated May 8, 2025 • 1

MHA2MLA

The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs

Paper • 2502.14837 • Published Feb 20, 2025 • 4
OpenMOSS-Team/Llama-2-7B-MLA-d_kv_16

Text Generation • 6B • Updated Mar 13, 2025 • 5 • 1
OpenMOSS-Team/Llama-2-7B-MLA-d_kv_32

Text Generation • 6B • Updated Mar 13, 2025 • 5 • 1
OpenMOSS-Team/Llama-2-7B-MLA-d_kv_64

Text Generation • 7B • Updated Mar 13, 2025 • 7 • 1

MOSS-Audio

An open-source audio understanding model supporting speech recognition, environmental sound analysis, music understanding, time-aware QA, and complex

Running

Agents

27

MOSS Audio 8B Thinking

🐢

27

Generate answers to audio or video prompts
OpenMOSS-Team/MOSS-Audio-4B-Instruct

Audio-Text-to-Text • 5B • Updated Apr 14 • 66.4k • 73
OpenMOSS-Team/MOSS-Audio-4B-Thinking

Audio-Text-to-Text • 5B • Updated Apr 14 • 15.4k • 33
OpenMOSS-Team/MOSS-Audio-8B-Instruct

Audio-Text-to-Text • 9B • Updated 24 days ago • 16.1k • 47

MOSS-Video-Preview

OpenMOSS-Team/moss-video-preview-base

Video-Text-to-Text • 11B • Updated 26 days ago • 19 • 13
OpenMOSS-Team/moss-video-preview-sft

Video-Text-to-Text • 11B • Updated 26 days ago • 35 • 15
OpenMOSS-Team/moss-video-preview-realtime-sft

Video-Text-to-Text • 11B • Updated 25 days ago • 59 • 23
OpenMOSS-Team/Realtime-QA-100K

Viewer • Updated 25 days ago • 100k • 497 • 5

MOSS-VL

OpenMOSS-Team/MOSS-VL-Instruct-0408

Video-Text-to-Text • 11B • Updated Apr 22 • 373 • 97
OpenMOSS-Team/MOSS-VL-Base-0408

Video-Text-to-Text • 11B • Updated Apr 23 • 755 • 61

MOSS-TTS

OpenMOSS-Team/MOSS-TTS

Text-to-Speech • 8B • Updated Mar 20 • 982k • 405
OpenMOSS-Team/MOSS-TTS-Local-Transformer

Text-to-Speech • 3B • Updated Mar 20 • 12.7k • 29
OpenMOSS-Team/MOSS-TTS-Realtime

Text-to-Speech • 2B • Updated Mar 20 • 27.5k • 90
OpenMOSS-Team/MOSS-TTS-Nano-100M

Text-to-Speech • Updated Apr 13 • 71.7k • 226

AI Can Learn Scientific Taste

AI Can Learn Scientific Taste

Paper • 2603.14473 • Published Mar 15 • 431
OpenMOSS-Team/SciJudgeBench

Preview • Updated 3 days ago • 254 • 10
OpenMOSS-Team/SciJudge-4B

Text Generation • 4B • Updated 3 days ago • 285 • • 6
OpenMOSS-Team/SciJudge-30B

Text Generation • 31B • Updated 3 days ago • 683 • 12

Llama Scope 2

Opensource Lorsas and Transcoders

OpenMOSS-Team/Llama-Scope-2

Updated Feb 10 • 1
OpenMOSS-Team/Llama-Scope-2-Qwen3-1.7B

Updated Apr 2 • 3

MOVA

OpenMOSS-Team/MOVA-360p

Image-to-Video • Updated Feb 15 • 94k • 215
OpenMOSS-Team/MOVA-720p

Any-to-Any • Updated Feb 11 • 140 • 129
MOVA: Towards Scalable and Synchronized Video-Audio Generation

Paper • 2602.08794 • Published Feb 9 • 159

MOSS-TTSD

OpenMOSS-Team/MOSS-TTSD-v1.0

Text-to-Speech • 8B • Updated Feb 14 • 5.85k • 58
OpenMOSS-Team/MOSS-TTSD-v0.7

Text-to-Speech • 2B • Updated Nov 11, 2025 • 199 • 18
OpenMOSS-Team/MOSS-TTSD-v0.5

Text-to-Speech • 2B • Updated Sep 2, 2025 • 904 • 54
OpenMOSS-Team/MOSS-TTSD-v0

Text-to-Speech • 2B • Updated Jun 20, 2025 • 20 • 28

MOSS Transcribe Diarize

A unified multimodal large language model for end-to-end speaker-attributed, time-stamped transcription.

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization

Paper • 2601.01554 • Published Jan 4 • 62
Running

Agents

Featured

63

MOSS Transcribe Diarize

🏢

63

Transcribe audio/video with speaker diarization

MOSS-Speech

True Speech-to-Speech Langugage Model

OpenMOSS-Team/MOSS-Speech

9B • Updated Sep 30, 2025 • 50 • 22
OpenMOSS-Team/MOSS-Speech-Codec

0.9B • Updated Oct 1, 2025 • 16 • 7
Running on Zero

Agents

20

MOSS-Speech Demo

🚀

20

True Speech-to-Speech Language Model
MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance

Paper • 2510.00499 • Published Oct 1, 2025 • 23

ABC-Bench

Evaluating Agentic Backend Coding Capabilities in Real-World Development Scenarios

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

Paper • 2601.11077 • Published Jan 16 • 67
OpenMOSS-Team/ABC-Bench

Viewer • Updated Jan 20 • 224 • 157 • 4
OpenMOSS-Team/Qwen3-32B-ABC

Text Generation • 33B • Updated Jan 20 • 6 • 3
OpenMOSS-Team/Qwen3-8B-ABC

Text Generation • 8B • Updated Jan 20 • 5 • 3

FutureOmni

First Omni-modal Future Forecasting Benchmark

OpenMOSS-Team/FutureOmni

Viewer • Updated Jan 22 • 1.03k • 291 • 6
FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs

Paper • 2601.13836 • Published Jan 20 • 37

Game-RL

[ICLR 2026] Game-RL: Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoning

OpenMOSS-Team/GameQA-140K

Updated Mar 19 • 211 • 18
OpenMOSS-Team/GameQA-5K

Preview • Updated Jun 22, 2025 • 91 • 2
OpenMOSS-Team/Game-RL-Qwen2.5-VL-7B

Image-Text-to-Text • 8B • Updated Jul 27, 2025 • 10 • 3
OpenMOSS-Team/Game-RL-InternVL3-8B

8B • Updated Jun 17, 2025 • 11 • 2

FRoM-W1

https://github.com/OpenMOSS/FRoM-W1

OpenMOSS-Team/FRoM-W1

Updated Feb 4 • 11
OpenMOSS-Team/FRoM-W1-Datasets

Viewer • Updated Jan 29 • 166k • 174 • 7
FRoM-W1: Towards General Humanoid Whole-Body Control with Language Instructions

Paper • 2601.12799 • Published Jan 19 • 4

DiRL

An Efficient Training Framework for Diffusion Language Models

OpenMOSS-Team/DiRL-8B-Instruct

Text Generation • 8B • Updated Jan 20 • 15 • 13

RoboOmni

Proactive Robot Manipulation in Omni-modal Context

OpenMOSS-Team/RoboOmni

Robotics • 5B • Updated Oct 30, 2025 • 12 • 16
OpenMOSS-Team/OmniAction

Updated Mar 27 • 38.1k • 283
OpenMOSS-Team/OmniAction-LIBERO

Updated Mar 27 • 5.3k • 70
OpenMOSS-Team/RoboOmni-LIBERO-Spatial

Robotics • 5B • Updated Oct 31, 2025 • 26 • 3

MOSS Embodied Planner

World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning

Paper • 2503.10480 • Published Mar 13, 2025 • 57
Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning

Paper • 2506.23127 • Published Jun 29, 2025 • 2
World-aware Planning Narratives Enhance Large Vision-Language Model Planner

Paper • 2506.21230 • Published Jun 26, 2025 • 1
OpenMOSS-Team/Embodied_R1-ScienceWorld

8B • Updated Jun 30, 2025 • 5 • 1

Low Rank Sparse Attention

Open source weights of Lorsa modules introduced in "Towards Understanding the Nature of Attention with Low-Rank Sparse Decomposition".

OpenMOSS-Team/Lorsa

Updated Apr 28, 2025 • 3
OpenMOSS-Team/Lorsa-Pythia-160M

Updated May 8, 2025 • 2
OpenMOSS-Team/Lorsa-Llama-3.1-8B

Updated May 8, 2025 • 1

MHA2MLA-refactor

The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"

OpenMOSS-Team/SmolLM-135M-MLA-d_kv_8-refactor

Text Generation • 0.1B • Updated Jun 23, 2025 • 10 • 1
OpenMOSS-Team/SmolLM-135M-MLA-d_kv_32-refactor

Text Generation • 0.1B • Updated Jun 17, 2025 • 9 • 1
OpenMOSS-Team/SmolLM-135M-MLA-d_kv_16-refactor

Text Generation • 0.1B • Updated Jun 17, 2025 • 7 • 1
OpenMOSS-Team/SmolLM-360M-MLA-d_kv_8-refactor

Text Generation • 0.3B • Updated Jun 17, 2025 • 8 • 1

MHA2MLA

The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs

Paper • 2502.14837 • Published Feb 20, 2025 • 4
OpenMOSS-Team/Llama-2-7B-MLA-d_kv_16

Text Generation • 6B • Updated Mar 13, 2025 • 5 • 1
OpenMOSS-Team/Llama-2-7B-MLA-d_kv_32

Text Generation • 6B • Updated Mar 13, 2025 • 5 • 1
OpenMOSS-Team/Llama-2-7B-MLA-d_kv_64

Text Generation • 7B • Updated Mar 13, 2025 • 7 • 1

MOSS

OpenMOSS-Team/moss-moon-003-sft-plugin

Text Generation • Updated May 27 • 498 • 71
OpenMOSS-Team/moss-moon-003-sft

Text Generation • Updated May 27 • 700 • 129
OpenMOSS-Team/moss-moon-003-base

Text Generation • Updated May 27 • 680 • 132
OpenMOSS-Team/moss-moon-003-sft-int4

Text Generation • Updated May 27 • 38 • 41

AI & ML interests

Recent Activity

Papers

Team members 39

OpenMOSS-Team 's collections 21

MOSS Audio 8B Thinking

MOSS Transcribe Diarize

MOSS-Speech Demo

MOSS Audio 8B Thinking

MOSS Transcribe Diarize

MOSS-Speech Demo