Low Frame Rate Universal Audio Codec with SemanticβAcoustic Disentanglement
ASLP-lab
ASLP-lab
AI & ML interests
None yet
Recent Activity
updated a model about 3 hours ago
ASLP-lab/OmniCodec updated a collection 1 day ago
OmniCodec published a model 1 day ago
ASLP-lab/OmniCodecOrganizations
None yet
WenetSpeech-Wu
SenSE
SongFormer
WenetSpeech-Yue
A Large-scale Cantonese Speech Corpus with Multi-dimensional Annotation
LLaSE
DiffRhythm
YingMusic-Singer
YingMusic-Singer: Controllable Singing Voice Synthesis with Flexible Lyric Manipulation and Annotation-free Melody Guidance
VoiceSculptor
An instruct text-to-speech model developed by ASLP.
Easy Turn
WenetSpeech-Chuan
a large-scale open-source corpus with a full processing pipeline and benchmarks for ASR and TTS
OSUM
OSUM: Open Speech Understanding Model, open-sourced by ASLP@NPU.
C2SER
OmniCodec
Low Frame Rate Universal Audio Codec with SemanticβAcoustic Disentanglement
YingMusic-Singer
YingMusic-Singer: Controllable Singing Voice Synthesis with Flexible Lyric Manipulation and Annotation-free Melody Guidance
WenetSpeech-Wu
VoiceSculptor
An instruct text-to-speech model developed by ASLP.
SenSE
Easy Turn
SongFormer
WenetSpeech-Chuan
a large-scale open-source corpus with a full processing pipeline and benchmarks for ASR and TTS
WenetSpeech-Yue
A Large-scale Cantonese Speech Corpus with Multi-dimensional Annotation
OSUM
OSUM: Open Speech Understanding Model, open-sourced by ASLP@NPU.
LLaSE
C2SER
DiffRhythm