Kyutai Share

non-profit

AI & ML interests

None defined yet.

authored 2 papers 4 months ago

PRiSM: Benchmarking Phone Realization in Speech Models

Paper • 2601.14046 • Published Jan 20 • 7

Towards Comprehensive Semantic Speech Embeddings for Chinese Dialects

Paper • 2601.07274 • Published Jan 12 • 1

authored 3 papers 5 months ago

PWESuite: Phonetic Word Embeddings and Tasks They Facilitate

Paper • 2304.02541 • Published Apr 5, 2023 • 2

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Paper • 2411.05361 • Published Nov 8, 2024 • 5

POWSM: A Phonetic Open Whisper-Style Speech Foundation Model

Paper • 2510.24992 • Published Oct 28, 2025 • 4

authored 4 papers over 2 years ago

Proactive Detection of Voice Cloning with Localized Watermarking

Paper • 2401.17264 • Published Jan 30, 2024 • 19

Masked Audio Generation using a Single Non-Autoregressive Transformer

Paper • 2401.04577 • Published Jan 9, 2024 • 44

High Fidelity Neural Audio Compression

Paper • 2210.13438 • Published Oct 24, 2022 • 4

Code Llama: Open Foundation Models for Code

Paper • 2308.12950 • Published Aug 24, 2023 • 29

authored 3 papers almost 3 years ago

From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion

Paper • 2308.02560 • Published Aug 2, 2023 • 5

Simple and Controllable Music Generation

Paper • 2306.05284 • Published Jun 8, 2023 • 167

Textually Pretrained Speech Language Models

Paper • 2305.13009 • Published May 22, 2023 • 4