Japanese SFT/DPO data convert to speech via TTS. And audio caption data generated by Qwen3-Omni. All datasets are available for commercial use.
Ayuto Tsutsumi
Atotti
AI & ML interests
None yet
Recent Activity
liked
a model
about 1 hour ago
cyberagent/CAT-Translate-1.4b
liked
a model
about 18 hours ago
Qwen/Qwen3-ForcedAligner-0.6B
updated
a model
about 18 hours ago
Atotti/LlamaForSpeechLM-ja-Transcribe-Full-step200000