Miscellaneous Text Datasets for Language Models izumi-lab/oscar2301-ja-filter-ja-normal Viewer • Updated Jul 29, 2023 • 31.4M • 240 • 6 izumi-lab/mc4-ja Viewer • Updated Jul 29, 2023 • 87.4M • 3.92k • 6 izumi-lab/mc4-ja-filter-ja-normal Viewer • Updated Jul 29, 2023 • 52.6M • 698 • 5 izumi-lab/wikinews-ja-20230728 Viewer • Updated Jul 29, 2023 • 4.28k • 60 • 5
Japanese General Pre-trained Language Models izumi-lab/deberta-v2-base-japanese Fill-Mask • 0.1B • Updated 7 days ago • 4.73k • • 5 izumi-lab/deberta-v2-small-japanese Fill-Mask • 26.2M • Updated 7 days ago • 50 izumi-lab/bert-small-japanese Fill-Mask • Updated Dec 9, 2022 • 91 • 5 izumi-lab/electra-base-japanese-discriminator 0.1B • Updated 7 days ago • 67 • 2
llm-japanese-dataset izumi-lab/llm-japanese-dataset Viewer • Updated Jan 18, 2024 • 9.07M • 437 • 142 izumi-lab/llm-japanese-dataset-vanilla Viewer • Updated Feb 17, 2024 • 2.49M • 367 • 33
Japanese LoRA-tuned LLMs izumi-lab/stormy-7b-10ep Updated Jun 26, 2023 • 5 izumi-lab/llama-13b-japanese-lora-v0-1ep Updated May 23, 2023 • 11 izumi-lab/llama-7b-japanese-lora-v0-5ep Updated Jun 23, 2023 • 3 Paused 4 LLaMA 13B Japanese LoRA v0 1 epoch 🐨 4
Japanese Financial Pre-trained Language Models izumi-lab/bert-base-japanese-fin-additional 0.1B • Updated Jun 16, 2025 • 610 • 3 izumi-lab/bert-small-japanese-fin Fill-Mask • 18.1M • Updated 7 days ago • 67 • 2 izumi-lab/electra-small-japanese-fin-discriminator 13.8M • Updated 7 days ago • 30 izumi-lab/electra-small-japanese-fin-generator Fill-Mask • 13.8M • Updated Oct 21, 2023 • 16
Miscellaneous Text Datasets for Language Models izumi-lab/oscar2301-ja-filter-ja-normal Viewer • Updated Jul 29, 2023 • 31.4M • 240 • 6 izumi-lab/mc4-ja Viewer • Updated Jul 29, 2023 • 87.4M • 3.92k • 6 izumi-lab/mc4-ja-filter-ja-normal Viewer • Updated Jul 29, 2023 • 52.6M • 698 • 5 izumi-lab/wikinews-ja-20230728 Viewer • Updated Jul 29, 2023 • 4.28k • 60 • 5
Japanese LoRA-tuned LLMs izumi-lab/stormy-7b-10ep Updated Jun 26, 2023 • 5 izumi-lab/llama-13b-japanese-lora-v0-1ep Updated May 23, 2023 • 11 izumi-lab/llama-7b-japanese-lora-v0-5ep Updated Jun 23, 2023 • 3 Paused 4 LLaMA 13B Japanese LoRA v0 1 epoch 🐨 4
Japanese General Pre-trained Language Models izumi-lab/deberta-v2-base-japanese Fill-Mask • 0.1B • Updated 7 days ago • 4.73k • • 5 izumi-lab/deberta-v2-small-japanese Fill-Mask • 26.2M • Updated 7 days ago • 50 izumi-lab/bert-small-japanese Fill-Mask • Updated Dec 9, 2022 • 91 • 5 izumi-lab/electra-base-japanese-discriminator 0.1B • Updated 7 days ago • 67 • 2
Japanese Financial Pre-trained Language Models izumi-lab/bert-base-japanese-fin-additional 0.1B • Updated Jun 16, 2025 • 610 • 3 izumi-lab/bert-small-japanese-fin Fill-Mask • 18.1M • Updated 7 days ago • 67 • 2 izumi-lab/electra-small-japanese-fin-discriminator 13.8M • Updated 7 days ago • 30 izumi-lab/electra-small-japanese-fin-generator Fill-Mask • 13.8M • Updated Oct 21, 2023 • 16
llm-japanese-dataset izumi-lab/llm-japanese-dataset Viewer • Updated Jan 18, 2024 • 9.07M • 437 • 142 izumi-lab/llm-japanese-dataset-vanilla Viewer • Updated Feb 17, 2024 • 2.49M • 367 • 33