Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
OctoLong 's Collections
Instruct Checkpoints
Merged Base Checkpoints
Extended Base Checkpoints
Original Base Checkpoints

Original Base Checkpoints

updated 10 days ago

Qwen3 checkpoints with modified configurations for long context fine-tuning in the OctoLong project

Upvote
-

  • OctoLong/Qwen3-0.6B-Base

    Text Generation • 0.6B • Updated 11 days ago • 30

  • OctoLong/Qwen3-1.7B-Base

    Text Generation • 2B • Updated 11 days ago • 71

  • OctoLong/Qwen3-4B-Base

    Text Generation • 4B • Updated 11 days ago • 73

  • OctoLong/Qwen3-8B-Base

    Text Generation • 8B • Updated 11 days ago • 91 • 1

  • OctoLong/Qwen3-14B-Base

    Text Generation • 15B • Updated 11 days ago • 105
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs