Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ADRA-RL 's Collections
LLM-MIA-Datasets-Raw
Pre-training
Distillation
Post-training

Distillation

updated 15 days ago
Upvote
-

  • ADRA-RL/s1_deepseek-r1_lexical_unique_trio_penalty_1.25_seed42

    Viewer • Updated 15 days ago • 128 • 13

  • ADRA-RL/s1_gemini-r1_lexical_unique_trio_penalty_1.25_seed42

    Viewer • Updated 15 days ago • 128 • 14

  • ADRA-RL/qwen2.5-7b-instrct_s1_deepseek-r1_distillation_original

    Text Generation • 1.0B • Updated 15 days ago • 22

  • ADRA-RL/qwen2.5-7b-instrct_s1_gemini-r1_distillation_original

    Text Generation • 2B • Updated 15 days ago • 15

  • ADRA-RL/qwen2.5-7b-instrct_lora_adra_s1_deepseek-r1_original_lexical_unique_trio_s140

    Updated 15 days ago

  • ADRA-RL/qwen2.5-7b-instrct_lora_adra_s1_gemini_original_lexical_unique_trio_s180

    Updated 15 days ago
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs