ShinkaEvolve: Towards Open-Ended And Sample-Efficient Program Evolution Paper • 2509.19349 • Published Sep 17, 2025 • 2
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published Aug 8, 2025 • 209
Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving Paper • 2507.23726 • Published Jul 31, 2025 • 115
DepNeCT Collection This Hugging Face collection hosts models and datasets from DepNeCT — a dependency-based method for nested compound type identification in Sanskrit • 4 items • Updated Jul 29, 2025 • 2
REFINE-AF: A Task-Agnostic Framework to Align Language Models via Self-Generated Instructions using Reinforcement Learning from Automated Feedback Paper • 2505.06548 • Published May 10, 2025 • 30
Sanskrit Semantic Similarity Collection A collection of models trained on sanskrit for semantic similarity • 5 items • Updated 1 day ago • 1
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA +3 May 24, 2023 • 176
Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model Paper • 2402.07827 • Published Feb 12, 2024 • 48
OFA: A Framework of Initializing Unseen Subword Embeddings for Efficient Large-scale Multilingual Continued Pretraining Paper • 2311.08849 • Published Nov 15, 2023 • 6