On Data Engineering for Scaling LLM Terminal Capabilities Paper • 2602.21193 • Published 3 days ago • 84
Secure Code Generation via Online Reinforcement Learning with Vulnerability Reward Model Paper • 2602.07422 • Published 20 days ago • 21
MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents Paper • 2601.12346 • Published Jan 18 • 49
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28, 2024 • 264
Can Large Language Models Capture Human Annotator Disagreements? Paper • 2506.19467 • Published Jun 24, 2025 • 18
Balancing Truthfulness and Informativeness with Uncertainty-Aware Instruction Fine-Tuning Paper • 2502.11962 • Published Feb 17, 2025 • 38