When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning Paper • 2603.21289 • Published 5 days ago • 17
MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly Paper • 2505.10610 • Published May 15, 2025 • 55
Running on CPU Upgrade 13.9k Open LLM Leaderboard 🏆 13.9k Track, rank and evaluate open LLMs and chatbots
sentence-transformers/multi-qa-mpnet-base-dot-v1 Sentence Similarity • 0.1B • Updated Aug 19, 2025 • 3.75M • • 191