Ling-2.6-flash / .eval_results /swe-bench_verified.yaml
RichardBian's picture
Add community evaluation results for AIME_2026, HMMT_FEB_2026, SWE-BENCH_VERIFIED (#2)
9c86125
raw
history blame contribute delete
186 Bytes
- dataset:
id: SWE-bench/SWE-bench_Verified
task_id: swe_bench_%_resolved
value: 61.2
source:
url: https://huggingface.co/inclusionAI/Ling-2.6-flash
name: Model Card