2 7 7

Lingyu Li

LingyuLi

lingyuli-cogs

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

From Sparse Decisions to Dense Reasoning: A Multi-attribute Trajectory Paradigm for Multimodal Moderation

authored a paper 4 months ago

Argus Inspection: Do Multimodal Large Language Models Possess the Eye of Panoptes?

liked a dataset 4 months ago

EVIGBYEN/DrBench

View all activity

Organizations

None yet

upvoted a paper 10 days ago

From Sparse Decisions to Dense Reasoning: A Multi-attribute Trajectory Paradigm for Multimodal Moderation

Paper • 2602.02536 • Published 19 days ago • 3

authored a paper 4 months ago

Argus Inspection: Do Multimodal Large Language Models Possess the Eye of Panoptes?

Paper • 2506.14805 • Published Jun 3, 2025 • 3

liked a dataset 4 months ago

EVIGBYEN/DrBench

Viewer • Updated 17 days ago • 214 • 96 • 4

upvoted 3 papers 4 months ago

A Mousetrap: Fooling Large Reasoning Models for Jailbreak with Chain of Iterative Chaos

Paper • 2502.15806 • Published Feb 19, 2025 • 2

A Rigorous Benchmark with Multidimensional Evaluation for Deep Research Agents: From Answers to Reports

Paper • 2510.02190 • Published Oct 2, 2025 • 19

Argus Inspection: Do Multimodal Large Language Models Possess the Eye of Panoptes?

Paper • 2506.14805 • Published Jun 3, 2025 • 3

liked a dataset 7 months ago

NousResearch/Hermes-3-Dataset

Viewer • Updated Jul 11, 2025 • 959k • 600 • 296

liked a Space 10 months ago

Qwen3 Demo

📊

836

Chat with AI assistant via text messages

New activity in meta-llama/Meta-Llama-3-8B 10 months ago

中国区的账号，都会被拒绝？

#124 opened almost 2 years ago by

HLearning

liked a dataset 10 months ago

TIGER-Lab/MMLU-Pro

Benchmark • Updated 28 days ago • 12.1k • 81.7k • 425

liked a model 11 months ago

CaasiHUANG/flames-scorer

Text Classification • Updated Apr 22, 2024 • 47 • 5

upvoted a paper 11 months ago

Survey on Evaluation of LLM-based Agents

Paper • 2503.16416 • Published Mar 20, 2025 • 96

liked a dataset 12 months ago

CCLV/CausalBench

Preview • Updated Jun 13, 2024 • 86 • 6

upvoted a paper about 1 year ago

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Paper • 2412.19723 • Published Dec 27, 2024 • 87

liked a dataset over 1 year ago

neuralwork/arxiver

Viewer • Updated Nov 1, 2024 • 63.4k • 116 • 366

upvoted a paper over 1 year ago

Reflection-Bench: probing AI intelligence with reflection

Paper • 2410.16270 • Published Oct 21, 2024 • 6

authored a paper over 1 year ago

ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models

Paper • 2406.14952 • Published Jun 21, 2024

commented a paper over 1 year ago

Reflection-Bench: probing AI intelligence with reflection

Paper • 2410.16270 • Published Oct 21, 2024 • 6 •

authored a paper over 1 year ago

Reflection-Bench: probing AI intelligence with reflection

Paper • 2410.16270 • Published Oct 21, 2024 • 6

Lingyu Li

AI & ML interests

Recent Activity

Organizations

LingyuLi's activity

Qwen3 Demo

中国区的账号， 都会被拒绝？

中国区的账号，都会被拒绝？