Hyunwoo Ko

Cartinoe5930

https://cartinoe5930.tistory.com/

AI & ML interests

NLP(LLM)

Recent Activity

upvoted a paper 1 day ago

K-BrowseComp: A Web Browsing Agent Benchmark Grounded in Korean Contexts

updated a dataset 11 days ago

Cartinoe5930/ola-embed

published a dataset 11 days ago

Cartinoe5930/ola-embed

View all activity

Organizations

upvoted a paper 1 day ago

K-BrowseComp: A Web Browsing Agent Benchmark Grounded in Korean Contexts

Paper • 2606.02404 • Published 11 days ago • 56

updated a dataset 11 days ago

Cartinoe5930/ola-embed

Updated 10 days ago • 2.64k

published a dataset 11 days ago

Cartinoe5930/ola-embed

Updated 10 days ago • 2.64k

upvoted a paper 14 days ago

LaRA: Layer-wise Representation Analysis for Detecting Data Contamination in RL Post-Training

Paper • 2605.29888 • Published 15 days ago • 34

authored a paper 15 days ago

ResearchMath-14K: Scaling Research-Level Mathematics via Agents

Paper • 2605.28003 • Published 16 days ago • 49

liked a dataset 15 days ago

amphora/ResearchMath-14k

Viewer • Updated 12 days ago • 14.1k • 2.35k • 51

upvoted a paper 15 days ago

ResearchMath-14K: Scaling Research-Level Mathematics via Agents

Paper • 2605.28003 • Published 16 days ago • 49

updated a dataset 24 days ago

Cartinoe5930/every_corpus

Updated 24 days ago • 67

upvoted a paper 24 days ago

KMMMU: Evaluation of Massive Multi-discipline Multimodal Understanding in Korean Language and Context

Paper • 2604.13058 • Published Mar 18 • 2

updated a model 28 days ago

Cartinoe5930/every_corpus

Updated 28 days ago

published a model 28 days ago

Cartinoe5930/every_corpus

Updated 28 days ago

published a dataset 28 days ago

Cartinoe5930/every_corpus

Updated 24 days ago • 67

authored 2 papers 30 days ago

What Users Leave Unsaid: Under-Specified Queries Limit Vision-Language Models

Paper • 2601.06165 • Published Jan 7 • 16

KMMMU: Evaluation of Massive Multi-discipline Multimodal Understanding in Korean Language and Context

Paper • 2604.13058 • Published Mar 18 • 2

authored a paper about 1 month ago

Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs

Paper • 2605.09063 • Published May 9 • 80

upvoted 2 papers about 1 month ago

XL-SafetyBench: A Country-Grounded Cross-Cultural Benchmark for LLM Safety and Cultural Sensitivity

Paper • 2605.05662 • Published May 7 • 11

Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs

Paper • 2605.09063 • Published May 9 • 80

updated a model about 1 month ago

Cartinoe5930/unsloth-qwen2.5-14b-inst-lora-s1k

Updated May 2

published 2 models about 1 month ago

Cartinoe5930/unsloth-qwen2.5-14b-inst-lora-s1k

Updated May 2

Cartinoe5930/unsloth-qwen3-30b-inst-s1k-full

Updated Apr 30

Hyunwoo Ko

AI & ML interests

Recent Activity

Organizations

Cartinoe5930's activity