Jimmy's picture

5 9

Jimmy

bigheiniuJ

·

AI & ML interests

None yet

Recent Activity

upvoted a collection 6 days ago

Agent World Model

liked a dataset 10 days ago

miromind-ai/MiroVerse-v0.1

upvoted a paper 9 months ago

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

View all activity

Organizations

upvoted a collection 6 days ago

Agent World Model

4 items • Updated 7 days ago • 8

liked a dataset 10 days ago

miromind-ai/MiroVerse-v0.1

Viewer • Updated Jan 16 • 228k • 396 • 222

upvoted a paper 9 months ago

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23, 2025 • 88

upvoted a paper 12 months ago

IHEval: Evaluating Language Models on Following the Instruction Hierarchy

Paper • 2502.08745 • Published Feb 12, 2025 • 20

liked a dataset about 1 year ago

mlabonne/chatml_dpo_pairs

Viewer • Updated Apr 11, 2024 • 12.9k • 87 • 54

updated 9 models about 1 year ago

bigheiniuJ/zephyr-7b-dpo-full-prompt-extend-chosen-delete_0.5

Text Generation • 7B • Updated Dec 27, 2024

bigheiniuJ/zephyr-7b-dpo-full-ensemble-mixv4

Text Generation • 7B • Updated Dec 26, 2024 • 1

bigheiniuJ/zephyr-7b-dpo-full-prompt-extend-chosen-delete_0.9

Text Generation • 7B • Updated Dec 26, 2024 • 1

bigheiniuJ/zephyr-7b-dpo-full-prompt-extend-chosen-delete_0.1

Text Generation • 7B • Updated Dec 26, 2024 • 1

bigheiniuJ/zephyr-7b-dpo-full-ensemble-mixv3

Text Generation • 7B • Updated Dec 25, 2024 • 1

bigheiniuJ/zephyr-7b-full-ut-sft

Text Generation • 7B • Updated Dec 25, 2024 • 1

bigheiniuJ/zephyr-7b-dpo-full-prompt-extend-bad

Text Generation • 7B • Updated Dec 25, 2024 • 1

bigheiniuJ/zephyr-7b-dpo-full-prompt-extend-chosen-norandom

Text Generation • 7B • Updated Dec 24, 2024 • 2

bigheiniuJ/zephyr-7b-dpo-full-prompt-extend-chosen-noshort

Text Generation • 7B • Updated Dec 24, 2024 • 4

updated 2 datasets about 1 year ago

bigheiniuJ/ultrafeedback_feedback_norandom

Viewer • Updated Dec 24, 2024 • 61.1k • 1

bigheiniuJ/ultrafeedback_feedback_noshort

Viewer • Updated Dec 24, 2024 • 61.1k • 2

updated 4 models about 1 year ago

bigheiniuJ/zephyr-7b-dpo-full-prompt-extend-chosen

Text Generation • 7B • Updated Dec 24, 2024 • 1

bigheiniuJ/zephyr-7b-dpo-full-ensemble-mixv2

Text Generation • 7B • Updated Dec 19, 2024 • 1

bigheiniuJ/zephyr-7b-dpo-full-shuffle-rejected-after

Text Generation • 7B • Updated Dec 18, 2024 • 1

bigheiniuJ/zephyr-7b-dpo-full-shuffle-chosen-after

Text Generation • 7B • Updated Dec 17, 2024 • 1