dpo/sft tuned language models on politune
Jonas Golde
whoisjones
AI & ML interests
Data-efficient transfer learning
Recent Activity
updated a collection 14 days ago
MastermindEval updated a collection 14 days ago
MastermindEval updated a collection 15 days ago
PolituneOrganizations
models 24
whoisjones/politune-qwen3-8b-right-dpo
Text Generation • Updated • 15
whoisjones/politune-qwen3-8b-right-sft
Text Generation • Updated • 23
whoisjones/politune-qwen3-8b-left-dpo
Text Generation • Updated • 22
whoisjones/politune-qwen3-8b-left-sft
Text Generation • Updated • 21
whoisjones/politune-mistral-7b-right-dpo
Text Generation • Updated • 23
whoisjones/politune-mistral-7b-right-sft
Text Generation • Updated • 19
whoisjones/politune-mistral-7b-left-dpo
Text Generation • Updated • 24
whoisjones/politune-mistral-7b-left-sft
Text Generation • Updated • 15
whoisjones/politune-llama3-8b-right-dpo
Text Generation • Updated • 24
whoisjones/politune-llama3-8b-right-sft
Text Generation • Updated • 24
datasets 29
whoisjones/finerweb_document_context
Updated • 37
whoisjones/sudoku
Viewer • Updated • 1.42M • 36
whoisjones/maze
Viewer • Updated • 9k • 6
whoisjones/multinerd
Viewer • Updated • 1.67M • 26
whoisjones/masakhaner
Viewer • Updated • 153k • 17 • 1
whoisjones/uner
Viewer • Updated • 66.8k • 71
whoisjones/fiNERweb
Viewer • Updated • 3.98M • 118 • 9
whoisjones/fiNERweb-x
Updated • 58
whoisjones/fiNERweb-x-multi
Updated • 37
whoisjones/fiNERweb-gemma-x-multi
Updated • 12