RLVF pipeline using parser oracles to align LMs for Icelandic and Danish. GPT-SW3 and Viking-13B trained with Delta-DPO.
Fakhar
Hodfa71
AI & ML interests
None yet
Recent Activity
updated a bucket 1 minute ago
TrustLLMeu/saga-annotation-storage published a bucket 3 minutes ago
TrustLLMeu/saga-annotation-storage updated a bucket 17 minutes ago
TrustLLMeu/omniagentbench-multiturn-annotation-storage