Boyd Kane's picture

Boyd Kane

beyarkay

·

AI & ML interests

None yet

Recent Activity

upvoted a collection 3 days ago

Self-Fulfilling (Mis)alignment: Post-Trained Models

updated a dataset 12 days ago

beyarkay/elicitation-on-hard-wrapped-text

published a dataset 12 days ago

beyarkay/elicitation-on-hard-wrapped-text

View all activity

Organizations

None yet

upvoted a collection 3 days ago

Self-Fulfilling (Mis)alignment: Post-Trained Models

Here is a selection of models that have undergone DPO. We also share the earlier instruction checkpoints. We recommend using the DPO models. • 22 items • Updated Jan 16 • 2