Whom to Query for What: Adaptive Group Elicitation via Multi-Turn LLM Interactions Paper • 2602.14279 • Published 8 days ago • 1
Whom to Query for What: Adaptive Group Elicitation via Multi-Turn LLM Interactions Paper • 2602.14279 • Published 8 days ago • 1
Rubrics as an Attack Surface: Stealthy Preference Drift in LLM Judges Paper • 2602.13576 • Published 10 days ago • 1
Rubrics as an Attack Surface (RIPD) Collection This collection releases the official artifacts accompanying “Rubrics as an Attack Surface: Stealthy Preference Drift in LLM Judges.” • 18 items • Updated 4 days ago
ZDCSlab/ripd-anthropic-saferlhf-dolphin3-llama31-8b-biased-bt Text Generation • Updated 1 day ago • 6
ZDCSlab/ripd-anthropic-saferlhf-dolphin3-llama31-8b-biased-bt Text Generation • Updated 1 day ago • 6
ZDCSlab/ripd-anthropic-saferlhf-gemma-2b-uncensored-v1-biased-bt Text Generation • 3B • Updated 2 days ago • 5
ZDCSlab/ripd-anthropic-saferlhf-gemma-2b-uncensored-v1-biased-bt Text Generation • 3B • Updated 2 days ago • 5
ZDCSlab/ripd-anthropic-saferlhf-gemma-2b-uncensored-v1-seed-bt Text Generation • 3B • Updated 2 days ago • 7
ZDCSlab/ripd-anthropic-saferlhf-gemma-2b-uncensored-v1-seed-bt Text Generation • 3B • Updated 2 days ago • 7