openai/gsm8k
Benchmark • Updated • 17.6k • 952k • 1.32k
How to use AnonyRepo/bidir-prm-dream7b-gsm8k with PEFT:
Task type is invalid.
LoRA adapter + reward head on Dream-org/Dream-v0-Instruct-7B for scoring partially-denoised dLLM intermediate states.
q_proj + v_proj| Mask ratio | Acc |
|---|---|
| 0.0-0.1 | 0.918 |
| 0.5-0.6 | 0.818 |
| 0.9-1.0 | 0.578 |
from transformers import AutoModel
from safetensors.torch import load_file
import torch
base = AutoModel.from_pretrained(
"Dream-org/Dream-v0-Instruct-7B",
trust_remote_code=True,
attn_implementation="sdpa",
torch_dtype=torch.bfloat16,
device_map="cuda",
)
state_dict = load_file("adapter.safetensors")
# Use DiffusionPRM from the companion code repo to reassemble.
Base model
Dream-org/Dream-v0-Instruct-7B