How to use from the
Use from the
PEFT library
Task type is invalid.

bidir-prm-llada-8b

Bidirectional PRM on LLaDA-8B-Base (cross-backbone replication). Used in Appendix F; achieves ~0.32 accuracy on GSM8K under PRM-Guided K=8 (all configs). Base model: GSAI-ML/LLaDA-8B-Base.

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for AnonyRepo/bidir-prm-llada-8b-gsm8k

Adapter
(12)
this model