Instructions to use AbhilekhMeda/qwen3-1.7b-math-prm with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use AbhilekhMeda/qwen3-1.7b-math-prm with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("AbhilekhMeda/qwen3-1.7b-math-prm", dtype="auto") - Notebooks
- Google Colab
- Kaggle
Qwen3-1.7B Math PRM
Training recipe:
- Base model:
Qwen/Qwen3-1.7B - Training data: raw step-level labels from
Mai0313/prm800k(prm800k/data/phase2_train.jsonl) - Evaluation:
Qwen/ProcessBench - Format: Qwen PRM-style
<extra_0>marker after each reasoning step, score at marker positions
This repo contains the training script for a discriminative process reward model that scores individual reasoning steps.
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support