offline-embedding

Offline training with embedding-based retrieved data (2.5% of full dataset).

Note: This checkpoint is from a single random seed (seed=3) and a specific training step (step 1040). Results may vary across seeds.

Details

Key	Value
Base model	`meta-llama/Llama-2-7b-hf`
Task	MMLU
Data selection	Embedding Retrieval
Data ratio	2.5%
Online	False
LoRA rank	128
LoRA alpha	512
Target modules	q_proj, k_proj, v_proj, o_proj
Seed	3
Checkpoint step	1040

Usage

from peft import PeftModel
from transformers import AutoModelForCausalLM, AutoTokenizer

base_model = AutoModelForCausalLM.from_pretrained("meta-llama/Llama-2-7b-hf")
model = PeftModel.from_pretrained(base_model, "DATA-ADAPT/offline-embedding")
tokenizer = AutoTokenizer.from_pretrained("DATA-ADAPT/offline-embedding")

Downloads last month: 42

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for DATA-ADAPT/offline-embedding

Base model

meta-llama/Llama-2-7b-hf

Adapter

(2337)

this model

Collection including DATA-ADAPT/offline-embedding

posttrain_model_ckpts

Collection

LoRA checkpoints for post-training experiments on LLaMA-2-7B with various data selection methods (MMLU task). • 8 items • Updated about 18 hours ago