Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Viharikvs
/
CMBATRM
like
1
Text Generation
PyTorch
trm
act
recursive
wikitext
Model card
Files
Files and versions
xet
Community
main
CMBATRM
635 MB
1 contributor
History:
23 commits
Viharikvs
Model card updated after epoch 1
3363076
verified
5 months ago
.gitattributes
Safe
1.52 kB
initial commit
5 months ago
README.md
886 Bytes
Model card updated after epoch 1
5 months ago
best_model_ema.bin
pickle
Detected Pickle imports (4)
"torch.FloatStorage"
,
"torch.BFloat16Storage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
159 MB
xet
End of Epoch 1: EMA weights upload
5 months ago
local_training_state.pt
pickle
Detected Pickle imports (3)
"torch.FloatStorage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
318 MB
xet
Checkpoint at step 8000 (Epoch 1)
5 months ago
pytorch_model.bin
pickle
Detected Pickle imports (4)
"torch.FloatStorage"
,
"torch.BFloat16Storage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
159 MB
xet
End of Epoch 1: Val Loss 4.8248, Val LM 4.8149, PPL 123.34 (raw)
5 months ago