Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Caiyun-AI
/
DCFormer-2.8B
like
2
Text Generation
Transformers
PyTorch
English
dcformer
causal-lm
dcmha
custom_code
arxiv:
2405.08553
License:
mit
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
DCFormer-2.8B
5.81 GB
Ctrl+K
Ctrl+K
3 contributors
History:
10 commits
Hilbertmeng
fix k_mask
51d254e
almost 2 years ago
.gitattributes
Safe
1.52 kB
initial commit
almost 2 years ago
README.md
Safe
2.42 kB
add paper link
almost 2 years ago
config.json
Safe
751 Bytes
upload model and code
almost 2 years ago
configuration_dcformer.py
Safe
2.51 kB
upload model and code
almost 2 years ago
generation_demo.py
Safe
1.31 kB
update readme
almost 2 years ago
modeling_dcformer.py
Safe
32.7 kB
fix k_mask
almost 2 years ago
pytorch_model.bin
Safe
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.HalfStorage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
5.81 GB
xet
upload model and code
almost 2 years ago
tokenizer.json
Safe
2.11 MB
upload model and code
almost 2 years ago
tokenizer_config.json
Safe
264 Bytes
upload model and code
almost 2 years ago