Instructions to use NeelNanda/SoLU_12L_v23_old with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use NeelNanda/SoLU_12L_v23_old with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("NeelNanda/SoLU_12L_v23_old", dtype="auto") - Notebooks
- Google Colab
- Kaggle
YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
A GPT-2 Medium sized SoLU model trained on 11.7B tokens of the Pile (training crashed because of dodgy data loaders at 11B, and wasn't resumed, so this is shorter than the others). 12 layers, d_model=1536.
- Downloads last month
- 204
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support