Pretrained LLMs from scratch.
Y. Yu
PursuitOfDataScience
AI & ML interests
LLM, GPU Computing, PyTorch
Recent Activity
updated
a collection
9 days ago
ArgonneAI
updated
a model
9 days ago
PursuitOfDataScience/Argonne-2.0
published
a model
9 days ago
PursuitOfDataScience/Argonne-2.0
Organizations
None yet
Sandbox Models
Trial & Error models for various tasks.
-
PursuitOfDataScience/roberta-large-ner
Token Classification • 0.4B • Updated • 4 -
PursuitOfDataScience/distilbert-base-cased-ner
Token Classification • 65.2M • Updated • 1 -
PursuitOfDataScience/bert-base-ner
Token Classification • 0.1B • Updated -
PursuitOfDataScience/t5-large-summary-model
0.7B • Updated
ArgonneAI
Pretrained LLMs from scratch.
Sandbox Models
Trial & Error models for various tasks.
-
PursuitOfDataScience/roberta-large-ner
Token Classification • 0.4B • Updated • 4 -
PursuitOfDataScience/distilbert-base-cased-ner
Token Classification • 65.2M • Updated • 1 -
PursuitOfDataScience/bert-base-ner
Token Classification • 0.1B • Updated -
PursuitOfDataScience/t5-large-summary-model
0.7B • Updated
models
22
PursuitOfDataScience/Argonne-2.0
Text Generation
•
6B
•
Updated
•
34
PursuitOfDataScience/llama3.2-1b-thinking
Text Generation
•
1B
•
Updated
•
1
PursuitOfDataScience/llama-3-2-1b-open-r1-mot-sft
Text Generation
•
1B
•
Updated
•
2
PursuitOfDataScience/qwen2.5-0.5b-r1-dpo
Text Generation
•
0.5B
•
Updated
PursuitOfDataScience/qwen2.5-0.5b-dpo
Text Generation
•
0.5B
•
Updated
•
3
PursuitOfDataScience/qwen2.5-0.5b-open-r1-mot-cot-sft
Text Generation
•
0.5B
•
Updated
•
1
PursuitOfDataScience/llama3.2-1b-dpo
Text Generation
•
1B
•
Updated
•
1
PursuitOfDataScience/qwen2.5-0.5b-ultrachat-sft-multi-turn
0.5B
•
Updated
•
1
PursuitOfDataScience/finetuned-llama-3.2-3b-math-reasoning
3B
•
Updated
•
1
PursuitOfDataScience/finetuned-llama-3.2-3b-dpo
Text Generation
•
3B
•
Updated
•
3
datasets
44
PursuitOfDataScience/toucan-agentic-thinking
Viewer
•
Updated
•
119k
•
44
PursuitOfDataScience/arxiv-qa-thinking
Viewer
•
Updated
•
215k
•
20
PursuitOfDataScience/0.9M-thinking
Viewer
•
Updated
•
898k
•
120
PursuitOfDataScience/0.5M-thinking
Viewer
•
Updated
•
499k
•
163
PursuitOfDataScience/MiniMax-M2.1-Mixture-of-Thoughts
Viewer
•
Updated
•
349k
•
426
•
2
PursuitOfDataScience/gsm8k-thinking
Viewer
•
Updated
•
8.79k
•
10
PursuitOfDataScience/bbc-news-llama4-maverick-summary
Viewer
•
Updated
•
174k
•
11
PursuitOfDataScience/govreport-llama4-maverick-summary
Viewer
•
Updated
•
19.5k
•
13
•
1
PursuitOfDataScience/arxiv-llama4-maverick-abstract
Viewer
•
Updated
•
198k
•
19
PursuitOfDataScience/xsum-llama4-maverick-summary
Viewer
•
Updated
•
227k
•
14