Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
11
2
Kristian Schwethelm
KristianS7
Follow
kschwethelm
AI & ML interests
Large Language Models
Recent Activity
updated
a dataset
4 days ago
KristianS7/nanochat-prepacked-fineweb-edu
published
a dataset
22 days ago
KristianS7/nanochat-prepacked-fineweb-edu
updated
a model
22 days ago
KristianS7/Ouro-1.4B
View all activity
Organizations
None yet
KristianS7
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a dataset
4 days ago
KristianS7/nanochat-prepacked-fineweb-edu
Updated
4 days ago
•
1.27k
published
a dataset
22 days ago
KristianS7/nanochat-prepacked-fineweb-edu
Updated
4 days ago
•
1.27k
updated
2 models
22 days ago
KristianS7/Ouro-1.4B
Text Generation
•
Updated
22 days ago
•
195
KristianS7/Ouro-1.4B-Thinking
Text Generation
•
Updated
22 days ago
•
298
New activity in
ByteDance/Ouro-2.6B
24 days ago
Lower evaluation results
1
#2 opened 4 months ago by
MianchuWang
New activity in
ByteDance/Ouro-1.4B
24 days ago
Differences in the results of the reproduction test on lm-evaluation-harness
3
#8 opened 3 months ago by
ThreeGold116
New activity in
ByteDance/Ouro-2.6B
24 days ago
Fix bos/eos token IDs (config.json + tokenizer_config.json)
#5 opened 24 days ago by
KristianS7
Fix UniversalTransformerCache.get_mask_sizes for batched generation
#4 opened 24 days ago by
KristianS7
New activity in
ByteDance/Ouro-1.4B
24 days ago
Fix bos/eos token IDs (config.json + tokenizer_config.json)
#11 opened 24 days ago by
KristianS7
Fix UniversalTransformerCache.get_mask_sizes for batched generation
#10 opened 24 days ago by
KristianS7
New activity in
ByteDance/Ouro-1.4B-Thinking
about 1 month ago
Fix UniversalTransformerCache.get_mask_sizes for batched generation
1
#5 opened about 1 month ago by
KristianS7
New activity in
ByteDance/Ouro-2.6B-Thinking
about 1 month ago
Fix UniversalTransformerCache.get_mask_sizes for batched generation
1
#8 opened about 1 month ago by
KristianS7
New activity in
ByteDance/Ouro-1.4B
about 1 month ago
Batched generation (batch_size > 1) produces incorrect outputs — possible causal mask issue?
➕
1
1
#9 opened 2 months ago by
vconchel
New activity in
ByteDance/Ouro-2.6B-Thinking
about 1 month ago
Fix bos/eos token IDs + add enable_thinking to chat template
2
#7 opened about 1 month ago by
KristianS7
New activity in
ByteDance/Ouro-1.4B-Thinking
about 1 month ago
Fix bos/eos token IDs + add enable_thinking to chat template
2
#4 opened about 1 month ago by
KristianS7
New activity in
ByteDance/Ouro-2.6B-Thinking
about 1 month ago
Fix bos/eos token IDs + add enable_thinking to chat template
2
#7 opened about 1 month ago by
KristianS7
New activity in
ByteDance/Ouro-1.4B-Thinking
about 1 month ago
Fix bos/eos token IDs + add enable_thinking to chat template
2
#4 opened about 1 month ago by
KristianS7
Load more