Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Occiglot
community
https://occiglot.eu/
occiglot
occiglot
Activity Feed
Follow
47
AI & ML interests
Open Source Language Models for Europe
Recent Activity
stefan-it
submitted
a paper
about 14 hours ago
FineInstructions: Scaling Synthetic Instructions to Pre-Training Scale
pjox
authored
a paper
1 day ago
SciLaD: A Large-Scale, Transparent, Reproducible Dataset for Natural Scientific Language Processing
bjoernp
authored
a paper
15 days ago
sui-1: Grounded and Verifiable Long-Form Summarization
View all activity
Team members
15
occiglot
's datasets
6
Sort: Recently updated
occiglot/arcX
Viewer
•
Updated
Apr 30, 2025
•
26.4k
•
517
occiglot/hellaswagX
Viewer
•
Updated
Apr 29, 2025
•
240k
•
233
occiglot/euro-llm-leaderboard-requests
Updated
Apr 2, 2025
•
55
•
2
occiglot/occiglot-fineweb-v1.0
Updated
Nov 16, 2024
•
451
•
3
occiglot/occiglot-fineweb-v0.5
Viewer
•
Updated
May 25, 2024
•
226M
•
9
•
15
occiglot/tokenizer-wiki-bench
Viewer
•
Updated
Apr 23, 2024
•
84.4M
•
17.4k
•
6