Spaces:
Running on CPU Upgrade
Requesting submission permission - LifeOS Jaram 4.0
Hi GAIA team,
I'd like to request permission to submit evaluation results
to the GAIA leaderboard.
Agent: LifeOS Jaram 4.0 โ Joonnam Lee
HF Username: joonnam
Organization: SA System Inc.
Email: joonnam.lee@gmail.com
Model family: Claude 3.5 Sonnet, Gemini 2.0 Flash
We have completed evaluation on the Level 1 test set (93 questions)
and are ready to submit our predictions JSONL file.
Thank you!
Thank you for maintaining the GAIA leaderboard!
I'd like to provide more information in support of submission request #84:
Agent: LifeOS Jaram 4.0
Organization: SA System Inc.
GAIA Level 1 Performance: 90/93 tasks solved (~96.8% accuracy)
(HAL Leaderboard World #1 reference: 82.1%)
HuggingFace Resources:
- ๐ค Model Card: https://huggingface.co/joonnam/lifeos-jaram-agent
- ๐ Evaluation Dataset: https://huggingface.co/datasets/joonnam/lifeos-jaram-gaia-evaluation
- ๐ Demo Space: https://huggingface.co/spaces/joonnam/lifeos-demo
Live Platform:
- https://lifeos-service.vercel.app/ (EN/AR/JA/VI/ID)
- https://lifeos-deploy.vercel.app/#tech (architecture)
Key Tools Used:
brave_search,web_fetch,wayback_fetch,crossref_searchaudio_transcribe(Whisper + audd.io music fingerprinting)fred_data(FRED economic API),python_execute
We have predictions ready for Level 1 (test set). Would appreciate submission access. Thank you!
We have completed all HuggingFace activity requirements since our initial request:
- Dataset: https://huggingface.co/datasets/joonnam/lifeos-jaram-gaia-evaluation
- Model Card: https://huggingface.co/joonnam/lifeos-jaram-agent
- Space: https://huggingface.co/spaces/joonnam/lifeos-demo
- Active community participation (likes, discussions)
Could you please verify if our account now meets the activity threshold for submission?
If not, what additional steps are needed?
Agent: LifeOS Jaram 4.0
Organization: SA System Inc.
GAIA Level 1: 93 questions evaluated, predictions ready to submit.
Thank you!
did they ever get back to you