·
AI & ML interests
LLM, RL
Organizations
None yet
habanoz/haber-90k-gpt-v1.6
Updated
habanoz/haber-90k-gpt-v1.5
habanoz/haber-90k-gpt-v1.4
habanoz/haber-90k-gpt-v1.3
habanoz/haber-90k-gpt-v1.2
habanoz/TinyLlama-1.1B-intermediate-step-715k-1.5T-lr-5-3epochs-oasst1-top1-instruct-V1
Text Generation
• 1B • Updated • 231
• habanoz/TinyLlama-1.1B-intermediate-step-715k-1.5T-lr-5-1epch-airoboros3.1-1k-instruct-V1
Text Generation
• 1B • Updated • 247
• habanoz/TinyLlama-1.1B-intermediate-step-715k-1.5T-lr-5-2.2epochs-oasst1-top1-instruct-V1
Text Generation
• 1B • Updated • 231
• habanoz/TinyLlama-1.1B-intermediate-step-715k-1.5T-lr-5-4epochs-oasst1-top1-instruct-V1
Text Generation
• 1B • Updated • 237
• • 1
habanoz/TinyLlama-1.1B-step-2T-lr-5-5ep-oasst1-top1-instruct-V1
Text Generation
• 1B • Updated • 16
• 3
habanoz/TinyLlama-1.1B-2T-lr-2e-4-3ep-dolly-15k-instruct-v1
Text Generation
• 1B • Updated • 7
• 1
habanoz/tinyllama-oasst1-top1-instruct-full-lr1-5-v0.1
Text Generation
• 1B • Updated • 1.38k
• • 4
habanoz/tinyllama-2.5t-oasst1-instruct-v1
Text Generation
• Updated • 3
habanoz/llama2-2t-asstop1-lr2-e5-cos-ep3-instruct-v4
Updated
habanoz/llama2-2t-asstop1-lr2-e5-cos-ep3-instruct-v3
Updated
habanoz/phi-1_5-lr-5-3epch-airoboros3.1-1k-instruct-V1
Text Generation
• 1B • Updated • 9
habanoz/phi-1_5-lr-5-1epch-airoboros3.1-1k-instruct-V1
Text Generation
• 1B • Updated • 2
habanoz/TinyLlama-1.1B-Chat-v0.3-GPTQ
Text Generation
• 1B • Updated • 9.11k
habanoz/a2c-PandaReachDense-v2
Reinforcement Learning
• Updated • 3
habanoz/ppo-sb3-lunarlander-v2
Reinforcement Learning
• Updated • 12
habanoz/ppo-clip-LunarLander-v2
Reinforcement Learning
• Updated habanoz/ppo-LunarLander-v2
Reinforcement Learning
• Updated • 6
Reinforcement Learning
• Updated habanoz/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
• Updated Reinforcement Learning
• Updated • 16
habanoz/a2c-AntBulletEnv-v0
Reinforcement Learning
• Updated habanoz/Reinforce-pixelcopter-50k-1
Reinforcement Learning
• Updated