guankoala's picture

1 3

guankoala

guankoala

·

purekoala

AI & ML interests

None yet

Recent Activity

upvoted a paper 24 days ago

HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing

liked a Space about 1 year ago

nanotron/ultrascale-playbook

liked a model over 1 year ago

medxiaorudan/CodeLlama_CPP_FineTuned

View all activity

Organizations

None yet

upvoted a paper 24 days ago

HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing

Paper • 2602.03560 • Published 25 days ago • 45

liked a Space about 1 year ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

liked 2 models over 1 year ago

medxiaorudan/CodeLlama_CPP_FineTuned

Updated Jan 24, 2024 • 1 • 1

ajibawa-2023/Code-Llama-3-8B

Text Generation • 8B • Updated May 8, 2024 • 105 • 31