Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
4
2
Tian Li
RicardoLee
Follow
nicolay-r's profile picture
wanyuzhang's profile picture
johnnyjl's profile picture
14 followers
·
1 following
AI & ML interests
Natural Language Procesing, Automatic Speech Recognition, Reinforcement Learning
Organizations
None yet
RicardoLee
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
RicardoLee/Llama2-chat-13B-Chinese-50W
almost 3 years ago
13b的context len多大以及batch?
5
#1 opened almost 3 years ago by
lucasjin
New activity in
RicardoLee/Llama2-chat-Chinese-50W
almost 3 years ago
此时不应降低学习率,warmup 等超参,而是应该放大到Pretrain 规模
3
#2 opened almost 3 years ago by
daner
关于train_sft.py中coati包
2
#3 opened almost 3 years ago by
BatmanBill
那这个怎么调用呢
4
#1 opened almost 3 years ago by
yjianchun
那这个怎么调用呢
4
#1 opened almost 3 years ago by
yjianchun