Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Wentian Zhao's picture
2

Wentian Zhao

zwt123home123
·
  • [email protected]

AI & ML interests

None yet

Recent Activity

authored a paper 25 days ago
DUMP: Automated Distribution-Level Curriculum Learning for RL-based LLM Post-training
upvoted a paper 25 days ago
DUMP: Automated Distribution-Level Curriculum Learning for RL-based LLM Post-training
updated a model about 1 month ago
zwt123home123/standardtraining_2p_Qwen2.5-7B-Instruct-1M-4ppl_largebs_global_step_320_actor
View all activity

Organizations

None yet

Papers 2

arxiv:2504.09710
arxiv:2410.06169

models 113

zwt123home123/standardtraining_2p_Qwen2.5-7B-Instruct-1M-4ppl_largebs_global_step_320_actor

Updated Apr 3 • 1

zwt123home123/standardtraining_2p_Qwen2.5-7B-Instruct-1M-4ppl_largebs_global_step_203_actor

Updated Apr 3 • 1

zwt123home123/standardtraining_2p_Qwen2.5-7B-Instruct-1M-3ppl_largebs_global_step_203_actor

Updated Apr 3 • 2

zwt123home123/standardtraining_2p_Qwen2.5-7B-Instruct-1M-3ppl_largebs_global_step_400_actor

Updated Apr 3 • 1

zwt123home123/global_step_840_actor

Updated Apr 2 • 1

zwt123home123/InternVL2_5-8B

Image-Text-to-Text • Updated Feb 19 • 6

zwt123home123/KV_internvl26b

Updated Feb 4

zwt123home123/13b_LUT_c100_zpz5_afterrope_nonorm_group_v_cache_640

Updated Jan 19

zwt123home123/13b_LUT_c100_zpz5_prerope_nonorm_group_v_cache_640

Updated Jan 16

zwt123home123/attn_weights_save_7b_all_layers_concat_10

Updated Jan 3

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs