1 2 4

ZhuofengLi

[email protected]

AI & ML interests

None yet

Recent Activity

updated a dataset 1 day ago

VerlTool/openmathreasoning_tir_100K

published a dataset 1 day ago

VerlTool/openmathreasoning_tir_100K

updated a model 8 days ago

VerlTool/torl-fsdp_agent-qwen_qwen2.5-math-7b-grpo-n16-b128-t1.0-lr1e-6

View all activity

Organizations

ZhuofengLi's activity

updated a dataset 1 day ago

VerlTool/openmathreasoning_tir_100K

Viewer • Updated 1 day ago • 104k

published a dataset 1 day ago

VerlTool/openmathreasoning_tir_100K

Viewer • Updated 1 day ago • 104k

updated a model 8 days ago

VerlTool/torl-fsdp_agent-qwen_qwen2.5-math-7b-grpo-n16-b128-t1.0-lr1e-6

Updated 8 days ago • 184

published a model 8 days ago

VerlTool/torl-fsdp_agent-qwen_qwen2.5-math-7b-grpo-n16-b128-t1.0-lr1e-6

Updated 8 days ago • 184

upvoted a paper 19 days ago

VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

Paper • 2504.08837 • Published 23 days ago • 42

upvoted a paper about 1 month ago

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published Mar 30 • 131

updated a model about 1 month ago

ZhuofengLi/pot-r1-grpo-qwen2.5-7b-Instruct

Text Generation • Updated Mar 30 • 1

published a model about 1 month ago

ZhuofengLi/pot-r1-grpo-qwen2.5-7b-Instruct

Text Generation • Updated Mar 30 • 1

updated a model about 1 month ago

ZhuofengLi/pot-r1-grpo-qwen2.5-1.5b-Instruct

Text Generation • Updated Mar 30 • 3

liked a dataset about 1 month ago

ZhuofengLi/TEG-Datasets

Preview • Updated Oct 29, 2024 • 391 • 4

updated a model about 1 month ago

ZhuofengLi/pot-r1-grpo-qwen2.5-1.5b-Instruct-wo-warmup

Text Generation • Updated Mar 28 • 1

published 3 models about 1 month ago

updated a model about 1 month ago

ZhuofengLi/pot-r1-grpo-qwen2.5-7b-Instruct-wo-warmup

Text Generation • Updated Mar 25 • 1

published a model about 1 month ago

ZhuofengLi/pot-r1-grpo-qwen2.5-7b-Instruct-wo-warmup

Text Generation • Updated Mar 25 • 1

liked 2 Spaces about 2 months ago

2.55k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

ScholarCopilot

📊

Using RAG LLM to assist your academic writing

updated a dataset about 2 months ago

ZhuofengLi/Big-Math-RL-Verified

Viewer • Updated Mar 14 • 251k • 12

published a dataset about 2 months ago

ZhuofengLi/Big-Math-RL-Verified

Viewer • Updated Mar 14 • 251k • 12