3 21 5

Run-Ze Fan

Vfrz

https://rzfan525.github.io/

AI & ML interests

Alignment, Large Language Models, Natural Language Process

Recent Activity

authored a paper 16 days ago

Generative AI Act II: Test Time Scaling Drives Cognition Engineering

upvoted a paper 16 days ago

Generative AI Act II: Test Time Scaling Drives Cognition Engineering

upvoted a paper about 1 month ago

MegaMath: Pushing the Limits of Open Math Corpora

View all activity

Organizations

Vfrz's activity

upvoted a paper 16 days ago

Generative AI Act II: Test Time Scaling Drives Cognition Engineering

Paper • 2504.13828 • Published 19 days ago • 16

upvoted a paper about 1 month ago

MegaMath: Pushing the Limits of Open Math Corpora

Paper • 2504.02807 • Published Apr 3 • 30

upvoted a paper about 2 months ago

LIMO: Less is More for Reasoning

Paper • 2502.03387 • Published Feb 5 • 61

upvoted 2 papers 2 months ago

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Paper • 2502.12982 • Published Feb 18 • 17

Grounded Persuasive Language Generation for Automated Marketing

Paper • 2502.16810 • Published Feb 24 • 12

upvoted a collection 3 months ago

Deepseek Papers

Collection

Deepseek papers collection • 20 items • Updated 6 days ago • 194

upvoted a paper 3 months ago

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Paper • 2502.07316 • Published Feb 11 • 49

upvoted a paper 4 months ago

PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World

Paper • 2412.17589 • Published Dec 23, 2024 • 12

upvoted 2 papers 5 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 367

LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks

Paper • 2412.15204 • Published Dec 19, 2024 • 38

upvoted 3 papers 7 months ago

Measuring Mathematical Problem Solving With the MATH Dataset

Paper • 2103.03874 • Published Mar 5, 2021 • 5

What Matters in Transformers? Not All Attention is Needed

Paper • 2406.15786 • Published Jun 22, 2024 • 32

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 63

upvoted a paper 9 months ago

Data Contamination Report from the 2024 CONDA Shared Task

Paper • 2407.21530 • Published Jul 31, 2024 • 10

upvoted a paper 11 months ago

OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI

Paper • 2406.12753 • Published Jun 18, 2024 • 14

upvoted 4 papers about 1 year ago

upvoted a paper over 1 year ago

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 148