Andrea Pierleoni
andreapie
AI & ML interests
None yet
Recent Activity
updated
a collection
about 1 month ago
LLM Training
updated
a collection
about 1 month ago
LLM Training
updated
a collection
about 1 month ago
LLM Training
Organizations
None yet
Collections
2
-
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models
Paper • 2501.09686 • Published • 37 -
Optimizing Large Language Model Training Using FP4 Quantization
Paper • 2501.17116 • Published • 36 -
Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search
Paper • 2502.02508 • Published • 23 -
On Teacher Hacking in Language Model Distillation
Paper • 2502.02671 • Published • 18
models
None public yet
datasets
None public yet