Chmielewski's picture

Chmielewski

Eryk-Chmielewski

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 5 hours ago

NEMOTRON-CROSSTHINK: Scaling Self-Learning beyond Math Reasoning

upvoted a paper about 5 hours ago

The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs

upvoted a paper about 5 hours ago

Kimi-Audio Technical Report

View all activity

Organizations

Eryk-Chmielewski's activity

upvoted 13 papers about 5 hours ago

NEMOTRON-CROSSTHINK: Scaling Self-Learning beyond Math Reasoning

Paper • 2504.13941 • Published 24 days ago • 10

The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs

Paper • 2504.17768 • Published 15 days ago • 13

Kimi-Audio Technical Report

Paper • 2504.18425 • Published 14 days ago • 16

Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning

Paper • 2504.16656 • Published 17 days ago • 55

Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory

Paper • 2504.19413 • Published 12 days ago • 12

RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning

Paper • 2504.20073 • Published 15 days ago • 9

ReasonIR: Training Retrievers for Reasoning Tasks

Paper • 2504.20595 • Published 11 days ago • 51

Taming the Titans: A Survey of Efficient LLM Inference Serving

Paper • 2504.19720 • Published 12 days ago • 10

Beyond the Last Answer: Your Reasoning Trace Uncovers More than You Think

Paper • 2504.20708 • Published 11 days ago • 22

Softpick: No Attention Sink, No Massive Activations with Rectified Softmax

Paper • 2504.20966 • Published 10 days ago • 26

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Paper • 2504.21233 • Published 10 days ago • 38

Phi-4-reasoning Technical Report

Paper • 2504.21318 • Published 10 days ago • 38

WebThinker: Empowering Large Reasoning Models with Deep Research Capability

Paper • 2504.21776 • Published 9 days ago • 43

upvoted 7 papers about 6 hours ago

Llama-Nemotron: Efficient Reasoning Models

Paper • 2505.00949 • Published 8 days ago • 31

A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency

Paper • 2505.01658 • Published 7 days ago • 30

SWE-smith: Scaling Data for Software Engineering Agents

Paper • 2504.21798 • Published 9 days ago • 8

An Empirical Study of Qwen3 Quantization

Paper • 2505.02214 • Published 5 days ago • 22

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published 4 days ago • 88

OSUniverse: Benchmark for Multimodal GUI-navigation AI Agents

Paper • 2505.03570 • Published 3 days ago • 6

LLM-Independent Adaptive RAG: Let the Question Speak for Itself

Paper • 2505.04253 • Published 3 days ago • 8