Eric NG's picture

Eric NG

Eric108

·

AI & ML interests

NLP

Recent Activity

upvoted a paper 1 day ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

upvoted a paper 1 day ago

Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning

upvoted a paper 1 day ago

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

View all activity

Organizations

None yet

Eric108's activity

upvoted 4 papers 1 day ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published 3 days ago • 78

Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning

Paper • 2505.01441 • Published 11 days ago • 30

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

Paper • 2505.02625 • Published 3 days ago • 16

RM-R1: Reward Modeling as Reasoning

Paper • 2505.02387 • Published 4 days ago • 58

liked a model 2 days ago

richinfoai/ritrieve_zh_v1

Sentence Similarity • Updated Mar 25 • 1.47k • • 9

upvoted 7 papers 3 days ago

Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks

Paper • 2505.00234 • Published 8 days ago • 21

DeepCritic: Deliberate Critique with Large Language Models

Paper • 2505.00662 • Published 7 days ago • 48

Beyond the Last Answer: Your Reasoning Trace Uncovers More than You Think

Paper • 2504.20708 • Published 10 days ago • 21

WebThinker: Empowering Large Reasoning Models with Deep Research Capability

Paper • 2504.21776 • Published 8 days ago • 41

ReasonIR: Training Retrievers for Reasoning Tasks

Paper • 2504.20595 • Published 10 days ago • 50

NEMOTRON-CROSSTHINK: Scaling Self-Learning beyond Math Reasoning

Paper • 2504.13941 • Published 23 days ago • 9

SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning

Paper • 2504.19162 • Published 12 days ago • 15

upvoted 2 papers 10 days ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 61

UltraIF: Advancing Instruction Following from the Wild

Paper • 2502.04153 • Published Feb 6 • 23

upvoted a collection 10 days ago

Qwen3

37 items • Updated about 9 hours ago • 547

upvoted 2 papers 11 days ago

Tina: Tiny Reasoning Models via LoRA

Paper • 2504.15777 • Published 17 days ago • 52

Process Reward Models That Think

Paper • 2504.16828 • Published 15 days ago • 16

liked 2 datasets 15 days ago

Team-ACE/ToolACE

Viewer • Updated Sep 4, 2024 • 11.3k • 1.2k • 81

Salesforce/xlam-function-calling-60k

Viewer • Updated Jan 24 • 60k • 4.36k • 448

upvoted a paper 17 days ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published 20 days ago • 119