-
Agent Laboratory: Using LLM Agents as Research Assistants
Paper ⢠2501.04227 ⢠Published ⢠86 -
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Paper ⢠2501.05366 ⢠Published ⢠95 -
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training
Paper ⢠2501.11425 ⢠Published ⢠93 -
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Paper ⢠2501.10893 ⢠Published ⢠24
Shyam Sunder Kumar
theainerd
AI & ML interests
Natural Language Processing
Recent Activity
upvoted
a
paper
about 23 hours ago
Gemini Embedding: Generalizable Embeddings from Gemini
liked
a model
1 day ago
google/gemma-3-27b-it
Organizations
Collections
4
-
Training Large Language Models to Reason in a Continuous Latent Space
Paper ⢠2412.06769 ⢠Published ⢠78 -
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper ⢠2408.03314 ⢠Published ⢠58 -
Evolving Deeper LLM Thinking
Paper ⢠2501.09891 ⢠Published ⢠106 -
Kimi k1.5: Scaling Reinforcement Learning with LLMs
Paper ⢠2501.12599 ⢠Published ⢠104
models
2
datasets
None public yet