view article Article How to Reduce Memory Use in Reasoning Models By Kseniase and 1 other • 3 days ago • 8
view article Article 🦸🏻#13: Action! How AI Agents Execute Tasks with UI and API Tools By Kseniase • 6 days ago • 4
view article Article 🦸🏻#12: How Do Agents Learn from Their Own Mistakes? The Role of Reflection in AI By Kseniase • 7 days ago • 5
view article Article Everything You Need to Know about Knowledge Distillation By Kseniase and 1 other • 10 days ago • 18
Guardians of the Agentic System: Preventing Many Shots Jailbreak with Agentic System Paper • 2502.16750 • Published 21 days ago • 10
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution Paper • 2502.18449 • Published 19 days ago • 69
Beyond Release: Access Considerations for Generative AI Systems Paper • 2502.16701 • Published 21 days ago • 12
Slamming: Training a Speech Language Model on One GPU in a Day Paper • 2502.15814 • Published 25 days ago • 66
Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement Paper • 2410.04444 • Published Oct 6, 2024 • 2
Tree of Thoughts: Deliberate Problem Solving with Large Language Models Paper • 2305.10601 • Published May 17, 2023 • 12
Chain of Hindsight Aligns Language Models with Feedback Paper • 2302.02676 • Published Feb 6, 2023 • 1
ReAct: Synergizing Reasoning and Acting in Language Models Paper • 2210.03629 • Published Oct 6, 2022 • 24
Reflexion: Language Agents with Verbal Reinforcement Learning Paper • 2303.11366 • Published Mar 20, 2023 • 5
view article Article 🌁#89: AI in Action: How AI Engineers, Self-Optimizing Models, and Humanoid Robots Are Reshaping 2025 By Kseniase • 20 days ago • 4
Demonstrating specification gaming in reasoning models Paper • 2502.13295 • Published 26 days ago • 1