Submitted by ambean 18 Clinical knowledge in LLMs does not translate to human interactions · 11 authors 2
Submitted by lgy0404 17 LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects · 18 authors 3
Submitted by QizhiPei 12 CipherBank: Exploring the Boundary of LLM Reasoning Capabilities through Cryptography Challenges · 9 authors 3
Submitted by judge 11 SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning · 8 authors 1
Submitted by iofu728 8 MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention · 11 authors 1
Submitted by cloudcatcher2 6 Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency · 7 authors 1
Submitted by renqiux0302 4 TrustGeoGen: Scalable and Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving · 13 authors 1
Submitted by observerw 3 ChiseLLM: Unleashing the Power of Reasoning LLMs for Chisel Agile Hardware Development · 6 authors 1
Submitted by FocusV857 2 ICL CIPHERS: Quantifying "Learning'' in In-Context Learning via Substitution Ciphers · 5 authors 1
Submitted by akhaliq 1 Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory · 5 authors 1
Submitted by AaronZ345 1 Versatile Framework for Song Generation with Prompt-based Control · 11 authors 1
Submitted by soujanyaporia - NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks · 8 authors 1