Research Papers/Reviews/Literature - a HYPRFLX Collection

HYPRFLX 's Collections

Flagship Models

Research Papers/Reviews/Literature

Resources/Tools/Tips & Tricks/Prompting/Quick Cheatsheets

Research Papers/Reviews/Literature

updated about 10 hours ago

Daily Research papers and review including older relevant content.

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published Jan 30 • 61
RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18 • 149
DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning

Paper • 2503.15265 • Published Mar 19 • 47
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning

Paper • 2503.15558 • Published Mar 18 • 49
One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation

Paper • 2503.13358 • Published Mar 17 • 96
MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization

Paper • 2504.00999 • Published Apr 1 • 90
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 284
The Carbon Footprint of Machine Learning Training Will Plateau, Then Shrink

Paper • 2204.05149 • Published Apr 11, 2022 • 10
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 127
Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21 • 85
Kuwain 1.5B: An Arabic SLM via Language Injection

Paper • 2504.15120 • Published Apr 21 • 120
TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 112
VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models

Paper • 2504.15279 • Published Apr 21 • 74
Step1X-Edit: A Practical Framework for General Image Editing

Paper • 2504.17761 • Published Apr 24 • 88
Towards Understanding Camera Motions in Any Video

Paper • 2504.15376 • Published Apr 21 • 157
BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs

Paper • 2504.18415 • Published Apr 25 • 43
MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention

Paper • 2504.16083 • Published Apr 22 • 9
LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects

Paper • 2504.19838 • Published Apr 28 • 21
Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29 • 93
Sadeed: Advancing Arabic Diacritization Through Small Language Model

Paper • 2504.21635 • Published Apr 30 • 59
UniversalRAG: Retrieval-Augmented Generation over Multiple Corpora with Diverse Modalities and Granularities

Paper • 2504.20734 • Published Apr 29 • 62
A Survey of Interactive Generative Video

Paper • 2504.21853 • Published Apr 30 • 47
Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks

Paper • 2505.00234 • Published May 1 • 26
Improving Editability in Image Generation with Layer-wise Memory

Paper • 2505.01079 • Published May 2 • 28
Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play

Paper • 2505.02707 • Published 29 days ago • 82
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

Paper • 2505.03318 • Published 28 days ago • 92
Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published 28 days ago • 168
Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Paper • 2505.04921 • Published 26 days ago • 173
Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published 17 days ago • 114
Web-Shepherd: Advancing PRMs for Reinforcing Web Agents

Paper • 2505.15277 • Published 13 days ago • 98
Scaling Law for Quantization-Aware Training

Paper • 2505.14302 • Published 14 days ago • 71
NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification

Paper • 2505.16938 • Published 12 days ago • 113
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published 11 days ago • 83
Mutarjim: Advancing Bidirectional Arabic-English Translation with a Small Language Model

Paper • 2505.17894 • Published 11 days ago • 212
The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published 6 days ago • 109
SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

Paper • 2505.20411 • Published 8 days ago • 82
Table-R1: Inference-Time Scaling for Table Reasoning

Paper • 2505.23621 • Published 5 days ago • 86
Large Language Models for Data Synthesis

Paper • 2505.14752 • Published 14 days ago • 42