May 2025 - Top Papers - a ajinkyakolhe112 Collection

ajinkyakolhe112 's Collections

fine-tuning & post-training

May 2025 - Top Papers

LLMs-for-Gaming

LLMs for "Low Training Data Languages"

Computer Vision - Essential Research Papers

NLP & LLM - Essential Research Papers

Computer Vision - Complimentary Research Papers

May 2025 - Top Papers

updated 1 day ago

LLMs for Engineering: Teaching Models to Design High Powered Rockets

Paper • 2504.19394 • Published Apr 27 • 13
Generative AI for Character Animation: A Comprehensive Survey of Techniques, Applications, and Future Directions

Paper • 2504.19056 • Published Apr 27 • 16
100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models

Paper • 2505.00551 • Published May 1 • 36
The Leaderboard Illusion

Paper • 2504.20879 • Published Apr 29 • 69
LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

Paper • 2505.02625 • Published 28 days ago • 22
Llama-Nemotron: Efficient Reasoning Models

Paper • 2505.00949 • Published about 1 month ago • 35
Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis

Paper • 2505.10046 • Published 18 days ago • 9
LightLab: Controlling Light Sources in Images with Diffusion Models

Paper • 2505.09608 • Published 18 days ago • 31
Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper • 2505.09343 • Published 19 days ago • 61
Bielik v3 Small: Technical Report

Paper • 2505.02550 • Published 28 days ago • 63
Parallel Scaling Law for Language Models

Paper • 2505.10475 • Published 17 days ago • 75
Constructing a 3D Town from a Single Image

Paper • 2505.15765 • Published 11 days ago • 23
Understanding Generative AI Capabilities in Everyday Image Editing Tasks

Paper • 2505.16181 • Published 11 days ago • 23
EfficientLLM: Efficiency in Large Language Models

Paper • 2505.13840 • Published 13 days ago • 22
Be Careful When Fine-tuning On Open-Source LLMs: Your Fine-tuning Data Could Be Secretly Stolen!

Paper • 2505.15656 • Published 11 days ago • 13
How Should We Enhance the Safety of Large Reasoning Models: An Empirical Study

Paper • 2505.15404 • Published 12 days ago • 13
Scaling Law for Quantization-Aware Training

Paper • 2505.14302 • Published 13 days ago • 70
Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models

Paper • 2505.14810 • Published 12 days ago • 58
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective

Paper • 2505.15045 • Published 12 days ago • 52
Qwen3 Technical Report

Paper • 2505.09388 • Published 18 days ago • 169
Emerging Properties in Unified Multimodal Pretraining

Paper • 2505.14683 • Published 12 days ago • 124
Alchemist: Turning Public Text-to-Image Data into Generative Gold

Paper • 2505.19297 • Published 7 days ago • 73
Reasoning Model is Stubborn: Diagnosing Instruction Overriding in Reasoning Models

Paper • 2505.17225 • Published 10 days ago • 62
MMMR: Benchmarking Massive Multi-Modal Reasoning Tasks

Paper • 2505.16459 • Published 11 days ago • 44
MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems

Paper • 2505.18943 • Published 8 days ago • 24
VeriThinker: Learning to Verify Makes Reasoning Model Efficient

Paper • 2505.17941 • Published 9 days ago • 24
Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs

Paper • 2505.19075 • Published 8 days ago • 20
Judging Quality Across Languages: A Multilingual Approach to Pretraining Data Filtering with Language Models

Paper • 2505.22232 • Published 5 days ago • 18
Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL

Paper • 2505.17952 • Published 9 days ago • 18