Inbox - a oeohomos Collection

Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

oeohomos 's Collections

Inbox

Qwen

Deepseek Papers

RAG

Inbox

updated about 20 hours ago

RuCCoD: Towards Automated ICD Coding in Russian

Paper • 2502.21263 • Published 19 days ago • 122
Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published 12 days ago • 105
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching

Paper • 2503.05179 • Published 12 days ago • 43
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Paper • 2503.05592 • Published 12 days ago • 25
Forgetting Transformer: Softmax Attention with a Forget Gate

Paper • 2503.02130 • Published 15 days ago • 27
SafeArena: Evaluating the Safety of Autonomous Web Agents

Paper • 2503.04957 • Published 13 days ago • 18
VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control

Paper • 2503.05639 • Published 12 days ago • 22
R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning

Paper • 2503.05379 • Published 12 days ago • 32
Learning from Failures in Multi-Attempt Reinforcement Learning

Paper • 2503.04808 • Published 15 days ago • 17
TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

Paper • 2503.04872 • Published 13 days ago • 14
TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models

Paper • 2503.05638 • Published 12 days ago • 17
BEHAVIOR Robot Suite: Streamlining Real-World Whole-Body Manipulation for Everyday Household Activities

Paper • 2503.05652 • Published 12 days ago • 10
ProReflow: Progressive Reflow with Decomposed Velocity

Paper • 2503.04824 • Published 14 days ago • 9
An Empirical Study on Eliciting and Improving R1-like Reasoning Models

Paper • 2503.04548 • Published 13 days ago • 8
Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts

Paper • 2503.05447 • Published 12 days ago • 7
LONGCODEU: Benchmarking Long-Context Language Models on Long Code Understanding

Paper • 2503.04359 • Published 13 days ago • 6
SAGE: A Framework of Precise Retrieval for RAG

Paper • 2503.01713 • Published 16 days ago • 5
EAGLE-3: Scaling up Inference Acceleration of Large Language Models via Training-Time Test

Paper • 2503.01840 • Published 16 days ago • 4
Know You First and Be You Better: Modeling Human-Like User Simulators via Implicit Profiles

Paper • 2502.18968 • Published 21 days ago • 3
LoRACode: LoRA Adapters for Code Embeddings

Paper • 2503.05315 • Published 12 days ago • 10
AnyAnomaly: Zero-Shot Customizable Video Anomaly Detection with LVLM

Paper • 2503.04504 • Published 13 days ago • 2
YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published 8 days ago • 57
SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories

Paper • 2503.08625 • Published 8 days ago • 24
Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model

Paper • 2503.07703 • Published 9 days ago • 31
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published 7 days ago • 55
Multimodal Language Modeling for High-Accuracy Single Cell Transcriptomics Analysis and Generation

Paper • 2503.09427 • Published 7 days ago • 3
Video Action Differencing

Paper • 2503.07860 • Published 8 days ago • 30
LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization

Paper • 2503.08619 • Published 8 days ago • 18
OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models

Paper • 2503.08686 • Published 8 days ago • 16
Exploiting Instruction-Following Retrievers for Malicious Information Retrieval

Paper • 2503.08644 • Published 8 days ago • 16
Robusto-1 Dataset: Comparing Humans and VLMs on real out-of-distribution Autonomous Driving VQA from Peru

Paper • 2503.07587 • Published 9 days ago • 10
"Principal Components" Enable A New Language of Images

Paper • 2503.08685 • Published 8 days ago • 11
^RFLAV: Rolling Flow matching for infinite Audio Video generation

Paper • 2503.08307 • Published 8 days ago • 8
BiasEdit: Debiasing Stereotyped Language Models via Model Editing

Paper • 2503.08588 • Published 8 days ago • 6
AnyMoLe: Any Character Motion In-betweening Leveraging Video Diffusion Models

Paper • 2503.08417 • Published 8 days ago • 7
AI-native Memory 2.0: Second Me

Paper • 2503.08102 • Published 8 days ago • 6
Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol

Paper • 2503.05860 • Published 12 days ago • 8
LocAgent: Graph-Guided LLM Agents for Code Localization

Paper • 2503.09089 • Published 7 days ago • 6
Perplexity Trap: PLM-Based Retrievers Overrate Low Perplexity Documents

Paper • 2503.08684 • Published 8 days ago • 5
Referring to Any Person

Paper • 2503.08507 • Published 8 days ago • 6
More Documents, Same Length: Isolating the Challenge of Multiple Documents in RAG

Paper • 2503.04388 • Published 13 days ago • 15
Quantizing Large Language Models for Code Generation: A Differentiated Replication

Paper • 2503.07103 • Published 9 days ago • 6
Cost-Optimal Grouped-Query Attention for Long-Context LLMs

Paper • 2503.09579 • Published 7 days ago • 5
Self-Taught Self-Correction for Small Language Models

Paper • 2503.08681 • Published 8 days ago • 12
Multi Agent based Medical Assistant for Edge Devices

Paper • 2503.05397 • Published 12 days ago • 6
MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation System

Paper • 2503.09600 • Published 7 days ago • 4
PhysicsGen: Can Generative Models Learn from Images to Predict Complex Physical Relations?

Paper • 2503.05333 • Published 12 days ago • 8
Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Paper • 2503.11224 • Published 5 days ago • 26
API Agents vs. GUI Agents: Divergence and Convergence

Paper • 2503.11069 • Published 5 days ago • 26
Group-robust Machine Unlearning

Paper • 2503.09330 • Published 7 days ago • 1
Personalize Anything for Free with Diffusion Transformer

Paper • 2503.12590 • Published 3 days ago • 29
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

Paper • 2503.12605 • Published 3 days ago • 22
Rewards Are Enough for Fast Photo-Realistic Text-to-image Generation

Paper • 2503.13070 • Published 2 days ago • 6

Collection guide
Browse collections

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs