r's picture

r PRO

oceansweep

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 11 hours ago

Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play

upvoted a paper about 12 hours ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

liked a model 1 day ago

nvidia/parakeet-tdt-0.6b-v2

View all activity

Organizations

None yet

oceansweep's activity

upvoted a paper about 11 hours ago

Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play

Paper • 2505.02707 • Published 3 days ago • 69

upvoted a paper about 12 hours ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published 2 days ago • 67

upvoted a paper 7 days ago

UniversalRAG: Retrieval-Augmented Generation over Multiple Corpora with Diverse Modalities and Granularities

Paper • 2504.20734 • Published 9 days ago • 60

upvoted a paper 9 days ago

Even Small Reasoners Should Quote Their Sources: Introducing the Pleias-RAG Model Family

Paper • 2504.18225 • Published 13 days ago • 12

upvoted a paper 12 days ago

Step1X-Edit: A Practical Framework for General Image Editing

Paper • 2504.17761 • Published 14 days ago • 86

upvoted 5 papers 15 days ago

RainbowPlus: Enhancing Adversarial Prompt Generation via Evolutionary Quality-Diversity Search

Paper • 2504.15047 • Published 17 days ago • 6

EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models

Paper • 2504.15133 • Published 17 days ago • 21

X-Teaming: Multi-Turn Jailbreaks and Defenses with Adaptive Multi-Agents

Paper • 2504.13203 • Published 23 days ago • 31

ToolRL: Reward is All Tool Learning Needs

Paper • 2504.13958 • Published 21 days ago • 43

FlowReasoner: Reinforcing Query-Level Meta-Agents

Paper • 2504.15257 • Published 17 days ago • 46

upvoted a collection 24 days ago

GLM-4-0414

GLM-4-0414 series model • 8 items • Updated 23 days ago • 123

upvoted 6 papers 29 days ago

Are You Getting What You Pay For? Auditing Model Substitution in LLM APIs

Paper • 2504.04715 • Published Apr 7 • 13

Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models

Paper • 2504.04823 • Published about 1 month ago • 30

URECA: Unique Region Caption Anything

Paper • 2504.05305 • Published about 1 month ago • 36

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published about 1 month ago • 102

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published about 1 month ago • 180

Rethinking Reflection in Pre-Training

Paper • 2504.04022 • Published Apr 5 • 77

upvoted 3 papers about 1 month ago

YourBench: Easy Custom Evaluation Sets for Everyone

Paper • 2504.01833 • Published Apr 2 • 20

Command A: An Enterprise-Ready Large Language Model

Paper • 2504.00698 • Published Apr 1 • 26

Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Paper • 2504.00595 • Published Apr 1 • 36