Tang

lzZzZx328

AI & ML interests

None yet

Recent Activity

upvoted an article 10 days ago

Tiny Agents: a MCP-powered agent in 50 lines of code

upvoted a paper about 2 months ago

Video-R1: Reinforcing Video Reasoning in MLLMs

upvoted an article about 2 months ago

Run ComfyUI workflows for free on Spaces

View all activity

Organizations

None yet

lzZzZx328's activity

upvoted an article 10 days ago

Article

Tiny Agents: a MCP-powered agent in 50 lines of code

20 days ago

• 234

upvoted a paper about 2 months ago

Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published Mar 27 • 78

upvoted 2 articles about 2 months ago

Article

Run ComfyUI workflows for free on Spaces

Jan 14, 2024

• 76

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

• 250

upvoted a paper about 2 months ago

Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published Mar 3 • 78

upvoted 5 papers 2 months ago

GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing

Paper • 2503.10639 • Published Mar 13 • 50

TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding

Paper • 2502.19400 • Published Feb 26 • 49

upvoted a paper 3 months ago

SurveyX: Academic Survey Automation via Large Language Models

Paper • 2502.14776 • Published Feb 20 • 100

upvoted 5 papers 4 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 276

Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains

Paper • 2501.05707 • Published Jan 10 • 20

VideoRAG: Retrieval-Augmented Generation over Video Corpus

Paper • 2501.05874 • Published Jan 10 • 72

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5, 2024 • 68

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 99

upvoted 2 papers 5 months ago

MALT: Improving Reasoning with Multi-Agent LLM Training

Paper • 2412.01928 • Published Dec 2, 2024 • 45

Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

Paper • 2411.19943 • Published Nov 29, 2024 • 64

upvoted a collection 6 months ago

📑 Trending Papers - October 🔟

Collection

10 items • Updated Mar 28 • 6

upvoted a paper 6 months ago

BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments

Paper • 2410.23918 • Published Oct 31, 2024 • 20