Chenj's picture

30 5

Chenj

Tsuchen

AI & ML interests

CV

Recent Activity

upvoted a paper 4 days ago

GeoDrive: 3D Geometry-Informed Driving World Model with Precise Action Control

upvoted a paper 4 days ago

Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model

upvoted a paper 4 days ago

VF-Eval: Evaluating Multimodal LLMs for Generating Feedback on AIGC Videos

View all activity

Organizations

None yet

Tsuchen's activity

upvoted 4 papers 4 days ago

GeoDrive: 3D Geometry-Informed Driving World Model with Precise Action Control

Paper • 2505.22421 • Published 6 days ago • 11

Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model

Paper • 2505.23606 • Published 5 days ago • 14

VF-Eval: Evaluating Multimodal LLMs for Generating Feedback on AIGC Videos

Paper • 2505.23693 • Published 5 days ago • 56

Table-R1: Inference-Time Scaling for Table Reasoning

Paper • 2505.23621 • Published 5 days ago • 86

upvoted 3 papers 6 days ago

MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems

Paper • 2505.18943 • Published 10 days ago • 24

MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios

Paper • 2505.21333 • Published 7 days ago • 38

OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data

Paper • 2505.18445 • Published 11 days ago • 63

upvoted a paper 3 months ago

SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models

Paper • 2503.07605 • Published Mar 10 • 69

upvoted 12 papers 4 months ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published Feb 13 • 194

Closed-loop Long-horizon Robotic Planning via Equilibrium Sequence Modeling

Paper • 2410.01440 • Published Oct 2, 2024 • 4

SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios

Paper • 2410.01481 • Published Oct 2, 2024 • 3

InfiniPot: Infinite Context Processing on Memory-Constrained LLMs

Paper • 2410.01518 • Published Oct 2, 2024 • 3

VLMGuard: Defending VLMs against Malicious Prompts via Unlabeled Data

Paper • 2410.00296 • Published Oct 1, 2024 • 6

HarmoniCa: Harmonizing Training and Inference for Better Feature Cache in Diffusion Transformer Acceleration

Paper • 2410.01723 • Published Oct 2, 2024 • 5

Old Optimizer, New Norm: An Anthology

Paper • 2409.20325 • Published Sep 30, 2024 • 4

EVER: Exact Volumetric Ellipsoid Rendering for Real-time View Synthesis

Paper • 2410.01804 • Published Oct 2, 2024 • 7

EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control

Paper • 2410.00316 • Published Oct 1, 2024 • 7

BordIRlines: A Dataset for Evaluating Cross-lingual Retrieval-Augmented Generation

Paper • 2410.01171 • Published Oct 2, 2024 • 6

General Preference Modeling with Preference Representations for Aligning Language Models

Paper • 2410.02197 • Published Oct 3, 2024 • 9

Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis

Paper • 2409.20059 • Published Sep 30, 2024 • 16