Dongzhichen

DongJinn

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

VF-Eval: Evaluating Multimodal LLMs for Generating Feedback on AIGC Videos

upvoted a paper 4 days ago

Table-R1: Inference-Time Scaling for Table Reasoning

upvoted a paper 5 days ago

Sherlock: Self-Correcting Reasoning in Vision-Language Models

View all activity

Organizations

None yet

DongJinn's activity

upvoted 2 papers 4 days ago

VF-Eval: Evaluating Multimodal LLMs for Generating Feedback on AIGC Videos

Paper • 2505.23693 • Published 5 days ago • 56

Table-R1: Inference-Time Scaling for Table Reasoning

Paper • 2505.23621 • Published 5 days ago • 86

upvoted a paper 5 days ago

Sherlock: Self-Correcting Reasoning in Vision-Language Models

Paper • 2505.22651 • Published 6 days ago • 50

upvoted 3 papers about 1 month ago

upvoted a paper 3 months ago

SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models

Paper • 2503.07605 • Published Mar 10 • 69

upvoted 13 papers 4 months ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published Feb 13 • 194

ViBe: A Text-to-Video Benchmark for Evaluating Hallucination in Large Multimodal Models

Paper • 2411.10867 • Published Nov 16, 2024 • 10

VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation

Paper • 2411.13281 • Published Nov 20, 2024 • 22

SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory

Paper • 2411.11922 • Published Nov 18, 2024 • 19

When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training

Paper • 2411.13476 • Published Nov 20, 2024 • 16

VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models

Paper • 2411.13503 • Published Nov 20, 2024 • 35

Natural Language Reinforcement Learning

Paper • 2411.14251 • Published Nov 21, 2024 • 31

Soft Robotic Dynamic In-Hand Pen Spinning

Paper • 2411.12734 • Published Nov 19, 2024 • 10

SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning

Paper • 2411.10161 • Published Nov 15, 2024 • 9

Evaluating Tokenizer Performance of Large Language Models Across Official Indian Languages

Paper • 2411.12240 • Published Nov 19, 2024 • 7

Continuous Speculative Decoding for Autoregressive Image Generation

Paper • 2411.11925 • Published Nov 18, 2024 • 16

ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements

Paper • 2411.12044 • Published Nov 18, 2024 • 14

Building Trust: Foundations of Security, Safety and Transparency in AI

Paper • 2411.12275 • Published Nov 19, 2024 • 11