ChaiwenJ's picture

22 4

ChaiwenJ

Puggyoll

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

VF-Eval: Evaluating Multimodal LLMs for Generating Feedback on AIGC Videos

upvoted a paper 4 days ago

Table-R1: Inference-Time Scaling for Table Reasoning

upvoted a paper 5 days ago

Sherlock: Self-Correcting Reasoning in Vision-Language Models

View all activity

Organizations

None yet

Puggyoll's activity

upvoted 2 papers 4 days ago

VF-Eval: Evaluating Multimodal LLMs for Generating Feedback on AIGC Videos

Paper • 2505.23693 • Published 5 days ago • 56

Table-R1: Inference-Time Scaling for Table Reasoning

Paper • 2505.23621 • Published 5 days ago • 86

upvoted a paper 5 days ago

Sherlock: Self-Correcting Reasoning in Vision-Language Models

Paper • 2505.22651 • Published 6 days ago • 50

upvoted a paper 6 days ago

OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data

Paper • 2505.18445 • Published 11 days ago • 63

upvoted a paper 29 days ago

PixelHacker: Image Inpainting with Structural and Semantic Consistency

Paper • 2504.20438 • Published Apr 29 • 43

upvoted a paper 3 months ago

SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models

Paper • 2503.07605 • Published Mar 10 • 69

upvoted a paper 4 months ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published Feb 13 • 194

liked 4 models 4 months ago

meta-llama/Llama-3.2-1B

Text Generation • Updated Oct 24, 2024 • 1.19M • 1.93k

BAAI/bge-m3

Sentence Similarity • Updated Jul 3, 2024 • 3.52M • • 2.09k

stabilityai/stable-diffusion-xl-base-1.0

Text-to-Image • Updated Oct 30, 2023 • 3.07M • • 6.63k

deepseek-ai/DeepSeek-R1-Distill-Qwen-14B

Text Generation • Updated Feb 24 • 521k • • 516

upvoted 9 papers 4 months ago

Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models

Paper • 2409.11136 • Published Sep 17, 2024 • 24

EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer

Paper • 2409.10819 • Published Sep 17, 2024 • 20

On the Diagram of Thought

Paper • 2409.10038 • Published Sep 16, 2024 • 14

ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds

Paper • 2409.09213 • Published Sep 13, 2024 • 13

Guiding Vision-Language Model Selection for Visual Question-Answering Across Tasks, Domains, and Knowledge Types

Paper • 2409.09269 • Published Sep 14, 2024 • 9

jina-embeddings-v3: Multilingual Embeddings With Task LoRA

Paper • 2409.10173 • Published Sep 16, 2024 • 33

One missing piece in Vision and Language: A Survey on Comics Understanding

Paper • 2409.09502 • Published Sep 14, 2024 • 26

Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models

Paper • 2409.06277 • Published Sep 10, 2024 • 16

Click2Mask: Local Editing with Dynamic Mask Generation

Paper • 2409.08272 • Published Sep 12, 2024 • 6