Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
zhongyuan peng's picture
2 5 3

zhongyuan peng

happzy2633
ruffclub's profile picture 21world's profile picture Evi1ran's profile picture
·
  • Happzy-WHU

AI & ML interests

None yet

Recent Activity

authored a paper 18 days ago
IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs
liked a dataset 2 months ago
m-a-p/SuperGPQA
authored a paper 2 months ago
CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models
View all activity

Organizations

Multimodal Art Projection's profile picture FormalMATH's profile picture

happzy2633's activity

upvoted a paper 3 months ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 103
upvoted a paper 6 months ago

Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models

Paper • 2411.07140 • Published Nov 11, 2024 • 35
upvoted 2 papers 7 months ago

A Comparative Study on Reasoning Patterns of OpenAI's o1 Model

Paper • 2410.13639 • Published Oct 17, 2024 • 19

MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models

Paper • 2410.11710 • Published Oct 15, 2024 • 20
upvoted a paper 8 months ago

FuzzCoder: Byte-level Fuzzing Test via Large Language Model

Paper • 2409.01944 • Published Sep 3, 2024 • 46
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs