FormalMATH

university

https://scholar.google.com/citations?user=qUMjnPcAAAAJ&hl=en

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

prt66 authored a paper 4 days ago

FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models

zhouliang authored a paper 4 days ago

FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models

happzy2633 authored a paper 16 days ago

IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs

View all activity

FOMA-colm's activity

prt66

authored a paper 4 days ago

FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models

Paper • 2505.02735 • Published 4 days ago • 25

zhouliang

authored a paper 4 days ago

FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models

Paper • 2505.02735 • Published 4 days ago • 25

happzy2633

authored a paper 16 days ago

IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs

Paper • 2504.15415 • Published 18 days ago • 22

zhouliang

authored a paper 19 days ago

Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning

Paper • 2504.11354 • Published 24 days ago • 2

ringringdang

updated a dataset 26 days ago

FOMA-colm/goedel-lite-3200

Preview • Updated 26 days ago • 24

ringringdang

published a dataset 26 days ago

FOMA-colm/goedel-lite-3200

Preview • Updated 26 days ago • 24

zhouliang

authored a paper 30 days ago

MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series

Paper • 2405.19327 • Published May 29, 2024 • 50

zhouliang

authored 2 papers about 1 month ago

CodeEditorBench: Evaluating Code Editing Capability of Large Language Models

Paper • 2404.03543 • Published Apr 4, 2024 • 18

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published Apr 7 • 44

ringringdang

updated a dataset about 1 month ago

FOMA-colm/break0328

Viewer • Updated Mar 28 • 496 • 8

ringringdang

published a dataset about 1 month ago

FOMA-colm/break0328

Viewer • Updated Mar 28 • 496 • 8

zhouliang

updated a collection 2 months ago

Lean4 Dataset

Collection

3 items • Updated Mar 7

zhouliang

authored a paper 2 months ago

Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model

Paper • 2404.04167 • Published Apr 5, 2024 • 14

happzy2633

authored a paper 2 months ago

CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models

Paper • 2502.16614 • Published Feb 23 • 27

zgao3186

authored a paper 3 months ago

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Paper • 2502.14768 • Published Feb 20 • 48

zhouliang

authored a paper 3 months ago

Generating Symbolic World Models via Test-time Scaling of Large Language Models

Paper • 2502.04728 • Published Feb 7 • 19

prt66

authored a paper 9 months ago

Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling

Paper • 2408.03695 • Published Aug 7, 2024 • 13

AI & ML interests

Recent Activity

Team members 5

FOMA-colm's activity