Ai2

Enterprise

non-profit

Verified

https://allenai.org/

allen_ai

allenai

AI & ML interests

Building breatkthrough AI to solve the world's biggest problems.

Recent Activity

amanrangapur updated a model 2 minutes ago

allenai/OLMo-2-0325-32B

ljvmiranda921 authored a paper 5 days ago

MMTEB: Massive Multilingual Text Embedding Benchmark

ljvmiranda921 authored a paper 5 days ago

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

View all activity

Articles

Introducing the Open Chain of Thought Leaderboard

allenai's activity

amanrangapur

updated a model 2 minutes ago

allenai/OLMo-2-0325-32B

Text Generation • Updated 4 days ago • 132k • 31

shannons

authored a paper 5 days ago

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Paper • 2502.09604 • Published Feb 13 • 33

ljvmiranda921

authored 2 papers 5 days ago

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published 27 days ago • 32

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published 7 days ago • 92

pradeepd

authored a paper 14 days ago

Large-Scale Data Selection for Instruction Tuning

Paper • 2503.01807 • Published 14 days ago • 11

yakazimir

authored a paper 19 days ago

ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning

Paper • 2502.01100 • Published Feb 3 • 17

armanc

authored a paper 25 days ago

TESS 2: A Large-Scale Generalist Diffusion Language Model

Paper • 2502.13917 • Published 26 days ago • 6

Muennighoff

authored a paper about 1 month ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 112

armanc

authored a paper about 2 months ago

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Paper • 2501.12380 • Published Jan 21 • 84

armanc

authored a paper 2 months ago

ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning

Paper • 2501.06590 • Published Jan 11 • 11

liujch1998

authored 10 papers 2 months ago

Don't throw away your value model! Making PPO even better via Value-Guided Monte-Carlo Tree Search decoding

Paper • 2309.15028 • Published Sep 26, 2023 • 1

MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts

Paper • 2310.02255 • Published Oct 3, 2023 • 2

Crystal: Introspective Reasoners Reinforced with Self-Feedback

Paper • 2310.04921 • Published Oct 7, 2023 • 1

NaturalProofs: Mathematical Theorem Proving in Natural Language

Paper • 2104.01112 • Published Mar 24, 2021

Generated Knowledge Prompting for Commonsense Reasoning

Paper • 2110.08387 • Published Oct 15, 2021

Minds versus Machines: Rethinking Entailment Verification with Language Models

Paper • 2402.03686 • Published Feb 6, 2024 • 1

NaturalProver: Grounded Mathematical Proof Generation with Language Models

Paper • 2205.12910 • Published May 25, 2022

Rainier: Reinforced Knowledge Introspector for Commonsense Question Answering

Paper • 2210.03078 • Published Oct 6, 2022 • 1

Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback

Paper • 2406.09279 • Published Jun 13, 2024 • 2

AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text

Paper • 2410.04265 • Published Oct 5, 2024