Josh Harris's picture

4

Josh Harris

jah242

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information

upvoted a paper 11 months ago

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

upvoted a paper 11 months ago

Are We Done with MMLU?

View all activity

Organizations

jah242's activity

upvoted a paper 3 days ago

Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information

Paper • 2505.06046 • Published 6 days ago • 11

upvoted 2 papers 11 months ago

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

Paper • 2406.15877 • Published Jun 22, 2024 • 47

Are We Done with MMLU?

Paper • 2406.04127 • Published Jun 6, 2024 • 40

upvoted a paper almost 2 years ago

Challenges and Applications of Large Language Models

Paper • 2307.10169 • Published Jul 19, 2023 • 48