ToxicityPrompts/sample-data-tower-v2-PolyGuard-Train-Test-Sample-Hindi Viewer • Updated 9 days ago • 171 • 80
ToxicityPrompts/sample-data-tower-v2-PolyGuard-Train-Test-Sample-Hindi Viewer • Updated 9 days ago • 171 • 80
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks Paper • 2412.14161 • Published Dec 18, 2024 • 51
SciDr at SDU-2020: IDEAS -- Identifying and Disambiguating Everyday Acronyms for Scientific Domain Paper • 2102.08818 • Published Feb 17, 2021
Refusal-Trained LLMs Are Easily Jailbroken As Browser Agents Paper • 2410.13886 • Published Oct 11, 2024
WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models Paper • 2406.18510 • Published Jun 26, 2024 • 9
PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language Models Paper • 2405.09373 • Published May 15, 2024 • 1
PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language Models Paper • 2405.09373 • Published May 15, 2024 • 1
SOTOPIA-$π$: Interactive Learning of Socially Intelligent Language Agents Paper • 2403.08715 • Published Mar 13, 2024 • 21
WebArena: A Realistic Web Environment for Building Autonomous Agents Paper • 2307.13854 • Published Jul 25, 2023 • 25