ToxicityPrompts

university

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

kpriyanshu256 published a dataset 9 days ago

ToxicityPrompts/sample-data-tower-v2-PolyGuard-Train-Test-Sample-Hindi

kpriyanshu256 updated a dataset 9 days ago

ToxicityPrompts/sample-data-tower-v2-PolyGuard-Train-Test-Sample-Hindi

kpriyanshu256 updated a model 29 days ago

ToxicityPrompts/duoguard-vllm

View all activity

ToxicityPrompts's activity

kpriyanshu256

published a dataset 9 days ago

ToxicityPrompts/sample-data-tower-v2-PolyGuard-Train-Test-Sample-Hindi

Viewer • Updated 9 days ago • 171 • 80

kpriyanshu256

updated a dataset 9 days ago

ToxicityPrompts/sample-data-tower-v2-PolyGuard-Train-Test-Sample-Hindi

Viewer • Updated 9 days ago • 171 • 80

kpriyanshu256

updated a model 29 days ago

ToxicityPrompts/duoguard-vllm

Updated 29 days ago • 62

kpriyanshu256

published a model 29 days ago

ToxicityPrompts/duoguard-vllm

Updated 29 days ago • 62

himanshubeniwal

published a model about 1 month ago

ToxicityPrompts/PolyGuard-Qwen

Updated Dec 31, 2024 • 47

kpriyanshu256

updated 2 models 2 months ago

ToxicityPrompts/PolyGuard-Ministral

Updated Jan 9 • 13

ToxicityPrompts/mWildGuard-Qwen-v2-9-LoRA

Updated Jan 9 • 7

Xuhui

authored a paper 3 months ago

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Paper • 2412.14161 • Published Dec 18, 2024 • 51

kpriyanshu256

authored 2 papers 4 months ago

SciDr at SDU-2020: IDEAS -- Identifying and Disambiguating Everyday Acronyms for Scientific Domain

Paper • 2102.08818 • Published Feb 17, 2021

Refusal-Trained LLMs Are Easily Jailbroken As Browser Agents

Paper • 2410.13886 • Published Oct 11, 2024

maartensap

authored a paper 9 months ago

WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models

Paper • 2406.18510 • Published Jun 26, 2024 • 9

devanshrj

authored a paper 10 months ago

PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language Models

Paper • 2405.09373 • Published May 15, 2024 • 1

kpriyanshu256

authored a paper 10 months ago

PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language Models

Paper • 2405.09373 • Published May 15, 2024 • 1

maartensap

authored a paper about 1 year ago

SOTOPIA-$π$: Interactive Learning of Socially Intelligent Language Agents

Paper • 2403.08715 • Published Mar 13, 2024 • 21

Xuhui

authored a paper over 1 year ago

WebArena: A Realistic Web Environment for Building Autonomous Agents

Paper • 2307.13854 • Published Jul 25, 2023 • 25

AI & ML interests

Recent Activity

Team members 8

ToxicityPrompts's activity