STAIR - a thu-ml Collection

Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

thu-ml 's Collections

STAIR

STAIR

updated 19 days ago

Datasets and Models for STAIR (Improving Safety Alignment with Introspective Reasoning)

thu-ml/STAIR-Llama-3.1-8B-SFT

Text Generation • Updated 20 days ago • 39
thu-ml/STAIR-Qwen2-7B-SFT

Text Generation • Updated 20 days ago • 46 • 1
thu-ml/STAIR-SFT

Viewer • Updated 20 days ago • 20k • 111
thu-ml/STAIR-Prompts

Viewer • Updated 20 days ago • 63k • 93
STAIR: Improving Safety Alignment with Introspective Reasoning

Paper • 2502.02384 • Published Feb 4
thu-ml/STAIR-Qwen2-7B-DPO-3

Text Generation • Updated 19 days ago • 31 • 1
thu-ml/STAIR-Llama-3.1-8B-DPO-3

Text Generation • Updated 19 days ago • 32

Collection guide
Browse collections

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs