Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
thu-ml
's Collections
STAIR
STAIR
updated
19 days ago
Datasets and Models for STAIR (Improving Safety Alignment with Introspective Reasoning)
Upvote
-
thu-ml/STAIR-Llama-3.1-8B-SFT
Text Generation
•
Updated
20 days ago
•
39
thu-ml/STAIR-Qwen2-7B-SFT
Text Generation
•
Updated
20 days ago
•
46
•
1
thu-ml/STAIR-SFT
Viewer
•
Updated
20 days ago
•
20k
•
111
thu-ml/STAIR-Prompts
Viewer
•
Updated
20 days ago
•
63k
•
93
STAIR: Improving Safety Alignment with Introspective Reasoning
Paper
•
2502.02384
•
Published
Feb 4
thu-ml/STAIR-Qwen2-7B-DPO-3
Text Generation
•
Updated
19 days ago
•
31
•
1
thu-ml/STAIR-Llama-3.1-8B-DPO-3
Text Generation
•
Updated
19 days ago
•
32
Upvote
-
Share collection
View history
Collection guide
Browse collections