2 6

Xuandong Zhao

Xuandong

https://xuandongzhao.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 19 days ago

THOUGHTTERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models

authored a paper about 1 month ago

Are You Getting What You Pay For? Auditing Model Substitution in LLM APIs

upvoted a paper about 1 month ago

Are You Getting What You Pay For? Auditing Model Substitution in LLM APIs

View all activity

Organizations

Xuandong's activity

upvoted a paper 19 days ago

THOUGHTTERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models

Paper • 2504.13367 • Published 24 days ago • 24

authored a paper about 1 month ago

Are You Getting What You Pay For? Auditing Model Substitution in LLM APIs

Paper • 2504.04715 • Published Apr 7 • 13

upvoted a paper about 1 month ago

Are You Getting What You Pay For? Auditing Model Substitution in LLM APIs

Paper • 2504.04715 • Published Apr 7 • 13

commented a paper about 1 month ago

Are You Getting What You Pay For? Auditing Model Substitution in LLM APIs

Paper • 2504.04715 • Published Apr 7 • 13 •

published a model 2 months ago

Xuandong/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Updated Mar 7

upvoted a paper 3 months ago

The Hidden Risks of Large Reasoning Models: A Safety Assessment of R1

Paper • 2502.12659 • Published Feb 18 • 7

upvoted a paper 7 months ago

Multimodal Situational Safety

Paper • 2410.06172 • Published Oct 8, 2024 • 11

authored a paper 7 months ago

Multimodal Situational Safety

Paper • 2410.06172 • Published Oct 8, 2024 • 11

upvoted a paper 9 months ago

Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents

Paper • 2408.07060 • Published Aug 13, 2024 • 43

updated a Space 11 months ago

Unigram-Watermark

👀

authored 3 papers over 1 year ago

Compressing Sentence Representation for Semantic Retrieval via Homomorphic Projective Distillation

Paper • 2203.07687 • Published Mar 15, 2022

Protecting Language Generation Models via Invisible Watermarking

Paper • 2302.03162 • Published Feb 6, 2023

Provable Robust Watermarking for AI-Generated Text

Paper • 2306.17439 • Published Jun 30, 2023

upvoted a paper over 1 year ago

Weak-to-Strong Jailbreaking on Large Language Models

Paper • 2401.17256 • Published Jan 30, 2024 • 16

authored a paper over 1 year ago

Weak-to-Strong Jailbreaking on Large Language Models

Paper • 2401.17256 • Published Jan 30, 2024 • 16

updated 2 models about 3 years ago

Xuandong/HPD-TinyBERT-F128

Feature Extraction • Updated May 10, 2022 • 4 • 1

Xuandong/HPD-MiniLM-F128

Feature Extraction • Updated May 10, 2022 • 5