Cascading Adversarial Bias from Injection to Distillation in Language Models Paper • 2505.24842 • Published 4 days ago • 5
Lessons from Defending Gemini Against Indirect Prompt Injections Paper • 2505.14534 • Published 14 days ago • 8
Fixing 7,400 Bugs for 1$: Cheap Crash-Site Program Repair Paper • 2505.13103 • Published 15 days ago • 6
Operationalizing Contextual Integrity in Privacy-Conscious Assistants Paper • 2408.02373 • Published Aug 5, 2024 • 5