Phillip Guo

PhillipGuo

AI & ML interests

Interp, Unlearning, Editing

Recent Activity

updated a dataset 4 months ago
PhillipGuo/wmdp-deduped-unlearn
updated a model 4 months ago
PhillipGuo/gemma-2-sae-gd-fullrank
View all activity

Organizations

Truthfulness & Deception Research Team's profile picture quirky-lats-at-mats's profile picture LLM Latent Adversarial Training's profile picture