Nils Feldhus's picture

6 47

Nils Feldhus PRO

nfel

·

https://nfelnlp.github.io

AI & ML interests

Interpretability, Explainability, Natural Language Generation

Recent Activity

authored a paper 3 days ago

Inseq: An Interpretability Toolkit for Sequence Generation Models

authored a paper 3 days ago

Efficient Explanations from Empirical Explainers

authored a paper 3 days ago

LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tools

View all activity

Organizations

nfel's activity

upvoted a paper 3 days ago

Do Large Language Models Latently Perform Multi-Hop Reasoning?

Paper • 2402.16837 • Published Feb 26, 2024 • 30

upvoted a paper about 1 month ago

Enhancing Automated Interpretability with Output-Centric Feature Descriptions

Paper • 2501.08319 • Published Jan 14 • 11

upvoted a paper about 2 months ago

QE4PE: Word-level Quality Estimation for Human Post-Editing

Paper • 2503.03044 • Published Mar 4 • 6

upvoted a paper 12 months ago

A Primer on the Inner Workings of Transformer-based Language Models

Paper • 2405.00208 • Published Apr 30, 2024 • 10

upvoted a collection over 1 year ago

🔍 Interpretability & Analysis of LMs

Outstanding research in LM interpretability and evaluation, summarized • 109 items • Updated 3 days ago • 99