Certified Mitigation of Worst-Case LLM Copyright Infringement Paper • 2504.16046 • Published 18 days ago • 12
Pretraining Language Models for Diachronic Linguistic Change Discovery Paper • 2504.05523 • Published Apr 7 • 6
Verifiable by Design: Aligning Language Models to Quote from Pre-Training Data Paper • 2404.03862 • Published Apr 5, 2024
AdapterSwap: Continuous Training of LLMs with Data Removal and Access-Control Guarantees Paper • 2404.08417 • Published Apr 12, 2024 • 1
Dated Data: Tracing Knowledge Cutoffs in Large Language Models Paper • 2403.12958 • Published Mar 19, 2024
Every Language Counts: Learn and Unlearn in Multilingual LLMs Paper • 2406.13748 • Published Jun 19, 2024
Insights into LLM Long-Context Failures: When Transformers Know but Don't Tell Paper • 2406.14673 • Published Jun 20, 2024
It Takes Two: On the Seamlessness between Reward and Policy Model in RLHF Paper • 2406.07971 • Published Jun 12, 2024
DyVo: Dynamic Vocabularies for Learned Sparse Retrieval with Entities Paper • 2410.07722 • Published Oct 10, 2024 • 13
SMHD: A Large-Scale Resource for Exploring Online Language Usage for Multiple Mental Health Conditions Paper • 1806.05258 • Published Jun 13, 2018
MultiTabQA: Generating Tabular Answers for Multi-Table Question Answering Paper • 2305.12820 • Published May 22, 2023
PARADE: Passage Representation Aggregation for Document Reranking Paper • 2008.09093 • Published Aug 20, 2020
BERT-QE: Contextualized Query Expansion for Document Re-ranking Paper • 2009.07258 • Published Sep 15, 2020
Pretrained Transformers for Text Ranking: BERT and Beyond Paper • 2010.06467 • Published Oct 13, 2020
Meta-Task Prompting Elicits Embedding from Large Language Models Paper • 2402.18458 • Published Feb 28, 2024