Sleeping 4 4 Plan2Align Test-Time Machine Translation 🌏 8B Test-Time Alignment for Machine Translation
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation Paper • 2502.13128 • Published 26 days ago • 37
Padding Tone: A Mechanistic Analysis of Padding Tokens in T2I Models Paper • 2501.06751 • Published Jan 12 • 31
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs Paper • 2411.19146 • Published Nov 28, 2024 • 17
Hymba: A Hybrid-head Architecture for Small Language Models Paper • 2411.13676 • Published Nov 20, 2024 • 42
HelpSteer2-Preference: Complementing Ratings with Preferences Paper • 2410.01257 • Published Oct 2, 2024 • 23
Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study Paper • 2304.06762 • Published Apr 13, 2023 • 1
CrossNER: Evaluating Cross-Domain Named Entity Recognition Paper • 2012.04373 • Published Dec 8, 2020
ChatQA: Building GPT-4 Level Conversational QA Models Paper • 2401.10225 • Published Jan 18, 2024 • 36
Multi-Stage Prompting for Knowledgeable Dialogue Generation Paper • 2203.08745 • Published Mar 16, 2022
RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs Paper • 2407.02485 • Published Jul 2, 2024 • 5
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities Paper • 2407.14482 • Published Jul 19, 2024 • 26
HelpSteer2: Open-source dataset for training top-performing reward models Paper • 2406.08673 • Published Jun 12, 2024 • 19
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models Paper • 2405.17428 • Published May 27, 2024 • 19
NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment Paper • 2405.01481 • Published May 2, 2024 • 30