NVIDIA

Enterprise

company

Verified

https://www.nvidia.com/

nvidia

AI & ML interests

None defined yet.

Recent Activity

leoye authored a paper 25 days ago

I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models

amalad authored a paper about 1 month ago

Eagle 2: Building Post-Training Data Strategies from Scratch for Frontier Vision-Language Models

amalad authored a paper about 1 month ago

Éclair -- Extracting Content and Layout with Integrated Reading Order for Documents

View all activity

Articles

Mastering Long Contexts in LLMs with KVPress

Controlling Language Model Generation with NVIDIA's LogitsProcessorZoo

nvidia's activity

huckiyang

updated a Space 1 day ago

Plan2Align Test-Time Machine Translation

8B Test-Time Alignment for Machine Translation

zihanliu

authored a paper 24 days ago

SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation

Paper • 2502.13128 • Published 26 days ago • 37

IdoGalilNvidia

authored a paper about 2 months ago

Padding Tone: A Mechanistic Analysis of Padding Tokens in T2I Models

Paper • 2501.06751 • Published Jan 12 • 31

IdoGalilNvidia

authored a paper 3 months ago

Puzzle: Distillation-Based NAS for Inference-Optimized LLMs

Paper • 2411.19146 • Published Nov 28, 2024 • 17

matthijsvk

authored a paper 4 months ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 42

okuchaiev

authored a paper 5 months ago

HelpSteer2-Preference: Complementing Ratings with Preferences

Paper • 2410.01257 • Published Oct 2, 2024 • 23

zihanliu

authored 10 papers 6 months ago

Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study

Paper • 2304.06762 • Published Apr 13, 2023 • 1

Retrieval meets Long Context Large Language Models

Paper • 2310.03025 • Published Oct 4, 2023 • 4

XPersona: Evaluating Multilingual Personalized Chatbot

Paper • 2003.07568 • Published Mar 17, 2020

CrossNER: Evaluating Cross-Domain Named Entity Recognition

Paper • 2012.04373 • Published Dec 8, 2020

Are Multilingual Models Effective in Code-Switching?

Paper • 2103.13309 • Published Mar 24, 2021

ChatQA: Building GPT-4 Level Conversational QA Models

Paper • 2401.10225 • Published Jan 18, 2024 • 36

Multi-Stage Prompting for Knowledgeable Dialogue Generation

Paper • 2203.08745 • Published Mar 16, 2022

Nemotron-4 340B Technical Report

Paper • 2406.11704 • Published Jun 17, 2024

RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs

Paper • 2407.02485 • Published Jul 2, 2024 • 5

NVLM: Open Frontier-Class Multimodal LLMs

Paper • 2409.11402 • Published Sep 17, 2024 • 73

zihanliu

authored a paper 8 months ago

ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities

Paper • 2407.14482 • Published Jul 19, 2024 • 26

okuchaiev

authored a paper 9 months ago

HelpSteer2: Open-source dataset for training top-performing reward models

Paper • 2406.08673 • Published Jun 12, 2024 • 19

Mengyao00

authored a paper 10 months ago

NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models

Paper • 2405.17428 • Published May 27, 2024 • 19

okuchaiev

authored a paper 11 months ago

NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment

Paper • 2405.01481 • Published May 2, 2024 • 30