MPCHAT: Towards Multimodal Persona-Grounded Conversation Paper • 2305.17388 • Published May 27, 2023 • 1
Compositional Conservatism: A Transductive Approach in Offline Reinforcement Learning Paper • 2404.04682 • Published Apr 6, 2024
Running 2.65k 2.65k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters