Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
sbrandeis 's Collections
Papers to read - General
Papers to read - Reinforcement Learning
Papers to read - Diffusion

Papers to read - General

updated Apr 9, 2024

Papers I want to read, at some point.

Upvote
4

  • Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation

    Paper • 2108.12409 • Published Aug 27, 2021 • 5

  • YaRN: Efficient Context Window Extension of Large Language Models

    Paper • 2309.00071 • Published Aug 31, 2023 • 71

  • MIMIC-IT: Multi-Modal In-Context Instruction Tuning

    Paper • 2306.05425 • Published Jun 8, 2023 • 11

  • Music ControlNet: Multiple Time-varying Controls for Music Generation

    Paper • 2311.07069 • Published Nov 13, 2023 • 45

  • Memory Augmented Language Models through Mixture of Word Experts

    Paper • 2311.10768 • Published Nov 15, 2023 • 18

  • Positional Description Matters for Transformers Arithmetic

    Paper • 2311.14737 • Published Nov 22, 2023 • 2

  • Mamba: Linear-Time Sequence Modeling with Selective State Spaces

    Paper • 2312.00752 • Published Dec 1, 2023 • 143

  • QuIP: 2-Bit Quantization of Large Language Models With Guarantees

    Paper • 2307.13304 • Published Jul 25, 2023 • 2
Upvote
4
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs