Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
mehmetcanbudak 's Collections
arXiv

arXiv

updated Jan 10, 2024
Upvote
-

  • LoRA: Low-Rank Adaptation of Large Language Models

    Paper • 2106.09685 • Published Jun 17, 2021 • 39

  • Attention Is All You Need

    Paper • 1706.03762 • Published Jun 12, 2017 • 61

  • Direct Preference Optimization: Your Language Model is Secretly a Reward Model

    Paper • 2305.18290 • Published May 29, 2023 • 58

  • Lost in the Middle: How Language Models Use Long Contexts

    Paper • 2307.03172 • Published Jul 6, 2023 • 40

  • Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

    Paper • 2005.11401 • Published May 22, 2020 • 11

  • FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

    Paper • 2205.14135 • Published May 27, 2022 • 13

  • Not All Attention Is All You Need

    Paper • 2104.04692 • Published Apr 10, 2021 • 1

  • Llama 2: Open Foundation and Fine-Tuned Chat Models

    Paper • 2307.09288 • Published Jul 18, 2023 • 243

  • Mistral 7B

    Paper • 2310.06825 • Published Oct 10, 2023 • 48

  • QLoRA: Efficient Finetuning of Quantized LLMs

    Paper • 2305.14314 • Published May 23, 2023 • 52
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs