Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
nthngdy 's Collections
Q-Filters

Q-Filters

updated Mar 3

Pre-computed Q-Filters for efficient KV cache compression.

Upvote
7

  • nthngdy/Llama-3.1-8B-Instruct_qfilt

    Updated Nov 28, 2024 • 181

  • nthngdy/Llama-3.2-1B-Instruct_qfilt

    Updated Nov 28, 2024 • 105

  • nthngdy/Llama-3.2-3B-Instruct_qfilt

    Updated Feb 6 • 103

  • nthngdy/Llama-3.2-3B_qfilt

    Updated Nov 28, 2024 • 102

  • nthngdy/Llama-3.1-8B_qfilt

    Updated Nov 28, 2024 • 102

  • nthngdy/Llama-3.1-70B-Instruct_qfilt

    Updated Mar 7 • 105

  • nthngdy/Llama-3.1-70B_qfilt

    Updated Feb 6 • 105

  • nthngdy/Meta-Llama-3.1-405B_qfilt

    Updated Feb 6 • 103

  • nthngdy/Mistral-Small-24B-Instruct-2501_qfilt

    Updated Feb 6 • 103

  • nthngdy/phi-4_qfilt

    Updated Feb 6 • 102

  • nthngdy/Llama-3.2-1B_qfilt

    Updated Nov 28, 2024 • 119

  • nthngdy/Qwen2.5-7B_qfilt

    Updated Feb 6 • 107

  • nthngdy/Qwen2.5-7B-Instruct_qfilt

    Updated Feb 6 • 6.91k

  • nthngdy/DeepSeek-R1-Distill-Llama-8B_qfilt

    Updated Mar 3 • 106

  • nthngdy/DeepSeek-R1-Distill-Qwen-1.5B_qfilt

    Updated Mar 3 • 106
Upvote
7
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs