Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jintao Zhang's picture
8 12

Jintao Zhang

jt-zhang
hongdoubaoer's profile picture 21world's profile picture JasonYinnnn's profile picture
·
https://jt-zhang.github.io/
  • jtzhang-6

AI & ML interests

Efficient ML

Recent Activity

authored a paper 4 days ago
SageAttention2++: A More Efficient Implementation of SageAttention2
updated a collection 4 days ago
efficient ml
upvoted a paper 4 days ago
SageAttention2++: A More Efficient Implementation of SageAttention2
View all activity

Organizations

None yet

Collections 1

efficient ml
  • SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration

    Paper • 2411.10958 • Published Nov 17, 2024 • 56
  • SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

    Paper • 2502.18137 • Published Feb 25 • 57
  • SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training

    Paper • 2505.11594 • Published 16 days ago • 67
  • SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration

    Paper • 2410.02367 • Published Oct 3, 2024 • 51

Papers 10

arxiv:2505.21136
arxiv:2505.18875
arxiv:2505.11594
arxiv:2503.08040

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs