Haihao Shen

Haihao

AI & ML interests

LLM quantization, sparsity, and acceleration

Recent Activity

Organizations

Intel's profile picture Need4Speed's profile picture Qwen's profile picture Open Platform for Enterprise AI's profile picture arcee-intel-colab's profile picture

Haihao's activity

upvoted an article about 20 hours ago
view article
Article

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

By wenhuach and 8 others
11
published an article 1 day ago
view article
Article

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

By wenhuach and 8 others
11
reacted to wenhuach's post with 🚀 4 months ago
view post
Post
345
This week, OPEA Space released several new INT4 models, including:
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
allenai/OLMo-2-1124-13B-Instruct
THUDM/glm-4v-9b
AIDC-AI/Marco-o1
and several others.
Let us know which models you'd like prioritized for quantization, and we'll do our best to make it happen!

OPEA
  • 3 replies
·