Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon May 9, 2024 • 12
Dynamic-TinyBERT: Boost TinyBERT's Inference Efficiency by Dynamic Sequence Length Paper • 2111.09645 • Published Nov 18, 2021
Intel/bert-base-uncased-squadv1.1-sparse-80-1x4-block-pruneofa Question Answering • Updated Apr 12, 2023 • 103
Intel/distilbert-base-uncased-sparse-90-unstructured-pruneofa Fill-Mask • Updated Apr 11, 2023 • 27 • 2