Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon May 9, 2024 • 12
Inference Performance Optimization for Large Language Models on CPUs Paper • 2407.07304 • Published Jul 10, 2024 • 52
Accelerating Speculative Decoding using Dynamic Speculation Length Paper • 2405.04304 • Published May 7, 2024 • 2
Distributed Speculative Inference of Large Language Models Paper • 2405.14105 • Published May 23, 2024 • 18
VALERIE22 -- A photorealistic, richly metadata annotated dataset of urban environments Paper • 2308.09632 • Published Aug 18, 2023
Cross-Domain Aspect Extraction using Transformers Augmented with Knowledge Graphs Paper • 2210.10144 • Published Oct 18, 2022
3D Neural Network for Lung Cancer Risk Prediction on CT Volumes Paper • 2007.12898 • Published Jul 25, 2020
ABSApp: A Portable Weakly-Supervised Aspect-Based Sentiment Extraction System Paper • 1909.05608 • Published Sep 12, 2019
Term Set Expansion based NLP Architect by Intel AI Lab Paper • 1808.08953 • Published Aug 27, 2018 • 1
Term Set Expansion based on Multi-Context Term Embeddings: an End-to-end Workflow Paper • 1807.10104 • Published Jul 26, 2018 • 1